Hardware
High Gear for Llama 3.1 405B: SambaNova boosts Llama 3.1 performance with fast, free access to largest model
SambaNova raised the speed limit for access to the largest model in the Llama 3.1 family — and it’s free.
Hardware
SambaNova raised the speed limit for access to the largest model in the Llama 3.1 family — and it’s free.
Hardware
Nvidia’s latest chip promises to boost AI’s speed and energy efficiency.
Hardware
An upstart chip company dramatically accelerates pretrained large language models. Groq offers cloud access to Meta’s Llama 2 and Mistral.ai’s Mixtral at speeds an order of magnitude greater than other AI platforms. Registered users can try it.
Hardware
Huawei is emerging as an important supplier of AI chips. Amid a U.S. ban on exports of advanced chips to China, demand for Huawei’s AI chips is so intense that the company is limiting production of the chip that powers one of its most popular smartphones so it can serve the AI market.
Hardware
The AI boom is taxing power grids and pushing builders of data centers to rethink their sources of electricity.
Hardware
ChatGPT is pitching in on the assembly line. Siemens and Microsoft launched a joint pilot program of a GPT-powered model for controlling manufacturing machinery. German automotive parts manufacturer Schaeffler is testing the system in its factories, as is Siemens itself.
Hardware
A new cloud-computing company promises to provide scarce AI processing power to startups and researchers. Voltage Park, a nonprofit north of Silicon Valley, will offer processing power from 24,000 top-of-the-line Nvidia H100 graphics processing units (GPUs)...
Tech & Society
The state of California pulled the parking brake on Cruise driverless vehicles. The DMV suspended Cruise’s permit to operate vehicles in the state without safety drivers. The General Motors subsidiary responded by halting its robotaxi operations across the United States.
Hardware
Nvidia’s top-of-the-line chips are in high demand and short supply. There aren’t enough H100 graphics processing units (GPUs) to meet the crush of demand brought on by the vogue for generative AI, VentureBeat reported.
Machine Learning Research
TinyML shows promise for bringing deep learning to applications where electrical power is scarce, processing in the cloud is impractical, and/or data privacy is paramount.
Business
Chinese companies have found loopholes to sidestep United States limits on AI chips. Facing severe limits on U.S. exports of high-performance chips, Chinese AI firms are purchasing them through subsidiaries and using them through cloud services, the Financial Times reported.
Hardware
A new computing cluster delivers more bang per chip. Cerebras unveiled Andromeda, a supercomputer based on its processors. Unlike conventional clusters, the system’s processing speed rises linearly with additional processors.