
Business
The Geopolitics of GPUs: US Bans Nvidia and AMD Chip Sales to China
The ban impacts Nvidia's A100 and H100 chips, and AMD's MI250.
Business
The ban impacts Nvidia's A100 and H100 chips, and AMD's MI250.
Hardware
Is your colossal text generator bogged down in training? Nvidia announced a chip designed to accelerate the transformer architecture, the basis of large language models such as GPT-3.
Interviews & Essays
I believe that natural language processing in 2022 will re-embrace symbolic reasoning, harmonizing it with the statistical operation of modern neural networks. Let me explain what I mean by this.
Tech & Society
The trend toward ever-larger models crossed the threshold from immense to ginormous. Google kicked off 2021 with Switch Transformer, the first published work to exceed a trillion parameters, weighing in at 1.6 trillion.
Machine Learning Research
DeepMind released three papers that push the boundaries — and examine the issues — of large language models.
Tech & Society
Chinese researchers for the first time swept a competition to develop AI systems that monitor urban traffic. Chinese universities and companies won first and second place place in all five categories of the 2021 AI City Challenge.
Machine Learning Research
In some animated games, different characters can perform the same actions — say, walking, jumping, or casting spells. A new system learned from unlabeled data to transfer such motions from one character to another.
Tech & Society
How much processing power do various nations have on hand to drive their AI strategy? An international trade group aims to find out. The Organisation for Economic Co-operation and Development (OECD) is launching an effort to measure the computing capacity available in countries around the world.
Machine Learning Research
Large transformer networks work wonders with natural language, but they require enormous amounts of computation. New research slashes processor cycles without compromising performance.
Business
In this work-from-home era, who hasn’t spent a video conference wishing they could read an onscreen document without turning their eyes from the person they’re talking with? Or simply hoping the stream wouldn’t stutter or stall? Deep learning can fill in the missing pieces.
Machine Learning Research
Trained on a small dataset, generative adversarial networks (GANs) tend to generate either replicas of the training data or noisy output. A new method spurs them to produce satisfying variations.
Machine Learning Research
Recognizing actions performed in a video requires understanding each frame and relationships between the frames. Previous research devised a way to analyze individual images efficiently known as Active Shift Layer (ASL). New research extends this technique to the steady march of video frames.