
Machine Learning Research
DeepSeek Sharpens Its Reasoning: DeepSeek-R1, an affordable rival to OpenAI’s o1
A new open model rivals OpenAI’s o1, and it’s free to use or modify.
Machine Learning Research
A new open model rivals OpenAI’s o1, and it’s free to use or modify.
Data Points
Codestral matches or beats mid-sized fill-in-the-middle models. MiniMax’s “Lightning Attention” tries to improve on the transformer. Co-STORM now available for AI collaboration on encyclopedia articles. LlamaIndex uses agentic RAG to retrieve info from complex documents.
Data Points
Moondream’s lightweight vision model adds gaze detection. Fine-tuning Flux Pro’s generative model with just a few images. Copilot Chat brings simple agents to Microsoft 365. Google adds Gemini to all its Workspace plans.
Letters
Writing software, especially prototypes, is becoming cheaper. This will lead to increased demand for people who can decide what to build. AI Product Management has a bright future!
The Batch Newsletter
The Batch AI News and Insights: Writing software, especially prototypes, is becoming cheaper. This will lead to increased demand for people who can decide what to build. AI Product Management has a bright future!
Machine Learning Research
Contrastive loss functions make it possible to produce good embeddings without labeled data. A twist on this idea makes even more useful embeddings.
Hardware
Nvidia’s new desktop computer is built specifically to run large AI models.
Tech & Society
The United States proposed limits on exports of AI technology that would dramatically expand previous restrictions, creating a new international hierarchy for access to advanced chips and models.
Machine Learning Research
A new model from Hangzhou upstart DeepSeek delivers outstanding performance and may change the equation for training costs.
Data Points
Court filings show Meta pirated model training data. Stability’s SPAR3D speeds up 3D image generation. How robots aid nursing care workers in Japan. Deliberative alignment uses more compute to ensure safety.
Data Points
AI careers remain just as hot as you might expect. Columbia’s GET model predicts gene expression. Cohere’s North brings easy and secure automaton to enterprise. Meta pauses older AI characters but will introduce new ones this year.
Letters
Using AI-assisted coding to build software prototypes is an important way to quickly explore many ideas and invent new things.