Machine Learning Research
Getting the Facts Right: A memory method that reduces hallucinations in LLMs
Large language models that remember more hallucinate less.
Machine Learning Research
Large language models that remember more hallucinate less.
Machine Learning Research
A new model improves on recent progress in generating interactive virtual worlds from still images.
Machine Learning Research
OpenAI launched not only its highly anticipated o1 model but also an operating mode that enables the model to deliver higher performance — at a hefty price.
Machine Learning Research
Jailbreak prompts can prod a large language model (LLM) to overstep built-in boundaries, leading it to do things like respond to queries it was trained to refuse to answer. Researchers devised a way to further boost the probability that LLMs will respond in ways that respect such limits.
Machine Learning Research
Mistral AI unveiled Pixtral Large, which rivals top models at processing combinations of text and images.
Hardware
An open source model is designed to perform sophisticated object detection on edge devices like phones, cars, medical equipment, and smart doorbells.
Machine Learning Research
An up-and-coming Hangzhou AI lab unveiled a model that implements run-time reasoning similar to OpenAI o1 and delivers competitive performance. Unlike o1, it displays its reasoning steps.
Machine Learning Research
Researchers cut the processing required to train transformers by around 20 percent with only a slight degradation in performance.
Machine Learning Research
A real-time video generator lets you explore an open-ended, interactive virtual world — a video game without a game engine.
Machine Learning Research
Builders of large AI models have relied on the idea that bigger neural networks trained on more data and given more processing power would show steady improvements. Recent developments are challenging that idea.
Machine Learning Research
An open source package inspired by the commercial agentic code generator Devin aims to automate computer programming and more.
Machine Learning Research
A new open source large language model outperforms competitors, including the open-weights Llama 3.1 405B, on a variety of benchmarks.