
Machine Learning Research
More Factual LLMs: FactTune, a method to fine-tune LLMs for factual accuracy without human feedback
Large language models sometimes generate false statements. New work makes them more likely to produce factual output.
Machine Learning Research
Large language models sometimes generate false statements. New work makes them more likely to produce factual output.
Tech & Society
Benchmarks provide a scientific basis for evaluating model performance, but they don’t necessarily map well to human cognitive abilities. Facebook aims to close the gap through a dynamic benchmarking method that keeps humans in the loop.