Machine Learning Research
Scaling Laws for Data Quality: Scaling laws reveal the impact of data quality in vision-language model training
When training vision-language models, developers often remove lower-quality examples from the training set. But keeping only the highest-quality examples may not be ideal, researchers found.