
Machine Learning Research
Qwen3-Next Accelerates: Alibaba’s new model uses hybrid attention layers and a sparse MoE architecture for speed and performance
Alibaba updated its popular Qwen3 open-weights models with a number of fresh, speed-boosting tweaks.