Alibaba’s reaction to DeepSeek is Qwen 2.5-Max, the corporate’s newest Aggregate-of-Mavens (MoE) large-scale type.
Qwen 2.5-Max boasts pretraining on over 20 trillion tokens and fine-tuning via state-of-the-art ways like Supervised High quality-Tuning (SFT) and Reinforcement Finding out from Human Comments (RLHF).
With the API now to be had via Alibaba Cloud and the type available for exploration by way of Qwen Chat, the Chinese language tech large is inviting builders and researchers to peer its breakthroughs firsthand.
Outperforming friends
When evaluating Qwen 2.5-Max’s efficiency towards one of the crucial maximum distinguished AI fashions on plenty of benchmarks, the consequences are promising.
Reviews integrated fashionable metrics just like the MMLU-Professional for college-level problem-solving, LiveCodeBench for coding experience, LiveBench for total functions, and Enviornment-Exhausting for assessing fashions towards human personal tastes.
Consistent with Alibaba, “Qwen 2.5-Max outperforms DeepSeek V3 in benchmarks equivalent to Enviornment-Exhausting, LiveBench, LiveCodeBench, and GPQA-Diamond, whilst additionally demonstrating aggressive leads to different checks, together with MMLU-Professional.”

The instruct type – designed for downstream duties like chat and coding – competes without delay with main fashions equivalent to GPT-4o, Claude-3.5-Sonnet, and DeepSeek V3. Amongst those, Qwen 2.5-Max controlled to outperform competitors in different key spaces.
Comparisons of base fashions additionally yielded promising results. Whilst proprietary fashions like GPT-4o and Claude-3.5-Sonnet remained out of achieve because of get entry to restrictions, Qwen 2.5-Max used to be assessed towards main public choices equivalent to DeepSeek V3, Llama-3.1-405B (the biggest open-weight dense type), and Qwen2.5-72B. Once more, Alibaba’s newcomer demonstrated outstanding efficiency around the board.
“Our base fashions have demonstrated vital benefits throughout maximum benchmarks,” Alibaba said, “and we’re positive that developments in post-training ways will carry the following model of Qwen 2.5-Max to new heights.”
Making Qwen 2.5-Max available
To make the type extra available to the worldwide group, Alibaba has built-in Qwen 2.5-Max with its Qwen Chat platform, the place customers can engage without delay with the type in quite a lot of capacities—whether or not exploring its seek functions or checking out its working out of advanced queries.
For builders, the Qwen 2.5-Max API is now to be had via Alibaba Cloud below the type identify “qwen-max-2025-01-25”. customers can get began by means of registering an Alibaba Cloud account, activating the Type Studio provider, and producing an API key.
The API is even suitable with OpenAI’s ecosystem, making integration simple for present initiatives and workflows. This compatibility lowers the barrier for the ones keen to check their packages with the type’s functions.
Alibaba has made a robust remark of intent with Qwen 2.5-Max. The corporate’s ongoing dedication to scaling AI fashions isn’t just about bettering efficiency benchmarks but additionally about improving the basic considering and reasoning talents of those techniques.
“The scaling of knowledge and type measurement no longer simplest showcases developments in type intelligence but additionally displays our unwavering dedication to pioneering analysis,” Alibaba famous.
Having a look forward, the crew objectives to push the bounds of reinforcement studying to foster much more complex reasoning talents. This, they are saying, may just allow their fashions not to simplest fit however surpass human intelligence in fixing intricate issues.
The consequences for the business might be profound. As scaling strategies strengthen and Qwen fashions ruin new floor, we’re more likely to see additional ripples throughout AI-driven fields globally that we’ve noticed in contemporary weeks.
(Photograph by means of Maico Amorim)
See additionally: ChatGPT Gov objectives to modernise US govt businesses

Need to be told extra about AI and massive information from business leaders? Take a look at AI & Large Information Expo going down in Amsterdam, California, and London. The great match is co-located with different main occasions together with Clever Automation Convention, BlockX, Virtual Transformation Week, and Cyber Safety & Cloud Expo.
Discover different upcoming endeavor era occasions and webinars powered by means of TechForge right here.
ai,alibaba,synthetic intelligence,fashions,qwen,qwen 2.5
Supply hyperlink