Alibaba unveils Qwen3, challenges industry leaders

April 29, 2025 08:29 AM AEST | By Investing
 Alibaba unveils Qwen3, challenges industry leaders
Alibaba unveils Qwen3, challenges industry leaders

Investing.com -- Alibaba Group Holdings Ltd ADR (NYSE:BABA) introduced its next-generation large language model series, Qwen3, on Tuesday, expanding its AI offerings across a range of model sizes and architectures. The release includes eight open-weight models, six dense and two mixture-of-experts (MoE), ranging from 0.6 billion to 235 billion parameters.

The flagship, Qwen3-235B-A22B, has demonstrated competitive performance in benchmarks across coding, mathematics, and general tasks when compared to leading models such as DeepSeek-R1, Grok-3, and Gemini-2.5-Pro. Smaller models like Qwen3-30B-A3B also outpaced more parameter-intensive models, indicating efficiency gains in structure and training.

All models—including pre-trained and post-trained variants—are publicly accessible via Hugging Face, ModelScope, and Kaggle. For deployment, Alibaba recommends SGLang and vLLM, while local users can run Qwen3 using tools like LMStudio, llama.cpp, and KTransformers.

Qwen3 offers scalable and adaptive performance, letting users tailor computational reasoning budgets to balance accuracy and resource cost. This flexibility aims to meet the increasingly diverse demands of developers integrating AI into consumer or enterprise-level workflows.

The models support 119 languages and dialects, tripling the coverage of their predecessor, Qwen2.5. This broad multilingual capability positions Qwen3 for adoption in global markets, including emerging regions with rich linguistic diversity.

Qwen3 models exhibit advances in coding and agentic functions, enhanced with deeper integration for model-conditional prompting (MCP). These refinements support sophisticated applications, such as autonomous agents and developer tooling with higher precision.

The series is trained on 36 trillion tokens, including high-quality sources from STEM, reasoning, books, and synthetic datasets. The data upgrade contributes to notable gains in language understanding, programming proficiency, and long-context memory.

Qwen3 employs architectural and training innovations such as qk layernorm and global-batch load balancing for MoE models. This leads to greater training stability and consistent performance improvements across model scales.

Its three-stage pretraining approach targets language comprehension, reasoning, and long-context processing separately, with token sequences extended up to 32,000. This modular strategy enhances Qwen3’s ability to handle complex, multi-turn interactions and larger documents.

With optimized hyperparameters guided by scaling laws for each model type, Qwen3 represents Alibaba’s most deliberate and technically comprehensive release to date. Industry observers say its open-weight strategy and multilingual reach could make it a significant contender in the global AI race.

This article first appeared in Investing.com


Disclaimer

The content, including but not limited to any articles, news, quotes, information, data, text, reports, ratings, opinions, images, photos, graphics, graphs, charts, animations and video (Content) is a service of Kalkine Media Pty Ltd (“Kalkine Media, we or us”), ACN 629 651 672 and is available for personal and non-commercial use only. The principal purpose of the Content is to educate and inform. The Content does not contain or imply any recommendation or opinion intended to influence your financial decisions and must not be relied upon by you as such. Some of the Content on this website may be sponsored/non-sponsored, as applicable, but is NOT a solicitation or recommendation to buy, sell or hold the stocks of the company(s) or engage in any investment activity under discussion. Kalkine Media is neither licensed nor qualified to provide investment advice through this platform. Users should make their own enquiries about any investments and Kalkine Media strongly suggests the users to seek advice from a financial adviser, stockbroker or other professional (including taxation and legal advice), as necessary.
The content published on Kalkine Media also includes feeds sourced from third-party providers. Kalkine does not assert any ownership rights over the content provided by these third-party sources. The inclusion of such feeds on the Website is for informational purposes only. Kalkine does not guarantee the accuracy, completeness, or reliability of the content obtained from third-party feeds. Furthermore, Kalkine Media shall not be held liable for any errors, omissions, or inaccuracies in the content obtained from third-party feeds, nor for any damages or losses arising from the use of such content.
Kalkine Media hereby disclaims any and all the liabilities to any user for any direct, indirect, implied, punitive, special, incidental or other consequential damages arising from any use of the Content on this website, which is provided without warranties. The views expressed in the Content by the guests, if any, are their own and do not necessarily represent the views or opinions of Kalkine Media. Some of the images/music that may be used on this website are copyrighted to their respective owner(s). Kalkine Media does not claim ownership of any of the pictures displayed/music used on this website unless stated otherwise. The images/music that may be used on this website are taken from various sources on the internet, including paid subscriptions or are believed to be in public domain. We have made reasonable efforts to accredit the source wherever it was indicated as or found to be necessary.
This disclaimer is subject to change without notice. Users are advised to review this disclaimer periodically for any updates or modifications.


AU_advertise

Advertise your brand on Kalkine Media

Sponsored Articles


Investing Ideas

Previous Next
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.