Qwen2.5 is a family of open-source, dense, decoder-only large language models, including specialized versions for coding and mathematics.
Qwen2.5 is the latest iteration in the Qwen family of foundation models, building upon feedback from previous versions. This release includes the general-purpose LLMs Qwen2.5, along with specialized models: Qwen2.5-Coder for coding tasks and Qwen2.5-Math for mathematical problem-solving. These models are available in various sizes, ranging from 0.5B to 72B parameters, with most open-source variants licensed under Apache 2.0. The Qwen2.5 models are pretrained on a large-scale dataset of up to 18 trillion tokens, demonstrating improved capabilities in knowledge, coding, mathematics, instruction following, and long text generation. They support up to 128K tokens context window and can generate up to 8K tokens, with multilingual support for over 29 languages. The specialized models, Qwen2.5-Coder and Qwen2.5-Math, have undergone significant enhancements, with Qwen2.5-Coder trained on 5.5 trillion tokens of code-related data and Qwen2.5-Math incorporating various reasoning methods like Chain-of-Thought (CoT), Program-of-Thought (PoT), and Tool-Integrated Reasoning (TIR). In addition to the open-source models, Qwen offers API access to flagship models like Qwen-Plus and Qwen-Turbo through Model Studio. The models are designed for developers, supporting integration with Hugging Face Transformers, vLLM, and Ollama, including tool-calling functionalities. Benchmarking shows Qwen2.5-72B and Qwen-Plus performing competitively against other leading open-source and proprietary models.
Integrations: Hugging Face Transformers, vLLM, Ollama, Peft, ChatLearn, Llama-Factory, Axolotl, Firefly, Swift, XTuner
Platforms: Web
View full Qwen2.5 profile on Tools-Radar | Browse Coding & Developer Tools tools | Alternatives to Qwen2.5
Tools-Radar is a free directory of 10,000+ AI tools — discover, compare, and choose the right AI software for your needs. Visit tools-radar.com