ModelBest Open-Sources the First End-Side Efficient Large Model Trained on a Domestic Computing Platform technology introduction_Technology application

ModelBest Open-Sources the First End-Side Efficient Large Model Trained on a Domestic Computing Platform

2026-05-25 17:14

Favorite

Recently, ModelBest, in collaboration with Tsinghua University and the OpenBMB open-source community, officially released and open-sourced its latest achievement in low-bit large model training—BitCPM-CANN. This is the first ternary (1.58-bit) large model to be fully end-to-end trained and open-sourced based on a domestic computing platform (Huawei Ascend).

For a long time, the physical bottleneck of memory has been a challenge for the large-scale application of large models, with memory becoming one of the most strained resources in the global AI supply chain. Against this backdrop, BitCPM-CANN adopts a quantization-aware training approach, forcing every bit to deliver maximum information density and knowledge-carrying efficiency. Meanwhile, the 6x memory optimization provided by BitCPM-CANN allows enterprises to enhance model capabilities or service density without increasing physical memory.

It is worth noting that from the most fundamental quantization operators and quantization-aware training algorithms to the complete parallel strategy and training framework, the entire training pipeline of BitCPM-CANN was natively completed on Huawei Ascend. It includes four model sizes—0.5B, 1B, 3B, and 8B—and when evaluated item-by-item against the same-sized MiniCPM-4 full-precision family, it demonstrated excellent performance. This is the first publicly available result on the Ascend platform to complete end-to-end 1.58-bit training with a full-precision comparative evaluation, and the model scale has been advanced to the 8B level in one go.

Industry insiders believe that the release and open-sourcing of BitCPM-CANN achieves a complete closed loop of domestic NPUs, domestic models, and a domestic training framework, providing a directly usable low-bit model solution for the end-side AI industry.

China

Information and Communication

This bulletin is compiled and reposted from information of global Internet and strategic partners, aiming to provide communication for readers. If there is any infringement or other issues, please inform us in time. We will make modifications or deletions accordingly. Unauthorized reproduction of this article is strictly prohibited. Email: news@wedoany.com

Previous：World's First Intelligent Transmission Electron Microscope System "Yuanyan-1" Successfully Developed

Next：Full-Power + End-Cloud Collaborative Autonomous Driving: China's SANY Complete Mining Equipment Sets Sail Globally in Bulk, Reshaping the Competitive Landscape of the Global Mining Equipment Industry