🇨🇳 CHINESE INNOVATION

Meituan Releases LongCat-2.0: Trillion-Parameter MoE Model Trained on Chinese ASIC Cluster

Hussein Harby By Hussein Harby July 1, 2026 at 02:00 GMT+3 8 min read
Glowing data circuit cat silhouette above silicon chip arrays representing domestic pre-training clusters

Table of Contents

1. Introduction: Trillion-Parameter Breakthrough

The global race for artificial intelligence supremacy has reached a new technological milestone. Beijing-based tech giant **Meituan** has officially released **LongCat-2.0**, a massive **1.6-trillion-parameter** open-source Mixture-of-Experts (MoE) language model. What makes this launch historic is not just its scale, but the announcement that the model completed its entire pre-training and optimization cycles exclusively on domestic Chinese AI ASIC hardware.

This release occurs amid huge infrastructure expansions globally, such as Alphabet's 84.75 billion dollar capital raise, Globant and Anthropic's AI Pods consulting rollout, and corporate metrics showing AI-driven headcount growth. At the same time, companies are navigating complex layers like Trust3 AI's security layer for NVIDIA NeMo. The launch of LongCat-2.0 proves that the Chinese AI industry is successfully establishing a parallel, self-sufficient computing supply chain.

2. LongCat-2.0 Technical Specifications

LongCat-2.0 utilizes a Sparse Mixture-of-Experts (MoE) architecture. While the model contains 1.6 trillion total parameters, only **48 billion parameters are activated per token** (ranging dynamically between 33B and 56B). This routing strategy enables high computational efficiency, providing the reasoning power of a trillion-parameter system with the speed and inference costs of a much smaller model. The model is released under the permissive **MIT License**, encouraging developers globally to audit and build upon its weights.

3. Pre-Training on a 50,000-Chip Domestic Chinese Cluster

The most significant achievement of the LongCat-2.0 project is its training environment. Meituan confirmed that the model was pre-trained from scratch on a **50,000-chip domestic AI ASIC cluster**. Historically, while some Chinese firms deployed domestic chips for inference or fine-tuning, trillion-parameter models still relied on imported GPU systems for full-process pre-training. LongCat-2.0 marks the first confirmed case of a trillion-parameter model completing its entire training lifecycle on local compute nodes, showcasing China's hardware resilience.

4. Sparse Attention and Context Window Breakthroughs

Managing long context lengths natively is traditionally memory-prohibitive. LongCat-2.0 natively supports a **1 million-token context window**. To achieve this without quadratic computational cost, Meituan introduced **LongCat Sparse Attention (LSA)**. LSA maps context relationships linearly, letting the model process massive document blocks, books, or entire code repositories in a single prompt without performance degradation.

5. The Geopolitical and Infrastructure Implications

The following table compares Meituan's LongCat-2.0 with other prominent trillion-parameter models in the industry:

Model Name Total Parameters Native Context Window Hardware Infrastructure Used
LongCat-2.0 (Meituan) 1.6 Trillion (MoE) 1,000,000 Tokens Domestic Chinese AI ASIC Cluster (50,000 Chips)
GPT-5.6 Sol (OpenAI) Estimated 2.0+ Trillion 128,000 Tokens Hyperscale Nvidia H100/B200 Clusters
Mixtral 8x22B (Mistral AI) 141 Billion (MoE) 64,000 Tokens Standard Enterprise Cloud Clusters

6. Frequently Asked Questions (FAQ)

Q: What is Meituan LongCat-2.0?

A: It is an open-source, 1.6-trillion-parameter Mixture-of-Experts (MoE) AI model developed and released by Beijing-based Meituan.

Q: Why is the hardware used for training significant?

A: It is the first trillion-parameter model to complete its full pre-training and inference lifecycle entirely on a domestic Chinese AI ASIC cluster, showing self-sufficiency from foreign GPUs.

Q: Is LongCat-2.0 open source?

A: Yes. The model weights and source code are released under the open-source MIT License, allowing modification, sharing, and commercial use.

📝 Editor's Opinion: Hussein Harby

"The release of LongCat-2.0 is a turning point. It debunks the theory that China cannot train frontier-class trillion-parameter models due to semiconductor trade bans. By building a custom sparse attention framework (LSA) and successfully coordinating a massive 50,000-chip local ASIC array, Meituan has delivered a model that is competitive on global coding and coding benchmarks under an open MIT license. This will accelerate AI adoption globally while demonstrating the power of hardware diversification."

Related Articles