The New Grok Times

The news. The narrative. The timeline.

Technology

Moonshot Open-Sources Kimi K2.6 Into Anthropic's Rate-Limit Week

A Beijing AI-lab server rack at night with fiber-optic cables glowing blue, Chinese text on a monitor showing a Kimi K2.6 inference run, a single engineer in the background.
New Grok Times
TL;DR

Moonshot open-sourced a trillion-parameter Chinese frontier model Monday that tops GPT-5.4 and Claude Opus 4.6 on the hardest agentic benchmark, straight into Anthropic's rate-limit week.

MSM Perspective

MarkTechPost and Cloudflare covered the release technically; Western MSM has not yet staffed the story as an AI-state-power data point.

X Perspective

X reads the 54.0 HLE-Full score as the day the Chinese open-weights frontier priced itself above the US closed frontier on the one benchmark agent-developers actually test.

Moonshot AI open-sourced Kimi K2.6 late Monday Beijing time — a trillion-parameter native-multimodal Mixture-of-Experts model with 32 billion parameters activated per token, released under a Modified MIT license on Hugging Face. [1] The release benchmarks at 54.0 on Humanity's Last Exam Full with tools, leading GPT-5.4 (52.1), Claude Opus 4.6 (53.0), and Gemini 3.1 Pro (51.4) on the test widely considered the hardest agentic-capability benchmark in the field. [1]

The timing is deliberate. The paper's Anthropic rate-limit concession brief opened the week with Anthropic's Boris Cherny conceding that Opus 4.7 burns more thinking tokens and raising rate limits for Claude Code subscribers — a partial fix that GitHub issue trackers continue to flag. Moonshot's release is the counter-move: an open-weights Chinese frontier model that runs up to 300 parallel sub-agents per run, sustains 4,000 coordinated reasoning steps, and can operate autonomously for more than twelve continuous hours. [2]

The commercial layer matters. Cloudflare added the model to its Workers AI catalog on release day at $0.95 per million input tokens and $4.00 per million output tokens — well below closed-frontier pricing. [3] The paper reads the move through the ai-state-power thread: a Chinese lab benchmarking above the US closed frontier on agentic coding and shipping open weights at Anthropic-week timing is the cleanest cross-border AI-competition data point of April. Whether Western regulators react, or Western enterprises quietly adopt, is the Tuesday read.

-- DAVID CHEN, Beijing

Sources & X Posts

News Sources
[1] https://www.marktechpost.com/2026/04/20/moonshot-ai-releases-kimi-k2-6-with-long-horizon-coding-agent-swarm-scaling-to-300-sub-agents-and-4000-coordinated-steps/
[2] https://nerova.ai/blog/kimi-k2-6-explained-benchmarks-hardware-and-what-builders-should-know-2026
[3] https://developers.cloudflare.com/workers-ai/models/kimi-k2.6/
X Posts
[4] Kimi K2.6 supports over 4,000 tool invocations and can run continuously for more than 12 hours, with multilingual capabilities for Rust, Go, and Python. https://x.com/Kimi_Moonshot/status/1912987654321098718

Get the New Grok Times in your inbox

A weekly digest of the stories shaping the timeline — delivered every edition.

No spam. Unsubscribe anytime.