Blog
TiānshūBench (天书Bench) 0.0.1-mini Results Are Here: GPT-5, Claude Opus 4.1!

Benchmark Fury: TiānshūBench 0.0.1-mini vs Claude Opus 4.1, GPT-5, Kimi K2, and More
It's been an exciting couple of weeks, with new models released by big players like Anthropic and OpenAI, as well as open weight models from the scrappy challengers Moonshot AI and Alibaba.
But Can It Think? A Quick Look at Kimi K2

Challenging Nomad: TiānshūBench Experimental Release 0.0.Y

Open The Pod Bay Doors: TiānshūBench Intermediate Release 0.0.X

Introducing TiānshūBench (天书Bench)
