Blog

TiānshūBench (天书Bench) 0.0.1-mini Results Are Here: GPT-5, Claude Opus 4.1!

TiānshūBench 0.0.1 Mini Results

Read more


Benchmark Fury: TiānshūBench 0.0.1-mini vs Claude Opus 4.1, GPT-5, Kimi K2, and More

It's been an exciting couple of weeks, with new models released by big players like Anthropic and OpenAI, as well as open weight models from the scrappy challengers Moonshot AI and Alibaba.

Read more


But Can It Think? A Quick Look at Kimi K2

Heavenly Robot

Read more


Challenging Nomad: TiānshūBench Experimental Release 0.0.Y

Arguing With AI

Read more


Open The Pod Bay Doors: TiānshūBench Intermediate Release 0.0.X

Mysterious AI Compute

Read more


Introducing TiānshūBench (天书Bench)

TiānshūBench Logo

Read more