Z.ai Launches GLM-5.2 With a Usable 1M-Token Context, Two Thinking-Effort Levels, and No Benchmarks at Launch
GLM-5.2 is the latest large language model from Z.ai, becoming the third major release in the GLM-5 line. It follows GLM-5 (February 11), GLM-5-Turbo (March 15), and GLM-5.1 (April 7). That makes four flagship-tier coding releases in roughly four months. Usable 1M-Token Context Window GLM-5.2’s standout spec is a 1,000,000-token context window. Z.ai labels the variant glm-5.2[1m] in its own configuration. Each response can return up to 131,072 output tokens. That is roughly a 5x jump from GLM-5.1’s 200,000-token window. A 1M-token window changes how a coding agent works in practice. The agent can hold an entire mid-sized repository in working memory. That includes source files, tests, configuration, and conversation history. It avoids the constant summarization that smaller windows force. The release also adds two thinking-effort levels: High and Max. Z.ai recommends Max effort for complex, multi-step coding work. In Claude Code, the /effort command contro...
