GLM-4.7 Flash

The ultra-fast "Agent-Loop" engine for high-volume automation.

About the Model

GLM-4.7 Flash is the lightweight, high-speed variant of Z.ai's 4.7 series. It is engineered for "Action-First" scenarios where a model needs to make hundreds of small decisions per minute. It is one of the most affordable models in the 2026 market, making it the favorite for "Agent Swarms" where dozens of instances run in parallel.

Model Key Capabilities

  • Interleaved Thinking:

    Can output its reasoning steps while performing tasks without a major speed penalty.


  • Bilingual Optimization:

    Optimized for 0.75 token-to-word ratio in English and 1.5 in Chinese.


  • Agentic Tool-Use:

    Specifically tuned for repetitive "Search-and-Extract" workflows.


  • Extreme Low Latency:

    Designed for real-time chat and interactive UI components.

Applications & Use Cases

  • Real-time Data Entry:

    Processing thousands of invoices into databases.


  • Massive Web Scrapers:

    Summarizing hundreds of search results in parallel.


  • Bilingual Customer Support:

    Instant, context-aware translation and support in English/Mandarin.

Recomended Models based on your needs

Qwen (DeepMask)

Versatile model with reasoning and tool use. Strong at document and image analysis & multilingual chat.

Qwen (DeepMask)

Versatile model with reasoning and tool use. Strong at document and image analysis & multilingual chat.

Qwen3 (StackIT)

Versatile model with reasoning and tool use. Strong at document and image analysis and multilingual chat.

Qwen3 (StackIT)

Versatile model with reasoning and tool use. Strong at document and image analysis and multilingual chat.

Kimi K2 (DeepMask)

Best for deep reasoning and tool use. Ideal for long, multi-step tasks and document analysis.

Kimi K2 (DeepMask)

Best for deep reasoning and tool use. Ideal for long, multi-step tasks and document analysis.

Model Specifications

General


Model Provider

Z.ai

Main Use Cases

Real-time Agents Local UI Gen High-Speed Translation

Intelligence


Reasoning Effort

Standard

GPQA Diamond

58.1%
Memory

Max Context

203K Tokens
Speed

Latency (TTFT)

0.59s

Throughput

91 Tokens/Sec

Find the Smarter Way to Work With AI

One workspace for all leading AI models. Think faster. Create smarter.

Haiku 4.5

New Chat

Chats

Projects

Recents

Show

Jonas has joined!

How can I help you today?

AI can make mistakes. Please double-check responses.

Models

Qwen (DeepMask)

Kimi K2 (DeepMask)

GPT-OSS 120B (Stack IT)

Haiku 4.5

Gemma 3 27B (Stack IT)

Gemini 2.2 Flash

Gemini 2.5 Flash

GPT-4o

GPT-4.1

Mistral large 2.1

DeepSeek V3

GPT-5.3

Opus 4.5

Sonnet 4.5

GPT-o3 Mini

Grok 3 Mini

Grok 4 Fast

Haiku 4.5

New Chat

Chats

Projects

AI Automation Product

Summer Campaign Research

PR Project Agents

Blog Post Daily Content

Ads Banners on Main Lander

Recents

Show

Jonas Müller

Paid plan

Models

Qwen (DeepMask)

Kimi K2 (DeepMask)

Qwen3 (Stack IT)

GPT 5.2

GPT-OSS 120B (Stack IT)

Haiku 4.5

Gemma 3 27B (Stack IT)

Gemini 2.0 Flash

Gemini 2.5 Flash

GPT-4o

GPT-4.1

Mistral large 2.1

DeepSeek V3

GPT-5.3

Opus 4.5

Sonnet 4.5

GPT-o3 Mini

Grok 3 Mini

Grok 4 Fast

Jonas has joined!

How can I help you today?

AI can make mistakes. Please double-check responses.

Find the Smarter Way to Work With AI

One workspace for all leading AI models. Think faster. Create smarter.

Haiku 4.5

New Chat

Chats

Projects

Recents

Show

Jonas has joined!

How can I help you today?

AI can make mistakes. Please double-check responses.

Models

Qwen (DeepMask)

Kimi K2 (DeepMask)

GPT-OSS 120B (Stack IT)

Haiku 4.5

Gemma 3 27B (Stack IT)

Gemini 2.2 Flash

Gemini 2.5 Flash

GPT-4o

GPT-4.1

Mistral large 2.1

DeepSeek V3

GPT-5.3

Opus 4.5

Sonnet 4.5

GPT-o3 Mini

Grok 3 Mini

Grok 4 Fast

Haiku 4.5

New Chat

Chats

Projects

AI Automation Product

Summer Campaign Research

PR Project Agents

Blog Post Daily Content

Ads Banners on Main Lander

Recents

Show

Jonas Müller

Paid plan

Models

Qwen (DeepMask)

Kimi K2 (DeepMask)

Qwen3 (Stack IT)

GPT 5.2

GPT-OSS 120B (Stack IT)

Haiku 4.5

Gemma 3 27B (Stack IT)

Gemini 2.0 Flash

Gemini 2.5 Flash

GPT-4o

GPT-4.1

Mistral large 2.1

DeepSeek V3

GPT-5.3

Opus 4.5

Sonnet 4.5

GPT-o3 Mini

Grok 3 Mini

Grok 4 Fast

Jonas has joined!

How can I help you today?

AI can make mistakes. Please double-check responses.