LATEST MODEL

DeepSeek-V4-Flash

DeepSeek ⚡ The Runner Released April 2026

DeepSeek's smaller, fast variant of V4 — same architecture at a fraction of the cost and latency

DeepSeek-V4-Flash

DeepSeekApril 2026

Latest

Training Data

Up to early 2026

DeepSeek-V4-Flash

April 2026

Parameters

284 billion (13B active)

Training Method

MoE with hybrid attention, Muon optimizer

Context Window

1,000,000 tokens

Knowledge Cutoff

Not disclosed

Key Features

1M Context • Sparse MoE (13B Active) • Aggressive Pricing • Open Weights (MIT)

Capabilities

Speed: Outstanding

Cost Efficiency: Outstanding

Reasoning: Excellent

What's New in This Version

Most affordable model at frontier-comparable quality; 1M context retained from V4-Pro at ~5× lower cost ($0.14 in / $0.28 out per MTok)

DeepSeek's smaller, fast variant of V4 — same architecture at a fraction of the cost and latency

What's New in This Version

Most affordable model at frontier-comparable quality; 1M context retained from V4-Pro at ~5× lower cost ($0.14 in / $0.28 out per MTok)

Technical Specifications

Parameters 284 billion (13B active)
Context Window 1,000,000 tokens
Training Method MoE with hybrid attention, Muon optimizer
Knowledge Cutoff Not disclosed
Training Data Up to early 2026

Key Features

1M Context Sparse MoE (13B Active) Aggressive Pricing Open Weights (MIT)

Capabilities

Speed: Outstanding
Cost Efficiency: Outstanding
Reasoning: Excellent
Theme
Language
Support
© funclosure 2025