ainewsblitz.com

Breaking

Cerebras Brings Gemma 4 31B Online at Over 1,500 Tokens Per Second

  • Foundation Models
  • Infra & Chips
  • AI Agents

Cerebras Systems has begun serving Google DeepMind's open-weight model Gemma 4 31B on its inference platform at over 1,500 tokens per second, marking the company's first multimodal model.

Continue reading

The rest of this article is for AI News Blitz readers. Choose an option below to keep reading.

$20
Read this article
$29/month
Unlimited — all 2,629 articles, the full archive, and comprehension quizzes
Save 72%
$98/year
≈ $8.17/month
Unlimited, billed once a year