Archive2026.07.04

AI Industry Daily News

A roundup of the AI industry's day, centered on Codex Windows support, grok-build-0.1, Claude Opus 4.8, Command A+, and Rosalind Biodefense.

Today's highlights

Anthropic / Claude

Anthropic Redeploys Claude Fable 5 / Mythos 5 After Export Clearance, Ships New Sonnet 5

Anthropic's new model lineup was redeployed following U.S. regulatory approval, alongside an official new version of Claude Sonnet 5.

OpenAI / Product

OpenAI Begins 12-Day Run of New Features and Demos

OpenAI launched a campaign to release new features and demonstrations continuously over 12 days.

Google DeepMind / Weather

Google DeepMind Releases Weather Forecasting Model It Says Exceeds Existing Systems

Google DeepMind announced a high-accuracy weather prediction model that it says outperforms existing systems.

Vidu / Video

Vidu S1 Adds Voice-Driven Real-Time Character Video Generation

Vidu S1 lets users control an avatar's expressions, gestures, and actions in real time using voice input.

Together AI / Open Models

Together AI Says GLM 5.2 Reaches 80% of Sonnet 5 on DeepSWE at 20% of the Cost

Together AI compared GLM 5.2 with Sonnet 5 on software engineering tasks and highlighted open-model economics and data sovereignty.

Key topics and reactions

Anthropic / Claude

Anthropic Redeploys Claude Fable 5 / Mythos 5 After Export Clearance, Ships New Sonnet 5

Anthropic redeployed its Fable 5 and Mythos 5 models after receiving clearance from U.S. authorities and the lifting of export restrictions, and formally released a new version of Sonnet 5. The company also raised Claude Platform API rate limits for all users, removing usage-tier scaling; at the top tier, limits for the latest Sonnet and Haiku are now five times higher.

Developers report using the models across full-project coding, game development, and creative workflows, including generating an ink-wash-style calligraphy defense game with roguelike elements from a PRD in one shot and running automated Playwright tests and bug fixes. Andrej Karpathy described the release as a major qualitative jump since Claude 4.5.

Some limitations were noted: older prompting patterns can backfire, the model sometimes adds unrequested features on its own initiative, token consumption runs nearly double that of Opus, and security-related tasks fall back to Opus 4.8. Users also cited difficulty timing interruptions in long sessions and lengthy official playbooks. Score comparisons before and after the redeployment were published in response to community requests.

OpenAI / Product

OpenAI Begins 12-Day Run of New Features and Demos

OpenAI began a campaign to unveil new features and demos across 12 consecutive days, signaling further expansion of its generative AI product line.

The rollout is expected to introduce additional capabilities and product updates over the course of the run.

Google DeepMind / Weather

Google DeepMind Releases Weather Forecasting Model It Says Exceeds Existing Systems

Google DeepMind unveiled an AI weather forecasting model that the company says surpasses the accuracy of existing systems, drawing attention as an applied use case for AI in a practical domain.

The announcement adds to a growing set of AI applications aimed at real-world operational tasks.

Vidu / Video

Vidu S1 Adds Voice-Driven Real-Time Character Video Generation

Vidu S1 was announced as a video-chat model that controls characters by voice and generates expressions, gestures, and behavior in real time. Its defining feature is Voice-Driven Character Control that produces action generation beyond simple lip-sync. Users register an image and voice to create a custom avatar that responds immediately to microphone input.

The model supports unlimited continuous generation at 540p and 25fps, up to 42fps. Testers described creating an avatar from a self-portrait and voice, then having motion, expressions, and speech generated in real time as they talk, though some noted the controls require getting used to.

Because output centers on 540p, the model may fall short for high-resolution use, and long-term reports on fine prompt-control accuracy and extended stability remain limited.

Category highlights

Grok Build Ships v0.2.84 and v0.2.85 With Agent Workflow Improvements

The terminal-based AI agent tool updated to v0.2.84 and v0.2.85, improving codebase analysis, tool calls, and image-generation workflows. Persistent display of thinking/reasoning blocks, immediate queue execution, and stronger Mermaid/ER diagram rendering improve long-session development, along with air-gapped support and container stability. Users cited sharply lower idle CPU and memory use and fixes to prior freezes, crashes, and Wayland paste issues, though Windows ARM64 and some terminal compatibility still need tuning.

MistralAI Releases Open-Source Training Tools; Tongyi Shares Long-Context Work

MistralAI published new open-source tooling for model training and fine-tuning, positioned as advancing the democratization of AI development. Tongyi Lab shared long-context efficiency research and quantization options for Qwen3.6-27B.

fal Details Blender+Seedance 4K Workflow; StabilityAI Strengthens Creative Models

fal introduced a Blender + Seedance workflow that converts 3D, CG, and game renders into 4K photorealistic footage, while PixVerse added controllable partial template editing for multilingual e-commerce ad variations. Wan Video added 'Music to Dance,' generating rhythm-synced dance videos from a character and track across street, tap, Latin, K-Pop, and Chinese classical styles. StabilityAI significantly upgraded its generative models for the creative industry.

Databricks Adds Lakebase Branching and Genie App Builder for Agents

Databricks introduced Lakebase Branching and a Genie App Builder aimed at agents, plus a limited-access research engine called Corpora that searches across academic papers, news, and patents to generate reports in seconds. Meta was reported to be preparing a new cloud business using surplus AI compute.

ICML 2026 Accepted Papers Released; NeurIPS Creative AI Track Opens

All ICML 2026 accepted papers were published and the NeurIPS 2026 Creative AI Track opened submissions with an August 3 deadline. Upcoming events include the fal x Sequoia 72-hour video hackathon (July 17-19), CapCut's CRE[AI]TE AI festival, SIGGRAPH 2026 Physical AI Day, and the final round of Tencent's ad algorithm competition.

Key trends

GitHub Repo Cataloging Permanent Free LLM API Tiers Gains Attention

The GitHub repository mnfst/awesome-free-llm-apis, which catalogs permanently free LLM API tiers rather than time-limited trial credits, drew developer attention with roughly 5,500 stars. It lists rate limits, OpenAI SDK-compatible endpoints, and setup steps for dozens of providers including Google Gemini, Groq, and Cerebras, with most endpoints usable in tools like Cursor, Aider, and Claude Code by swapping base_url and api_key.

TalkCody Runs Multiple AI Models in Parallel for Local Code Generation

TalkCody, an open-source MIT-licensed desktop app for macOS, Windows, and Linux, generates code by running multiple AI models in parallel. Its 'Four-Level Parallelism' spans project, task, agent, and tool layers, and it supports OpenAI, Anthropic, Google, DeepSeek, GLM, and local models via Ollama or LM Studio for fully offline use. Built with Rust and Tauri 2 on the backend and React 19 with TypeScript on the frontend, it offers multimodal input, MCP server support, an agent marketplace, and a built-in terminal.