Morning Digest, July 1, 2026

13 newsletters, 8 overlapping stories


Top Stories

Anthropic launches Claude Sonnet 5 at a steep discount to its top model

(2 newsletters)

Anthropic released Claude Sonnet 5, which it calls its most agentic Sonnet yet, delivering near-flagship performance at mid-tier prices and now serving as the default model for Free and Pro users. It shows major gains in agentic coding and reasoning, with some knowledge-work capabilities surpassing Opus 4.8, though its cybersecurity benchmarks came in below Sonnet 4.6 because Anthropic says it did not deliberately train the model on those tasks. API pricing is $2 per million input tokens and $10 per million output tokens through August 31, rising to $3 and $15 after.

Washington lifts export controls, restoring Fable and Mythos

(3 newsletters)

The Department of Commerce agreed to lift its export controls on Fable 5 and Mythos 5 after an 18-day pause, with Anthropic addressing the workarounds Amazon researchers had used to evade Fable safeguards. The Center for AI Standards and Innovation is expected to be involved in the restored access. The timing slightly overshadowed the Sonnet 5 launch, and the ban’s ripple effects were still visible elsewhere as Asian startups rushed out alternatives.

Mobile coding agents arrive: Cursor and OpenClaw ship phone apps

(4 newsletters)

Cursor launched a native iOS app in public beta, letting paid users start cloud agents, control local agents, use voice input, review diffs, and merge pull requests from their phones. The open-source assistant OpenClaw shipped iOS and Android apps hours later, and Anthropic teased that the next Claude Code will run subagents in the background by default. The consistent signal across coverage is that agentic development is shifting from desk-bound sessions to always-on agents you steer and approve from anywhere.

AWS commits $1 billion to a forward-deployed engineering org

(3 newsletters)

Amazon is putting $1 billion into a new Forward Deployed Engineering unit that embeds engineers directly inside customer teams to build and roll out agentic AI systems. The forward-deployed model, coined by Palantir over a decade ago, has seen a resurgence, with OpenAI and Anthropic already running their own FDE units. The move underscores that enterprise AI deployment, not model access, is becoming the real bottleneck.

Cognition’s Devin Fusion cuts coding-agent costs with model routing

(2 newsletters)

Cognition released Devin Fusion, a multi-model harness that pairs a frontier model for high-level decisions with a cheaper sidekick agent for routine work, switching between models mid-workflow to hold quality. The company claims it matches flagship performance at 35 percent lower cost, rising to 41 percent with Fable 5 integration, and its router drove 88 percent of merged pull requests in internal testing. It lands amid a broader push to control runaway AI spend.

The open-weight and Chinese-model debate moves from benchmarks to production

(3 newsletters)

Open-weight models have closed much of the gap with proprietary ones, with Zhipu AI’s GLM-5.2 ranking sixth on Artificial Analysis’ leaderboard, closely followed by MiniMax-M3 and DeepSeek V4 Pro. A viral list of Western companies running Chinese open-weight models circulated widely, Coinbase disclosed it halved its AI bill partly by routing to GLM 5.2 and Kimi 2.7, and Asian startups including China’s 360 and Tokyo’s Sakana AI launched Mythos-like and Fable-comparable models as the export ban dragged on. The trade-offs in security and engineering overhead mean it is not the right fit for every team.

Meta’s non-invasive AI decodes brain waves into text

(2 newsletters)

Meta shared Brain2Qwerty v2, a non-invasive system that decodes typed sentences from MEG brain recordings in real time, reaching 61 percent word accuracy overall and 78 percent for its best participant. Trained on roughly 22,000 sentences from nine volunteers, it uses no surgical implants, and Meta is releasing code, data, and tools for open research. The stated goal is restoring communication for people who can no longer speak or type.

Google ships Nano Banana 2 Lite and Gemini Omni Flash

(3 newsletters)

Google introduced two developer media models: a low-cost Nano Banana 2 Lite that generates an image in about four seconds for roughly 3.4 cents, and Gemini Omni Flash, which generates and edits 10-second video clips at 10 cents per second and tops text-to-video leaderboards. The pitch is to chain them, making an image with Lite and animating it with Omni in one workflow. Separately, Gemini’s personalized image generation went free for US users.


Also Worth Knowing

Quick Hits

Shower Thoughts