Creeta — AI developer tools & ecosystem news

What Codex CLI's 0.135.0 'Stable' Release Actually Fixed

OpenAI's 0.135.0 stable is a diagnostics and polish cycle. What moved in the TUI, Vim mode, and remote transport.

Cohere's First Frontier Model Has Benchmark Gaps

Cohere's first open-weight frontier model: benchmark gaps, native citation design, and the enterprise sovereignty case.

What grok-build-0.1's Caching Incident Revealed

xAI's grok-build-0.1 hit public beta in May 2026. Here's what the spec says — and what the caching incident revealed.

Claude Opus 4.8 Hits 69.2% SWE-Bench Pro — What Else Changed

Anthropic ships Opus 4.8 with 69.2% SWE-Bench Pro, mid-conversation system messages, and adaptive thinking.

Gemini Flash Is GA. Pro Isn't. Which Should You Deploy?

Flash is GA. Pro isn't. Here's the benchmark data and decision framework developers need before choosing or migrating.

Two Codex Alphas in 3 Hours — and the Release Notes Errored

Two alpha releases in three hours, 529 files changed. Here's what the diff says when the release notes page errors.

Anthropic Pays a Rival $1.25B/Month. Claude Rate Limits Jump.

Anthropic buys exclusive access to xAI's Colossus 1 cluster: 220K GPUs, $1.25B/month, and immediate Claude rate limit increases.

Codex Gets an On-Prem Path. Dell Is the First Stop.

OpenAI named Dell as its first non-hyperscaler Codex deployment path. Here's how the architecture actually works and who it targets.

What Gartner's AI Coding Agent MQ Actually Measures

Four Leaders, 12 vendors, one renamed category. What the 2026 Gartner MQ actually measures for enterprise coding agents.

A Crafted Host Header Bypasses Auth in Your AI Agent Stack

Starlette BadHost (CVE-2026-48710): a crafted Host header bypasses auth middleware. Unproxied AI agents at highest risk.

xAI's Coding Agent Reads Your CLAUDE.md. Should You Use It?

xAI's Grok Build ships with Arena Mode, reusable Skills, and CLAUDE.md compat. Here's what developers need to know.

Codex CLI 0.134.0 Kills Your Legacy Profile Config

v0.134.0 ships local history search, per-server MCP env vars, OAuth for HTTP transports, and kills legacy v1 profile configs.

Showing of 214 posts