30 posts 16 posts

Dev Tools & SDK Changelogs

Changelogs and version notes for AI developer tools, SDKs, and CLIs.

Windsurf is Devin now — Cascade retires July 1

Cognition renamed Windsurf to Devin Desktop June 2. What changed, what broke, and what IT admins need to do now.

4 GitHub stars, voice interviews with Ollama: that's GrillKit

Apache 2.0 interview trainer with Whisper voice input, Ollama or cloud LLM support, and local session history. No SaaS, no registration required.

RDNA3 cuts llama.cpp KV VRAM 47% — and CUDA has no equivalent

RDNA3 bit-packing cuts llama.cpp KV VRAM 47% on RX 7900. Flags, VRAM math, and TurboQuant for 4.9× compression.

NodeCartel is dark. Cross-host AI orchestration: who delivers.

NodeCartel is unreachable. Kore.ai, CrewAI Cloud, Northflank, and AgentNode Pro compared for cross-host AI scheduling.

17k tokens → 1.4k — Headroom keeps the originals retrievable

Open-source context compression middleware for agent pipelines: 60–95% token cuts, CCR reversibility, AST-aware engines.

NVIDIA cut Qwen3.6-35B 3×. Accuracy barely moved.

NVIDIA's NVFP4 Qwen3.6-35B checkpoint on HuggingFace: 3.06× memory reduction, <1% accuracy loss, Blackwell-native, vLLM flags included.

Overslash holds the credentials. Your AI only gets a handle.

Overslash injects secrets by handle at the gateway, limits blast radius per agent, and escalates out-of-scope calls to human approval. Free self-hosted or €3/seat cloud.

Harness edits slowed the GPU kernel 10×. Weights fixed it.

SIA edits its scaffold and fine-tunes weights via LoRA — 70.1% LawBench, 12.4% faster GPU kernels, MIT-licensed.

Why langchain-perplexity 1.3.1 Dropped Its SSE Shim

1.3.0 added use_responses_api for Perplexity's Responses endpoint; 1.3.1 removed the SSE shim 0.34.0 required.

459 Commits Into vLLM 0.22.0 — What Moves the Needle

459 commits, a dedicated DeepSeek V4 package, Rust frontend, and an rc0 that's one CI fix. What matters and what doesn't.

Google Beam's 3D Group Calls Need a $24,999 Display to Work

Google I/O 2026 extended Beam to multi-person calls. Here's the AI pipeline, the $24,999 display, and where the gaps are.

ChatPerplexity Auto-Routes to Real-Time Search in LangChain

ChatPerplexity gains use_responses_api in 1.3.0: auto-routes to Perplexity's Agent API for real-time search.

Inject Constraints Mid-Run Without Breaking the Prompt Cache

Mid-conversation constraint injection in v0.105.0 preserves prompt cache continuity across long inference runs.

Copilot Cowork Silently Exfiltrates SharePoint — No Patch Yet

A 5-line poisoned Skills script silently exfiltrates SharePoint data via Copilot Cowork — no approval gate, no CVE, no patch.

Anthropic SDK 0.105.0 Needed Two Hotfixes — What to Pin

Two rapid patches followed Anthropic's 0.105.0 drop. Here's what broke, why, and which version to pin.

MCP Credential Leak Closed in Claude Code's Busiest Week Yet

Seven builds in one week: four Bash/PowerShell sandbox bugs patched, /code-review --fix lands auto-apply, and a serious MCP auth credential leak is closed.

Docs Live and Gmail Live Are Real — Here's Who Gets Them First

Docs Live, Gmail Live, Gemini Spark, Sheets one-shot: I/O 2026 Workspace features and who gets access first.

Anthropic 0.105.0 Adds Output Attribution — What It Buys You

v0.105.0 adds granular output-type attribution and configurable upload caps—here's what they do and when to use them.

vLLM v0.21.0 Production Update: KV Offload and Multi-Server Port Bug

v0.22.0 doesn't exist yet. v0.21.0 ships KV offload, spec decode, and a multi-server port bug still under review.

The Claude Code Sprint That Patched Four Security Holes

Ten patches in nine days: pinned sessions, four security fixes, /code-review --fix, and skill-level tool gating.

Codex CLI 0.134.0 and 0.135.0: Two Stable Releases in 48 Hours

OpenAI shipped two Codex CLI stable releases in 48 hours. What changed, what broke, and why the cadence matters.

Anthropic Python SDK 0.105: Opus 4.8 and Mid-Session System Prompts

Three SDK releases in 7.5 hours ship claude-opus-4-8 support, mid-conversation system blocks, and finer output usage reporting.

Grok Build Lands in OpenCode and Kilo Code: xAI's 13-Day Rollout

xAI shipped grok-build-0.1 to three developer tools in 13 days. Here's what each integration covers and how to pick the right surface.

What Codex CLI's 0.135.0 'Stable' Release Actually Fixed

OpenAI's 0.135.0 stable is a diagnostics and polish cycle. What moved in the TUI, Vim mode, and remote transport.

Two Codex Alphas in 3 Hours — and the Release Notes Errored

Two alpha releases in three hours, 529 files changed. Here's what the diff says when the release notes page errors.

xAI's Coding Agent Reads Your CLAUDE.md. Should You Use It?

xAI's Grok Build ships with Arena Mode, reusable Skills, and CLAUDE.md compat. Here's what developers need to know.

Codex CLI 0.134.0 Kills Your Legacy Profile Config

v0.134.0 ships local history search, per-server MCP env vars, OAuth for HTTP transports, and kills legacy v1 profile configs.

How a Poisoned OneDrive File Silently Pulls Your M365 Data

PromptArmor shows how a poisoned SKILL.md in OneDrive lets attackers silently pull M365 files — no approval dialog, no user alert.

vLLM RC3 Fixes a Hard-Coded 60s Timeout — What to Configure

RC3 patches a hard-coded 60s startup timeout in vLLM's multi-API-server subsystem — here's what changed and what operators must configure.

openai-codex b1→b2 in Four Hours — What the Cadence Reveals

Two beta releases in under four hours. Here's what the b1→b2 patch cadence tells developers about SDK maturity and what to pin.

$24,999 디스플레이와 Google Beam이 아직 못 하는 것

Google I/O 2026 extended Beam to multi-person calls. Here's the AI pipeline, the $24,999 display, and where the gaps are.

ChatPerplexity 1.3.0, 실시간 검색 자동 라우팅이 된다

ChatPerplexity gains use_responses_api in 1.3.0: auto-routes to Perplexity's Agent API for real-time search.

대화 중간 제약을 바꿔도 프롬프트 캐시가 끊기지 않는다

Mid-conversation constraint injection in v0.105.0 preserves prompt cache continuity across long inference runs.

5줄 스크립트로 SharePoint가 조용히 유출된다

A 5-line poisoned Skills script silently exfiltrates SharePoint data via Copilot Cowork — no approval gate, no CVE, no patch.

Anthropic SDK 릴리즈가 PyPI 배포를 깨뜨린 이유

Two rapid patches followed Anthropic's 0.105.0 drop. Here's what broke, why, and which version to pin.

Claude Code MCP 크리덴셜 유출이 패치됐다

Seven builds in one week: four Bash/PowerShell sandbox bugs patched, /code-review --fix lands auto-apply, and a serious MCP auth credential leak is closed.

Google Workspace Live, 기능 접근 순서가 정해졌다

Docs Live, Gmail Live, Gemini Spark, Sheets one-shot: I/O 2026 Workspace features and who gets access first.

Anthropic SDK 출력 귀속, 코드에서 실제로 뭐가 달라지나

v0.105.0 adds granular output-type attribution and configurable upload caps—here's what they do and when to use them.

vLLM 최신은 v0.21.0, 포트 버그는 아직 미해결

v0.22.0 doesn't exist yet. v0.21.0 ships KV offload, spec decode, and a multi-server port bug still under review.

Claude Code, 9일 만에 보안 구멍 4개를 닫았다

Ten patches in nine days: pinned sessions, four security fixes, /code-review --fix, and skill-level tool gating.

Codex CLI 0.134.0 & 0.135.0: 48시간 안에 안정 버전 2개 출시

OpenAI shipped two Codex CLI stable releases in 48 hours. What changed, what broke, and why the cadence matters.

Anthropic Python SDK 0.105: Opus 4.8 및 미드-세션 시스템 프롬프트

Three SDK releases in 7.5 hours ship claude-opus-4-8 support, mid-conversation system blocks, and finer output usage reporting.

Grok Build, OpenCode·Kilo Code에 상륙: xAI의 13일 롤아웃

xAI shipped grok-build-0.1 to three developer tools in 13 days. Here's what each integration covers and how to pick the right surface.

Codex CLI Doctor가 생겼다, TUI와 Vim 모드도 달라졌다

OpenAI's 0.135.0 stable is a diagnostics and polish cycle. What moved in the TUI, Vim mode, and remote transport.

Codex CLI alpha, 릴리즈 노트 오류 뒤 529개 파일의 실체

Two alpha releases in three hours, 529 files changed. Here's what the diff says when the release notes page errors.

openai-codex 4시간 만에 재패치, SDK 성숙도를 어떻게 볼까

Two beta releases in under four hours. Here's what the b1→b2 patch cadence tells developers about SDK maturity and what to pin.