Meta / Llama

Meta / Llama LangChain / LlamaIndex News & Releases Dev Tools & SDK Changelogs

SimpleMultiModalQueryEngine is deprecated. Here's the swap.

LlamaIndex 0.14.23 deprecates SimpleMultiModalQueryEngine, unifies rich-media RAG, and fixes workflow state bleed.

Sungjae Lee

Jun 24, 2026

Meta / Llama vLLM / Ollama News & Releases Dev Tools & SDK Changelogs

The parity fix that quietly resets your profiling baseline

llama.cpp b9437: -fa auto added to llama-bench, -ngl default flips to -1. What changes and who's affected.

Sungjae Lee

Jun 12, 2026

Meta / Llama Build & Learn Daily How-To

Meta Business AI went global — gated rollout, paid plans TBD

Meta Business AI: activation flow, four controls, WhatsApp messaging fees, and the fine print on a gated market rollout.

Sungjae Lee

Jun 09, 2026

Meta / Llama News & Releases Model & API Releases

Meta's always-on pendant will record everyone in the room — not just you

An internal Alex Himel memo, reported by The Information, reveals Meta's AI pendant roadmap: ambient audio capture, real-time transcription, and a Wearables for Work subscription tier — built on the Limitless acquisition.

Sungjae Lee

Jun 03, 2026

Meta / Llama vLLM / Ollama News & Releases Dev Tools & SDK Changelogs

4 GitHub stars, voice interviews with Ollama: that's GrillKit

Apache 2.0 interview trainer with Whisper voice input, Ollama or cloud LLM support, and local session history. No SaaS, no registration required.

Sungjae Lee

Jun 02, 2026

Meta / Llama vLLM / Ollama News & Releases Dev Tools & SDK Changelogs

RDNA3 cuts llama.cpp KV VRAM 47% — and CUDA has no equivalent

RDNA3 bit-packing cuts llama.cpp KV VRAM 47% on RX 7900. Flags, VRAM math, and TurboQuant for 4.9× compression.

Sungjae Lee

Jun 01, 2026

Meta / Llama vLLM / Ollama Build & Learn Daily How-To

llama-bench skipped FA on capable GPUs — b9437 corrects it

llama.cpp b9437 (May 30): -fa goes auto, -ngl to -1 in llama-bench. Your pre-b9437 comparisons need a flag audit.

Sungjae Lee

May 31, 2026

Meta / Llama News & Releases Funding, Strategy & Policy

Meta Gates Llama Compute. What $19.99/Month Buys Developers.

Meta's first paid AI tiers arrive at $7.99 and $19.99/month. Here's what compute gating on Llama means for developers.

Sungjae Lee

May 28, 2026

이 섹션에 아직 한국어 글이 없습니다.

SimpleMultiModalQueryEngine is deprecated. Here's the swap.

The parity fix that quietly resets your profiling baseline

Meta Business AI went global — gated rollout, paid plans TBD

Meta's always-on pendant will record everyone in the room — not just you

4 GitHub stars, voice interviews with Ollama: that's GrillKit

RDNA3 cuts llama.cpp KV VRAM 47% — and CUDA has no equivalent

llama-bench skipped FA on capable GPUs — b9437 corrects it

Meta Gates Llama Compute. What $19.99/Month Buys Developers.

Featured posts

Ghostty beats iTerm2 3× — speed isn't the agent bottleneck

World Monitor hit 67k stars — here's what the MCP endpoint

Ghostty가 iTerm2보다 3배 빠르다 — 병목은 에이전트가 아니다

World Monitor 6.7만 스타 — MCP 엔드포인트는?

Codex CLI가 chat-wire를 버리자, OpenCodex가 라우팅을 맡다

OpenAI, Claude Code에 Codex 탑재 — 두 명령인가, 네 명령인가?

LongCat-Video-Avatar 1.5, 추론 8단계로 단축 — 핵심

Tags

Meta / Llama

Featured posts

Tags

Sign up for insights and ideas