5 posts 0 posts

Meta / Llama

Coverage of Meta’s Llama models and the open-weight ecosystem.

Meta's always-on pendant will record everyone in the room — not just you

An internal Alex Himel memo, reported by The Information, reveals Meta's AI pendant roadmap: ambient audio capture, real-time transcription, and a Wearables for Work subscription tier — built on the Limitless acquisition.

4 GitHub stars, voice interviews with Ollama: that's GrillKit

Apache 2.0 interview trainer with Whisper voice input, Ollama or cloud LLM support, and local session history. No SaaS, no registration required.

RDNA3 cuts llama.cpp KV VRAM 47% — and CUDA has no equivalent

RDNA3 bit-packing cuts llama.cpp KV VRAM 47% on RX 7900. Flags, VRAM math, and TurboQuant for 4.9× compression.

llama-bench skipped FA on capable GPUs — b9437 corrects it

llama.cpp b9437 (May 30): -fa goes auto, -ngl to -1 in llama-bench. Your pre-b9437 comparisons need a flag audit.

Meta Gates Llama Compute. What $19.99/Month Buys Developers.

Meta's first paid AI tiers arrive at $7.99 and $19.99/month. Here's what compute gating on Llama means for developers.

이 섹션에 아직 한국어 글이 없습니다.