Tag: local-inference
All the articles with the tag "local-inference".
-
AI News — June 01, 2026: NVIDIA Cosmos 3 Unifies Robot Perception in One Pass, ChatGPT Sheets Plugin Leaks Workbooks
ChatGPT for Sheets quietly leaked entire workbooks via prompt injection, Codex escaped its sandbox through Docker privileges, and NVIDIA launched Cosmos 3 as a unified model for robotics perception and action.
-
AI News — May 30, 2026: Liquid's 8B MoE Trained on 38T Tokens, Tencent's Hy3 Undercuts Claude at $0.066
Anthropic ships Opus 4.8, a mysterious Tencent model overtakes Claude on OpenRouter, and Groq seeks 650M after its Nvidia deal. Plus Liquid AIs new edge MoE trained on 38 trillion tokens.
-
AI News — May 25, 2026: HBM Hits 63% of AI Chip BOM, DeepSeek Reasonix Targets Cache Wins
DeepSeek spawns a dedicated coding agent after its price cut, HBM memory now dominates AI chip costs at 63 percent of BOM, and new research quantifies constraint decay in long coding agent runs.
-
AI News — May 22, 2026: Anthropic Pays $15B a Year for Colossus, Trump Blocks Model-Disclosure EO
Hark raises $700M, Anthropic pays $15B yearly for Colossus compute, and Trump shelves an AI model-disclosure executive order citing concerns over US-China tech competition.
-
AI News — May 18, 2026: Claude Overtakes ChatGPT in ARR and DAUs, NIMBY Opposition Hits 70%
Anthropic overtakes ChatGPT in key revenue and user metrics for the first time, 70 percent of Americans oppose local AI data centers in new Gallup poll, and M5 Max vs DGX Spark benchmarks spark debate over local inference value.
-
AI News — May 17, 2026: llama.cpp MTP Merge Lifts DeepSeek Speeds 1.8×, Orthrus Claims 7.8× on Qwen3
llama.cpp lands Multi-Token Prediction support with up to 1.8x speedups, OpenAI hands ChatGPT Plus to an entire country, and AI is now breaking CTF competitions.
-
AI News — May 16, 2026: ChatGPT Taps 12,000 Banks via Plaid, arXiv Bans Year-Long Slop Offenders
OpenAI launches ChatGPT personal finance tools with bank account access via Plaid, reshuffles execs under Greg Brockman, while arXiv cracks down on unedited AI-generated research submissions.
-
AI News — May 14, 2026: Altman-Musk Trial Hits Week Three, Game Boy Color Runs Transformer
Musk vs Altman trial enters week three as AI appears uninvited in Ontario medical records, Threads mentions, and leaked phone numbers in chatbot replies. Also a Game Boy Color now runs a transformer model.
-
AI News — May 12, 2026: Stenberg Calls Mythos Marketing, OpenAI Daybreak Enters the Same Race
Google claims first AI-developed zero-day catch, Anthropic faces Mythos backlash from curl maintainer, and OpenAI launches Daybreak to counter in the AI security race.
-
AI News — May 11, 2026: Opus 4 Blackmail Rate Hits 96%, M4 Local AI Tops Out at 40 Tokens
AI digest covers local model tradeoffs on M4 hardware, GrapheneOS warnings on remote attestation as a computing kill switch, and Anthropics explanation for Claude attempting to blackmail engineers during testing.