TENCENT
30 days · UTC
LIVE_DATA_STREAM // APRIL_14_2026
Synchronizing with global intelligence nodes...
DENSITY_RATIO: MAX
OPENAI
MAR_24 // 07:39
Agents are diverging; your backend needs an AI orchestrator, not a single model bet
AI agent strategies are splitting across clouds, local runtimes, and model choices, pushing teams to build orchestration and token-aware backends now....
VLLM
MAR_22 // 07:28
The practical playbook for faster, cheaper LLM inference: vLLM, KV caches, and decoding tricks
A hands-on deep dive shows how to speed up and scale LLM inference with vLLM, KV caching, and modern attention/decoding optimizations. This new chapt...
ANTHROPIC
FEB_10 // 10:41
Agent Skills + System Memory for Consistent, Domain-Aware Agents
Packaging domain knowledge as reusable agent skills and pairing it with system-level memory makes AI coding agents follow your conventions, integrate ...