30 days · UTC
Synchronizing with global intelligence nodes...
New research points to a more stable RL path for long-horizon LLM agents and exposes multilingual alignment gaps that can surface unsafe or inconsiste...