DAPO: An Open-Source LLM Reinforcement Learning System at Scale
MASPO:面向 LLM 多智能体系统的联合提示词优化
MASPO: Joint Prompt Optimization for LLM-based Multi-Agent Systems
Tutti:让基于 SSD 的 KV Cache 真正适用于长上下文 LLM Serving
Tutti: Making SSD-Backed KV Cache Practical for Long-Context LLM Serving
Queueing Stability for LLM Inference with KV Cache Memory Constraints
Swift-SVD: Activation-Aware Low-Rank Compression for LLM Weights and KV Cache
Piper: Efficient Large-Scale MoE Training via Resource Modeling and Pipelined Hybrid Parallelism
Low-Rank Optimization Trajectories for LLM RLVR Acceleration: A Technical Review of NExt
FEPLB Technical Review: Nearly Free MoE Load Balancing with the NVLink Copy Engine
Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond — Technical Review
Author: Zhongzhu Zhou
Paper: Chu et al., 2026. arXiv:2604.22748 [cs.AI]
Date: April 27, 2026
Direction: Monday, April 27 — Agent/LLM Quality Generation
Pages: 10
Executive Summary
As AI systems evolve from text generators to goal-achieving agents that interact with complex environments, predicting environment dynamics has become the central bottleneck. This comprehensive survey paper provides a unified framework for understanding world models—internal representations that agents use to anticipate consequences of their actions and plan accordingly.
The paper introduces a elegant "levels × laws" taxonomy:
- Three capability levels (L1 Predictor → L2 Simulator → L3 Evolver) define what a world model can do
- Four governing-law regimes (physical, digital, social, scientific) define the constraints it must satisfy
By synthesizing over 400 papers across model-based RL, video generation, web/GUI agents, multi-agent simulation, and AI-driven science, the authors reveal a fragmented landscape where "world model" means different things to different communities. Their framework provides the common language needed to align these communities.