04-16 PipeDream: Turning Pipeline Parallelism into a Practical Training System — Deep Technical Review
04-09 DistServe: Disaggregating Prefill and Decoding for Goodput-optimized Large Language Model Serving — Deep Technical Review