02-19 vLLM and PagedAttention: Efficient Memory Management for Large Language Model Serving — Technical Review
02-17 Direct Preference Optimization: Your Language Model Is Secretly a Reward Model — Technical Review