04-10 SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression — Deep Technical Review
03-25 GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers — In-Depth Technical Review