Publications

(2024). BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-Distillation. Under Review.

PDF Cite Code Project

(2023). AFPQ: Asymmetric Floating Point Quantization for LLMs. Under Review.

PDF Cite Code

(2013). Benchmarking and Dissecting the Nvidia Hopper GPU Architecture. IPDPS 2024.

PDF Cite