Xuefei Ning
Xuefei Ning
Home
Updates
Publications
Talks
Light
Dark
Automatic
Quantization
An Introduction to Quantization of Large Language Models
A talk about efficient LLM with a special focus on quantization.
Last updated on Aug 30, 2023
Slides
Video
Model Compression Towards Efficient Deep Learning Inference
A talk on model compression towards efficient DL inference
Last updated on Aug 29, 2023
Slides