PhD Candidate at Beihang University, specializing in neural network compression and model quantization.
Making deep learning more efficient and accessible for real-world deployment.
Research Focus
Neural Network Compression
Developing efficient methods to reduce model size and computational cost while maintaining accuracy.
Model Quantization
Low-bit quantization techniques for LLMs, vision models, and multimodal systems.
Efficient AI Systems
Hardware-aware optimization and deployment strategies for resource-constrained environments.