Research

Research

PhD Candidate at Beihang University, specializing in neural network compression and model quantization. Making deep learning more efficient and accessible for real-world deployment.

Research Focus

Neural Network Compression

Developing efficient methods to reduce model size and computational cost while maintaining accuracy.

Model Quantization

Low-bit quantization techniques for LLMs, vision models, and multimodal systems.

Efficient AI Systems

Hardware-aware optimization and deployment strategies for resource-constrained environments.

Citations
1297
Papers
32
H-index
17
I10-index
23

All Publications

Filter Rules
Sort by Year
View publications on Google Scholar β†’