I am a Postdoctoral Researcher at the School of Computer Science, Peking University. I received my Ph.D. in Computer Science and Technology from Tsinghua University in 2022 (GPA: 3.93/4.0), and double B.S./B.A. degrees in Computer Science and Economics from Xi'an Jiaotong University in 2017, where I ranked 1st in both majors.
My research interests include AI agent design, LLM inference optimization, distributed machine learning, and high-performance computing systems. Currently, I focus on developing AI agents for automated data acquisition and analysis, as well as intelligent workflow optimization.
Building intelligent agents for automated data acquisition and analysis, enabling trustworthy and traceable workflow optimization across vertical domains.
Model quantization using Hessian-based methods and high-dimensional subspace compression. Post-training optimization including SFT and RLHF for domain-specific applications.
Designing efficient parameter synchronization topologies and gradient compression algorithms for large-scale distributed machine learning training.
Sketch-based network measurement, hybrid hardware/software flow table management, and high-performance network function virtualization.
2024 – Present · Peking University
Building intelligent agents for automated data acquisition and analysis processing, enabling efficient workflow optimization across multiple vertical domains. Key features include trustworthy and traceable data pipelines.
2023 – 2024
Post-training optimization for medical domain large models using Supervised Fine-Tuning (SFT) and Reinforcement Learning from Human Feedback (RLHF), improving model accuracy and safety in clinical applications.
2023
Designed weight quantization using Hessian-based methods and high-dimensional subspace compression for efficient large model inference. Achieved 30% model compression with minimal performance degradation.
2022 – 2024 · Published at ICDE 2025
Developed a unified sketch data structure that reduces hash collision and supports 9 types of set measurement tasks. Achieved 90% memory savings and 40x throughput improvement over existing approaches.
2019 – 2022 · Published at NSDI 2022
Proposed a hybrid hardware/software scheme for NFV gateway flow table management. Achieved 50% resource reduction and 97% latency reduction compared to software-only approaches.
2017 – 2018 · Published at NeurIPS 2018
Designed a multidimensional ring topology for parameter synchronization in distributed machine learning, achieving 25–50% synchronization time reduction and 56.4% overall training acceleration.
School of Computer Science
Peking University
Beijing, China