Welcome to my GitHub! I am an Algorithm Engineer focused on LLM Post-training, dedicated to building smarter and more aligned large language models.
- LLM Post-training: Expertise in SFT, RLHF, DPO, and Alignment.
- Model Evaluation: Developing robust frameworks for reasoning and safety.
- Efficiency: Optimizing large-scale training pipelines and inference.

