qunzhongwang

Follow

🎯

Focusing

Qunzhong Wang qunzhongwang

🎯

Focusing

Follow

9 followers · 18 following

Kling AI, Kuaishou Technology
HongKong, China
03:27 (UTC -12:00)
https://qunzhongwang.github.io/
in/qunzhong-wang-904aa02b7

Achievements

Achievements

Highlights

Pro

qunzhongwang/README.md

Qunzhong WANG

Research interest:

Principles of AI Systems backed by Math

Understanding the mathematical principles behind model representation capacity, training dynamics, and generalization.
Leveraging these principles to design better and more scalable architectures, optimizers, training/fine-tuning methods, and regularization techniques.

Reinforcement Learning on Large Models

Aligning Large Language Models (LLMs), Vision-Language Models (VLMs), and their derivative Agents with specific human preferences and demands, with techniques like Reinforcement Learning from Human Feedback (RLHF) and Reinforcement Learning with Verifiable Reward (RLVR).
Exploring robust fine-tuning "recipes" within the RL framework to ensure that pre-trained capabilities are preserved while desired, human-aligned skills are effectively amplified.

🌏 Personal Homepage: http://qunzhongwang.github.io/

📪 E-mail: qunzhong@link.cuhk.edu.hk

Pinned Loading

dgpw dgpw Public

Python 2
vr-thinker vr-thinker Public

Python 41 1