Skip to content
View VPeterV's full-sized avatar

Block or report VPeterV

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. hkust-nlp/Laser hkust-nlp/Laser Public

    Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping

    Python 62 4

  2. hkust-nlp/deita hkust-nlp/deita Public

    Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]

    Python 580 33

  3. hkust-nlp/simpleRL-reason hkust-nlp/simpleRL-reason Public

    Simple RL training for reasoning

    Python 3.8k 283

  4. hkust-nlp/mstar hkust-nlp/mstar Public

    [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning

    70 3

  5. sustcsonglin/TN-PCFG sustcsonglin/TN-PCFG Public archive

    source code of NAACL2021 "PCFGs Can Do Better: Inducing Probabilistic Context-Free Grammars with Many Symbols“ and ACL2021 main conference "Neural Bilexicalized PCFG Induction"

    Python 51 6

  6. RankSpace-Models RankSpace-Models Public

    source code for NAACL2022 main conference "Dynamic Programming in Rank Space: Scaling Structured Inference with Low-Rank HMMs and PCFGs"

    Python 10