Highlights
- Pro
Popular repositories Loading
-
steering-off-course
steering-off-course PublicOfficial code for the paper - Steering off Course: Reliability Challenges in Steering Language Models
Python 5
-
supercomputing-utils
supercomputing-utils PublicHelpful functions and scripts for using Slurm on Supercomputing platforms
Python 1
-
dedup_mc_reward
dedup_mc_reward Public(Work in Progress) - Deduplicated Monte Carlo Reward Generation for Large Language Models
Python
-
-
diverlsity
diverlsity PublicForked from verl-project/verl
verl: Volcano Engine Reinforcement Learning for LLMs w/ diversity similar to DARLING
Python
-
vllm-log_en
vllm-log_en PublicForked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs - NEW: return logprob entropy in a memory efficient manner
Python
If the problem persists, check the GitHub status page or contact support.
