Logit-Linear-Selection Example

Code accompanying "Subliminal Effects in Your Data: A General Mechanism via Log-Linearity". A simple implementation of our filtering/subset selection method, Logit-Linear-Selection (LLS). We provide a minimal end-to-end example showing how to transfer an affinity for owls from a system-prompted teacher (OLMo2-1B-Instruct) to a student model (Llama3.2-1B-Instruct) via preference tuning on an LLS dataset.

We use the stack_exchange_paired subset of Tulu 2.5, keeping examples with prompts under 250 tokens and truncating responses to 20 tokens. This dataset is fed into our LLS algorithm to construct an LLS preference dataset.

Requirements: torch, transformers, datasets, accelerate, trl, peft, numpy, pyyaml, tqdm

pip install -r requirements.txt

See requirements.txt for tested versions. Requires access to Llama 3.2 via HuggingFace.

Setup

Set local_root in config.yaml to your desired output directory
Ensure HF_HOME and HF_TOKEN environment variables are set

Usage

Step 1: Logit-Linear Selection

python logit_linear_selection.py

Step 2: Preference Tuning with DPO

python training.py

Multi-GPU / Multi-Node

The code uses HuggingFace Accelerate and extends naturally to multi-GPU and multi-node setups:

accelerate launch --num_processes <NUM_GPUS> logit_linear_selection.py
accelerate launch --num_processes <NUM_GPUS> training.py

For SLURM clusters, wrap with srun to ensure proper GPU allocation. See Accelerate documentation for details.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
README.md		README.md
config.yaml		config.yaml
helper_functions.py		helper_functions.py
logit_linear_selection.py		logit_linear_selection.py
requirements.txt		requirements.txt
training.py		training.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Logit-Linear-Selection Example

Setup

Usage

Multi-GPU / Multi-Node

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Logit-Linear-Selection Example

Setup

Usage

Multi-GPU / Multi-Node

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages