ICL

Here are the instructions for getting started with this repository

Data Preperation

Data are scraped from datasets. To download data, first make sure that the version of datasets is 1.4.0 (which, if you installed dependencies in requirements.txt, is NOT). then run the script in preprocess.sh

Generating Ablations

To generate random or #%_correct data by dataset, run the following:

python create_data.py --variant {random|0_correct|25_correct|50_correct|75_correct} --dataset {dataset}

Alternatively, if you want to generate a variant of all datasets in a config.json file under config, run:

python create_data.py --variant {random|0_correct|25_correct|50_correct|75_correct} --task {config_name}

This will create the corresponding datasets, and a new config.json file under config.
For the mechanistic part of the paper, you will need to create the random variant of data with k=-1 (save as many train samples as possible to sample). An example of the parametric knowledge retrieval datasets (which is referred to as "function_vectors_original") would look like:

python create_data.py --variant {random|0_correct|25_correct|50_correct|75_correct} --task function_vectors_original --k -1

Running Experiments

Behaviour experiments (Section 4): this script runs on the parametric knowledge retrieval datasets.
Extracting FVs for all heads (Section 5.1): this script runs on the parametric knowledge retrieval datasets.
Identifying head importance via AIE (Section 5.1): this script runs on the parametric knowledge retrieval datasets.
Steering top FV heads (Section 5.3): this script runs on the parametric knowledge retrieval datasets.
Ablating top FV heads (Section 5.3): this script runs on the parametric knowledge retrieval datasets.

Name		Name	Last commit message	Last commit date
Latest commit History 337 Commits
config		config
img		img
interpretability		interpretability
preprocess		preprocess
utils		utils
.gitignore		.gitignore
README.md		README.md
constants.py		constants.py
create_data.py		create_data.py
extract_activations.py		extract_activations.py
extract_fv_steer.sh		extract_fv_steer.sh
function_vectors.py		function_vectors.py
function_vectors_original.sh		function_vectors_original.sh
layer_steer.py		layer_steer.py
preprocess.sh		preprocess.sh
requirements.txt		requirements.txt
steer.py		steer.py
test.py		test.py
test_fv_og.sh		test_fv_og.sh
test_fv_og_removal_ablation.sh		test_fv_og_removal_ablation.sh
test_fv_og_steer.sh		test_fv_og_steer.sh
top_p_overlap.py		top_p_overlap.py
visualize_fv.py		visualize_fv.py
visualize_line_transformer.py		visualize_line_transformer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ICL

Data Preperation

Generating Ablations

Running Experiments

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ICL

Data Preperation

Generating Ablations

Running Experiments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages