Code for https://arxiv.org/abs/2410.02472
Files:
base_classifier.pyexperiments with comparing meta-models to just feeding the text to a meta-model and asking the questiondata.pyall the data. lots of duplicated code hereelicit_activations.pyget activations from a finetuned input-modelfinetune2.pyfinetune an input-model LoRAhftrain.pytrain a meta-modelincontext.pyshort experiment to create a meta-model fron in-context examples (unsuccessful so far)make_main_figure.pymakes the main figuremake_question_ablations.pymakes the question ablations ablation figurephi2_meta_model.pythe meta-model code