Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models

Code for the paper "Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models"

Authors: Baolong Bi $^{1}$, Shenghua Liu $^{1}$, Yiwei Wang $^{2}$, Yilong Xu $^{1}$, Junfeng Fang $^{3}$, Lingrui Mei $^{1}$, Xueqi Cheng $^{1}$

$^1$ University of Chinese Academy of Sciences, $^2$ University of California, Merced, $^3$ National University of Singapore

Overview

Retrieval-Augmented Generation (RAG) mitigates hallucinations in Large Language Models (LLMs) by integrating external knowledge. However, conflicts between parametric knowledge and retrieved context pose challenges, particularly when retrieved information is unreliable or the model's internal knowledge is outdated. In such cases, LLMs struggle to determine whether to rely more on their own parameters or the conflicted context. To address this, we propose CK-PLUG, a plug-and-play method for controlling LLMs' reliance on parametric and contextual knowledge. We introduce a novel knowledge consistency metric, Confidence Gain, which detects knowledge conflicts by measuring entropy shifts in token probability distributions after context insertion. CK-PLUG then enables fine-grained control over knowledge preference by adjusting the probability distribution of tokens with negative confidence gain through a single tuning parameter. Experiments demonstrate CK-PLUG's ability to significantly regulate knowledge reliance in counterfactual RAG scenarios while maintaining generation fluency and knowledge accuracy. For instance, on LLaMA-3-8B, memory recall (MR) of RAG response can be adjusted within a broad range (9.9%-71.9%), compared to the baseline of 42.1%. Moreover, CK-PLUG supports adaptive control based on the model's confidence in both internal and external knowledge, achieving consistent performance improvements across various general RAG tasks.

Datasets

Knowledge Reliance Controlling

Download the datasets for ./kr_data:

NQ: Google Drive ConFiQA: GitHub Repository MQuAKE: GitHub Repository

RAG Downstream Tasks

In order to obtain the retrieved passages, we provide the implementation code of the retrieval stage at \retrieval. Specifically,

retrievers.py: class definition of retrievers
retrieval.py: retrieval pipeline (referenced from beir)
preprocess.py: including data preprocessing operations such as downloading, format alignment, sampling, etc.
main.py: entry file

Using the segmented corpus, we can run the following command to perform the retrieval operation:

python \retrieval\main.py --retriever bge --corpus_path wikipedia_100_2019_08_01.jsonl --topk 20

Remember to modify the file path for your environment.

Experiments

Setup with transformers (incorporating CK-PLUG)

pip install -e transformers-4.49

Run the knowledge control evaluation on the NQ, ConFiQA and MQuAKE using the following command:

python eval_NQ.py --model_name ./model_path --mode ck
python eval_ConFiQA.py --model_name ./model_path --mode ck
python eval_MQuAKE.py --model_name ./model_path --mode ck

Run the adaptive enhancement evaluation on the KILT using the following command:

python eval_rag.py --model_name ./model_path --mode ck --adaptive True --input_file rag_data --task rag_task

Bugs or Qustions?

If you have any questions related to the repo or the paper, or you encounter any problems when using the datasets/code, feel free to email Baolong Bi (bibaolong23z@ict.ac.cn) or open an issue!

Citation

Please cite our paper if it's helpful to your work!

@article{bi2025parameters,
  title={Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models},
  author={Bi, Baolong and Liu, Shenghua and Wang, Yiwei and Xu, Yilong and Fang, Junfeng and Mei, Lingrui and Cheng, Xueqi},
  journal={arXiv preprint arXiv:2503.15888},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
__pycache__		__pycache__
retrieval		retrieval
transformers-4.49		transformers-4.49
README.md		README.md
ck.py		ck.py
eval_ConFiQA.py		eval_ConFiQA.py
eval_MQuAKE.py		eval_MQuAKE.py
eval_NQ.py		eval_NQ.py
eval_rag.py		eval_rag.py
eval_rag_utils.py		eval_rag_utils.py
framework.png		framework.png
overview.png		overview.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models

Overview

Datasets

Knowledge Reliance Controlling

RAG Downstream Tasks

Experiments

Bugs or Qustions?

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

byronBBL/CK-PLUG

Folders and files

Latest commit

History

Repository files navigation

Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models

Overview

Datasets

Knowledge Reliance Controlling

RAG Downstream Tasks

Experiments

Bugs or Qustions?

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages