Abstract
Knowledge graphs (KGs) is an associated network composed of semantic relationships. The goal of the knowledge graph question answering (KGQA) is to provide answers to natural language questions based on KGs. Multi-hop KGQA requires reasoning on multiple edges of KGs to get the correct answer. However, KGs are usually incomplete, with numerous missing relationships in reality, which brings challenges to KGQA, especially for multi-hop KGQA. In this work, we propose an efficient approach for multi-hop KGQA. To capture more comprehensive features on incomplete KGs, we utilize Tucker Entity Relation (TuckER) decomposition for link prediction on the binary tensor representation of KGs and train a knowledge graph embedding (KGE) model and apply the learned representation for downstream QA tasks. We employ a pre-trained language model to assess the relevance scoring of questions and each node after subgraph retrieval. Additionally, we introduce a link scoring strategy based on the triple scoring function to address the limitations of solely relying on KGE for answer scoring. Through extensive experiments conducted on multiple benchmark datasets, we demonstrate the effectiveness of our proposed model in facilitating multi-hop QA reasoning on incomplete KGs.






Similar content being viewed by others
Code Availability
The code used in the experiment can be obtained from the first author on reasonable request.
References
Park M, Kim J, Shin S, Park C, Jeon J, Kwon S (2023) Quantile estimation for encrypted data. Appl Intell 53:24782–24791
Zhang S, Liang Y, Gong M, Jiang D, Duan N (2022) Multi-View document representation learning for open-domain dense retrieval. Paper presented at the Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics 5990–6000
Shao Z, Huang M (2022) Answering open-domain multi-answer questions via a recall-then-verify framework. Paper presented at the Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics 1825–1838
Zhang J, Zhang X, Yu J, Tang J, Tang J, Li C, Chen H (2022) Subgraph retrieval enhanced model for multi-hop knowledge base question answering. Paper presented at the Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics 5773–5784
Zhang Y, Zhou Z, Yao Q, Li Y (2022) KGTuner: Efficient hyper-parameter search for knowledge graph Learning. Paper presented at the Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics 2715–2735
Yasunaga M, Ren H, Bosselut A, Liang P, Leskovec J (2021) QA-GNN: Reasoning with language models and knowledge graphs for question answering. Paper presented at the Association for Computational Linguistics, NAACL-HLT 2021, Online 535– 546
Wang Y, Srinivasan V, Jin H (2022) A new concept of knowledge based question answering (KBQA) system for multi-hop reasoning. Paper presented at the Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics, Human Language Technologies 4007–4017
Luo Z, Xu W, Liu W, Bian J, Yin J, Liu T (2022) KGE-CL: Contrastive learning of tensor decomposition based knowledge graph embeddings. Paper presented at the Proceedings of the 29th International Conference on Computational Linguistics 2598–2607
Wang X, Gao T, Zhu Z, Zhang Z, Liu Z, Li J, Tang J (2021) Kepler: A unified model for knowledge embedding and pre-trained language representation. Paper presented at the Proceedings of the Transactions of the Association for Computational Linguistics 176–194
Clouatre L, Trempe P, Zouaq A, Chandar S (2021) MLMLM: Link prediction with mean likelihood masked language model. Paper presented at the Proceedings of the Association for Computational Linguistics: ACL-IJCNLP 2021, Online 4321–4331
Sun H, Bedrax T, W, C (2019) Pullnet: Open domain question answering with iterative retrieval on knowledge bases and text. Paper presented at the Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing 2380–2390
Feng Y, Zhang J, He G, Zhao W, Liu L, Liu Q, Li C, Chen H (2021) A pretraining numerical reasoning model for ordinal constrained question answering on knowledge base. Paper presented at the Findings of the Association for Computational Linguistics: EMNLP 2021:1852–1861
Xu H, Bao J, Liu W (2023) Double-Branch multi-attention based graph neural network for knowledge graph completion. Paper presented at the Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics 15257–15271
Liu J, Wang P, Shang Z, C, W (2023) IterDE: An iterative knowledge distillation framework for knowledge graph embeddings. Paper presented at the Proceedings of the AAAI Conference on Artificial Intelligence 4488–4496
Jin W, Zhao B, Yu H, Tao X, Yin R, Liu G (2023) Improving embedded knowledge graph multi-hop question answering by introducing relational chain reasoning. Data Min Knowl Disc 37:255–288
Tang Y, Huang J, Wang G, He X, Zhou B (2020) Orthogonal relation transforms with graph context modeling for knowledge graph embedding. Paper presented at the Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics 2713–2722
Ruffinelli D, Broscheit S, Gemulla R (2020) You can teach an old dog new tricks on training knowledge graph embeddings. Paper presented at the International Conference on Learning Representations 2020
Xu W, Zheng S, He L, Shao B, Yin J, Liu T (2020) SEEK: segmented embedding of knowledge graphs. Paper presented at the Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics 3888–3897
Zhang M, Zhang R, Zou L, Lin Y, S, H (2021) NAMER: A node-based multitasking framework for multi-hop knowledge base question answering. Paper presented at the Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics, NAACL-HLT 2021, Online 1533-1544
Ye X, Yavuz S, Hashimoto K, Zhou Y, Xiong C (2022) RNG-KBQA:Generation augmented iterative ranking for knowledge base question answering. Paper presented at the Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics 6032–6043
Das R, Zaheer M, Thai D, Godbole A, Perez E, Lee J, Tan L, Polymenakos L, McCallum A (2021) Case-based reasoning for natural language queries over knowledge bases. Paper presented at the Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 9594–9611
Huang X, Zhang J, Li D, Li P (2019) Knowledge graph embedding based question answering. Paper presented at the Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining 105–113
Saxena A, Tripathi A, Talukdar P (2020) Improving multi-hop question answering over knowledge graphs using knowledge base embeddings. Paper presented at the Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics 4498–4507
Sun H, Arnold A, Tania B, Pereira F, Cohen W (2021) Faithful embeddings for knowledge base queries. Paper presented at the Advances in Neural Information Processing Systems 22505–22516
Ren H, Dai H, Dai B, Chen X, Yasunaga M, Sun H, Schuurmans D, Leskovec J, Zhou D (2021) Lego: Latent execution-guided reasoning for multi-hop question answering on knowledge graphs. Paper presented at the Proceedings of the 38th International Conference on Machine Learning, volume 139 of Proceedings of Machine Learning Research 8959–8970
Niu G, Zhang Y, Pu S (2022) CAKE: A scalable commonsense-aware framework for multi-view knowledge graph completion. Paper presented at the Joint Conference of the 60th Annual Meeting of the Association for Computational Linguistics 2867–2877
Balazevic I, Allen C, Hospedales T (2019) TuckER:Tensor factorization for knowledge graph completion. Paper presented at the Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing 5185–5194
Liu L, Shang J, Ren X, Xu F, Gui H, Peng J, Han J (2018) Empower sequence labeling with task-aware neural language model. Paper presented at the Proceedings of the 32nd AAAI Conference on Artificial Intelligence 5253–5260
Liu Y, Ott M, Goyal N, Du J, Joshi M, Chen D, Levy O, Lewis M, Zettlemoyer L, Stoyanov V (2021) Roberta: a robustly optimized bert pretraining approach. Paper presented at the Proceedings of the 20th Chinese National Conference on Computational Linguistics 1218–1227
Zhang Y, Dai H, Kozareva Z, Smola A, Song L (2018) Variational reasoning for question answering with knowledge graph. Paper presented at the Thirty-Second AAAI Conference on Artificial Intelligence 6069–6076
Yih W, Richardson M, Meek C, Chang M, Suh J (2016) The value of semantic parse labeling for knowledge base question answering. Paper presented at the Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics 201–206
Bollacker K, Evans C, Paritosh P, Sturge T, Taylor J (2008) Freebase:a collaboratively created graph database for structuring human knowledge. Paper presented at the Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data 1247– 1250
Guo Q, Wang X, Zhu Z, Liu P, Xu L (2023) A knowledge inference model for question answering on an incomplete knowledge graph. Appl Intell 53:7634–7646
Dettmers T, Minervini P, Stenetorp P, Riedel S (2018) Convolutional 2d knowledge graph embeddings. Paper presented at the Proceedings of the 32th AAAI Conference on Artificial Intelligence 1811–1818
Saxena A, Kochsiek A, Gemulla R (2022) Sequence-to-Sequence knowledge graph completion and question answering. Paper presented at the Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics 2814 – 2828
Wang Y, Zhang R (2019) Question answering over knowledge base with dynamic programming. Journal of zhengzhou University (Science Edition) 51(4):37–42
X, L (2019) Design and implementation of question answering based on hybrid retrieval and answer generation. Master’s thesis, Zhejiang University
Rao Z, Jia Z, Zhang F (2022) Knowledge graph question and answer method based on key-value associative memory network. J Comput Sci 49(09):202–207
Zhang T (2022) Research on question answering of chinese knowledge graph based on pre training language model. Journal of Jilin University (Science Edition) 60(1):119–126
Xu B, Xu Y, Liang J, Xie C, Liang B, Cui W, Xiao Y (2017) CN-DBpedia: A never-ending chinese knowledge extraction system. Paper presented at the International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems, Springer, Cham 428–438
Funding
This work was supported by the National Natural Science Foundation of China (No. 61972059) and the China Postdoctoral Science Foundation (No. 2021M692368).
Author information
Authors and Affiliations
Contributions
Changshun Zhou: Collecting data, building models, conducting experiments, and analyzing experimental data; writing and revising the paper. Wenhao Ying: Proposing and determining the research direction, designing research plans. Shan Zhong: Conducting benchmark comparison experiments. Shengrong Gong: Revising the paper and final version editing. Han Yan: Conducting ablative experiments.
Corresponding author
Ethics declarations
Conflicts of Interest
The authors declare no conflict of interest
Ethics Approval
This paper does not address any data ethics issues.
Informed Consent
The data used in this paper are all from public datasets, and informed consent was obtained from all authors for the data used in this paper.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Zhou, C., Ying, W., Zhong, S. et al. Subgraph retrieval and link scoring model for multi-hop question answering in knowledge graphs. Appl Intell 55, 431 (2025). https://doi.org/10.1007/s10489-024-05935-8
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1007/s10489-024-05935-8

