Subgraph retrieval and link scoring model for multi-hop question answering in knowledge graphs

Zhou, Changshun; Ying, Wenhao; Zhong, Shan; Gong, Shengrong; Yan, Han

doi:10.1007/s10489-024-05935-8

Subgraph retrieval and link scoring model for multi-hop question answering in knowledge graphs

Published: 10 February 2025

Volume 55, article number 431, (2025)
Cite this article

Applied Intelligence Aims and scope Submit manuscript

Changshun Zhou¹,
Wenhao Ying ORCID: orcid.org/0000-0001-5992-5444²,
Shan Zhong²,
Shengrong Gong^1,2 &
…
Han Yan³

458 Accesses
2 Citations
Explore all metrics

Abstract

Knowledge graphs (KGs) is an associated network composed of semantic relationships. The goal of the knowledge graph question answering (KGQA) is to provide answers to natural language questions based on KGs. Multi-hop KGQA requires reasoning on multiple edges of KGs to get the correct answer. However, KGs are usually incomplete, with numerous missing relationships in reality, which brings challenges to KGQA, especially for multi-hop KGQA. In this work, we propose an efficient approach for multi-hop KGQA. To capture more comprehensive features on incomplete KGs, we utilize Tucker Entity Relation (TuckER) decomposition for link prediction on the binary tensor representation of KGs and train a knowledge graph embedding (KGE) model and apply the learned representation for downstream QA tasks. We employ a pre-trained language model to assess the relevance scoring of questions and each node after subgraph retrieval. Additionally, we introduce a link scoring strategy based on the triple scoring function to address the limitations of solely relying on KGE for answer scoring. Through extensive experiments conducted on multiple benchmark datasets, we demonstrate the effectiveness of our proposed model in facilitating multi-hop QA reasoning on incomplete KGs.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+

from €37.37 /Month

Starting from 10 chapters or articles per month
Access and download chapters and articles from more than 300k books and 2,500 journals
Cancel anytime

Buy Now

Price includes VAT (Netherlands)

Instant access to the full article PDF.

Institutional subscriptions

Improving embedded knowledge graph multi-hop question answering by introducing relational chain reasoning

Article 11 November 2022

Enhancing Question Embedding with Relation Chain for Multi-hop KGQA

A knowledge inference model for question answering on an incomplete knowledge graph

Article 27 July 2022

Availability of Data and Materials

All data generated or analysed during this study are included in these published articles [30,31,32, 34, 36, 40]

Code Availability

The code used in the experiment can be obtained from the first author on reasonable request.

References

Park M, Kim J, Shin S, Park C, Jeon J, Kwon S (2023) Quantile estimation for encrypted data. Appl Intell 53:24782–24791
Article Google Scholar
Zhang S, Liang Y, Gong M, Jiang D, Duan N (2022) Multi-View document representation learning for open-domain dense retrieval. Paper presented at the Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics 5990–6000
Shao Z, Huang M (2022) Answering open-domain multi-answer questions via a recall-then-verify framework. Paper presented at the Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics 1825–1838
Zhang J, Zhang X, Yu J, Tang J, Tang J, Li C, Chen H (2022) Subgraph retrieval enhanced model for multi-hop knowledge base question answering. Paper presented at the Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics 5773–5784
Zhang Y, Zhou Z, Yao Q, Li Y (2022) KGTuner: Efficient hyper-parameter search for knowledge graph Learning. Paper presented at the Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics 2715–2735
Yasunaga M, Ren H, Bosselut A, Liang P, Leskovec J (2021) QA-GNN: Reasoning with language models and knowledge graphs for question answering. Paper presented at the Association for Computational Linguistics, NAACL-HLT 2021, Online 535– 546
Wang Y, Srinivasan V, Jin H (2022) A new concept of knowledge based question answering (KBQA) system for multi-hop reasoning. Paper presented at the Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics, Human Language Technologies 4007–4017
Luo Z, Xu W, Liu W, Bian J, Yin J, Liu T (2022) KGE-CL: Contrastive learning of tensor decomposition based knowledge graph embeddings. Paper presented at the Proceedings of the 29th International Conference on Computational Linguistics 2598–2607
Wang X, Gao T, Zhu Z, Zhang Z, Liu Z, Li J, Tang J (2021) Kepler: A unified model for knowledge embedding and pre-trained language representation. Paper presented at the Proceedings of the Transactions of the Association for Computational Linguistics 176–194
Clouatre L, Trempe P, Zouaq A, Chandar S (2021) MLMLM: Link prediction with mean likelihood masked language model. Paper presented at the Proceedings of the Association for Computational Linguistics: ACL-IJCNLP 2021, Online 4321–4331
Sun H, Bedrax T, W, C (2019) Pullnet: Open domain question answering with iterative retrieval on knowledge bases and text. Paper presented at the Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing 2380–2390
Feng Y, Zhang J, He G, Zhao W, Liu L, Liu Q, Li C, Chen H (2021) A pretraining numerical reasoning model for ordinal constrained question answering on knowledge base. Paper presented at the Findings of the Association for Computational Linguistics: EMNLP 2021:1852–1861
Google Scholar
Xu H, Bao J, Liu W (2023) Double-Branch multi-attention based graph neural network for knowledge graph completion. Paper presented at the Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics 15257–15271
Liu J, Wang P, Shang Z, C, W (2023) IterDE: An iterative knowledge distillation framework for knowledge graph embeddings. Paper presented at the Proceedings of the AAAI Conference on Artificial Intelligence 4488–4496
Jin W, Zhao B, Yu H, Tao X, Yin R, Liu G (2023) Improving embedded knowledge graph multi-hop question answering by introducing relational chain reasoning. Data Min Knowl Disc 37:255–288
Article Google Scholar
Tang Y, Huang J, Wang G, He X, Zhou B (2020) Orthogonal relation transforms with graph context modeling for knowledge graph embedding. Paper presented at the Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics 2713–2722
Ruffinelli D, Broscheit S, Gemulla R (2020) You can teach an old dog new tricks on training knowledge graph embeddings. Paper presented at the International Conference on Learning Representations 2020
Xu W, Zheng S, He L, Shao B, Yin J, Liu T (2020) SEEK: segmented embedding of knowledge graphs. Paper presented at the Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics 3888–3897
Zhang M, Zhang R, Zou L, Lin Y, S, H (2021) NAMER: A node-based multitasking framework for multi-hop knowledge base question answering. Paper presented at the Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics, NAACL-HLT 2021, Online 1533-1544
Ye X, Yavuz S, Hashimoto K, Zhou Y, Xiong C (2022) RNG-KBQA:Generation augmented iterative ranking for knowledge base question answering. Paper presented at the Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics 6032–6043
Das R, Zaheer M, Thai D, Godbole A, Perez E, Lee J, Tan L, Polymenakos L, McCallum A (2021) Case-based reasoning for natural language queries over knowledge bases. Paper presented at the Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 9594–9611
Huang X, Zhang J, Li D, Li P (2019) Knowledge graph embedding based question answering. Paper presented at the Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining 105–113
Saxena A, Tripathi A, Talukdar P (2020) Improving multi-hop question answering over knowledge graphs using knowledge base embeddings. Paper presented at the Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics 4498–4507
Sun H, Arnold A, Tania B, Pereira F, Cohen W (2021) Faithful embeddings for knowledge base queries. Paper presented at the Advances in Neural Information Processing Systems 22505–22516
Ren H, Dai H, Dai B, Chen X, Yasunaga M, Sun H, Schuurmans D, Leskovec J, Zhou D (2021) Lego: Latent execution-guided reasoning for multi-hop question answering on knowledge graphs. Paper presented at the Proceedings of the 38th International Conference on Machine Learning, volume 139 of Proceedings of Machine Learning Research 8959–8970
Niu G, Zhang Y, Pu S (2022) CAKE: A scalable commonsense-aware framework for multi-view knowledge graph completion. Paper presented at the Joint Conference of the 60th Annual Meeting of the Association for Computational Linguistics 2867–2877
Balazevic I, Allen C, Hospedales T (2019) TuckER:Tensor factorization for knowledge graph completion. Paper presented at the Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing 5185–5194
Liu L, Shang J, Ren X, Xu F, Gui H, Peng J, Han J (2018) Empower sequence labeling with task-aware neural language model. Paper presented at the Proceedings of the 32nd AAAI Conference on Artificial Intelligence 5253–5260
Liu Y, Ott M, Goyal N, Du J, Joshi M, Chen D, Levy O, Lewis M, Zettlemoyer L, Stoyanov V (2021) Roberta: a robustly optimized bert pretraining approach. Paper presented at the Proceedings of the 20th Chinese National Conference on Computational Linguistics 1218–1227
Zhang Y, Dai H, Kozareva Z, Smola A, Song L (2018) Variational reasoning for question answering with knowledge graph. Paper presented at the Thirty-Second AAAI Conference on Artificial Intelligence 6069–6076
Yih W, Richardson M, Meek C, Chang M, Suh J (2016) The value of semantic parse labeling for knowledge base question answering. Paper presented at the Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics 201–206
Bollacker K, Evans C, Paritosh P, Sturge T, Taylor J (2008) Freebase:a collaboratively created graph database for structuring human knowledge. Paper presented at the Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data 1247– 1250
Guo Q, Wang X, Zhu Z, Liu P, Xu L (2023) A knowledge inference model for question answering on an incomplete knowledge graph. Appl Intell 53:7634–7646
Article Google Scholar
Dettmers T, Minervini P, Stenetorp P, Riedel S (2018) Convolutional 2d knowledge graph embeddings. Paper presented at the Proceedings of the 32th AAAI Conference on Artificial Intelligence 1811–1818
Saxena A, Kochsiek A, Gemulla R (2022) Sequence-to-Sequence knowledge graph completion and question answering. Paper presented at the Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics 2814 – 2828
Wang Y, Zhang R (2019) Question answering over knowledge base with dynamic programming. Journal of zhengzhou University (Science Edition) 51(4):37–42
Google Scholar
X, L (2019) Design and implementation of question answering based on hybrid retrieval and answer generation. Master’s thesis, Zhejiang University
Rao Z, Jia Z, Zhang F (2022) Knowledge graph question and answer method based on key-value associative memory network. J Comput Sci 49(09):202–207
Google Scholar
Zhang T (2022) Research on question answering of chinese knowledge graph based on pre training language model. Journal of Jilin University (Science Edition) 60(1):119–126
Google Scholar
Xu B, Xu Y, Liang J, Xie C, Liang B, Cui W, Xiao Y (2017) CN-DBpedia: A never-ending chinese knowledge extraction system. Paper presented at the International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems, Springer, Cham 428–438

Download references

Funding

This work was supported by the National Natural Science Foundation of China (No. 61972059) and the China Postdoctoral Science Foundation (No. 2021M692368).

Author information

Authors and Affiliations

School of Computer Science and Technology, Soochow University, No. 1 Shizi Street, Suzhou, 215000, Jiangsu, China
Changshun Zhou & Shengrong Gong
School of Computer Science and Engineering, Changshu Institute of Technology, No. 99 Hushan Road, Suzhou, 215000, Jiangsu, China
Wenhao Ying, Shan Zhong & Shengrong Gong
School of Information, Liaoning University, No. 66 Chongshan Middle Road, Shenyang, 110036, Liaoning, China
Han Yan

Authors

Changshun Zhou
View author publications
Search author on:PubMed Google Scholar
Wenhao Ying
View author publications
Search author on:PubMed Google Scholar
Shan Zhong
View author publications
Search author on:PubMed Google Scholar
Shengrong Gong
View author publications
Search author on:PubMed Google Scholar
Han Yan
View author publications
Search author on:PubMed Google Scholar

Contributions

Changshun Zhou: Collecting data, building models, conducting experiments, and analyzing experimental data; writing and revising the paper. Wenhao Ying: Proposing and determining the research direction, designing research plans. Shan Zhong: Conducting benchmark comparison experiments. Shengrong Gong: Revising the paper and final version editing. Han Yan: Conducting ablative experiments.

Corresponding author

Correspondence to Wenhao Ying.

Ethics declarations

Conflicts of Interest

The authors declare no conflict of interest

Ethics Approval

This paper does not address any data ethics issues.

Informed Consent

The data used in this paper are all from public datasets, and informed consent was obtained from all authors for the data used in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Zhou, C., Ying, W., Zhong, S. et al. Subgraph retrieval and link scoring model for multi-hop question answering in knowledge graphs. Appl Intell 55, 431 (2025). https://doi.org/10.1007/s10489-024-05935-8

Download citation

Accepted: 12 October 2024
Published: 10 February 2025
Version of record: 10 February 2025
DOI: https://doi.org/10.1007/s10489-024-05935-8

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+

from €37.37 /Month

Starting from 10 chapters or articles per month
Access and download chapters and articles from more than 300k books and 2,500 journals
Cancel anytime

Buy Now

Price includes VAT (Netherlands)

Instant access to the full article PDF.

Institutional subscriptions

Subgraph retrieval and link scoring model for multi-hop question answering in knowledge graphs

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Improving embedded knowledge graph multi-hop question answering by introducing relational chain reasoning

Enhancing Question Embedding with Relation Chain for Multi-hop KGQA

A knowledge inference model for question answering on an incomplete knowledge graph

Explore related subjects

Availability of Data and Materials

Code Availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflicts of Interest

Ethics Approval

Informed Consent

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now