Skip to main content
Log in

Subgraph retrieval and link scoring model for multi-hop question answering in knowledge graphs

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

Knowledge graphs (KGs) is an associated network composed of semantic relationships. The goal of the knowledge graph question answering (KGQA) is to provide answers to natural language questions based on KGs. Multi-hop KGQA requires reasoning on multiple edges of KGs to get the correct answer. However, KGs are usually incomplete, with numerous missing relationships in reality, which brings challenges to KGQA, especially for multi-hop KGQA. In this work, we propose an efficient approach for multi-hop KGQA. To capture more comprehensive features on incomplete KGs, we utilize Tucker Entity Relation (TuckER) decomposition for link prediction on the binary tensor representation of KGs and train a knowledge graph embedding (KGE) model and apply the learned representation for downstream QA tasks. We employ a pre-trained language model to assess the relevance scoring of questions and each node after subgraph retrieval. Additionally, we introduce a link scoring strategy based on the triple scoring function to address the limitations of solely relying on KGE for answer scoring. Through extensive experiments conducted on multiple benchmark datasets, we demonstrate the effectiveness of our proposed model in facilitating multi-hop QA reasoning on incomplete KGs.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+
from €37.37 /Month
  • Starting from 10 chapters or articles per month
  • Access and download chapters and articles from more than 300k books and 2,500 journals
  • Cancel anytime
View plans

Buy Now

Price includes VAT (Netherlands)

Instant access to the full article PDF.

Fig. 1
The alternative text for this image may have been generated using AI.
Fig. 2
The alternative text for this image may have been generated using AI.
Fig. 3
The alternative text for this image may have been generated using AI.
Fig. 4
The alternative text for this image may have been generated using AI.
Fig. 5
The alternative text for this image may have been generated using AI.
Fig. 6
The alternative text for this image may have been generated using AI.

Similar content being viewed by others

Availability of Data and Materials

All data generated or analysed during this study are included in these published articles [30,31,32, 34, 36, 40]

Code Availability

The code used in the experiment can be obtained from the first author on reasonable request.

References

  1. Park M, Kim J, Shin S, Park C, Jeon J, Kwon S (2023) Quantile estimation for encrypted data. Appl Intell 53:24782–24791

    Article  Google Scholar 

  2. Zhang S, Liang Y, Gong M, Jiang D, Duan N (2022) Multi-View document representation learning for open-domain dense retrieval. Paper presented at the Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics 5990–6000

  3. Shao Z, Huang M (2022) Answering open-domain multi-answer questions via a recall-then-verify framework. Paper presented at the Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics 1825–1838

  4. Zhang J, Zhang X, Yu J, Tang J, Tang J, Li C, Chen H (2022) Subgraph retrieval enhanced model for multi-hop knowledge base question answering. Paper presented at the Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics 5773–5784

  5. Zhang Y, Zhou Z, Yao Q, Li Y (2022) KGTuner: Efficient hyper-parameter search for knowledge graph Learning. Paper presented at the Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics 2715–2735

  6. Yasunaga M, Ren H, Bosselut A, Liang P, Leskovec J (2021) QA-GNN: Reasoning with language models and knowledge graphs for question answering. Paper presented at the Association for Computational Linguistics, NAACL-HLT 2021, Online 535– 546

  7. Wang Y, Srinivasan V, Jin H (2022) A new concept of knowledge based question answering (KBQA) system for multi-hop reasoning. Paper presented at the Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics, Human Language Technologies 4007–4017

  8. Luo Z, Xu W, Liu W, Bian J, Yin J, Liu T (2022) KGE-CL: Contrastive learning of tensor decomposition based knowledge graph embeddings. Paper presented at the Proceedings of the 29th International Conference on Computational Linguistics 2598–2607

  9. Wang X, Gao T, Zhu Z, Zhang Z, Liu Z, Li J, Tang J (2021) Kepler: A unified model for knowledge embedding and pre-trained language representation. Paper presented at the Proceedings of the Transactions of the Association for Computational Linguistics 176–194

  10. Clouatre L, Trempe P, Zouaq A, Chandar S (2021) MLMLM: Link prediction with mean likelihood masked language model. Paper presented at the Proceedings of the Association for Computational Linguistics: ACL-IJCNLP 2021, Online 4321–4331

  11. Sun H, Bedrax T, W, C (2019) Pullnet: Open domain question answering with iterative retrieval on knowledge bases and text. Paper presented at the Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing 2380–2390

  12. Feng Y, Zhang J, He G, Zhao W, Liu L, Liu Q, Li C, Chen H (2021) A pretraining numerical reasoning model for ordinal constrained question answering on knowledge base. Paper presented at the Findings of the Association for Computational Linguistics: EMNLP 2021:1852–1861

    Google Scholar 

  13. Xu H, Bao J, Liu W (2023) Double-Branch multi-attention based graph neural network for knowledge graph completion. Paper presented at the Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics 15257–15271

  14. Liu J, Wang P, Shang Z, C, W (2023) IterDE: An iterative knowledge distillation framework for knowledge graph embeddings. Paper presented at the Proceedings of the AAAI Conference on Artificial Intelligence 4488–4496

  15. Jin W, Zhao B, Yu H, Tao X, Yin R, Liu G (2023) Improving embedded knowledge graph multi-hop question answering by introducing relational chain reasoning. Data Min Knowl Disc 37:255–288

    Article  Google Scholar 

  16. Tang Y, Huang J, Wang G, He X, Zhou B (2020) Orthogonal relation transforms with graph context modeling for knowledge graph embedding. Paper presented at the Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics 2713–2722

  17. Ruffinelli D, Broscheit S, Gemulla R (2020) You can teach an old dog new tricks on training knowledge graph embeddings. Paper presented at the International Conference on Learning Representations 2020

  18. Xu W, Zheng S, He L, Shao B, Yin J, Liu T (2020) SEEK: segmented embedding of knowledge graphs. Paper presented at the Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics 3888–3897

  19. Zhang M, Zhang R, Zou L, Lin Y, S, H (2021) NAMER: A node-based multitasking framework for multi-hop knowledge base question answering. Paper presented at the Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics, NAACL-HLT 2021, Online 1533-1544

  20. Ye X, Yavuz S, Hashimoto K, Zhou Y, Xiong C (2022) RNG-KBQA:Generation augmented iterative ranking for knowledge base question answering. Paper presented at the Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics 6032–6043

  21. Das R, Zaheer M, Thai D, Godbole A, Perez E, Lee J, Tan L, Polymenakos L, McCallum A (2021) Case-based reasoning for natural language queries over knowledge bases. Paper presented at the Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 9594–9611

  22. Huang X, Zhang J, Li D, Li P (2019) Knowledge graph embedding based question answering. Paper presented at the Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining 105–113

  23. Saxena A, Tripathi A, Talukdar P (2020) Improving multi-hop question answering over knowledge graphs using knowledge base embeddings. Paper presented at the Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics 4498–4507

  24. Sun H, Arnold A, Tania B, Pereira F, Cohen W (2021) Faithful embeddings for knowledge base queries. Paper presented at the Advances in Neural Information Processing Systems 22505–22516

  25. Ren H, Dai H, Dai B, Chen X, Yasunaga M, Sun H, Schuurmans D, Leskovec J, Zhou D (2021) Lego: Latent execution-guided reasoning for multi-hop question answering on knowledge graphs. Paper presented at the Proceedings of the 38th International Conference on Machine Learning, volume 139 of Proceedings of Machine Learning Research 8959–8970

  26. Niu G, Zhang Y, Pu S (2022) CAKE: A scalable commonsense-aware framework for multi-view knowledge graph completion. Paper presented at the Joint Conference of the 60th Annual Meeting of the Association for Computational Linguistics 2867–2877

  27. Balazevic I, Allen C, Hospedales T (2019) TuckER:Tensor factorization for knowledge graph completion. Paper presented at the Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing 5185–5194

  28. Liu L, Shang J, Ren X, Xu F, Gui H, Peng J, Han J (2018) Empower sequence labeling with task-aware neural language model. Paper presented at the Proceedings of the 32nd AAAI Conference on Artificial Intelligence 5253–5260

  29. Liu Y, Ott M, Goyal N, Du J, Joshi M, Chen D, Levy O, Lewis M, Zettlemoyer L, Stoyanov V (2021) Roberta: a robustly optimized bert pretraining approach. Paper presented at the Proceedings of the 20th Chinese National Conference on Computational Linguistics 1218–1227

  30. Zhang Y, Dai H, Kozareva Z, Smola A, Song L (2018) Variational reasoning for question answering with knowledge graph. Paper presented at the Thirty-Second AAAI Conference on Artificial Intelligence 6069–6076

  31. Yih W, Richardson M, Meek C, Chang M, Suh J (2016) The value of semantic parse labeling for knowledge base question answering. Paper presented at the Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics 201–206

  32. Bollacker K, Evans C, Paritosh P, Sturge T, Taylor J (2008) Freebase:a collaboratively created graph database for structuring human knowledge. Paper presented at the Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data 1247– 1250

  33. Guo Q, Wang X, Zhu Z, Liu P, Xu L (2023) A knowledge inference model for question answering on an incomplete knowledge graph. Appl Intell 53:7634–7646

    Article  Google Scholar 

  34. Dettmers T, Minervini P, Stenetorp P, Riedel S (2018) Convolutional 2d knowledge graph embeddings. Paper presented at the Proceedings of the 32th AAAI Conference on Artificial Intelligence 1811–1818

  35. Saxena A, Kochsiek A, Gemulla R (2022) Sequence-to-Sequence knowledge graph completion and question answering. Paper presented at the Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics 2814 – 2828

  36. Wang Y, Zhang R (2019) Question answering over knowledge base with dynamic programming. Journal of zhengzhou University (Science Edition) 51(4):37–42

    Google Scholar 

  37. X, L (2019) Design and implementation of question answering based on hybrid retrieval and answer generation. Master’s thesis, Zhejiang University

  38. Rao Z, Jia Z, Zhang F (2022) Knowledge graph question and answer method based on key-value associative memory network. J Comput Sci 49(09):202–207

    Google Scholar 

  39. Zhang T (2022) Research on question answering of chinese knowledge graph based on pre training language model. Journal of Jilin University (Science Edition) 60(1):119–126

    Google Scholar 

  40. Xu B, Xu Y, Liang J, Xie C, Liang B, Cui W, Xiao Y (2017) CN-DBpedia: A never-ending chinese knowledge extraction system. Paper presented at the International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems, Springer, Cham 428–438

Download references

Funding

This work was supported by the National Natural Science Foundation of China (No. 61972059) and the China Postdoctoral Science Foundation (No. 2021M692368).

Author information

Authors and Affiliations

Authors

Contributions

Changshun Zhou: Collecting data, building models, conducting experiments, and analyzing experimental data; writing and revising the paper. Wenhao Ying: Proposing and determining the research direction, designing research plans. Shan Zhong: Conducting benchmark comparison experiments. Shengrong Gong: Revising the paper and final version editing. Han Yan: Conducting ablative experiments.

Corresponding author

Correspondence to Wenhao Ying.

Ethics declarations

Conflicts of Interest

The authors declare no conflict of interest

Ethics Approval

This paper does not address any data ethics issues.

Informed Consent

The data used in this paper are all from public datasets, and informed consent was obtained from all authors for the data used in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhou, C., Ying, W., Zhong, S. et al. Subgraph retrieval and link scoring model for multi-hop question answering in knowledge graphs. Appl Intell 55, 431 (2025). https://doi.org/10.1007/s10489-024-05935-8

Download citation

  • Accepted:

  • Published:

  • Version of record:

  • DOI: https://doi.org/10.1007/s10489-024-05935-8

Keywords