A database of the Knowledge Diffusion for Neural Dialogue Generation.
The files are in json and the encoding format is UTF8.
KB for movies and celebrities
--id : unique ID for each movie
--title : title of the movie
--director : a list of celebrityIDs who direct the movie
--actor : a list of celebrityIDs who act the movie
--duration : movie duration in minutes
--release_time : movie release year
--comments : a list of 5 comments on the movie
--id : unique ID for each celebrity
--name : name of the celebrity
Conversations grounded on knowledge base
--raw_sentence : raw utterance
--movie : a list of movie and corresponding movieID appear in raw_sentence for similar knowledge based conversations
--celebrity : a list of celebrity and corresponding celebrityID appear in raw_sentence for similar knowledge based conversations
--triple : a list of facts(subjectID, relation, objectID) for revelant knowledge based conversations
We appreciate your citation if you find our dataset is beneficial.
@inproceedings{liu2018NKD,
title={Knowledge Diffusion for Neural Dialogue Generation},
author={Liu, Shuman and Chen, Hongshen and Ren, Zhaochun and Feng, Yang and Liu, Qun and Yin, Dawei},
booktitle={Proceedings of the 56 th Annual Meeting of the Association for Computational Linguistics(Volume 1: Long Papers)},
pages={1489--1498},
year={2018},
publisher = {Association for Computational Linguistics},
}