DAM

New Record

We got the new, best score of R_10 at 1 (85.67%) in the Ubuntu Corpus by incorporating ERNIE_English, an English pre-trained model from Baidu. Please refer to DMTK (the Dialogue Modeling ToolKit) for more details. https://github.com/PaddlePaddle/models/tree/develop/PaddleNLP/dialogue_model_toolkit

Deep Attention Matching Network

This is the source code of Deep Attention Matching network (DAM), that is proposed for multi-turn response selection in the retrieval-based chatbot.

DAM is a neural matching network that entirely based on attention mechanism. The motivation of DAM is to capture those semantic dependencies, among dialogue elements at different level of granularities, in multi-turn conversation as matching evidences, in order to better match response candidate with its multi-turn context. DAM will appear on ACL-2018, please find our paper at: http://acl2018.org/conference/accepted-papers/.

Paddle Version

DAM is originally implemented with Tensorflow, we highly recommend using the paddle version as Paddle supports parallely training with very large corpus.

You can find the paddle version at: https://github.com/PaddlePaddle/models/tree/develop/fluid .

Network

DAM is inspired by Transformer in Machine Translation (Vaswani et al., 2017), and we extend the key attention mechanism of Transformer in two perspectives and introduce those two kinds of attention in one uniform neural network.

self-attention To gradually capture semantic representations in different granularities by stacking attention from word-level embeddings. Those multi-grained semantic representations would facilitate exploring segmental dependencies between context and response.
cross-attention Attention across context and response can generally capture the relevance in dependency between segment pairs, which could provide complementary information to textual relevance for matching response with multi-turn context.

Results

We test DAM on two large-scale multi-turn response selection tasks, i.e., the Ubuntu Corpus v1 and Douban Conversation Corpus, experimental results are bellow:

Usage

First, please download data and unzip it:

cd data
unzip data.zip

If you want use well trained models directly, please download models and unzip it:

cd output
unzip output.zip

Train and test the model by:

sh run.sh

Dependencies

Python >= 2.7.3
Tensorflow == 1.2.1

Citation

The following article describe the DAM in detail. We recommend citing this article as default.

@inproceedings{ ,
  title={Multi-Turn Response Selection for Chatbots with Deep Attention Matching Network},
  author={Xiangyang Zhou, Lu Li, Daxiang Dong, Yi Liu, Ying Chen, Wayne Xin Zhao, Dianhai Yu and Hua Wu},
  booktitle={Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)},
  volume={1},
  pages={  --  },
  year={2018}
}

Name		Name	Last commit message	Last commit date
parent directory ..
appendix		appendix
bin		bin
data		data
log		log
models		models
output		output
utils		utils
README.md		README.md
main.py		main.py
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

New Record

Deep Attention Matching Network

Paddle Version

Network

Results

Usage

Dependencies

Citation

FilesExpand file tree

DAM

Directory actions

More options

Directory actions

More options

Latest commit

History

DAM

Folders and files

parent directory

README.md

New Record

Deep Attention Matching Network

Paddle Version

Network

Results

Usage

Dependencies

Citation