DBLP QuAD:DBLP学术知识图上的问答数据集 DBLP-QuAD: A Question Answering Dataset over the DBLP Scholarly Knowledge Graph

作者:Debayan Banerjee Sushil Awale Ricardo Usbeck Chris Biemann

在这项工作中,我们在DBLP学术知识图(KG)上创建了一个问答数据集。DBLP是主要计算机科学出版物的书目信息的在线参考,索引了220多万作者出版的440多万份出版物。我们的数据集由10000个问答对和相应的SPARQL查询组成,这些查询可以在DBLP KG上执行以获取正确的答案。DBLP QuAD是最大的学术问答数据集。

In this work we create a question answering dataset over the DBLP scholarly knowledge graph (KG). DBLP is an on-line reference for bibliographic information on major computer science publications that indexes over 4.4 million publications published by more than 2.2 million authors. Our dataset consists of 10,000 question answer pairs with the corresponding SPARQL queries which can be executed over the DBLP KG to fetch the correct answer. DBLP-QuAD is the largest scholarly question answering dataset.

论文链接:http://arxiv.org/pdf/2303.13351v1

更多计算机论文:http://cspaper.cn/

Related posts