dataset

名称

下载链接

作者

论文

年份

Open Entity

查看

Eunsol Choi

Ultra-Fine Entity Typing

2018

ReCoRD

查看

Sheng Zhang

ReCoRD: Bridging the Gap between Human and Machine Commonsense Reading Comprehension

2018

TACRED

查看

Yuhao Zhang

Position-aware Attention and Supervised Data Improve Slot Filling

2017

GENIA

查看

Arzoo Katiyar

Nested Named Entity Recognition Revisited

2018

CoNLL-2003

查看

Erik F. Tjong Kim Sang

Introduction to the CoNLL-2003 Shared Task: Language-Independent Named Entity Recognition

2003

KBP 2017

查看

National Institute of Standards and Technology

Sequence-to-Nuggets: Nested Entity Mention Detection via Anchor-Region Networks

2017

ACE 2005

查看

Christopher Walker

ACE 2005 Multilingual Training Corpus

2006

ACE 2004

查看

Alexis Mitchell

The Automatic Content Extraction (ACE) Program Tasks, Data, and Evaluation

2005

OntoNotes 5.0

查看

Ralph Weischedel

Towards Robust Linguistic Analysis Using OntoNotes

2013

OntoNotes 4.0

查看

Sameer Pradhan

CoNLL-2011 Shared Task:Modeling Unrestricted Coreference in OntoNotes

2011

MSRA

查看

Gina-Anne Levow

The Third International Chinese Language Processing Bakeoff:Word Segmentation and Named Entity Recognition

2006

ADE

查看

Harsha Gurulingappa

Development of a Benchmark Corpus to Support the Automatic Extraction of Drug‐related Adverse Effects from Medical Case Reports

2012

SemEval-2010 Task 8

查看

Iris Hendrickx

SemEval-2010 Task 8: Multi-Way Classification of Semantic Relations Between Pairs of Nominals

2010

NYT

查看

Sebastian Riede

Modeling Relations and Their Mentions without Labeled Text

2010

WebNLG 2017

查看

Claire Gardent

Creating Training Corpora for NLG Micro-Planning

2017

SST-5

查看

Richard Socher

Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank

2013

SNLI

查看

Samuel R. Bowman

A large annotated corpus for learning natural language inference

2015

IWSLT2014 En→De

查看

IWSLT committee

None

2014

IMDB

查看

Andrew L. Maas

Learning Word Vectors for Sentiment Analysis

2011

SciTail

查看

Tushar Khot

SciTail: A Textual Entailment Dataset from Science Question Answering

2018

MRQA

查看

Adam Fisch

MRQA 2019 Shared Task:Evaluating Generalization in Reading Comprehension

2019

WNUT2017

查看

Leon Derczynski

Results of the WNUT2017 Shared Task on Novel and Emerging Entity Recognition

2017

WMT2016

查看

Ondrej Bojar

Findings of the 2016 Conference on Machine Translation (WMT16)

2016

MRPC

查看

William B. Dolan

Automatically Constructing a Corpus of Sentential Paraphrases

2005

QQP

查看

Shankar Iyer

First Quora Dataset Release: Question Pairs

2017

MNLI

查看

Adina Williams

A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference

2018

RTE

查看

Luisa Bentivogli

The Sixth PASCAL Recognizing Textual Entailment Challenge

2009

COPA

查看

Melissa Roemmele

Choice of Plausible Alternatives:An Evaluation of Commonsense Causal Reasoning

2011

WiC

查看

Mohammad Taher Pilehvar

WiC: the Word-in-Context Datasetfor Evaluating Context-Sensitive Meaning Representations

2019

NewsQA

查看

Adam Trischler

NewsQA: A Machine Comprehension Dataset

2017

TriviaQA

查看

Mandar Joshi

TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension

2017

SearchQA

查看

Matt Dunn

SearchQA: A New Q&A DatasetAugmented with Context from a Search Engine

2017

Natural Questions

查看

Tom Kwiatkowski

Natural Questions: a Benchmark for Question Answering Research

2019

WikiHop

查看

Johannes Welbl

Constructing Datasets for Multi-hop Reading Comprehension Across Documents

2018

CLEVR

查看

Justin Johnson

Constructing Datasets for Multi-hop Reading Comprehension Across Documents

2016

CNN / Daily Mail

查看

Karl Moritz Hermann

Teaching Machines to Read and Comprehend

2015

Gigaword

查看

Rush Alexander M.

A Neural Attention Model for Abstractive Sentence Summarization

2015

bAbI

查看

Juan Pavez

Constructing Datasets for Multi-hop Reading Comprehension Across Documents

2018

X-Sum

查看

Shashi Narayan

Don’t Give Me the Details, Just the Summary! Topic-Aware Convolutional Neural Networks for Extreme Summarization

2018

DUC 2004 Task 1

查看

N

None

2004

Webis-TLDR-17 Corpus

查看

Michael Volske

TL;DR: Mining Reddit to Learn Automatic Summarization

2017

Adversarial NLI

查看

Yixin Nie

Adversarial NLI: A New Benchmark for Natural Language Understanding

2020

boolean-questions

查看

Christopher Clark

BoolQ: Exploring the Surprising Difficulty of Natural Yes/No Questions

2019

CLUTRR

查看

Koustuv Sinha

CLUTRR: A Diagnostic Benchmark for Inductive Reasoning from Text

2019

Conceptual Captions

查看

Piyush Sharma

Constructing Datasets for Multi-hop Reading Comprehension Across Documents

2018

CODAH

查看

Michael Chen

CODAH: An Adversarially-Authored Question Answering Dataset for Common Sense

2019

WebQuestions

查看

Jonathan Berant

Constructing Datasets for Multi-hop Reading Comprehension Across Documents

2013

CommonsenseQA

查看

Alon Talmor

Constructing Datasets for Multi-hop Reading Comprehension Across Documents

2019

VizWiz

查看

Danna Gurari

Constructing Datasets for Multi-hop Reading Comprehension Across Documents

2018

RSICD

查看

Xiaoqiang Lu

Constructing Datasets for Multi-hop Reading Comprehension Across Documents

2018

ReferItGame

查看

Sahar Kazemzadeh

Constructing Datasets for Multi-hop Reading Comprehension Across Documents

2014

SentiCap

查看

Alexander Mathews

Constructing Datasets for Multi-hop Reading Comprehension Across Documents