-
Time-Aware Language Models as Temporal Knowledge Bases
-
https://github.com/salesforce/ctrl
-
Attention Is All You Need
-
https://github.com/chiphuyen/lazynlp
-
https://github.com/jcpeterson/openwebtext
-
Teaching Machines to Read and Comprehend
-
https://aclanthology.org/2020.wmt-1.1.pdf
-
2008-sandhaus.pdf
-
Newsroom: A Dataset of 1.3 Million Summaries with Diverse Extractive Strategies
-
Amazon Reviews: Image-based Recommendations on Styles and Substitutes
-
ELI5: Long Form Question Answering
-
https://github.com/mrqa/MRQA-Shared-Task-2019
-
SQuAD: 100,000+ Questions for Machine Comprehension of Text
-
NewsQA: A Machine Comprehension Dataset
-
TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension
-
SearchQA: A New Q&A Dataset Augmented with Context from a Search Engine
-
HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering
-
Natural Questions: A Benchmark for Question Answering Research
-
2019-keskar-table7-datasetsandcontrolcodesmetadata.png
-
https://web.archive.org/web/20110109074405/http://www.urlesque.com/2011/01/06/whats-a-subreddit-how-reddit-works/
-
https://www.reddit.com/r/keto/
-
https://www.reddit.com/r/politics/
-