ForecastQA: A Question Answering Challenge for Event Forecasting with Temporal Text Data

Dataset Statistics

ForecastQA dataset is composed of 10,392 questions, divided into the standard 80/10/10 split of train, dev, and test data. Our 10k questions are roughly evenly split between multiple-choice and yes-no binary questions. Over 17K distinct words were used to construct our questions and we have 218 unique time constraints associated with them.

Types of Questions

To better understand the types of questions in ForecastQA, we examined popular beginnings of sentences and created a treemap plot. As is shown, nearly half of our questions start with the word will (44%), a result of over half of our questions being yes-no questions.

Types of Reasoning

To examine the reasoning required to answer our questions we sampled 100 questions and manually annotated them with reasoning types. Due to the forecasting nature of our dataset, we are particularly interested in questions containing temporal reasoning and thus spend more time looking into these questions.

Analysis

Dataset Statistics

Types of Questions

Types of Reasoning