Stream of Stories

The data we created is composed of a stream of 10,000 sentences, organized into 564 stories. Each story is composed of a list of not-repeated facts, involving 130 entity and 27 relation instances that belong to a pre-designed ontology (\textit{not provided to the system}). Facts in a story mostly talk about a certain entity, usually referred as “main entity”, and that can also appear in other stories. Entities and relations are mentioned with different surface forms (synonyms, sub-portions of names, etc…).

You can download the dataset at the following link:
dataset

It is a .zip file containing a README and the dataset itself.

SAILab