SCROLLS

This repository contains the official code of the paper: "SCROLLS: Standardized CompaRison Over Long Language Sequences".

Setup instructions are in the baselines and evaluator folders.

For the live leaderboard, checkout the official website.

Loading the SCROLLS Benchmark Datasets

via 🤗 Datasets (huggingface/datasets) library (recommended):

Installation

Usage:

from datasets import load_dataset

qasper_dataset = load_dataset("tau/scrolls", "qasper")
"""
Options are: ["gov_report", "summ_screen_fd", "qmsum", "narrative_qa", "qasper", "quality", "contract_nli"]
"""

via ZIP files, where each split is in a JSONL file:
- GovReport
- SummScreenFD
- QMSum
- NarrativeQA
- Qasper
- QuALITY
- ContractNLI

Citation

@inproceedings{shaham-etal-2022-scrolls,
    title = "{SCROLLS}: Standardized {C}ompa{R}ison Over Long Language Sequences",
    author = "Shaham, Uri  and
      Segal, Elad  and
      Ivgi, Maor  and
      Efrat, Avia  and
      Yoran, Ori  and
      Haviv, Adi  and
      Gupta, Ankit  and
      Xiong, Wenhan  and
      Geva, Mor  and
      Berant, Jonathan  and
      Levy, Omer",
    booktitle = "Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing",
    month = dec,
    year = "2022",
    address = "Abu Dhabi, United Arab Emirates",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2022.emnlp-main.823",
    pages = "12007--12021",
}

When citing SCROLLS, please make sure to cite all the original dataset papers. [bibtex]

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
baselines		baselines
evaluator		evaluator
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
scrolls_datasets.bib		scrolls_datasets.bib

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SCROLLS

Loading the SCROLLS Benchmark Datasets

Citation

About

Releases

Packages

Contributors 2

Languages

License

tau-nlp/scrolls

Folders and files

Latest commit

History

Repository files navigation

SCROLLS

Loading the SCROLLS Benchmark Datasets

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages