Nampdn-ai / tiny-peS2o

Best of peS2o

A small subset ~10% from original allenai/peS2o that have been curated for long-term human value. I’m making it more accessible for SLM development purpose.

@techreport{peS2o,
    author = {Luca Soldaini and Kyle Lo},
    year = 2023,
    title = {{peS2o (Pretraining Efficiently on S2ORC) Dataset}},
    institution = {{Allen Institute for AI}},
    note = {ODC-By, \url{https://github.com/allenai/pes2o}}
}