Extract dataset usage details from documents
Explore extraction model performance on holdout test set