Prepare the Kleister NDA dataset for LLM-based extraction. Validates labels against a Pydantic schema and delivers partitioned Parquet with co-located PDFs
-
Updated
Apr 12, 2026 - Python
Prepare the Kleister NDA dataset for LLM-based extraction. Validates labels against a Pydantic schema and delivers partitioned Parquet with co-located PDFs
Benchmarking agentic and single-pass extraction strategies across LLM providers on the Kleister NDA dataset
Add a description, image, and links to the kleister-nda topic page so that developers can more easily learn about it.
To associate your repository with the kleister-nda topic, visit your repo's landing page and select "manage topics."