Profile Pictureasapworks
$299

Contract Clause Dataset v1.0

Add to cart

Contract Clause Dataset v1.0

🧾 Contract Clause Dataset v1.0 β€” 66K+ Enriched Legal Clauses from SEC Agreements

A premium, production-grade dataset featuring 66,813 unique legal contract clauses extracted from 10+ years of SEC filings (10-K, 10-Q, 8-K, Exhibit 10). Each clause is cleanly parsed and metadata-enriched, enabling immediate use in legal AI, contract analysis, and LLM training.


βœ… Key Stats

  • πŸ“… Years Covered: 2010–2024
  • 🧠 Clause Count: 66,813
  • 🏷️ Clause Types (15+ categories):
    • termination: 18,663
    • dispute_resolution: 14,474
    • payment_terms: 11,074
    • liability: 6,547
    • duration, assignment, warranty, IP, governing_law, and more
  • πŸ’Ό Top Companies by Clause Count:
    • EXC: 5,029
    • CCL: 2,971
    • AAL, F, MA, MPC, ZBRA, KHC, etc.

πŸ’‘ Designed For:

  • Legal AI training – Clause classification, NER, similarity, LLM fine-tuning
  • Contract compliance & clause retrieval systems
  • RAG (Retrieval-Augmented Generation) pipelines
  • Legal search engines and enterprise lawtech solutions

πŸ“‚ What’s Included:

  • clause_dataset_final.jsonl – The complete dataset (66K+ clauses), each entry includes:
    • clause_text, clause_type, ticker, accession, filing_date, line_num, year, and source_file
  • sample_clauses_preview.jsonl – 200 diverse clause samples
  • Readme.md – Field documentation, usage, structure
  • LICENSE.txt – Terms of use and redistribution rights

πŸ’Ό License Tiers

Tier Use Case Price Early Bird Solo use, no updates$299 Individual Solo use + lifetime updates $499 Enterprise Org-wide use + internal product rights $999

See LICENSE.txt for exact terms and attribution policy.


🎁 Sample file: Contract Clause Dataset Sample
πŸ“¬ Questions or enterprise licensing? Contact: Asapuaiworks@gmail.com

Add to cart

High-quality legal clause dataset for AI, legaltech, and contract understanding

66,813 unique legal clauses from real SEC Exhibit 10 contracts
Labeled with types: termination, IP, payment, dispute, more
JSONL format, ready for AI training, classification, and RAG
Covers 200 companies across 10+ years of filings
Includes preview set, license, and documentation
Copy product URL

No refunds allowed

Due to the digital nature of this product, all sales are final. Please review the sample dataset before purchase(Link shared in Product Description)

Last updated May 17, 2025