cuad

Referensi:

Gunakan perintah berikut untuk memuat dataset ini di TFDS:

ds = tfds.load('huggingface:cuad')
  • Deskripsi :
Contract Understanding Atticus Dataset (CUAD) v1 is a corpus of more than 13,000 labels in 510
commercial legal contracts that have been manually labeled to identify 41 categories of important
clauses that lawyers look for when reviewing contracts in connection with corporate transactions.
  • Lisensi : CUAD dilisensikan di bawah lisensi Creative Commons Attribution 4.0 (CC BY 4.0).
  • Versi : 1.0.0
  • Perpecahan :
Membelah Contoh
'test' 4182
'train' 22450
  • Fitur :
{
    "id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "context": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "answers": {
        "feature": {
            "text": {
                "dtype": "string",
                "id": null,
                "_type": "Value"
            },
            "answer_start": {
                "dtype": "int32",
                "id": null,
                "_type": "Value"
            }
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    }
}