rvl_cdip

ข้อมูลอ้างอิง:

ใช้คำสั่งต่อไปนี้เพื่อโหลดชุดข้อมูลนี้ใน TFDS:

ds = tfds.load('huggingface:rvl_cdip')
  • คำอธิบาย :
The RVL-CDIP (Ryerson Vision Lab Complex Document Information Processing) dataset consists of 400,000 grayscale images in 16 classes, with 25,000 images per class. There are 320,000 training images, 40,000 validation images, and 40,000 test images.
แยก ตัวอย่าง
'test' 40000
'train' 320,000
'validation' 40000
  • คุณสมบัติ :
{
    "image": {
        "decode": true,
        "id": null,
        "_type": "Image"
    },
    "label": {
        "num_classes": 16,
        "names": [
            "letter",
            "form",
            "email",
            "handwritten",
            "advertisement",
            "scientific report",
            "scientific publication",
            "specification",
            "file folder",
            "news article",
            "budget",
            "invoice",
            "presentation",
            "questionnaire",
            "resume",
            "memo"
        ],
        "id": null,
        "_type": "ClassLabel"
    }
}