Visual_genome

อ้างอิง:

ภูมิภาค_descriptions_v1.0.0

ใช้คำสั่งต่อไปนี้เพื่อโหลดชุดข้อมูลนี้ใน TFDS:

ds = tfds.load('huggingface:visual_genome/region_descriptions_v1.0.0')
  • คำอธิบาย :
Visual Genome enable to model objects and relationships between objects.
They collect dense annotations of objects, attributes, and relationships within each image.
Specifically, the dataset contains over 108K images where each image has an average of 35 objects, 26 attributes, and 21 pairwise relationships between objects.
  • ใบอนุญาต : Creative Commons Attribution 4.0 ใบอนุญาตระหว่างประเทศ
  • เวอร์ชัน : 1.0.0
  • แยก :
แยก ตัวอย่าง
'train' 108077
  • คุณสมบัติ :
{
    "image": {
        "decode": true,
        "id": null,
        "_type": "Image"
    },
    "image_id": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "url": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "width": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "height": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "coco_id": {
        "dtype": "int64",
        "id": null,
        "_type": "Value"
    },
    "flickr_id": {
        "dtype": "int64",
        "id": null,
        "_type": "Value"
    },
    "regions": [
        {
            "region_id": {
                "dtype": "int32",
                "id": null,
                "_type": "Value"
            },
            "image_id": {
                "dtype": "int32",
                "id": null,
                "_type": "Value"
            },
            "phrase": {
                "dtype": "string",
                "id": null,
                "_type": "Value"
            },
            "x": {
                "dtype": "int32",
                "id": null,
                "_type": "Value"
            },
            "y": {
                "dtype": "int32",
                "id": null,
                "_type": "Value"
            },
            "width": {
                "dtype": "int32",
                "id": null,
                "_type": "Value"
            },
            "height": {
                "dtype": "int32",
                "id": null,
                "_type": "Value"
            }
        }
    ]
}