컨셉_12분

참조:

다음 명령을 사용하여 TFDS에서 이 데이터세트를 로드합니다.

ds = tfds.load('huggingface:conceptual_12m')

설명 :

Conceptual 12M is a large-scale dataset of 12 million
image-text pairs specifically meant to be used for visionand-language pre-training.
Its data collection pipeline is a relaxed version of the one used in Conceptual Captions 3M.

라이선스 : 데이터 소스가 Google LLC("Google")임을 인정하더라도 데이터 세트는 어떤 목적으로든 자유롭게 사용할 수 있습니다. 데이터 세트는 명시적 또는 묵시적 보증 없이 "있는 그대로" 제공됩니다. Google은 데이터세트 사용으로 인해 발생하는 직간접적인 손해에 대해 모든 책임을 지지 않습니다.
버전 : 0.0.0
분할 :

나뉘다	예
`'train'`	12423374

특징 :

{
    "image_url": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "caption": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}