imagenet2012_subset

설명 :

Imagenet2012Subset은 원본 ImageNet ILSVRC 2012 데이터세트의 하위 집합입니다. 데이터 세트는 원본 ImageNet ILSVRC 2012 데이터 세트와 동일한 검증 세트를 공유합니다. 그러나 훈련 세트는 레이블 균형 방식으로 서브샘플링됩니다. 1pct 구성, 1% 또는 12811에서 이미지가 샘플링되고 대부분의 클래스에는 동일한 수의 이미지(평균 12.8)가 있으며 일부 클래스에는 다른 클래스보다 무작위로 1개의 예가 더 있습니다. 10pct 구성, ~10% 또는 128116에서 대부분의 클래스는 동일한 수의 이미지(평균 128)를 가지며 일부 클래스는 무작위로 다른 클래스보다 1개의 더 많은 예를 갖습니다.

이는 준지도 학습의 벤치마크로 사용되며 원래 SimCLR 논문( https://arxiv.org/abs/2002.05709 )에서 사용되었습니다.

홈페이지 : http://image-net.org/
소스 코드 : tfds.datasets.imagenet2012_subset.Builder
버전 :
- 2.0.0 : 유효성 검사 라벨을 수정합니다.
- 2.0.1 : 인코딩 수정. 사용자 관점에서는 변화가 없습니다.
- 3.0.0 : ~12개 이미지(CMYK -> RGB)의 색상화를 수정합니다. 일관성을 위해 형식을 수정합니다(단일 png 이미지를 Jpeg로 변환). 아카이브에서 직접 읽는 속도가 빨라집니다.
- 4.0.0 : (미공개)
- 5.0.0 (기본값): 새로운 분할 API( https://tensorflow.org/datasets/splits )
- 5.1.0 : 테스트 분할이 추가되었습니다.
수동 다운로드 지침 : 이 데이터세트에서는 소스 데이터를 download_config.manual_dir 에 수동으로 다운로드해야 합니다(기본값은 ~/tensorflow_datasets/downloads/manual/ ).
manual_dir에는 ILSVRC2012_img_train.tar 및 ILSVRC2012_img_val.tar라는 두 개의 파일이 포함되어야 합니다. 데이터 세트를 다운로드할 수 있는 링크를 얻으려면 https://image-net.org/download-images 에 등록해야 합니다.
자동 캐시 ( 문서 ): 아니요
기능 구조 :

FeaturesDict({
    'file_name': Text(shape=(), dtype=string),
    'image': Image(shape=(None, None, 3), dtype=uint8),
    'label': ClassLabel(shape=(), dtype=int64, num_classes=1000),
})

기능 문서 :

특징	수업	모양	Dtype
	특징Dict
파일 이름	텍스트		끈
영상	영상	(없음, 없음, 3)	uint8
상표	클래스 라벨		정수64

감독되는 키 ( as_supervised doc 참조): ('image', 'label')
인용 :

@article{chen2020simple,
  title={A Simple Framework for Contrastive Learning of Visual Representations},
  author={Chen, Ting and Kornblith, Simon and Norouzi, Mohammad and Hinton, Geoffrey},
  journal={arXiv preprint arXiv:2002.05709},
  year={2020}
}
@article{ILSVRC15,
  Author = {Olga Russakovsky and Jia Deng and Hao Su and Jonathan Krause and Sanjeev Satheesh and Sean Ma and Zhiheng Huang and Andrej Karpathy and Aditya Khosla and Michael Bernstein and Alexander C. Berg and Li Fei-Fei},
  Title = { {ImageNet Large Scale Visual Recognition Challenge} },
  Year = {2015},
  journal   = {International Journal of Computer Vision (IJCV)},
  doi = {10.1007/s11263-015-0816-y},
  volume={115},
  number={3},
  pages={211-252}
}

imagenet2012_subset/1pct(기본 구성)

구성 설명 : 전체 ImageNet 훈련 세트의 1pct.
다운로드 크기 : 254.22 KiB
데이터세트 크기 : 7.61 GiB
분할 :

나뉘다	예
`'train'`	12,811
`'validation'`	50,000

그림 ( tfds.show_examples ):

예 ( tfds.as_dataframe ):

imagenet2012_subset/10pct

구성 설명 : 전체 ImageNet 훈련 세트의 10%.
다운로드 크기 : 2.48 MiB
데이터세트 크기 : 19.91 GiB
분할 :

나뉘다	예
`'train'`	128,116
`'validation'`	50,000

그림 ( tfds.show_examples ):

예 ( tfds.as_dataframe ):