웹비드

설명 :

WebVid는 웹에서 가져온 텍스트 설명이 포함된 짧은 비디오의 대규모 데이터 세트입니다. 비디오는 콘텐츠가 다양하고 풍부합니다.

WebVid-10M에는 다음이 포함됩니다.

10.7M 비디오 캡션 쌍. 총 동영상 시간 52,000시간.

홈페이지 : https://m-bain.github.io/webvid-dataset/
소스 코드 : tfds.datasets.webvid.Builder
버전 :
- 1.0.0 (기본값): 최초 릴리스.
다운로드 크기 : Unknown size
데이터세트 크기 : Unknown size
수동 다운로드 지침 : 이 데이터 세트는 원본 데이터를 download_config.manual_dir 에 수동으로 다운로드해야 합니다(기본값은 ~/tensorflow_datasets/downloads/manual/ ).
https://m-bain.github.io/webvid-dataset/ 의 다운로드 지침에 따라 데이터를 가져옵니다. mp4 파일이 manual_dir/webvid manual_dir/webvid/*/*_*/*.mp4 에 배치하십시오.

첫 번째 디렉토리는 일반적으로 임의의 부분 디렉토리(샤드 다운로드용)이고 두 번째 디렉토리는 페이지 디렉토리(밑줄 주위에 두 개의 숫자)이며 그 안에 하나 이상의 mp4 파일이 있습니다.

자동 캐시됨 ( 문서 ): 알 수 없음
분할 :

나뉘다	예

기능 구조 :

FeaturesDict({
    'caption': Text(shape=(), dtype=string),
    'id': Text(shape=(), dtype=string),
    'url': Text(shape=(), dtype=string),
    'video': Video(Image(shape=(360, 640, 3), dtype=uint8)),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
표제	텍스트		끈
ID	텍스트		끈
URL	텍스트		끈
동영상	동영상(이미지)	(없음, 360, 640, 3)	uint8

감독된 키 ( as_supervised 문서 참조): None
그림 ( tfds.show_examples ): 지원되지 않습니다.
예 ( tfds.as_dataframe ): 누락.
인용 :

@misc{bain2021frozen,
      title={Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval},
      author={Max Bain and Arsha Nagrani and Gül Varol and Andrew Zisserman},
      year={2021},
      eprint={2104.00650},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

설명 :

WebVid는 웹에서 가져온 텍스트 설명이 있는 짧은 비디오의 대규모 데이터 세트입니다. 비디오는 콘텐츠가 다양하고 풍부합니다.

WebVid-10M에는 다음이 포함됩니다.

10.7M 비디오 캡션 쌍. 총 동영상 시간 52,000시간.

홈페이지 : https://m-bain.github.io/webvid-dataset/
소스 코드 : tfds.datasets.webvid.Builder
버전 :
- 1.0.0 (기본값): 최초 릴리스.
다운로드 크기 : Unknown size
데이터세트 크기 : Unknown size
수동 다운로드 지침 : 이 데이터 세트는 원본 데이터를 download_config.manual_dir 에 수동으로 다운로드해야 합니다(기본값은 ~/tensorflow_datasets/downloads/manual/ ).
https://m-bain.github.io/webvid-dataset/ 의 다운로드 지침에 따라 데이터를 가져옵니다. mp4 파일이 manual_dir/webvid manual_dir/webvid/*/*_*/*.mp4 에 배치하십시오.

자동 캐시됨 ( 문서 ): 알 수 없음
분할 :

나뉘다	예

기능 구조 :

FeaturesDict({
    'caption': Text(shape=(), dtype=string),
    'id': Text(shape=(), dtype=string),
    'url': Text(shape=(), dtype=string),
    'video': Video(Image(shape=(360, 640, 3), dtype=uint8)),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
표제	텍스트		끈
ID	텍스트		끈
URL	텍스트		끈
동영상	동영상(이미지)	(없음, 360, 640, 3)	uint8

감독된 키 ( as_supervised 문서 참조): None
그림 ( tfds.show_examples ): 지원되지 않습니다.
예 ( tfds.as_dataframe ): 누락.
인용 :

@misc{bain2021frozen,
      title={Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval},
      author={Max Bain and Arsha Nagrani and Gül Varol and Andrew Zisserman},
      year={2021},
      eprint={2104.00650},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}