common_voice

  • 説明

Mozilla CommonVoiceデータセット

スプリット

common_voice / en(デフォルト設定)

  • コンフィグの説明:言語コード:アン

  • 特長

FeaturesDict({
    'accent': ClassLabel(shape=(), dtype=tf.int64, num_classes=17),
    'age': Text(shape=(), dtype=tf.string),
    'client_id': Text(shape=(), dtype=tf.string),
    'downvotes': tf.int32,
    'gender': ClassLabel(shape=(), dtype=tf.int64, num_classes=3),
    'sentence': Text(shape=(), dtype=tf.string),
    'upvotes': tf.int32,
    'voice': Audio(shape=(None,), dtype=tf.int64),
})

common_voice / de

  • コンフィグの説明:言語コード:デ

  • 特長

FeaturesDict({
    'accent': ClassLabel(shape=(), dtype=tf.int64, num_classes=10),
    'age': Text(shape=(), dtype=tf.string),
    'client_id': Text(shape=(), dtype=tf.string),
    'downvotes': tf.int32,
    'gender': ClassLabel(shape=(), dtype=tf.int64, num_classes=3),
    'sentence': Text(shape=(), dtype=tf.string),
    'upvotes': tf.int32,
    'voice': Audio(shape=(None,), dtype=tf.int64),
})

common_voice / fr

  • コンフィグの説明:言語コード:FR

  • 特長

FeaturesDict({
    'accent': ClassLabel(shape=(), dtype=tf.int64, num_classes=19),
    'age': Text(shape=(), dtype=tf.string),
    'client_id': Text(shape=(), dtype=tf.string),
    'downvotes': tf.int32,
    'gender': ClassLabel(shape=(), dtype=tf.int64, num_classes=3),
    'sentence': Text(shape=(), dtype=tf.string),
    'upvotes': tf.int32,
    'voice': Audio(shape=(None,), dtype=tf.int64),
})

common_voice / cy

  • コンフィグの説明:言語コード:CY

  • 特長

FeaturesDict({
    'accent': ClassLabel(shape=(), dtype=tf.int64, num_classes=2),
    'age': Text(shape=(), dtype=tf.string),
    'client_id': Text(shape=(), dtype=tf.string),
    'downvotes': tf.int32,
    'gender': ClassLabel(shape=(), dtype=tf.int64, num_classes=3),
    'sentence': Text(shape=(), dtype=tf.string),
    'upvotes': tf.int32,
    'voice': Audio(shape=(None,), dtype=tf.int64),
})

common_voice / br

  • コンフィグの説明:言語コード:BR

  • 特長

FeaturesDict({
    'accent': ClassLabel(shape=(), dtype=tf.int64, num_classes=1),
    'age': Text(shape=(), dtype=tf.string),
    'client_id': Text(shape=(), dtype=tf.string),
    'downvotes': tf.int32,
    'gender': ClassLabel(shape=(), dtype=tf.int64, num_classes=3),
    'sentence': Text(shape=(), dtype=tf.string),
    'upvotes': tf.int32,
    'voice': Audio(shape=(None,), dtype=tf.int64),
})

common_voice / cv

  • コンフィグの説明:言語コード:CV

  • 特長

FeaturesDict({
    'accent': ClassLabel(shape=(), dtype=tf.int64, num_classes=0),
    'age': Text(shape=(), dtype=tf.string),
    'client_id': Text(shape=(), dtype=tf.string),
    'downvotes': tf.int32,
    'gender': ClassLabel(shape=(), dtype=tf.int64, num_classes=3),
    'sentence': Text(shape=(), dtype=tf.string),
    'upvotes': tf.int32,
    'voice': Audio(shape=(None,), dtype=tf.int64),
})

common_voice / tr

  • コンフィグの説明:言語コード:TR

  • 特長

FeaturesDict({
    'accent': ClassLabel(shape=(), dtype=tf.int64, num_classes=1),
    'age': Text(shape=(), dtype=tf.string),
    'client_id': Text(shape=(), dtype=tf.string),
    'downvotes': tf.int32,
    'gender': ClassLabel(shape=(), dtype=tf.int64, num_classes=3),
    'sentence': Text(shape=(), dtype=tf.string),
    'upvotes': tf.int32,
    'voice': Audio(shape=(None,), dtype=tf.int64),
})

common_voice / tt

  • コンフィグの説明:言語コード:TT

  • 特長

FeaturesDict({
    'accent': ClassLabel(shape=(), dtype=tf.int64, num_classes=0),
    'age': Text(shape=(), dtype=tf.string),
    'client_id': Text(shape=(), dtype=tf.string),
    'downvotes': tf.int32,
    'gender': ClassLabel(shape=(), dtype=tf.int64, num_classes=3),
    'sentence': Text(shape=(), dtype=tf.string),
    'upvotes': tf.int32,
    'voice': Audio(shape=(None,), dtype=tf.int64),
})

common_voice / ky

  • コンフィグの説明:言語コード:ケンタッキー州

  • 特長

FeaturesDict({
    'accent': ClassLabel(shape=(), dtype=tf.int64, num_classes=1),
    'age': Text(shape=(), dtype=tf.string),
    'client_id': Text(shape=(), dtype=tf.string),
    'downvotes': tf.int32,
    'gender': ClassLabel(shape=(), dtype=tf.int64, num_classes=3),
    'sentence': Text(shape=(), dtype=tf.string),
    'upvotes': tf.int32,
    'voice': Audio(shape=(None,), dtype=tf.int64),
})

common_voice / ga-IE

  • コンフィグの説明:言語コード:GA-IE

  • 特長

FeaturesDict({
    'accent': ClassLabel(shape=(), dtype=tf.int64, num_classes=3),
    'age': Text(shape=(), dtype=tf.string),
    'client_id': Text(shape=(), dtype=tf.string),
    'downvotes': tf.int32,
    'gender': ClassLabel(shape=(), dtype=tf.int64, num_classes=3),
    'sentence': Text(shape=(), dtype=tf.string),
    'upvotes': tf.int32,
    'voice': Audio(shape=(None,), dtype=tf.int64),
})

common_voice / kab

  • コンフィグの説明:言語コード:KAB

  • 特長

FeaturesDict({
    'accent': ClassLabel(shape=(), dtype=tf.int64, num_classes=1),
    'age': Text(shape=(), dtype=tf.string),
    'client_id': Text(shape=(), dtype=tf.string),
    'downvotes': tf.int32,
    'gender': ClassLabel(shape=(), dtype=tf.int64, num_classes=3),
    'sentence': Text(shape=(), dtype=tf.string),
    'upvotes': tf.int32,
    'voice': Audio(shape=(None,), dtype=tf.int64),
})

common_voice / ca

  • コンフィグの説明:言語コード:CA

  • 特長

FeaturesDict({
    'accent': ClassLabel(shape=(), dtype=tf.int64, num_classes=6),
    'age': Text(shape=(), dtype=tf.string),
    'client_id': Text(shape=(), dtype=tf.string),
    'downvotes': tf.int32,
    'gender': ClassLabel(shape=(), dtype=tf.int64, num_classes=3),
    'sentence': Text(shape=(), dtype=tf.string),
    'upvotes': tf.int32,
    'voice': Audio(shape=(None,), dtype=tf.int64),
})

common_voice / zh-TW

  • コンフィグの説明:言語コード:ZH-TW

  • 特長

FeaturesDict({
    'accent': ClassLabel(shape=(), dtype=tf.int64, num_classes=1),
    'age': Text(shape=(), dtype=tf.string),
    'client_id': Text(shape=(), dtype=tf.string),
    'downvotes': tf.int32,
    'gender': ClassLabel(shape=(), dtype=tf.int64, num_classes=3),
    'sentence': Text(shape=(), dtype=tf.string),
    'upvotes': tf.int32,
    'voice': Audio(shape=(None,), dtype=tf.int64),
})

common_voice / sl

  • コンフィグの説明:言語コード:SL

  • 特長

FeaturesDict({
    'accent': ClassLabel(shape=(), dtype=tf.int64, num_classes=1),
    'age': Text(shape=(), dtype=tf.string),
    'client_id': Text(shape=(), dtype=tf.string),
    'downvotes': tf.int32,
    'gender': ClassLabel(shape=(), dtype=tf.int64, num_classes=3),
    'sentence': Text(shape=(), dtype=tf.string),
    'upvotes': tf.int32,
    'voice': Audio(shape=(None,), dtype=tf.int64),
})

common_voice / it

  • コンフィグの説明:言語コード:それは

  • 特長

FeaturesDict({
    'accent': ClassLabel(shape=(), dtype=tf.int64, num_classes=1),
    'age': Text(shape=(), dtype=tf.string),
    'client_id': Text(shape=(), dtype=tf.string),
    'downvotes': tf.int32,
    'gender': ClassLabel(shape=(), dtype=tf.int64, num_classes=3),
    'sentence': Text(shape=(), dtype=tf.string),
    'upvotes': tf.int32,
    'voice': Audio(shape=(None,), dtype=tf.int64),
})

common_voice / nl

  • コンフィグの説明:言語コード:NL

  • 特長

FeaturesDict({
    'accent': ClassLabel(shape=(), dtype=tf.int64, num_classes=3),
    'age': Text(shape=(), dtype=tf.string),
    'client_id': Text(shape=(), dtype=tf.string),
    'downvotes': tf.int32,
    'gender': ClassLabel(shape=(), dtype=tf.int64, num_classes=3),
    'sentence': Text(shape=(), dtype=tf.string),
    'upvotes': tf.int32,
    'voice': Audio(shape=(None,), dtype=tf.int64),
})

common_voice / cnh

  • コンフィグの説明:言語コード:CNH

  • 特長

FeaturesDict({
    'accent': ClassLabel(shape=(), dtype=tf.int64, num_classes=1),
    'age': Text(shape=(), dtype=tf.string),
    'client_id': Text(shape=(), dtype=tf.string),
    'downvotes': tf.int32,
    'gender': ClassLabel(shape=(), dtype=tf.int64, num_classes=3),
    'sentence': Text(shape=(), dtype=tf.string),
    'upvotes': tf.int32,
    'voice': Audio(shape=(None,), dtype=tf.int64),
})

common_voice / eo

  • コンフィグの説明:言語コード:EO

  • 特長

FeaturesDict({
    'accent': ClassLabel(shape=(), dtype=tf.int64, num_classes=2),
    'age': Text(shape=(), dtype=tf.string),
    'client_id': Text(shape=(), dtype=tf.string),
    'downvotes': tf.int32,
    'gender': ClassLabel(shape=(), dtype=tf.int64, num_classes=3),
    'sentence': Text(shape=(), dtype=tf.string),
    'upvotes': tf.int32,
    'voice': Audio(shape=(None,), dtype=tf.int64),
})