wikipedia

  • Description:

Wikipedia dataset containing cleaned articles of all languages. The datasets are built from the Wikipedia dump (https://dumps.wikimedia.org/) with one split per language. Each example contains the content of one full Wikipedia article with cleaning to strip markdown and unwanted sections (references, etc.).

FeaturesDict({
    'text': Text(shape=(), dtype=string),
    'title': Text(shape=(), dtype=string),
})
  • Feature documentation:
Feature Class Shape Dtype Description
FeaturesDict
text Text string
title Text string
@ONLINE {wikidump,
    author = "Wikimedia Foundation",
    title  = "Wikimedia Downloads",
    url    = "https://dumps.wikimedia.org"
}

wikipedia/20230601.en (default config)

  • Config description: Wikipedia dataset for en, parsed from 20230601 dump.

  • Download size: 20.53 GiB

  • Dataset size: 19.88 GiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 6,672,479

wikipedia/20230601.ab

  • Config description: Wikipedia dataset for ab, parsed from 20230601 dump.

  • Download size: 3.25 MiB

  • Dataset size: 3.96 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 7,559

wikipedia/20230601.ace

  • Config description: Wikipedia dataset for ace, parsed from 20230601 dump.

  • Download size: 3.46 MiB

  • Dataset size: 4.32 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 14,011

wikipedia/20230601.ady

  • Config description: Wikipedia dataset for ady, parsed from 20230601 dump.

  • Download size: 1.04 MiB

  • Dataset size: 559.38 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 609

wikipedia/20230601.af

  • Config description: Wikipedia dataset for af, parsed from 20230601 dump.

  • Download size: 128.42 MiB

  • Dataset size: 216.77 MiB

  • Auto-cached (documentation): Only when shuffle_files=False (train)

  • Splits:

Split Examples
'train' 132,782

wikipedia/20230601.ak

  • Config description: Wikipedia dataset for ak, parsed from 20230601 dump.

  • Download size: 255.26 KiB

  • Dataset size: 153 bytes

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 1

wikipedia/20230601.als

  • Config description: Wikipedia dataset for als, parsed from 20230601 dump.

  • Download size: 58.20 MiB

  • Dataset size: 78.05 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 32,335

wikipedia/20230601.am

  • Config description: Wikipedia dataset for am, parsed from 20230601 dump.

  • Download size: 8.17 MiB

  • Dataset size: 21.97 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 13,847

wikipedia/20230601.an

  • Config description: Wikipedia dataset for an, parsed from 20230601 dump.

  • Download size: 41.43 MiB

  • Dataset size: 55.22 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 60,634

wikipedia/20230601.ang

  • Config description: Wikipedia dataset for ang, parsed from 20230601 dump.

  • Download size: 4.78 MiB

  • Dataset size: 2.70 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 4,026

wikipedia/20230601.ar

  • Config description: Wikipedia dataset for ar, parsed from 20230601 dump.

  • Download size: 1.52 GiB

  • Dataset size: 2.93 GiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 2,237,839

wikipedia/20230601.arc

  • Config description: Wikipedia dataset for arc, parsed from 20230601 dump.

  • Download size: 1.16 MiB

  • Dataset size: 885.92 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 3,704

wikipedia/20230601.arz

  • Config description: Wikipedia dataset for arz, parsed from 20230601 dump.

  • Download size: 232.11 MiB

  • Dataset size: 1.19 GiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 1,635,653

wikipedia/20230601.as

  • Config description: Wikipedia dataset for as, parsed from 20230601 dump.

  • Download size: 35.49 MiB

  • Dataset size: 81.79 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 12,041

wikipedia/20230601.ast

  • Config description: Wikipedia dataset for ast, parsed from 20230601 dump.

  • Download size: 222.42 MiB

  • Dataset size: 455.49 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 148,016

wikipedia/20230601.atj

  • Config description: Wikipedia dataset for atj, parsed from 20230601 dump.

  • Download size: 718.23 KiB

  • Dataset size: 957.72 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 1,965

wikipedia/20230601.av

  • Config description: Wikipedia dataset for av, parsed from 20230601 dump.

  • Download size: 8.17 MiB

  • Dataset size: 5.82 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 4,976

wikipedia/20230601.ay

  • Config description: Wikipedia dataset for ay, parsed from 20230601 dump.

  • Download size: 2.50 MiB

  • Dataset size: 4.26 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 5,453

wikipedia/20230601.az

  • Config description: Wikipedia dataset for az, parsed from 20230601 dump.

  • Download size: 249.24 MiB

  • Dataset size: 408.29 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 231,404

wikipedia/20230601.azb

  • Config description: Wikipedia dataset for azb, parsed from 20230601 dump.

  • Download size: 100.21 MiB

  • Dataset size: 162.59 MiB

  • Auto-cached (documentation): Only when shuffle_files=False (train)

  • Splits:

Split Examples
'train' 271,703

wikipedia/20230601.ba

  • Config description: Wikipedia dataset for ba, parsed from 20230601 dump.

  • Download size: 97.34 MiB

  • Dataset size: 280.42 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 70,618

wikipedia/20230601.bar

  • Config description: Wikipedia dataset for bar, parsed from 20230601 dump.

  • Download size: 34.64 MiB

  • Dataset size: 35.56 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 41,136

wikipedia/20230601.bcl

  • Config description: Wikipedia dataset for bcl, parsed from 20230601 dump.

  • Download size: 16.95 MiB

  • Dataset size: 17.41 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 13,940

wikipedia/20230601.be

  • Config description: Wikipedia dataset for be, parsed from 20230601 dump.

  • Download size: 268.64 MiB

  • Dataset size: 568.27 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 231,631

wikipedia/20230601.bg

  • Config description: Wikipedia dataset for bg, parsed from 20230601 dump.

  • Download size: 417.03 MiB

  • Dataset size: 1.02 GiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 421,120

wikipedia/20230601.bh

  • Config description: Wikipedia dataset for bh, parsed from 20230601 dump.

  • Download size: 17.60 MiB

  • Dataset size: 14.83 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 8,446

wikipedia/20230601.bi

  • Config description: Wikipedia dataset for bi, parsed from 20230601 dump.

  • Download size: 627.92 KiB

  • Dataset size: 362.13 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 1,543

wikipedia/20230601.bjn

  • Config description: Wikipedia dataset for bjn, parsed from 20230601 dump.

  • Download size: 6.10 MiB

  • Dataset size: 6.09 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 10,912

wikipedia/20230601.bm

  • Config description: Wikipedia dataset for bm, parsed from 20230601 dump.

  • Download size: 763.34 KiB

  • Dataset size: 455.91 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 1,221

wikipedia/20230601.bn

  • Config description: Wikipedia dataset for bn, parsed from 20230601 dump.

  • Download size: 340.26 MiB

  • Dataset size: 889.35 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 226,385

wikipedia/20230601.bo

  • Config description: Wikipedia dataset for bo, parsed from 20230601 dump.

  • Download size: 14.15 MiB

  • Dataset size: 123.96 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 12,435

wikipedia/20230601.bpy

  • Config description: Wikipedia dataset for bpy, parsed from 20230601 dump.

  • Download size: 5.38 MiB

  • Dataset size: 37.74 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 25,699

wikipedia/20230601.br

  • Config description: Wikipedia dataset for br, parsed from 20230601 dump.

  • Download size: 57.99 MiB

  • Dataset size: 81.19 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 92,433

wikipedia/20230601.bs

  • Config description: Wikipedia dataset for bs, parsed from 20230601 dump.

  • Download size: 146.44 MiB

  • Dataset size: 191.33 MiB

  • Auto-cached (documentation): Only when shuffle_files=False (train)

  • Splits:

Split Examples
'train' 205,927

wikipedia/20230601.bug

  • Config description: Wikipedia dataset for bug, parsed from 20230601 dump.

  • Download size: 2.10 MiB

  • Dataset size: 2.94 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 16,120

wikipedia/20230601.bxr

  • Config description: Wikipedia dataset for bxr, parsed from 20230601 dump.

  • Download size: 5.13 MiB

  • Dataset size: 6.43 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 3,601

wikipedia/20230601.ca

  • Config description: Wikipedia dataset for ca, parsed from 20230601 dump.

  • Download size: 1.11 GiB

  • Dataset size: 1.81 GiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 851,793

wikipedia/20230601.cdo

  • Config description: Wikipedia dataset for cdo, parsed from 20230601 dump.

  • Download size: 5.27 MiB

  • Dataset size: 4.32 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 17,671

wikipedia/20230601.ce

  • Config description: Wikipedia dataset for ce, parsed from 20230601 dump.

  • Download size: 93.21 MiB

  • Dataset size: 649.56 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 603,356

wikipedia/20230601.ceb

  • Config description: Wikipedia dataset for ceb, parsed from 20230601 dump.

  • Download size: 2.04 GiB

  • Dataset size: 4.10 GiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 6,124,071

wikipedia/20230601.ch

  • Config description: Wikipedia dataset for ch, parsed from 20230601 dump.

  • Download size: 768.73 KiB

  • Dataset size: 182.82 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 573

wikipedia/20230601.cho

  • Config description: Wikipedia dataset for cho, parsed from 20230601 dump.

  • Download size: 27.65 KiB

  • Dataset size: 7.44 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 14

wikipedia/20230601.chr

  • Config description: Wikipedia dataset for chr, parsed from 20230601 dump.

  • Download size: 727.82 KiB

  • Dataset size: 689.03 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 1,114

wikipedia/20230601.chy

  • Config description: Wikipedia dataset for chy, parsed from 20230601 dump.

  • Download size: 388.58 KiB

  • Dataset size: 121.07 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 801

wikipedia/20230601.ckb

  • Config description: Wikipedia dataset for ckb, parsed from 20230601 dump.

  • Download size: 52.62 MiB

  • Dataset size: 94.99 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 60,761

wikipedia/20230601.co

  • Config description: Wikipedia dataset for co, parsed from 20230601 dump.

  • Download size: 5.26 MiB

  • Dataset size: 7.64 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 7,491

wikipedia/20230601.cr

  • Config description: Wikipedia dataset for cr, parsed from 20230601 dump.

  • Download size: 317.96 KiB

  • Dataset size: 39.74 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 182

wikipedia/20230601.crh

  • Config description: Wikipedia dataset for crh, parsed from 20230601 dump.

  • Download size: 8.44 MiB

  • Dataset size: 7.98 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 25,647

wikipedia/20230601.cs

  • Config description: Wikipedia dataset for cs, parsed from 20230601 dump.

  • Download size: 1.04 GiB

  • Dataset size: 1.46 GiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 693,228

wikipedia/20230601.csb

  • Config description: Wikipedia dataset for csb, parsed from 20230601 dump.

  • Download size: 2.36 MiB

  • Dataset size: 3.58 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 5,846

wikipedia/20230601.cu

  • Config description: Wikipedia dataset for cu, parsed from 20230601 dump.

  • Download size: 802.56 KiB

  • Dataset size: 1.01 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 2,307

wikipedia/20230601.cv

  • Config description: Wikipedia dataset for cv, parsed from 20230601 dump.

  • Download size: 34.06 MiB

  • Dataset size: 75.23 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 56,342

wikipedia/20230601.cy

  • Config description: Wikipedia dataset for cy, parsed from 20230601 dump.

  • Download size: 123.06 MiB

  • Dataset size: 298.92 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 324,367

wikipedia/20230601.da

  • Config description: Wikipedia dataset for da, parsed from 20230601 dump.

  • Download size: 404.99 MiB

  • Dataset size: 519.15 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 291,814

wikipedia/20230601.de

  • Config description: Wikipedia dataset for de, parsed from 20230601 dump.

  • Download size: 6.50 GiB

  • Dataset size: 9.00 GiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 3,703,022

wikipedia/20230601.din

  • Config description: Wikipedia dataset for din, parsed from 20230601 dump.

  • Download size: 570.67 KiB

  • Dataset size: 534.96 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 506

wikipedia/20230601.diq

  • Config description: Wikipedia dataset for diq, parsed from 20230601 dump.

  • Download size: 12.13 MiB

  • Dataset size: 18.18 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 43,859

wikipedia/20230601.dsb

  • Config description: Wikipedia dataset for dsb, parsed from 20230601 dump.

  • Download size: 3.90 MiB

  • Dataset size: 3.21 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 3,604

wikipedia/20230601.dty

  • Config description: Wikipedia dataset for dty, parsed from 20230601 dump.

  • Download size: 7.20 MiB

  • Dataset size: 6.32 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 3,629

wikipedia/20230601.dv

  • Config description: Wikipedia dataset for dv, parsed from 20230601 dump.

  • Download size: 4.70 MiB

  • Dataset size: 12.91 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 4,347

wikipedia/20230601.dz

  • Config description: Wikipedia dataset for dz, parsed from 20230601 dump.

  • Download size: 1.14 MiB

  • Dataset size: 7.98 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 778

wikipedia/20230601.ee

  • Config description: Wikipedia dataset for ee, parsed from 20230601 dump.

  • Download size: 780.39 KiB

  • Dataset size: 795.50 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 1,165

wikipedia/20230601.el

  • Config description: Wikipedia dataset for el, parsed from 20230601 dump.

  • Download size: 493.25 MiB

  • Dataset size: 1.23 GiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 307,672

wikipedia/20230601.eml

  • Config description: Wikipedia dataset for eml, parsed from 20230601 dump.

  • Download size: 9.46 MiB

  • Dataset size: 3.37 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 15,249

wikipedia/20230601.eo

  • Config description: Wikipedia dataset for eo, parsed from 20230601 dump.

  • Download size: 334.95 MiB

  • Dataset size: 500.92 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 471,532

wikipedia/20230601.es

  • Config description: Wikipedia dataset for es, parsed from 20230601 dump.

  • Download size: 4.05 GiB

  • Dataset size: 5.67 GiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 3,267,500

wikipedia/20230601.et

  • Config description: Wikipedia dataset for et, parsed from 20230601 dump.

  • Download size: 261.00 MiB

  • Dataset size: 423.78 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 370,007

wikipedia/20230601.eu

  • Config description: Wikipedia dataset for eu, parsed from 20230601 dump.

  • Download size: 287.57 MiB

  • Dataset size: 533.18 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 524,083

wikipedia/20230601.ext

  • Config description: Wikipedia dataset for ext, parsed from 20230601 dump.

  • Download size: 3.05 MiB

  • Dataset size: 3.97 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 4,034

wikipedia/20230601.fa

  • Config description: Wikipedia dataset for fa, parsed from 20230601 dump.

  • Download size: 1.09 GiB

  • Dataset size: 1.89 GiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 2,786,413

wikipedia/20230601.ff

  • Config description: Wikipedia dataset for ff, parsed from 20230601 dump.

  • Download size: 1.35 MiB

  • Dataset size: 1.31 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 1,484

wikipedia/20230601.fi

  • Config description: Wikipedia dataset for fi, parsed from 20230601 dump.

  • Download size: 872.65 MiB

  • Dataset size: 1.07 GiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 762,166

wikipedia/20230601.fj

  • Config description: Wikipedia dataset for fj, parsed from 20230601 dump.

  • Download size: 1010.29 KiB

  • Dataset size: 558.58 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 1,283

wikipedia/20230601.fo

  • Config description: Wikipedia dataset for fo, parsed from 20230601 dump.

  • Download size: 15.18 MiB

  • Dataset size: 14.46 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 13,954

wikipedia/20230601.fr

  • Config description: Wikipedia dataset for fr, parsed from 20230601 dump.

  • Download size: 5.64 GiB

  • Dataset size: 7.44 GiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 2,527,241

wikipedia/20230601.frp

  • Config description: Wikipedia dataset for frp, parsed from 20230601 dump.

  • Download size: 4.31 MiB

  • Dataset size: 3.67 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 9,011

wikipedia/20230601.frr

  • Config description: Wikipedia dataset for frr, parsed from 20230601 dump.

  • Download size: 13.09 MiB

  • Dataset size: 10.04 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 19,586

wikipedia/20230601.fur

  • Config description: Wikipedia dataset for fur, parsed from 20230601 dump.

  • Download size: 2.70 MiB

  • Dataset size: 3.87 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 4,243

wikipedia/20230601.fy

  • Config description: Wikipedia dataset for fy, parsed from 20230601 dump.

  • Download size: 67.61 MiB

  • Dataset size: 125.68 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 51,517

wikipedia/20230601.ga

  • Config description: Wikipedia dataset for ga, parsed from 20230601 dump.

  • Download size: 36.02 MiB

  • Dataset size: 57.47 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 68,254

wikipedia/20230601.gag

  • Config description: Wikipedia dataset for gag, parsed from 20230601 dump.

  • Download size: 2.21 MiB

  • Dataset size: 2.31 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 3,135

wikipedia/20230601.gan

  • Config description: Wikipedia dataset for gan, parsed from 20230601 dump.

  • Download size: 4.44 MiB

  • Dataset size: 2.53 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 6,786

wikipedia/20230601.gd

  • Config description: Wikipedia dataset for gd, parsed from 20230601 dump.

  • Download size: 9.64 MiB

  • Dataset size: 13.53 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 16,021

wikipedia/20230601.gl

  • Config description: Wikipedia dataset for gl, parsed from 20230601 dump.

  • Download size: 320.32 MiB

  • Dataset size: 473.58 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 270,549

wikipedia/20230601.glk

  • Config description: Wikipedia dataset for glk, parsed from 20230601 dump.

  • Download size: 2.89 MiB

  • Dataset size: 5.64 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 8,430

wikipedia/20230601.gn

  • Config description: Wikipedia dataset for gn, parsed from 20230601 dump.

  • Download size: 4.86 MiB

  • Dataset size: 6.48 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 6,503

wikipedia/20230601.gom

  • Config description: Wikipedia dataset for gom, parsed from 20230601 dump.

  • Download size: 6.87 MiB

  • Dataset size: 29.14 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 4,253

wikipedia/20230601.gor

  • Config description: Wikipedia dataset for gor, parsed from 20230601 dump.

  • Download size: 4.07 MiB

  • Dataset size: 5.36 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 14,680

wikipedia/20230601.got

  • Config description: Wikipedia dataset for got, parsed from 20230601 dump.

  • Download size: 761.65 KiB

  • Dataset size: 1.38 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 1,005

wikipedia/20230601.gu

  • Config description: Wikipedia dataset for gu, parsed from 20230601 dump.

  • Download size: 33.01 MiB

  • Dataset size: 113.11 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 30,360

wikipedia/20230601.gv

  • Config description: Wikipedia dataset for gv, parsed from 20230601 dump.

  • Download size: 7.25 MiB

  • Dataset size: 5.78 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 5,955

wikipedia/20230601.ha

  • Config description: Wikipedia dataset for ha, parsed from 20230601 dump.

  • Download size: 32.30 MiB

  • Dataset size: 59.46 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 27,905

wikipedia/20230601.hak

  • Config description: Wikipedia dataset for hak, parsed from 20230601 dump.

  • Download size: 4.45 MiB

  • Dataset size: 4.21 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 12,928

wikipedia/20230601.haw

  • Config description: Wikipedia dataset for haw, parsed from 20230601 dump.

  • Download size: 1.24 MiB

  • Dataset size: 1.55 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 2,880

wikipedia/20230601.he

  • Config description: Wikipedia dataset for he, parsed from 20230601 dump.

  • Download size: 864.64 MiB

  • Dataset size: 1.80 GiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 527,620

wikipedia/20230601.hi

  • Config description: Wikipedia dataset for hi, parsed from 20230601 dump.

  • Download size: 196.49 MiB

  • Dataset size: 615.83 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 197,556

wikipedia/20230601.hif

  • Config description: Wikipedia dataset for hif, parsed from 20230601 dump.

  • Download size: 6.30 MiB

  • Dataset size: 5.23 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 10,976

wikipedia/20230601.ho

  • Config description: Wikipedia dataset for ho, parsed from 20230601 dump.

  • Download size: 19.90 KiB

  • Dataset size: 3.27 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 3

wikipedia/20230601.hr

  • Config description: Wikipedia dataset for hr, parsed from 20230601 dump.

  • Download size: 306.78 MiB

  • Dataset size: 424.24 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 252,345

wikipedia/20230601.hsb

  • Config description: Wikipedia dataset for hsb, parsed from 20230601 dump.

  • Download size: 11.23 MiB

  • Dataset size: 15.13 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 15,283

wikipedia/20230601.ht

  • Config description: Wikipedia dataset for ht, parsed from 20230601 dump.

  • Download size: 20.15 MiB

  • Dataset size: 51.43 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 69,781

wikipedia/20230601.hu

  • Config description: Wikipedia dataset for hu, parsed from 20230601 dump.

  • Download size: 1.02 GiB

  • Dataset size: 1.41 GiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 736,016

wikipedia/20230601.hy

  • Config description: Wikipedia dataset for hy, parsed from 20230601 dump.

  • Download size: 419.70 MiB

  • Dataset size: 1.09 GiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 654,410

wikipedia/20230601.ia

  • Config description: Wikipedia dataset for ia, parsed from 20230601 dump.

  • Download size: 10.88 MiB

  • Dataset size: 14.90 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 27,940

wikipedia/20230601.id

  • Config description: Wikipedia dataset for id, parsed from 20230601 dump.

  • Download size: 872.37 MiB

  • Dataset size: 1.05 GiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 1,233,100

wikipedia/20230601.ie

  • Config description: Wikipedia dataset for ie, parsed from 20230601 dump.

  • Download size: 3.58 MiB

  • Dataset size: 5.95 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 11,706

wikipedia/20230601.ig

  • Config description: Wikipedia dataset for ig, parsed from 20230601 dump.

  • Download size: 23.43 MiB

  • Dataset size: 43.08 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 18,388

wikipedia/20230601.ii

  • Config description: Wikipedia dataset for ii, parsed from 20230601 dump.

  • Download size: 32.57 KiB

  • Dataset size: 8.31 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 14

wikipedia/20230601.ik

  • Config description: Wikipedia dataset for ik, parsed from 20230601 dump.

  • Download size: 347.67 KiB

  • Dataset size: 168.51 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 823

wikipedia/20230601.ilo

  • Config description: Wikipedia dataset for ilo, parsed from 20230601 dump.

  • Download size: 18.80 MiB

  • Dataset size: 15.96 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 15,380

wikipedia/20230601.inh

  • Config description: Wikipedia dataset for inh, parsed from 20230601 dump.

  • Download size: 5.04 MiB

  • Dataset size: 2.62 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 3,511

wikipedia/20230601.io

  • Config description: Wikipedia dataset for io, parsed from 20230601 dump.

  • Download size: 17.23 MiB

  • Dataset size: 35.74 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 40,808

wikipedia/20230601.is

  • Config description: Wikipedia dataset for is, parsed from 20230601 dump.

  • Download size: 55.73 MiB

  • Dataset size: 84.35 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 82,205

wikipedia/20230601.it

  • Config description: Wikipedia dataset for it, parsed from 20230601 dump.

  • Download size: 3.49 GiB

  • Dataset size: 4.48 GiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 2,226,602

wikipedia/20230601.iu

  • Config description: Wikipedia dataset for iu, parsed from 20230601 dump.

  • Download size: 377.69 KiB

  • Dataset size: 241.49 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 565

wikipedia/20230601.ja

  • Config description: Wikipedia dataset for ja, parsed from 20230601 dump.

  • Download size: 3.74 GiB

  • Dataset size: 6.44 GiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 1,714,906

wikipedia/20230601.jam

  • Config description: Wikipedia dataset for jam, parsed from 20230601 dump.

  • Download size: 976.34 KiB

  • Dataset size: 1.05 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 1,772

wikipedia/20230601.jbo

  • Config description: Wikipedia dataset for jbo, parsed from 20230601 dump.

  • Download size: 1.26 MiB

  • Dataset size: 2.39 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 1,390

wikipedia/20230601.jv

  • Config description: Wikipedia dataset for jv, parsed from 20230601 dump.

  • Download size: 56.17 MiB

  • Dataset size: 67.91 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 92,994

wikipedia/20230601.ka

  • Config description: Wikipedia dataset for ka, parsed from 20230601 dump.

  • Download size: 191.35 MiB

  • Dataset size: 660.52 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 209,358

wikipedia/20230601.kaa

  • Config description: Wikipedia dataset for kaa, parsed from 20230601 dump.

  • Download size: 3.89 MiB

  • Dataset size: 4.36 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 4,034

wikipedia/20230601.kab

  • Config description: Wikipedia dataset for kab, parsed from 20230601 dump.

  • Download size: 4.18 MiB

  • Dataset size: 4.15 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 5,801

wikipedia/20230601.kbd

  • Config description: Wikipedia dataset for kbd, parsed from 20230601 dump.

  • Download size: 1.78 MiB

  • Dataset size: 2.81 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 1,638

wikipedia/20230601.kbp

  • Config description: Wikipedia dataset for kbp, parsed from 20230601 dump.

  • Download size: 1.44 MiB

  • Dataset size: 3.41 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 1,918

wikipedia/20230601.kg

  • Config description: Wikipedia dataset for kg, parsed from 20230601 dump.

  • Download size: 571.71 KiB

  • Dataset size: 431.03 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 1,334

wikipedia/20230601.ki

  • Config description: Wikipedia dataset for ki, parsed from 20230601 dump.

  • Download size: 484.40 KiB

  • Dataset size: 407.56 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 1,635

wikipedia/20230601.kj

  • Config description: Wikipedia dataset for kj, parsed from 20230601 dump.

  • Download size: 18.15 KiB

  • Dataset size: 4.93 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 5

wikipedia/20230601.kk

  • Config description: Wikipedia dataset for kk, parsed from 20230601 dump.

  • Download size: 134.04 MiB

  • Dataset size: 452.13 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 277,834

wikipedia/20230601.kl

  • Config description: Wikipedia dataset for kl, parsed from 20230601 dump.

  • Download size: 577.84 KiB

  • Dataset size: 308.00 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 298

wikipedia/20230601.km

  • Config description: Wikipedia dataset for km, parsed from 20230601 dump.

  • Download size: 26.16 MiB

  • Dataset size: 97.70 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 14,778

wikipedia/20230601.kn

  • Config description: Wikipedia dataset for kn, parsed from 20230601 dump.

  • Download size: 87.92 MiB

  • Dataset size: 375.49 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 30,814

wikipedia/20230601.ko

  • Config description: Wikipedia dataset for ko, parsed from 20230601 dump.

  • Download size: 933.11 MiB

  • Dataset size: 1.33 GiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 1,364,337

wikipedia/20230601.koi

  • Config description: Wikipedia dataset for koi, parsed from 20230601 dump.

  • Download size: 2.50 MiB

  • Dataset size: 4.77 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 3,962

wikipedia/20230601.krc

  • Config description: Wikipedia dataset for krc, parsed from 20230601 dump.

  • Download size: 3.32 MiB

  • Dataset size: 4.39 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 2,371

wikipedia/20230601.ks

  • Config description: Wikipedia dataset for ks, parsed from 20230601 dump.

  • Download size: 3.87 MiB

  • Dataset size: 2.04 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 4,060

wikipedia/20230601.ksh

  • Config description: Wikipedia dataset for ksh, parsed from 20230601 dump.

  • Download size: 3.40 MiB

  • Dataset size: 3.01 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 3,480

wikipedia/20230601.ku

  • Config description: Wikipedia dataset for ku, parsed from 20230601 dump.

  • Download size: 31.56 MiB

  • Dataset size: 40.79 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 73,438

wikipedia/20230601.kv

  • Config description: Wikipedia dataset for kv, parsed from 20230601 dump.

  • Download size: 3.96 MiB

  • Dataset size: 8.82 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 6,973

wikipedia/20230601.kw

  • Config description: Wikipedia dataset for kw, parsed from 20230601 dump.

  • Download size: 4.12 MiB

  • Dataset size: 4.49 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 7,557

wikipedia/20230601.ky

  • Config description: Wikipedia dataset for ky, parsed from 20230601 dump.

  • Download size: 37.64 MiB

  • Dataset size: 154.64 MiB

  • Auto-cached (documentation): Only when shuffle_files=False (train)

  • Splits:

Split Examples
'train' 80,599

wikipedia/20230601.la

  • Config description: Wikipedia dataset for la, parsed from 20230601 dump.

  • Download size: 98.26 MiB

  • Dataset size: 138.85 MiB

  • Auto-cached (documentation): Only when shuffle_files=False (train)

  • Splits:

Split Examples
'train' 137,858

wikipedia/20230601.lad

  • Config description: Wikipedia dataset for lad, parsed from 20230601 dump.

  • Download size: 3.59 MiB

  • Dataset size: 4.76 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 4,978

wikipedia/20230601.lb

  • Config description: Wikipedia dataset for lb, parsed from 20230601 dump.

  • Download size: 53.34 MiB

  • Dataset size: 84.30 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 68,921

wikipedia/20230601.lbe

  • Config description: Wikipedia dataset for lbe, parsed from 20230601 dump.

  • Download size: 1.86 MiB

  • Dataset size: 713.86 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 1,636

wikipedia/20230601.lez

  • Config description: Wikipedia dataset for lez, parsed from 20230601 dump.

  • Download size: 6.23 MiB

  • Dataset size: 9.26 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 5,083

wikipedia/20230601.lfn

  • Config description: Wikipedia dataset for lfn, parsed from 20230601 dump.

  • Download size: 4.13 MiB

  • Dataset size: 8.38 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 4,807

wikipedia/20230601.lg

  • Config description: Wikipedia dataset for lg, parsed from 20230601 dump.

  • Download size: 3.66 MiB

  • Dataset size: 6.35 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 4,017

wikipedia/20230601.li

  • Config description: Wikipedia dataset for li, parsed from 20230601 dump.

  • Download size: 16.24 MiB

  • Dataset size: 28.32 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 16,969

wikipedia/20230601.lij

  • Config description: Wikipedia dataset for lij, parsed from 20230601 dump.

  • Download size: 7.59 MiB

  • Dataset size: 10.78 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 12,463

wikipedia/20230601.lmo

  • Config description: Wikipedia dataset for lmo, parsed from 20230601 dump.

  • Download size: 26.99 MiB

  • Dataset size: 39.10 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 80,175

wikipedia/20230601.ln

  • Config description: Wikipedia dataset for ln, parsed from 20230601 dump.

  • Download size: 2.18 MiB

  • Dataset size: 1.91 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 3,517

wikipedia/20230601.lo

  • Config description: Wikipedia dataset for lo, parsed from 20230601 dump.

  • Download size: 6.24 MiB

  • Dataset size: 13.59 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 4,929

wikipedia/20230601.lrc

  • Config description: Wikipedia dataset for lrc, parsed from 20230601 dump.

  • Download size: 24.67 KiB

  • Dataset size: 107 bytes

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 1

wikipedia/20230601.lt

  • Config description: Wikipedia dataset for lt, parsed from 20230601 dump.

  • Download size: 214.45 MiB

  • Dataset size: 318.46 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 242,917

wikipedia/20230601.ltg

  • Config description: Wikipedia dataset for ltg, parsed from 20230601 dump.

  • Download size: 957.25 KiB

  • Dataset size: 876.27 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 1,044

wikipedia/20230601.lv

  • Config description: Wikipedia dataset for lv, parsed from 20230601 dump.

  • Download size: 173.29 MiB

  • Dataset size: 212.63 MiB

  • Auto-cached (documentation): Only when shuffle_files=False (train)

  • Splits:

Split Examples
'train' 120,514

wikipedia/20230601.mai

  • Config description: Wikipedia dataset for mai, parsed from 20230601 dump.

  • Download size: 12.47 MiB

  • Dataset size: 19.14 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 15,104

wikipedia/20230601.mdf

  • Config description: Wikipedia dataset for mdf, parsed from 20230601 dump.

  • Download size: 8.84 MiB

  • Dataset size: 3.91 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 4,060

wikipedia/20230601.mg

  • Config description: Wikipedia dataset for mg, parsed from 20230601 dump.

  • Download size: 30.74 MiB

  • Dataset size: 70.06 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 134,110

wikipedia/20230601.mh

  • Config description: Wikipedia dataset for mh, parsed from 20230601 dump.

  • Download size: 29.30 KiB

  • Dataset size: 11.04 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 8

wikipedia/20230601.mhr

  • Config description: Wikipedia dataset for mhr, parsed from 20230601 dump.

  • Download size: 6.86 MiB

  • Dataset size: 17.88 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 13,173

wikipedia/20230601.mi

  • Config description: Wikipedia dataset for mi, parsed from 20230601 dump.

  • Download size: 2.50 MiB

  • Dataset size: 3.78 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 7,855

wikipedia/20230601.min

  • Config description: Wikipedia dataset for min, parsed from 20230601 dump.

  • Download size: 36.43 MiB

  • Dataset size: 106.59 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 231,064

wikipedia/20230601.mk

  • Config description: Wikipedia dataset for mk, parsed from 20230601 dump.

  • Download size: 223.27 MiB

  • Dataset size: 611.37 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 182,674

wikipedia/20230601.ml

  • Config description: Wikipedia dataset for ml, parsed from 20230601 dump.

  • Download size: 176.68 MiB

  • Dataset size: 466.46 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 153,341

wikipedia/20230601.mn

  • Config description: Wikipedia dataset for mn, parsed from 20230601 dump.

  • Download size: 38.06 MiB

  • Dataset size: 84.79 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 26,747

wikipedia/20230601.mr

  • Config description: Wikipedia dataset for mr, parsed from 20230601 dump.

  • Download size: 77.14 MiB

  • Dataset size: 244.72 MiB

  • Auto-cached (documentation): Only when shuffle_files=False (train)

  • Splits:

Split Examples
'train' 156,544

wikipedia/20230601.mrj

  • Config description: Wikipedia dataset for mrj, parsed from 20230601 dump.

  • Download size: 3.52 MiB

  • Dataset size: 8.34 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 10,828

wikipedia/20230601.ms

  • Config description: Wikipedia dataset for ms, parsed from 20230601 dump.

  • Download size: 304.62 MiB

  • Dataset size: 389.80 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 416,348

wikipedia/20230601.mt

  • Config description: Wikipedia dataset for mt, parsed from 20230601 dump.

  • Download size: 15.11 MiB

  • Dataset size: 25.91 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 6,825

wikipedia/20230601.mus

  • Config description: Wikipedia dataset for mus, parsed from 20230601 dump.

  • Download size: 15.76 KiB

  • Dataset size: 875 bytes

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 2

wikipedia/20230601.mwl

  • Config description: Wikipedia dataset for mwl, parsed from 20230601 dump.

  • Download size: 9.43 MiB

  • Dataset size: 18.72 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 4,785

wikipedia/20230601.my

  • Config description: Wikipedia dataset for my, parsed from 20230601 dump.

  • Download size: 60.15 MiB

  • Dataset size: 275.29 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 108,796

wikipedia/20230601.myv

  • Config description: Wikipedia dataset for myv, parsed from 20230601 dump.

  • Download size: 11.54 MiB

  • Dataset size: 10.09 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 8,821

wikipedia/20230601.mzn

  • Config description: Wikipedia dataset for mzn, parsed from 20230601 dump.

  • Download size: 8.84 MiB

  • Dataset size: 13.54 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 22,374

wikipedia/20230601.nah

  • Config description: Wikipedia dataset for nah, parsed from 20230601 dump.

  • Download size: 4.50 MiB

  • Dataset size: 2.88 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 9,903

wikipedia/20230601.nap

  • Config description: Wikipedia dataset for nap, parsed from 20230601 dump.

  • Download size: 5.54 MiB

  • Dataset size: 6.08 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 15,440

wikipedia/20230601.nds

  • Config description: Wikipedia dataset for nds, parsed from 20230601 dump.

  • Download size: 43.78 MiB

  • Dataset size: 88.33 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 92,151

wikipedia/20230601.ne

  • Config description: Wikipedia dataset for ne, parsed from 20230601 dump.

  • Download size: 43.64 MiB

  • Dataset size: 98.35 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 33,040

wikipedia/20230601.new

  • Config description: Wikipedia dataset for new, parsed from 20230601 dump.

  • Download size: 16.04 MiB

  • Dataset size: 140.54 MiB

  • Auto-cached (documentation): Only when shuffle_files=False (train)

  • Splits:

Split Examples
'train' 73,006

wikipedia/20230601.ng

  • Config description: Wikipedia dataset for ng, parsed from 20230601 dump.

  • Download size: 92.88 KiB

  • Dataset size: 66.12 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 21

wikipedia/20230601.nl

  • Config description: Wikipedia dataset for nl, parsed from 20230601 dump.

  • Download size: 1.73 GiB

  • Dataset size: 2.45 GiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 2,651,793

wikipedia/20230601.nn

  • Config description: Wikipedia dataset for nn, parsed from 20230601 dump.

  • Download size: 157.18 MiB

  • Dataset size: 231.37 MiB

  • Auto-cached (documentation): Only when shuffle_files=False (train)

  • Splits:

Split Examples
'train' 244,587

wikipedia/20230601.no

  • Config description: Wikipedia dataset for no, parsed from 20230601 dump.

  • Download size: 742.29 MiB

  • Dataset size: 1008.71 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 941,304

wikipedia/20230601.nov

  • Config description: Wikipedia dataset for nov, parsed from 20230601 dump.

  • Download size: 1.26 MiB

  • Dataset size: 850.19 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 1,627

wikipedia/20230601.nrm

  • Config description: Wikipedia dataset for nrm, parsed from 20230601 dump.

  • Download size: 2.01 MiB

  • Dataset size: 2.97 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 4,891

wikipedia/20230601.nso

  • Config description: Wikipedia dataset for nso, parsed from 20230601 dump.

  • Download size: 2.97 MiB

  • Dataset size: 2.50 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 8,620

wikipedia/20230601.nv

  • Config description: Wikipedia dataset for nv, parsed from 20230601 dump.

  • Download size: 5.72 MiB

  • Dataset size: 13.91 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 22,219

wikipedia/20230601.ny

  • Config description: Wikipedia dataset for ny, parsed from 20230601 dump.

  • Download size: 2.44 MiB

  • Dataset size: 1.59 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 1,133

wikipedia/20230601.oc

  • Config description: Wikipedia dataset for oc, parsed from 20230601 dump.

  • Download size: 79.06 MiB

  • Dataset size: 114.46 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 99,037

wikipedia/20230601.olo

  • Config description: Wikipedia dataset for olo, parsed from 20230601 dump.

  • Download size: 2.31 MiB

  • Dataset size: 2.95 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 5,222

wikipedia/20230601.om

  • Config description: Wikipedia dataset for om, parsed from 20230601 dump.

  • Download size: 1.84 MiB

  • Dataset size: 2.89 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 1,574

wikipedia/20230601.or

  • Config description: Wikipedia dataset for or, parsed from 20230601 dump.

  • Download size: 33.23 MiB

  • Dataset size: 68.75 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 32,711

wikipedia/20230601.os

  • Config description: Wikipedia dataset for os, parsed from 20230601 dump.

  • Download size: 15.45 MiB

  • Dataset size: 11.90 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 19,999

wikipedia/20230601.pa

  • Config description: Wikipedia dataset for pa, parsed from 20230601 dump.

  • Download size: 73.51 MiB

  • Dataset size: 194.67 MiB

  • Auto-cached (documentation): Only when shuffle_files=False (train)

  • Splits:

Split Examples
'train' 59,842

wikipedia/20230601.pag

  • Config description: Wikipedia dataset for pag, parsed from 20230601 dump.

  • Download size: 1.55 MiB

  • Dataset size: 1.23 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 2,638

wikipedia/20230601.pam

  • Config description: Wikipedia dataset for pam, parsed from 20230601 dump.

  • Download size: 9.54 MiB

  • Dataset size: 7.77 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 8,935

wikipedia/20230601.pap

  • Config description: Wikipedia dataset for pap, parsed from 20230601 dump.

  • Download size: 2.68 MiB

  • Dataset size: 3.44 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 3,238

wikipedia/20230601.pcd

  • Config description: Wikipedia dataset for pcd, parsed from 20230601 dump.

  • Download size: 5.44 MiB

  • Dataset size: 5.42 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 5,645

wikipedia/20230601.pdc

  • Config description: Wikipedia dataset for pdc, parsed from 20230601 dump.

  • Download size: 1.26 MiB

  • Dataset size: 1.17 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 2,508

wikipedia/20230601.pfl

  • Config description: Wikipedia dataset for pfl, parsed from 20230601 dump.

  • Download size: 3.73 MiB

  • Dataset size: 3.60 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 3,986

wikipedia/20230601.pi

  • Config description: Wikipedia dataset for pi, parsed from 20230601 dump.

  • Download size: 833.50 KiB

  • Dataset size: 968.65 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 3,061

wikipedia/20230601.pih

  • Config description: Wikipedia dataset for pih, parsed from 20230601 dump.

  • Download size: 958.98 KiB

  • Dataset size: 355.87 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 930

wikipedia/20230601.pl

  • Config description: Wikipedia dataset for pl, parsed from 20230601 dump.

  • Download size: 2.28 GiB

  • Dataset size: 2.75 GiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 1,959,303

wikipedia/20230601.pms

  • Config description: Wikipedia dataset for pms, parsed from 20230601 dump.

  • Download size: 14.70 MiB

  • Dataset size: 31.87 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 69,200

wikipedia/20230601.pnb

  • Config description: Wikipedia dataset for pnb, parsed from 20230601 dump.

  • Download size: 100.48 MiB

  • Dataset size: 284.16 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 82,359

wikipedia/20230601.pnt

  • Config description: Wikipedia dataset for pnt, parsed from 20230601 dump.

  • Download size: 609.76 KiB

  • Dataset size: 647.97 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 551

wikipedia/20230601.ps

  • Config description: Wikipedia dataset for ps, parsed from 20230601 dump.

  • Download size: 41.25 MiB

  • Dataset size: 98.23 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 19,604

wikipedia/20230601.pt

  • Config description: Wikipedia dataset for pt, parsed from 20230601 dump.

  • Download size: 2.08 GiB

  • Dataset size: 2.57 GiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 1,582,349

wikipedia/20230601.qu

  • Config description: Wikipedia dataset for qu, parsed from 20230601 dump.

  • Download size: 13.60 MiB

  • Dataset size: 16.90 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 33,180

wikipedia/20230601.rm

  • Config description: Wikipedia dataset for rm, parsed from 20230601 dump.

  • Download size: 7.27 MiB

  • Dataset size: 17.24 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 3,939

wikipedia/20230601.rmy

  • Config description: Wikipedia dataset for rmy, parsed from 20230601 dump.

  • Download size: 649.92 KiB

  • Dataset size: 470.20 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 930

wikipedia/20230601.rn

  • Config description: Wikipedia dataset for rn, parsed from 20230601 dump.

  • Download size: 938.15 KiB

  • Dataset size: 494.18 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 805

wikipedia/20230601.ro

  • Config description: Wikipedia dataset for ro, parsed from 20230601 dump.

  • Download size: 618.85 MiB

  • Dataset size: 795.10 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 440,494

wikipedia/20230601.ru

  • Config description: Wikipedia dataset for ru, parsed from 20230601 dump.

  • Download size: 4.87 GiB

  • Dataset size: 9.52 GiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 3,171,747

wikipedia/20230601.rue

  • Config description: Wikipedia dataset for rue, parsed from 20230601 dump.

  • Download size: 6.84 MiB

  • Dataset size: 12.17 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 9,414

wikipedia/20230601.rw

  • Config description: Wikipedia dataset for rw, parsed from 20230601 dump.

  • Download size: 7.47 MiB

  • Dataset size: 10.29 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 7,434

wikipedia/20230601.sa

  • Config description: Wikipedia dataset for sa, parsed from 20230601 dump.

  • Download size: 16.99 MiB

  • Dataset size: 67.07 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 22,844

wikipedia/20230601.sah

  • Config description: Wikipedia dataset for sah, parsed from 20230601 dump.

  • Download size: 16.35 MiB

  • Dataset size: 44.89 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 19,362

wikipedia/20230601.sat

  • Config description: Wikipedia dataset for sat, parsed from 20230601 dump.

  • Download size: 14.50 MiB

  • Dataset size: 32.48 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 8,264

wikipedia/20230601.sc

  • Config description: Wikipedia dataset for sc, parsed from 20230601 dump.

  • Download size: 7.66 MiB

  • Dataset size: 11.97 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 7,542

wikipedia/20230601.scn

  • Config description: Wikipedia dataset for scn, parsed from 20230601 dump.

  • Download size: 12.35 MiB

  • Dataset size: 16.80 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 31,656

wikipedia/20230601.sco

  • Config description: Wikipedia dataset for sco, parsed from 20230601 dump.

  • Download size: 54.04 MiB

  • Dataset size: 42.39 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 36,210

wikipedia/20230601.sd

  • Config description: Wikipedia dataset for sd, parsed from 20230601 dump.

  • Download size: 19.99 MiB

  • Dataset size: 34.99 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 22,577

wikipedia/20230601.se

  • Config description: Wikipedia dataset for se, parsed from 20230601 dump.

  • Download size: 4.04 MiB

  • Dataset size: 3.43 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 8,738

wikipedia/20230601.sg

  • Config description: Wikipedia dataset for sg, parsed from 20230601 dump.

  • Download size: 391.07 KiB

  • Dataset size: 116.56 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 548

wikipedia/20230601.sh

  • Config description: Wikipedia dataset for sh, parsed from 20230601 dump.

  • Download size: 436.58 MiB

  • Dataset size: 834.05 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 3,942,887

wikipedia/20230601.si

  • Config description: Wikipedia dataset for si, parsed from 20230601 dump.

  • Download size: 51.31 MiB

  • Dataset size: 129.54 MiB

  • Auto-cached (documentation): Only when shuffle_files=False (train)

  • Splits:

Split Examples
'train' 32,762

wikipedia/20230601.simple

  • Config description: Wikipedia dataset for simple, parsed from 20230601 dump.

  • Download size: 263.29 MiB

  • Dataset size: 267.22 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 231,282

wikipedia/20230601.sk

  • Config description: Wikipedia dataset for sk, parsed from 20230601 dump.

  • Download size: 305.04 MiB

  • Dataset size: 393.08 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 266,385

wikipedia/20230601.sl

  • Config description: Wikipedia dataset for sl, parsed from 20230601 dump.

  • Download size: 282.01 MiB

  • Dataset size: 430.47 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 221,104

wikipedia/20230601.sm

  • Config description: Wikipedia dataset for sm, parsed from 20230601 dump.

  • Download size: 971.49 KiB

  • Dataset size: 846.60 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 1,144

wikipedia/20230601.sn

  • Config description: Wikipedia dataset for sn, parsed from 20230601 dump.

  • Download size: 4.86 MiB

  • Dataset size: 8.67 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 10,924

wikipedia/20230601.so

  • Config description: Wikipedia dataset for so, parsed from 20230601 dump.

  • Download size: 12.37 MiB

  • Dataset size: 14.00 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 10,830

wikipedia/20230601.sq

  • Config description: Wikipedia dataset for sq, parsed from 20230601 dump.

  • Download size: 115.08 MiB

  • Dataset size: 189.84 MiB

  • Auto-cached (documentation): Only when shuffle_files=False (train)

  • Splits:

Split Examples
'train' 126,441

wikipedia/20230601.sr

  • Config description: Wikipedia dataset for sr, parsed from 20230601 dump.

  • Download size: 956.10 MiB

  • Dataset size: 1.85 GiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 3,183,435

wikipedia/20230601.srn

  • Config description: Wikipedia dataset for srn, parsed from 20230601 dump.

  • Download size: 684.53 KiB

  • Dataset size: 634.11 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 1,295

wikipedia/20230601.ss

  • Config description: Wikipedia dataset for ss, parsed from 20230601 dump.

  • Download size: 1.01 MiB

  • Dataset size: 826.41 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 720

wikipedia/20230601.st

  • Config description: Wikipedia dataset for st, parsed from 20230601 dump.

  • Download size: 2.37 MiB

  • Dataset size: 884.77 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 1,073

wikipedia/20230601.stq

  • Config description: Wikipedia dataset for stq, parsed from 20230601 dump.

  • Download size: 3.63 MiB

  • Dataset size: 4.84 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 4,596

wikipedia/20230601.su

  • Config description: Wikipedia dataset for su, parsed from 20230601 dump.

  • Download size: 28.99 MiB

  • Dataset size: 44.43 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 68,415

wikipedia/20230601.sv

  • Config description: Wikipedia dataset for sv, parsed from 20230601 dump.

  • Download size: 1.47 GiB

  • Dataset size: 2.11 GiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 4,314,124

wikipedia/20230601.sw

  • Config description: Wikipedia dataset for sw, parsed from 20230601 dump.

  • Download size: 44.71 MiB

  • Dataset size: 68.59 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 77,340

wikipedia/20230601.szl

  • Config description: Wikipedia dataset for szl, parsed from 20230601 dump.

  • Download size: 13.90 MiB

  • Dataset size: 19.16 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 57,960

wikipedia/20230601.ta

  • Config description: Wikipedia dataset for ta, parsed from 20230601 dump.

  • Download size: 205.59 MiB

  • Dataset size: 743.92 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 188,878

wikipedia/20230601.tcy

  • Config description: Wikipedia dataset for tcy, parsed from 20230601 dump.

  • Download size: 5.38 MiB

  • Dataset size: 11.28 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 2,177

wikipedia/20230601.te

  • Config description: Wikipedia dataset for te, parsed from 20230601 dump.

  • Download size: 138.71 MiB

  • Dataset size: 670.19 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 112,186

wikipedia/20230601.tet

  • Config description: Wikipedia dataset for tet, parsed from 20230601 dump.

  • Download size: 1.40 MiB

  • Dataset size: 1.39 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 1,499

wikipedia/20230601.tg

  • Config description: Wikipedia dataset for tg, parsed from 20230601 dump.

  • Download size: 52.40 MiB

  • Dataset size: 131.82 MiB

  • Auto-cached (documentation): Only when shuffle_files=False (train)

  • Splits:

Split Examples
'train' 121,808

wikipedia/20230601.th

  • Config description: Wikipedia dataset for th, parsed from 20230601 dump.

  • Download size: 368.24 MiB

  • Dataset size: 958.28 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 288,381

wikipedia/20230601.ti

  • Config description: Wikipedia dataset for ti, parsed from 20230601 dump.

  • Download size: 893.44 KiB

  • Dataset size: 625.02 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 433

wikipedia/20230601.tk

  • Config description: Wikipedia dataset for tk, parsed from 20230601 dump.

  • Download size: 6.42 MiB

  • Dataset size: 11.93 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 7,838

wikipedia/20230601.tl

  • Config description: Wikipedia dataset for tl, parsed from 20230601 dump.

  • Download size: 72.37 MiB

  • Dataset size: 78.95 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 44,818

wikipedia/20230601.tn

  • Config description: Wikipedia dataset for tn, parsed from 20230601 dump.

  • Download size: 2.67 MiB

  • Dataset size: 3.38 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 1,162

wikipedia/20230601.to

  • Config description: Wikipedia dataset for to, parsed from 20230601 dump.

  • Download size: 947.73 KiB

  • Dataset size: 998.04 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 1,848

wikipedia/20230601.tpi

  • Config description: Wikipedia dataset for tpi, parsed from 20230601 dump.

  • Download size: 1.53 MiB

  • Dataset size: 421.04 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 1,390

wikipedia/20230601.tr

  • Config description: Wikipedia dataset for tr, parsed from 20230601 dump.

  • Download size: 826.83 MiB

  • Dataset size: 951.97 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 796,137

wikipedia/20230601.ts

  • Config description: Wikipedia dataset for ts, parsed from 20230601 dump.

  • Download size: 1.83 MiB

  • Dataset size: 808.68 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 771

wikipedia/20230601.tt

  • Config description: Wikipedia dataset for tt, parsed from 20230601 dump.

  • Download size: 116.89 MiB

  • Dataset size: 622.23 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 545,625

wikipedia/20230601.tum

  • Config description: Wikipedia dataset for tum, parsed from 20230601 dump.

  • Download size: 12.19 MiB

  • Dataset size: 7.91 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 14,169

wikipedia/20230601.tw

  • Config description: Wikipedia dataset for tw, parsed from 20230601 dump.

  • Download size: 4.09 MiB

  • Dataset size: 6.17 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 3,610

wikipedia/20230601.ty

  • Config description: Wikipedia dataset for ty, parsed from 20230601 dump.

  • Download size: 603.73 KiB

  • Dataset size: 299.14 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 1,349

wikipedia/20230601.tyv

  • Config description: Wikipedia dataset for tyv, parsed from 20230601 dump.

  • Download size: 5.38 MiB

  • Dataset size: 13.24 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 4,179

wikipedia/20230601.udm

  • Config description: Wikipedia dataset for udm, parsed from 20230601 dump.

  • Download size: 3.97 MiB

  • Dataset size: 6.58 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 6,967

wikipedia/20230601.ug

  • Config description: Wikipedia dataset for ug, parsed from 20230601 dump.

  • Download size: 8.70 MiB

  • Dataset size: 39.40 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 8,575

wikipedia/20230601.uk

  • Config description: Wikipedia dataset for uk, parsed from 20230601 dump.

  • Download size: 2.04 GiB

  • Dataset size: 4.54 GiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 1,943,992

wikipedia/20230601.ur

  • Config description: Wikipedia dataset for ur, parsed from 20230601 dump.

  • Download size: 219.05 MiB

  • Dataset size: 388.95 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 389,321

wikipedia/20230601.uz

  • Config description: Wikipedia dataset for uz, parsed from 20230601 dump.

  • Download size: 243.85 MiB

  • Dataset size: 365.50 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 366,942

wikipedia/20230601.ve

  • Config description: Wikipedia dataset for ve, parsed from 20230601 dump.

  • Download size: 351.96 KiB

  • Dataset size: 323.14 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 836

wikipedia/20230601.vec

  • Config description: Wikipedia dataset for vec, parsed from 20230601 dump.

  • Download size: 27.83 MiB

  • Dataset size: 35.37 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 78,401

wikipedia/20230601.vep

  • Config description: Wikipedia dataset for vep, parsed from 20230601 dump.

  • Download size: 8.47 MiB

  • Dataset size: 11.02 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 8,397

wikipedia/20230601.vi

  • Config description: Wikipedia dataset for vi, parsed from 20230601 dump.

  • Download size: 941.06 MiB

  • Dataset size: 1.49 GiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 1,521,735

wikipedia/20230601.vls

  • Config description: Wikipedia dataset for vls, parsed from 20230601 dump.

  • Download size: 7.50 MiB

  • Dataset size: 10.97 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 8,303

wikipedia/20230601.vo

  • Config description: Wikipedia dataset for vo, parsed from 20230601 dump.

  • Download size: 12.62 MiB

  • Dataset size: 17.83 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 33,641

wikipedia/20230601.wa

  • Config description: Wikipedia dataset for wa, parsed from 20230601 dump.

  • Download size: 7.77 MiB

  • Dataset size: 11.34 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 11,872

wikipedia/20230601.war

  • Config description: Wikipedia dataset for war, parsed from 20230601 dump.

  • Download size: 265.28 MiB

  • Dataset size: 413.17 MiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 1,266,240

wikipedia/20230601.wo

  • Config description: Wikipedia dataset for wo, parsed from 20230601 dump.

  • Download size: 2.09 MiB

  • Dataset size: 3.33 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 1,719

wikipedia/20230601.wuu

  • Config description: Wikipedia dataset for wuu, parsed from 20230601 dump.

  • Download size: 16.54 MiB

  • Dataset size: 21.95 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 46,554

wikipedia/20230601.xal

  • Config description: Wikipedia dataset for xal, parsed from 20230601 dump.

  • Download size: 2.09 MiB

  • Dataset size: 1.24 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 2,825

wikipedia/20230601.xh

  • Config description: Wikipedia dataset for xh, parsed from 20230601 dump.

  • Download size: 1.97 MiB

  • Dataset size: 2.17 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 1,602

wikipedia/20230601.xmf

  • Config description: Wikipedia dataset for xmf, parsed from 20230601 dump.

  • Download size: 13.42 MiB

  • Dataset size: 34.69 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 21,496

wikipedia/20230601.yi

  • Config description: Wikipedia dataset for yi, parsed from 20230601 dump.

  • Download size: 13.27 MiB

  • Dataset size: 35.12 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 26,070

wikipedia/20230601.yo

  • Config description: Wikipedia dataset for yo, parsed from 20230601 dump.

  • Download size: 17.19 MiB

  • Dataset size: 16.26 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 33,183

wikipedia/20230601.za

  • Config description: Wikipedia dataset for za, parsed from 20230601 dump.

  • Download size: 1.15 MiB

  • Dataset size: 1.20 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 3,189

wikipedia/20230601.zea

  • Config description: Wikipedia dataset for zea, parsed from 20230601 dump.

  • Download size: 2.95 MiB

  • Dataset size: 4.97 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 7,045

wikipedia/20230601.zh

  • Config description: Wikipedia dataset for zh, parsed from 20230601 dump.

  • Download size: 2.62 GiB

  • Dataset size: 2.52 GiB

  • Auto-cached (documentation): No

  • Splits:

Split Examples
'train' 1,935,042

wikipedia/20230601.zu

  • Config description: Wikipedia dataset for zu, parsed from 20230601 dump.

  • Download size: 5.18 MiB

  • Dataset size: 6.35 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 11,381

wikipedia/20230201.en

  • Config description: Wikipedia dataset for en, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.aa

  • Config description: Wikipedia dataset for aa, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ab

  • Config description: Wikipedia dataset for ab, parsed from 20230201 dump.

  • Download size: 2.65 MiB

  • Dataset size: 3.90 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 7,502

wikipedia/20230201.ace

  • Config description: Wikipedia dataset for ace, parsed from 20230201 dump.

  • Download size: 3.44 MiB

  • Dataset size: 4.31 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 13,991

wikipedia/20230201.ady

  • Config description: Wikipedia dataset for ady, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.af

  • Config description: Wikipedia dataset for af, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ak

  • Config description: Wikipedia dataset for ak, parsed from 20230201 dump.

  • Download size: 670.43 KiB

  • Dataset size: 850.62 KiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 739

wikipedia/20230201.als

  • Config description: Wikipedia dataset for als, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.am

  • Config description: Wikipedia dataset for am, parsed from 20230201 dump.

  • Download size: 8.05 MiB

  • Dataset size: 21.56 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 13,741

wikipedia/20230201.an

  • Config description: Wikipedia dataset for an, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ang

  • Config description: Wikipedia dataset for ang, parsed from 20230201 dump.

  • Download size: 4.62 MiB

  • Dataset size: 2.65 MiB

  • Auto-cached (documentation): Yes

  • Splits:

Split Examples
'train' 3,906

wikipedia/20230201.ar

  • Config description: Wikipedia dataset for ar, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.arc

  • Config description: Wikipedia dataset for arc, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.arz

  • Config description: Wikipedia dataset for arz, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.as

  • Config description: Wikipedia dataset for as, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ast

  • Config description: Wikipedia dataset for ast, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.atj

  • Config description: Wikipedia dataset for atj, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.av

  • Config description: Wikipedia dataset for av, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ay

  • Config description: Wikipedia dataset for ay, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.az

  • Config description: Wikipedia dataset for az, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.azb

  • Config description: Wikipedia dataset for azb, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ba

  • Config description: Wikipedia dataset for ba, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.bar

  • Config description: Wikipedia dataset for bar, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.bcl

  • Config description: Wikipedia dataset for bcl, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.be

  • Config description: Wikipedia dataset for be, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.bg

  • Config description: Wikipedia dataset for bg, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.bh

  • Config description: Wikipedia dataset for bh, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.bi

  • Config description: Wikipedia dataset for bi, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.bjn

  • Config description: Wikipedia dataset for bjn, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.bm

  • Config description: Wikipedia dataset for bm, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.bn

  • Config description: Wikipedia dataset for bn, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.bo

  • Config description: Wikipedia dataset for bo, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.bpy

  • Config description: Wikipedia dataset for bpy, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.br

  • Config description: Wikipedia dataset for br, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.bs

  • Config description: Wikipedia dataset for bs, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.bug

  • Config description: Wikipedia dataset for bug, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.bxr

  • Config description: Wikipedia dataset for bxr, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ca

  • Config description: Wikipedia dataset for ca, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.cdo

  • Config description: Wikipedia dataset for cdo, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ce

  • Config description: Wikipedia dataset for ce, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ceb

  • Config description: Wikipedia dataset for ceb, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ch

  • Config description: Wikipedia dataset for ch, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.cho

  • Config description: Wikipedia dataset for cho, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.chr

  • Config description: Wikipedia dataset for chr, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.chy

  • Config description: Wikipedia dataset for chy, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ckb

  • Config description: Wikipedia dataset for ckb, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.co

  • Config description: Wikipedia dataset for co, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.cr

  • Config description: Wikipedia dataset for cr, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.crh

  • Config description: Wikipedia dataset for crh, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.cs

  • Config description: Wikipedia dataset for cs, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.csb

  • Config description: Wikipedia dataset for csb, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.cu

  • Config description: Wikipedia dataset for cu, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.cv

  • Config description: Wikipedia dataset for cv, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.cy

  • Config description: Wikipedia dataset for cy, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.da

  • Config description: Wikipedia dataset for da, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.de

  • Config description: Wikipedia dataset for de, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.din

  • Config description: Wikipedia dataset for din, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.diq

  • Config description: Wikipedia dataset for diq, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.dsb

  • Config description: Wikipedia dataset for dsb, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.dty

  • Config description: Wikipedia dataset for dty, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.dv

  • Config description: Wikipedia dataset for dv, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.dz

  • Config description: Wikipedia dataset for dz, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ee

  • Config description: Wikipedia dataset for ee, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.el

  • Config description: Wikipedia dataset for el, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.eml

  • Config description: Wikipedia dataset for eml, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.eo

  • Config description: Wikipedia dataset for eo, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.es

  • Config description: Wikipedia dataset for es, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.et

  • Config description: Wikipedia dataset for et, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.eu

  • Config description: Wikipedia dataset for eu, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ext

  • Config description: Wikipedia dataset for ext, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.fa

  • Config description: Wikipedia dataset for fa, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ff

  • Config description: Wikipedia dataset for ff, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.fi

  • Config description: Wikipedia dataset for fi, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.fj

  • Config description: Wikipedia dataset for fj, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.fo

  • Config description: Wikipedia dataset for fo, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.fr

  • Config description: Wikipedia dataset for fr, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.frp

  • Config description: Wikipedia dataset for frp, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.frr

  • Config description: Wikipedia dataset for frr, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.fur

  • Config description: Wikipedia dataset for fur, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.fy

  • Config description: Wikipedia dataset for fy, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ga

  • Config description: Wikipedia dataset for ga, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.gag

  • Config description: Wikipedia dataset for gag, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.gan

  • Config description: Wikipedia dataset for gan, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.gd

  • Config description: Wikipedia dataset for gd, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.gl

  • Config description: Wikipedia dataset for gl, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.glk

  • Config description: Wikipedia dataset for glk, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.gn

  • Config description: Wikipedia dataset for gn, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.gom

  • Config description: Wikipedia dataset for gom, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.gor

  • Config description: Wikipedia dataset for gor, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.got

  • Config description: Wikipedia dataset for got, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.gu

  • Config description: Wikipedia dataset for gu, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.gv

  • Config description: Wikipedia dataset for gv, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ha

  • Config description: Wikipedia dataset for ha, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.hak

  • Config description: Wikipedia dataset for hak, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.haw

  • Config description: Wikipedia dataset for haw, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.he

  • Config description: Wikipedia dataset for he, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.hi

  • Config description: Wikipedia dataset for hi, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.hif

  • Config description: Wikipedia dataset for hif, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ho

  • Config description: Wikipedia dataset for ho, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.hr

  • Config description: Wikipedia dataset for hr, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.hsb

  • Config description: Wikipedia dataset for hsb, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ht

  • Config description: Wikipedia dataset for ht, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.hu

  • Config description: Wikipedia dataset for hu, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.hy

  • Config description: Wikipedia dataset for hy, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ia

  • Config description: Wikipedia dataset for ia, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.id

  • Config description: Wikipedia dataset for id, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ie

  • Config description: Wikipedia dataset for ie, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ig

  • Config description: Wikipedia dataset for ig, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ii

  • Config description: Wikipedia dataset for ii, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ik

  • Config description: Wikipedia dataset for ik, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ilo

  • Config description: Wikipedia dataset for ilo, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.inh

  • Config description: Wikipedia dataset for inh, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.io

  • Config description: Wikipedia dataset for io, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.is

  • Config description: Wikipedia dataset for is, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.it

  • Config description: Wikipedia dataset for it, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.iu

  • Config description: Wikipedia dataset for iu, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ja

  • Config description: Wikipedia dataset for ja, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.jam

  • Config description: Wikipedia dataset for jam, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.jbo

  • Config description: Wikipedia dataset for jbo, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.jv

  • Config description: Wikipedia dataset for jv, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ka

  • Config description: Wikipedia dataset for ka, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.kaa

  • Config description: Wikipedia dataset for kaa, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.kab

  • Config description: Wikipedia dataset for kab, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.kbd

  • Config description: Wikipedia dataset for kbd, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.kbp

  • Config description: Wikipedia dataset for kbp, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.kg

  • Config description: Wikipedia dataset for kg, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ki

  • Config description: Wikipedia dataset for ki, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.kj

  • Config description: Wikipedia dataset for kj, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.kk

  • Config description: Wikipedia dataset for kk, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.kl

  • Config description: Wikipedia dataset for kl, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.km

  • Config description: Wikipedia dataset for km, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.kn

  • Config description: Wikipedia dataset for kn, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ko

  • Config description: Wikipedia dataset for ko, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.koi

  • Config description: Wikipedia dataset for koi, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.krc

  • Config description: Wikipedia dataset for krc, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ks

  • Config description: Wikipedia dataset for ks, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ksh

  • Config description: Wikipedia dataset for ksh, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ku

  • Config description: Wikipedia dataset for ku, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.kv

  • Config description: Wikipedia dataset for kv, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.kw

  • Config description: Wikipedia dataset for kw, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ky

  • Config description: Wikipedia dataset for ky, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.la

  • Config description: Wikipedia dataset for la, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.lad

  • Config description: Wikipedia dataset for lad, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.lb

  • Config description: Wikipedia dataset for lb, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.lbe

  • Config description: Wikipedia dataset for lbe, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.lez

  • Config description: Wikipedia dataset for lez, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.lfn

  • Config description: Wikipedia dataset for lfn, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.lg

  • Config description: Wikipedia dataset for lg, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.li

  • Config description: Wikipedia dataset for li, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.lij

  • Config description: Wikipedia dataset for lij, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.lmo

  • Config description: Wikipedia dataset for lmo, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ln

  • Config description: Wikipedia dataset for ln, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.lo

  • Config description: Wikipedia dataset for lo, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.lrc

  • Config description: Wikipedia dataset for lrc, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.lt

  • Config description: Wikipedia dataset for lt, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ltg

  • Config description: Wikipedia dataset for ltg, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.lv

  • Config description: Wikipedia dataset for lv, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.mai

  • Config description: Wikipedia dataset for mai, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.mdf

  • Config description: Wikipedia dataset for mdf, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.mg

  • Config description: Wikipedia dataset for mg, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.mh

  • Config description: Wikipedia dataset for mh, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.mhr

  • Config description: Wikipedia dataset for mhr, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.mi

  • Config description: Wikipedia dataset for mi, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.min

  • Config description: Wikipedia dataset for min, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.mk

  • Config description: Wikipedia dataset for mk, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ml

  • Config description: Wikipedia dataset for ml, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.mn

  • Config description: Wikipedia dataset for mn, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.mr

  • Config description: Wikipedia dataset for mr, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.mrj

  • Config description: Wikipedia dataset for mrj, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ms

  • Config description: Wikipedia dataset for ms, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.mt

  • Config description: Wikipedia dataset for mt, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.mus

  • Config description: Wikipedia dataset for mus, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.mwl

  • Config description: Wikipedia dataset for mwl, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.my

  • Config description: Wikipedia dataset for my, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.myv

  • Config description: Wikipedia dataset for myv, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.mzn

  • Config description: Wikipedia dataset for mzn, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.na

  • Config description: Wikipedia dataset for na, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.nah

  • Config description: Wikipedia dataset for nah, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.nap

  • Config description: Wikipedia dataset for nap, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.nds

  • Config description: Wikipedia dataset for nds, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ne

  • Config description: Wikipedia dataset for ne, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.new

  • Config description: Wikipedia dataset for new, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ng

  • Config description: Wikipedia dataset for ng, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.nl

  • Config description: Wikipedia dataset for nl, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.nn

  • Config description: Wikipedia dataset for nn, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.no

  • Config description: Wikipedia dataset for no, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.nov

  • Config description: Wikipedia dataset for nov, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.nrm

  • Config description: Wikipedia dataset for nrm, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.nso

  • Config description: Wikipedia dataset for nso, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.nv

  • Config description: Wikipedia dataset for nv, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ny

  • Config description: Wikipedia dataset for ny, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.oc

  • Config description: Wikipedia dataset for oc, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.olo

  • Config description: Wikipedia dataset for olo, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.om

  • Config description: Wikipedia dataset for om, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.or

  • Config description: Wikipedia dataset for or, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.os

  • Config description: Wikipedia dataset for os, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.pa

  • Config description: Wikipedia dataset for pa, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.pag

  • Config description: Wikipedia dataset for pag, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.pam

  • Config description: Wikipedia dataset for pam, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.pap

  • Config description: Wikipedia dataset for pap, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.pcd

  • Config description: Wikipedia dataset for pcd, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.pdc

  • Config description: Wikipedia dataset for pdc, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.pfl

  • Config description: Wikipedia dataset for pfl, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.pi

  • Config description: Wikipedia dataset for pi, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.pih

  • Config description: Wikipedia dataset for pih, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.pl

  • Config description: Wikipedia dataset for pl, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.pms

  • Config description: Wikipedia dataset for pms, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.pnb

  • Config description: Wikipedia dataset for pnb, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.pnt

  • Config description: Wikipedia dataset for pnt, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ps

  • Config description: Wikipedia dataset for ps, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.pt

  • Config description: Wikipedia dataset for pt, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.qu

  • Config description: Wikipedia dataset for qu, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.rm

  • Config description: Wikipedia dataset for rm, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.rmy

  • Config description: Wikipedia dataset for rmy, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.rn

  • Config description: Wikipedia dataset for rn, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ro

  • Config description: Wikipedia dataset for ro, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.ru

  • Config description: Wikipedia dataset for ru, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.rue

  • Config description: Wikipedia dataset for rue, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.rw

  • Config description: Wikipedia dataset for rw, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.sa

  • Config description: Wikipedia dataset for sa, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.sah

  • Config description: Wikipedia dataset for sah, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.sat

  • Config description: Wikipedia dataset for sat, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.sc

  • Config description: Wikipedia dataset for sc, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.scn

  • Config description: Wikipedia dataset for scn, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.sco

  • Config description: Wikipedia dataset for sco, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.sd

  • Config description: Wikipedia dataset for sd, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.se

  • Config description: Wikipedia dataset for se, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples

wikipedia/20230201.sg

  • Config description: Wikipedia dataset for sg, parsed from 20230201 dump.

  • Download size: Unknown size

  • Dataset size: Unknown size

  • Auto-cached