europa_eac_tm

参考文献:

en2bg

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:europa_eac_tm/en2bg')
  • 説明
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 4061
  • 特徴
{
    "translation": {
        "languages": [
            "en",
            "bg"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2cs

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:europa_eac_tm/en2cs')
  • 説明
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 3351
  • 特徴
{
    "translation": {
        "languages": [
            "en",
            "cs"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2da

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:europa_eac_tm/en2da')
  • 説明
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 3757
  • 特徴
{
    "translation": {
        "languages": [
            "en",
            "da"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

エンツーデ

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:europa_eac_tm/en2de')
  • 説明
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 4473
  • 特徴
{
    "translation": {
        "languages": [
            "en",
            "de"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

エンツーエル

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:europa_eac_tm/en2el')
  • 説明
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 2818
  • 特徴
{
    "translation": {
        "languages": [
            "en",
            "el"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

エンス

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:europa_eac_tm/en2es')
  • 説明
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 4303
  • 特徴
{
    "translation": {
        "languages": [
            "en",
            "es"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

エント

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:europa_eac_tm/en2et')
  • 説明
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 2270
  • 特徴
{
    "translation": {
        "languages": [
            "en",
            "et"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2fi

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:europa_eac_tm/en2fi')
  • 説明
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 1458年
  • 特徴
{
    "translation": {
        "languages": [
            "en",
            "fi"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2fr

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:europa_eac_tm/en2fr')
  • 説明
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 4476
  • 特徴
{
    "translation": {
        "languages": [
            "en",
            "fr"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2hu

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:europa_eac_tm/en2hu')
  • 説明
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 3455
  • 特徴
{
    "translation": {
        "languages": [
            "en",
            "hu"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2is

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:europa_eac_tm/en2is')
  • 説明
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 2206
  • 特徴
{
    "translation": {
        "languages": [
            "en",
            "is"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

エンイット

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:europa_eac_tm/en2it')
  • 説明
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 2170
  • 特徴
{
    "translation": {
        "languages": [
            "en",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2lt

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:europa_eac_tm/en2lt')
  • 説明
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 3386
  • 特徴
{
    "translation": {
        "languages": [
            "en",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2lv

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:europa_eac_tm/en2lv')
  • 説明
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 3880
  • 特徴
{
    "translation": {
        "languages": [
            "en",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2mt

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:europa_eac_tm/en2mt')
  • 説明
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 1722年
  • 特徴
{
    "translation": {
        "languages": [
            "en",
            "mt"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2nb

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:europa_eac_tm/en2nb')
  • 説明
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 642
  • 特徴
{
    "translation": {
        "languages": [
            "en",
            "nb"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2nl

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:europa_eac_tm/en2nl')
  • 説明
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 1805年
  • 特徴
{
    "translation": {
        "languages": [
            "en",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2pl

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:europa_eac_tm/en2pl')
  • 説明
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 4027
  • 特徴
{
    "translation": {
        "languages": [
            "en",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2pt

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:europa_eac_tm/en2pt')
  • 説明
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 3501
  • 特徴
{
    "translation": {
        "languages": [
            "en",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

エンロ

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:europa_eac_tm/en2ro')
  • 説明
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 3159
  • 特徴
{
    "translation": {
        "languages": [
            "en",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2sk

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:europa_eac_tm/en2sk')
  • 説明
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 2972
  • 特徴
{
    "translation": {
        "languages": [
            "en",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2sl

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:europa_eac_tm/en2sl')
  • 説明
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 4644
  • 特徴
{
    "translation": {
        "languages": [
            "en",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2sv

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:europa_eac_tm/en2sv')
  • 説明
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 2909
  • 特徴
{
    "translation": {
        "languages": [
            "en",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2tr

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:europa_eac_tm/en2tr')
  • 説明
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 3198
  • 特徴
{
    "translation": {
        "languages": [
            "en",
            "tr"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}