unnatural_instructions

  • Description:

Dataset described in the paper: Unnatural Instructions: Tuning Language Models with (Almost) No Human Labor (2022). Contains sets of natural-language instructions, with optional constraints / LLM-generated reformulations.

Split Examples
'train' 66,010
  • Feature structure:
FeaturesDict({
    'id': Text(shape=(), dtype=string),
    'instances': Sequence({
        'constraints': Text(shape=(), dtype=string),
        'input': Text(shape=(), dtype=string),
        'instruction_with_input': Text(shape=(), dtype=string),
        'output': Text(shape=(), dtype=string),
    }),
    'instruction': Text(shape=(), dtype=string),
    'reformulations': Sequence({
        'input': Text(shape=(), dtype=string),
        'instruction': Text(shape=(), dtype=string),
        'instruction_with_input': Text(shape=(), dtype=string),
        'output': Text(shape=(), dtype=string),
    }),
})
  • Feature documentation:
Feature Class Shape Dtype Description
FeaturesDict
id Text string Unique identifier for example.
instances Sequence
instances/constraints Text string Task-specific constraints.
instances/input Text string Input to be fed into placeholders for given instruction.
instances/instruction_with_input Text string Instructions with inputs supplied to placeholders.
instances/output Text string Target output for given task.
instruction Text string Instruction with placeholder for inputs.
reformulations Sequence
reformulations/input Text string Input to be fed into placeholders for given instruction.
reformulations/instruction Text string Instruction with placeholder for inputs.
reformulations/instruction_with_input Text string Instructions with inputs supplied to placeholders.
reformulations/output Text string Target output for given task.
  • Citation:
@misc{honovich2022unnatural,
      title = {Unnatural Instructions: Tuning Language Models with (Almost) No Human Labor},
      author = {Honovich, Or and Scialom, Thomas and Levy, Omer and Schick, Timo},
      url = {https://arxiv.org/abs/2212.09689},
      publisher = {arXiv},
      year={2022}
}