TFDS תומך כעת בפורמט קרואסון 🥐 ! קרא את התיעוד כדי לדעת יותר.

דף זה תורגם על ידי Cloud Translation API.

rlu_rwrl

תיאור :

RL Unplugged היא חבילת אמות מידה ללימוד חיזוק לא מקוון. ה-RL Unplugged מתוכנן סביב השיקולים הבאים: כדי להקל על השימוש, אנו מספקים למערכי נתונים עם API מאוחד אשר מקל על המתרגל לעבוד עם כל הנתונים בחבילה לאחר הקמת צינור כללי.

מערכי הנתונים פועלים לפי פורמט RLDS כדי לייצג שלבים ופרקים.

דוגמאות במערך הנתונים מייצגות מעברי SAR המאוחסנים בעת הפעלת סוכן מאומן מקוון חלקית כמתואר ב- https://arxiv.org/abs/1904.12901 . אנו פועלים לפי פורמט מערך הנתונים של RLDS, כמפורט ב- https://github.com/google-research /rlds#dataset-format

אנו משחררים 40 מערכי נתונים על 8 משימות בסך הכל -- ללא אתגר משולב ואתגר משולב קל על עמוד העגלה, הליכון, מרובע ומשימות דמוי אדם. כל משימה מכילה 5 גדלים שונים של מערכי נתונים, 1%, 5%, 20%, 40% ו-100%. שים לב שלא מובטח שמערך הנתונים הקטן יותר הוא תת-קבוצה של הגדולים יותר. לפרטים על אופן יצירת מערך הנתונים, עיין במאמר.

דף הבית : https://github.com/deepmind/deepmind-research/tree/master/rl_unplugged
קוד מקור : tfds.rl_unplugged.rlu_rwrl.RluRwrl
גרסאות :
- 1.0.0 : שחרור ראשוני.
- 1.0.1 (ברירת מחדל): מתקן באג במערך הנתונים של RLU RWRL שבו יש מזהי פרקים משוכפלים באחד ממערכי הנתונים האנושיים.
גודל הורדה : Unknown size
מפתחות בפיקוח (ראה as_supervised doc ): None
איור ( tfds.show_examples ): לא נתמך.
ציטוט :

@misc{gulcehre2020rl,
    title={RL Unplugged: Benchmarks for Offline Reinforcement Learning},
    author={Caglar Gulcehre and Ziyu Wang and Alexander Novikov and Tom Le Paine
        and  Sergio Gómez Colmenarejo and Konrad Zolna and Rishabh Agarwal and
        Josh Merel and Daniel Mankowitz and Cosmin Paduraru and Gabriel
        Dulac-Arnold and Jerry Li and Mohammad Norouzi and Matt Hoffman and
        Ofir Nachum and George Tucker and Nicolas Heess and Nando deFreitas},
    year={2020},
    eprint={2006.13888},
    archivePrefix={arXiv},
    primaryClass={cs.LG}
}

rlu_rwrl/cartpole_swingup_combined_challenge_none_1_percent (תצורת ברירת המחדל)

גודל מערך נתונים : 172.43 KiB
שמור אוטומטי במטמון ( תיעוד ): כן
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	5

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
צעדים/תצפית/עמדה	מוֹתֵחַ	(3,)	לצוף32
צעדים/תצפית/מהירות	מוֹתֵחַ	(2,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/cartpole_swingup_combined_challenge_none_5_percent

גודל ערכת נתונים: 862.13 KiB
שמור אוטומטי במטמון ( תיעוד ): כן
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	25

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
צעדים/תצפית/עמדה	מוֹתֵחַ	(3,)	לצוף32
צעדים/תצפית/מהירות	מוֹתֵחַ	(2,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/cartpole_swingup_combined_challenge_none_20_percent

גודל ערכת נתונים : 3.37 MiB
שמור אוטומטי במטמון ( תיעוד ): כן
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	100

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
צעדים/תצפית/עמדה	מוֹתֵחַ	(3,)	לצוף32
צעדים/תצפית/מהירות	מוֹתֵחַ	(2,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/cartpole_swingup_combined_challenge_none_40_percent

גודל ערכת נתונים : 6.74 MiB
שמור אוטומטי במטמון ( תיעוד ): כן
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	200

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
צעדים/תצפית/עמדה	מוֹתֵחַ	(3,)	לצוף32
צעדים/תצפית/מהירות	מוֹתֵחַ	(2,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/cartpole_swingup_combined_challenge_none_100_percent

גודל ערכת נתונים : 16.84 MiB
שמור אוטומטי במטמון ( תיעוד ): כן
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	500

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
צעדים/תצפית/עמדה	מוֹתֵחַ	(3,)	לצוף32
צעדים/תצפית/מהירות	מוֹתֵחַ	(2,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_none_1_percent

גודל מערך נתונים : 1.77 MiB
שמור אוטומטי במטמון ( תיעוד ): כן
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	5

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(12,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
צעדים/תצפית/מצב_אגוצנטריים	מוֹתֵחַ	(44,)	לצוף32
צעדים/תצפית/כוח_מומנט	מוֹתֵחַ	(24,)	לצוף32
צעדים/תצפית/אימו	מוֹתֵחַ	(6,)	לצוף32
צעדים/התבוננות/טורסו_זקוף	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/מהירות_גו	מוֹתֵחַ	(3,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_none_5_percent

גודל ערכת נתונים : 8.86 MiB
שמור אוטומטי במטמון ( תיעוד ): כן
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	25

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(12,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
צעדים/תצפית/מצב_אגוצנטריים	מוֹתֵחַ	(44,)	לצוף32
צעדים/תצפית/כוח_מומנט	מוֹתֵחַ	(24,)	לצוף32
צעדים/תצפית/אימו	מוֹתֵחַ	(6,)	לצוף32
צעדים/התבוננות/טורסו_זקוף	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/מהירות_גו	מוֹתֵחַ	(3,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_none_20_percent

גודל מערך נתונים : 35.46 MiB
שמור אוטומטי במטמון ( תיעוד ): כן
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	100

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(12,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
צעדים/תצפית/מצב_אגוצנטריים	מוֹתֵחַ	(44,)	לצוף32
צעדים/תצפית/כוח_מומנט	מוֹתֵחַ	(24,)	לצוף32
צעדים/תצפית/אימו	מוֹתֵחַ	(6,)	לצוף32
צעדים/התבוננות/טורסו_זקוף	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/מהירות_גו	מוֹתֵחַ	(3,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_none_40_percent

גודל ערכת נתונים: 70.92 MiB
שמור אוטומטי במטמון ( תיעוד ): כן
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	200

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(12,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
צעדים/תצפית/מצב_אגוצנטריים	מוֹתֵחַ	(44,)	לצוף32
צעדים/תצפית/כוח_מומנט	מוֹתֵחַ	(24,)	לצוף32
צעדים/תצפית/אימו	מוֹתֵחַ	(6,)	לצוף32
צעדים/התבוננות/טורסו_זקוף	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/מהירות_גו	מוֹתֵחַ	(3,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_none_100_percent

גודל ערכת נתונים: 177.29 MiB
שמור אוטומטי במטמון ( תיעוד ): רק כאשר shuffle_files=False (רכבת)
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	500

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(12,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
צעדים/תצפית/מצב_אגוצנטריים	מוֹתֵחַ	(44,)	לצוף32
צעדים/תצפית/כוח_מומנט	מוֹתֵחַ	(24,)	לצוף32
צעדים/תצפית/אימו	מוֹתֵחַ	(6,)	לצוף32
צעדים/התבוננות/טורסו_זקוף	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/מהירות_גו	מוֹתֵחַ	(3,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_none_1_percent

גודל מערך נתונים : 6.27 MiB
שמור אוטומטי במטמון ( תיעוד ): כן
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	50

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(6,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
מדרגות/תצפית/גובה	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/אוריינטציות	מוֹתֵחַ	(14,)	לצוף32
צעדים/תצפית/מהירות	מוֹתֵחַ	(9,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_none_5_percent

גודל ערכת נתונים : 31.34 MiB
שמור אוטומטי במטמון ( תיעוד ): כן
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	250

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(6,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
מדרגות/תצפית/גובה	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/אוריינטציות	מוֹתֵחַ	(14,)	לצוף32
צעדים/תצפית/מהירות	מוֹתֵחַ	(9,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_none_20_percent

גודל מערך נתונים : 125.37 MiB
שמור אוטומטי במטמון ( תיעוד ): כן
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	1,000

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(6,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
מדרגות/תצפית/גובה	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/אוריינטציות	מוֹתֵחַ	(14,)	לצוף32
צעדים/תצפית/מהירות	מוֹתֵחַ	(9,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_none_40_percent

גודל מערך נתונים : 250.75 MiB
שמירה אוטומטית במטמון ( תיעוד ): לא
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	2,000

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(6,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
מדרגות/תצפית/גובה	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/אוריינטציות	מוֹתֵחַ	(14,)	לצוף32
צעדים/תצפית/מהירות	מוֹתֵחַ	(9,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_none_100_percent

גודל ערכת נתונים: 626.86 MiB
שמירה אוטומטית במטמון ( תיעוד ): לא
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	5,000

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(6,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
מדרגות/תצפית/גובה	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/אוריינטציות	מוֹתֵחַ	(14,)	לצוף32
צעדים/תצפית/מהירות	מוֹתֵחַ	(9,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_none_1_percent

גודל מערך נתונים : 69.40 MiB
שמור אוטומטי במטמון ( תיעוד ): כן
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	200

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(21,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
צעדים/תצפית/מהירות_תנועה	מוֹתֵחַ	(3,)	לצוף32
צעדים/תצפית/הקצנות	מוֹתֵחַ	(12,)	לצוף32
צעדים/תצפית/גובה_ראש	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/זוויות_מפרק	מוֹתֵחַ	(21,)	לצוף32
צעדים/תצפית/טורסו_אנכי	מוֹתֵחַ	(3,)	לצוף32
צעדים/תצפית/מהירות	מוֹתֵחַ	(27,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_none_5_percent

גודל ערכת נתונים: 346.98 MiB
שמירה אוטומטית במטמון ( תיעוד ): לא
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	1,000

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(21,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
צעדים/תצפית/מהירות_תנועה	מוֹתֵחַ	(3,)	לצוף32
צעדים/תצפית/הקצנות	מוֹתֵחַ	(12,)	לצוף32
צעדים/תצפית/גובה_ראש	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/זוויות_מפרק	מוֹתֵחַ	(21,)	לצוף32
צעדים/תצפית/טורסו_אנכי	מוֹתֵחַ	(3,)	לצוף32
צעדים/תצפית/מהירות	מוֹתֵחַ	(27,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_none_20_percent

גודל ערכת נתונים : 1.36 GiB
שמירה אוטומטית במטמון ( תיעוד ): לא
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	4,000

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(21,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
צעדים/תצפית/מהירות_תנועה	מוֹתֵחַ	(3,)	לצוף32
צעדים/תצפית/הקצנות	מוֹתֵחַ	(12,)	לצוף32
צעדים/תצפית/גובה_ראש	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/זוויות_מפרק	מוֹתֵחַ	(21,)	לצוף32
צעדים/תצפית/טורסו_אנכי	מוֹתֵחַ	(3,)	לצוף32
צעדים/תצפית/מהירות	מוֹתֵחַ	(27,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_none_40_percent

גודל ערכת נתונים : 2.71 GiB
שמירה אוטומטית במטמון ( תיעוד ): לא
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	8,000

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(21,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
צעדים/תצפית/מהירות_תנועה	מוֹתֵחַ	(3,)	לצוף32
צעדים/תצפית/הקצנות	מוֹתֵחַ	(12,)	לצוף32
צעדים/תצפית/גובה_ראש	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/זוויות_מפרק	מוֹתֵחַ	(21,)	לצוף32
צעדים/תצפית/טורסו_אנכי	מוֹתֵחַ	(3,)	לצוף32
צעדים/תצפית/מהירות	מוֹתֵחַ	(27,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_none_100_percent

גודל מערך נתונים : 6.78 GiB
שמירה אוטומטית במטמון ( תיעוד ): לא
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	20,000

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(21,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
צעדים/תצפית/מהירות_תנועה	מוֹתֵחַ	(3,)	לצוף32
צעדים/תצפית/הקצנות	מוֹתֵחַ	(12,)	לצוף32
צעדים/תצפית/גובה_ראש	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/זוויות_מפרק	מוֹתֵחַ	(21,)	לצוף32
צעדים/תצפית/טורסו_אנכי	מוֹתֵחַ	(3,)	לצוף32
צעדים/תצפית/מהירות	מוֹתֵחַ	(27,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/cartpole_swingup_combined_challenge_easy_1_percent

גודל ערכת נתונים: 369.84 KiB
שמור אוטומטי במטמון ( תיעוד ): כן
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	5

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
צעדים/תצפית/דמה-0	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-1	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-2	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-3	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-4	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-5	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-6	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-7	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-8	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-9	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/עמדה	מוֹתֵחַ	(3,)	לצוף32
צעדים/תצפית/מהירות	מוֹתֵחַ	(2,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/cartpole_swingup_combined_challenge_easy_5_percent

גודל מערך נתונים : 1.81 MiB
שמור אוטומטי במטמון ( תיעוד ): כן
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	25

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
צעדים/תצפית/דמה-0	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-1	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-2	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-3	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-4	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-5	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-6	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-7	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-8	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-9	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/עמדה	מוֹתֵחַ	(3,)	לצוף32
צעדים/תצפית/מהירות	מוֹתֵחַ	(2,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/cartpole_swingup_combined_challenge_easy_20_percent

גודל מערך נתונים : 7.22 MiB
שמור אוטומטי במטמון ( תיעוד ): כן
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	100

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
צעדים/תצפית/דמה-0	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-1	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-2	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-3	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-4	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-5	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-6	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-7	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-8	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-9	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/עמדה	מוֹתֵחַ	(3,)	לצוף32
צעדים/תצפית/מהירות	מוֹתֵחַ	(2,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/cartpole_swingup_combined_challenge_easy_40_percent

גודל מערך נתונים : 14.45 MiB
שמור אוטומטי במטמון ( תיעוד ): כן
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	200

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
צעדים/תצפית/דמה-0	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-1	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-2	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-3	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-4	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-5	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-6	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-7	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-8	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-9	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/עמדה	מוֹתֵחַ	(3,)	לצוף32
צעדים/תצפית/מהירות	מוֹתֵחַ	(2,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/cartpole_swingup_combined_challenge_easy_100_percent

גודל מערך נתונים : 36.12 MiB
שמור אוטומטי במטמון ( תיעוד ): כן
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	500

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
צעדים/תצפית/דמה-0	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-1	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-2	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-3	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-4	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-5	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-6	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-7	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-8	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-9	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/עמדה	מוֹתֵחַ	(3,)	לצוף32
צעדים/תצפית/מהירות	מוֹתֵחַ	(2,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_easy_1_percent

גודל מערך נתונים : 1.97 MiB
שמור אוטומטי במטמון ( תיעוד ): כן
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	5

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(12,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
צעדים/תצפית/דמה-0	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-1	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-2	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-3	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-4	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-5	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-6	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-7	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-8	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-9	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/מצב_אגוצנטריים	מוֹתֵחַ	(44,)	לצוף32
צעדים/תצפית/כוח_מומנט	מוֹתֵחַ	(24,)	לצוף32
צעדים/תצפית/אימו	מוֹתֵחַ	(6,)	לצוף32
צעדים/התבוננות/טורסו_זקוף	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/מהירות_גו	מוֹתֵחַ	(3,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_easy_5_percent

גודל ערכת נתונים : 9.83 MiB
שמור אוטומטי במטמון ( תיעוד ): כן
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	25

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(12,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
צעדים/תצפית/דמה-0	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-1	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-2	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-3	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-4	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-5	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-6	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-7	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-8	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-9	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/מצב_אגוצנטריים	מוֹתֵחַ	(44,)	לצוף32
צעדים/תצפית/כוח_מומנט	מוֹתֵחַ	(24,)	לצוף32
צעדים/תצפית/אימו	מוֹתֵחַ	(6,)	לצוף32
צעדים/התבוננות/טורסו_זקוף	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/מהירות_גו	מוֹתֵחַ	(3,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_easy_20_percent

גודל ערכת נתונים: 39.31 MiB
שמור אוטומטי במטמון ( תיעוד ): כן
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	100

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(12,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
צעדים/תצפית/דמה-0	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-1	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-2	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-3	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-4	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-5	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-6	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-7	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-8	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-9	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/מצב_אגוצנטריים	מוֹתֵחַ	(44,)	לצוף32
צעדים/תצפית/כוח_מומנט	מוֹתֵחַ	(24,)	לצוף32
צעדים/תצפית/אימו	מוֹתֵחַ	(6,)	לצוף32
צעדים/התבוננות/טורסו_זקוף	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/מהירות_גו	מוֹתֵחַ	(3,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_easy_40_percent

גודל מערך נתונים : 78.63 MiB
שמור אוטומטי במטמון ( תיעוד ): כן
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	200

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(12,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
צעדים/תצפית/דמה-0	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-1	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-2	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-3	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-4	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-5	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-6	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-7	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-8	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-9	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/מצב_אגוצנטריים	מוֹתֵחַ	(44,)	לצוף32
צעדים/תצפית/כוח_מומנט	מוֹתֵחַ	(24,)	לצוף32
צעדים/תצפית/אימו	מוֹתֵחַ	(6,)	לצוף32
צעדים/התבוננות/טורסו_זקוף	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/מהירות_גו	מוֹתֵחַ	(3,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_easy_100_percent

גודל מערך נתונים : 196.57 MiB
שמור אוטומטי במטמון ( תיעוד ): רק כאשר shuffle_files=False (רכבת)
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	500

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(12,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
צעדים/תצפית/דמה-0	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-1	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-2	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-3	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-4	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-5	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-6	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-7	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-8	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-9	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/מצב_אגוצנטריים	מוֹתֵחַ	(44,)	לצוף32
צעדים/תצפית/כוח_מומנט	מוֹתֵחַ	(24,)	לצוף32
צעדים/תצפית/אימו	מוֹתֵחַ	(6,)	לצוף32
צעדים/התבוננות/טורסו_זקוף	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/מהירות_גו	מוֹתֵחַ	(3,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_easy_1_percent

גודל מערך נתונים : 8.20 MiB
שמור אוטומטי במטמון ( תיעוד ): כן
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	50

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(6,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
צעדים/תצפית/דמה-0	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-1	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-2	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-3	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-4	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-5	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-6	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-7	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-8	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-9	מוֹתֵחַ	(1,)	לצוף32
מדרגות/תצפית/גובה	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/אוריינטציות	מוֹתֵחַ	(14,)	לצוף32
צעדים/תצפית/מהירות	מוֹתֵחַ	(9,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_easy_5_percent

גודל מערך נתונים : 40.98 MiB
שמור אוטומטי במטמון ( תיעוד ): כן
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	250

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(6,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
צעדים/תצפית/דמה-0	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-1	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-2	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-3	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-4	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-5	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-6	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-7	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-8	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-9	מוֹתֵחַ	(1,)	לצוף32
מדרגות/תצפית/גובה	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/אוריינטציות	מוֹתֵחַ	(14,)	לצוף32
צעדים/תצפית/מהירות	מוֹתֵחַ	(9,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_easy_20_percent

גודל ערכת נתונים: 163.93 MiB
שמור אוטומטי במטמון ( תיעוד ): רק כאשר shuffle_files=False (רכבת)
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	1,000

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(6,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
צעדים/תצפית/דמה-0	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-1	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-2	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-3	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-4	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-5	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-6	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-7	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-8	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-9	מוֹתֵחַ	(1,)	לצוף32
מדרגות/תצפית/גובה	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/אוריינטציות	מוֹתֵחַ	(14,)	לצוף32
צעדים/תצפית/מהירות	מוֹתֵחַ	(9,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_easy_40_percent

גודל ערכת נתונים: 327.86 MiB
שמירה אוטומטית במטמון ( תיעוד ): לא
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	2,000

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(6,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
צעדים/תצפית/דמה-0	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-1	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-2	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-3	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-4	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-5	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-6	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-7	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-8	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-9	מוֹתֵחַ	(1,)	לצוף32
מדרגות/תצפית/גובה	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/אוריינטציות	מוֹתֵחַ	(14,)	לצוף32
צעדים/תצפית/מהירות	מוֹתֵחַ	(9,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_easy_100_percent

גודל ערכת נתונים: 819.65 MiB
שמירה אוטומטית במטמון ( תיעוד ): לא
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	5,000

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(6,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
צעדים/תצפית/דמה-0	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-1	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-2	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-3	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-4	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-5	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-6	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-7	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-8	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-9	מוֹתֵחַ	(1,)	לצוף32
מדרגות/תצפית/גובה	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/אוריינטציות	מוֹתֵחַ	(14,)	לצוף32
צעדים/תצפית/מהירות	מוֹתֵחַ	(9,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_easy_1_percent

גודל ערכת נתונים: 77.11 MiB
שמור אוטומטי במטמון ( תיעוד ): כן
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	200

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(21,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
צעדים/תצפית/מהירות_תנועה	מוֹתֵחַ	(3,)	לצוף32
צעדים/תצפית/דמה-0	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-1	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-2	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-3	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-4	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-5	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-6	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-7	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-8	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-9	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/הקצנות	מוֹתֵחַ	(12,)	לצוף32
צעדים/תצפית/גובה_ראש	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/זוויות_מפרק	מוֹתֵחַ	(21,)	לצוף32
צעדים/תצפית/טורסו_אנכי	מוֹתֵחַ	(3,)	לצוף32
צעדים/תצפית/מהירות	מוֹתֵחַ	(27,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_easy_5_percent

גודל ערכת נתונים: 385.54 MiB
שמירה אוטומטית במטמון ( תיעוד ): לא
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	1,000

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(21,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
צעדים/תצפית/מהירות_תנועה	מוֹתֵחַ	(3,)	לצוף32
צעדים/תצפית/דמה-0	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-1	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-2	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-3	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-4	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-5	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-6	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-7	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-8	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-9	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/הקצנות	מוֹתֵחַ	(12,)	לצוף32
צעדים/תצפית/גובה_ראש	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/זוויות_מפרק	מוֹתֵחַ	(21,)	לצוף32
צעדים/תצפית/טורסו_אנכי	מוֹתֵחַ	(3,)	לצוף32
צעדים/תצפית/מהירות	מוֹתֵחַ	(27,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_easy_20_percent

גודל מערך נתונים : 1.51 GiB
שמירה אוטומטית במטמון ( תיעוד ): לא
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	4,000

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(21,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
צעדים/תצפית/מהירות_תנועה	מוֹתֵחַ	(3,)	לצוף32
צעדים/תצפית/דמה-0	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-1	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-2	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-3	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-4	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-5	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-6	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-7	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-8	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-9	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/הקצנות	מוֹתֵחַ	(12,)	לצוף32
צעדים/תצפית/גובה_ראש	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/זוויות_מפרק	מוֹתֵחַ	(21,)	לצוף32
צעדים/תצפית/טורסו_אנכי	מוֹתֵחַ	(3,)	לצוף32
צעדים/תצפית/מהירות	מוֹתֵחַ	(27,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_easy_40_percent

גודל מערך נתונים : 3.01 GiB
שמירה אוטומטית במטמון ( תיעוד ): לא
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	8,000

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(21,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
צעדים/תצפית/מהירות_תנועה	מוֹתֵחַ	(3,)	לצוף32
צעדים/תצפית/דמה-0	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-1	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-2	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-3	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-4	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-5	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-6	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-7	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-8	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-9	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/הקצנות	מוֹתֵחַ	(12,)	לצוף32
צעדים/תצפית/גובה_ראש	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/זוויות_מפרק	מוֹתֵחַ	(21,)	לצוף32
צעדים/תצפית/טורסו_אנכי	מוֹתֵחַ	(3,)	לצוף32
צעדים/תצפית/מהירות	מוֹתֵחַ	(27,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_easy_100_percent

גודל מערך נתונים : 7.53 GiB
שמירה אוטומטית במטמון ( תיעוד ): לא
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	20,000

מבנה תכונה :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

תיעוד תכונה :

תכונה	מעמד	צוּרָה	Dtype
	FeaturesDict
episode_return	מוֹתֵחַ		לצוף32
צעדים	מערך נתונים
צעדים/פעולה	מוֹתֵחַ	(21,)	לצוף32
צעדים/הנחה	מוֹתֵחַ	(1,)	לצוף32
צעדים/הוא_ראשון	מוֹתֵחַ		bool
צעדים/הוא_אחרון	מוֹתֵחַ		bool
steps/is_terminal	מוֹתֵחַ		bool
צעדים/תצפית	FeaturesDict
צעדים/תצפית/מהירות_תנועה	מוֹתֵחַ	(3,)	לצוף32
צעדים/תצפית/דמה-0	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-1	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-2	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-3	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-4	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-5	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-6	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-7	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-8	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/דמה-9	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/הקצנות	מוֹתֵחַ	(12,)	לצוף32
צעדים/תצפית/גובה_ראש	מוֹתֵחַ	(1,)	לצוף32
צעדים/תצפית/זוויות_מפרק	מוֹתֵחַ	(21,)	לצוף32
צעדים/תצפית/טורסו_אנכי	מוֹתֵחַ	(3,)	לצוף32
צעדים/תצפית/מהירות	מוֹתֵחַ	(27,)	לצוף32
צעדים/פרס	מוֹתֵחַ	(1,)	לצוף32

דוגמאות ( tfds.as_dataframe ):

rlu_rwrl קל לארגן דפים בעזרת אוספים אפשר לשמור ולסווג תוכן על סמך ההעדפות שלך.