stanford_kuka_multimodal_dataset_converted_externally_to_rlds

説明：

フォースフィードバック付き Kuka iiwa ペグ挿入

ホームページ: https://sites.google.com/view/visionandtouch
ソースコード: tfds.robotics.rtx.StanfordKukaMultimodalDatasetConvertedExternallyToRlds
バージョン:
- 0.1.0 (デフォルト): 初期リリース。
ダウンロードサイズ: Unknown size
データセットのサイズ: 31.98 GiB
自動キャッシュ(ドキュメント): いいえ
分割:

スプリット	例
`'train'`	3,000

機能の構造:

FeaturesDict({
    'episode_metadata': FeaturesDict({
    }),
    'steps': Dataset({
        'action': Tensor(shape=(4,), dtype=float32),
        'discount': Scalar(shape=(), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'language_embedding': Tensor(shape=(512,), dtype=float32),
        'language_instruction': Text(shape=(), dtype=string),
        'observation': FeaturesDict({
            'contact': Tensor(shape=(50,), dtype=float32),
            'depth_image': Tensor(shape=(128, 128, 1), dtype=float32),
            'ee_forces_continuous': Tensor(shape=(50, 6), dtype=float32),
            'ee_orientation': Tensor(shape=(4,), dtype=float32),
            'ee_orientation_vel': Tensor(shape=(3,), dtype=float32),
            'ee_position': Tensor(shape=(3,), dtype=float32),
            'ee_vel': Tensor(shape=(3,), dtype=float32),
            'ee_yaw': Tensor(shape=(4,), dtype=float32),
            'ee_yaw_delta': Tensor(shape=(4,), dtype=float32),
            'image': Image(shape=(128, 128, 3), dtype=uint8),
            'joint_pos': Tensor(shape=(7,), dtype=float32),
            'joint_vel': Tensor(shape=(7,), dtype=float32),
            'optical_flow': Tensor(shape=(128, 128, 2), dtype=float32),
            'state': Tensor(shape=(8,), dtype=float32),
        }),
        'reward': Scalar(shape=(), dtype=float32),
    }),
})

機能ドキュメント:

特徴	クラス	形	Dタイプ	説明
	特徴辞書
エピソード_メタデータ	特徴辞書
ステップ	データセット
ステップ/アクション	テンソル	(4,)	float32	ロボットアクションは、[3x EEF 位置、1x グリッパー開閉] で構成されます。
歩数/割引	スカラー		float32	割引が指定されている場合、デフォルトは 1 です。
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/言語_埋め込み	テンソル	(512,)	float32	コナ言語の埋め込み。 https://tfhub.dev/google/universal-sentence-encoder-large/5を参照してください。
ステップ/言語説明	文章		弦	言語指導。
ステップ/観察	特徴辞書
歩数・観察・連絡	テンソル	(50,)	float32	ロボットの連絡先情報。
ステップ/観察/深さ_画像	テンソル	(128、128、1)	float32	メイン深度カメラ観察。
ステップ/観察/ee_forces_continuous	テンソル	(50,6)	float32	ロボットのエンドエフェクターの力。
歩数/観察/ee_orientation	テンソル	(4,)	float32	ロボットのエンドエフェクターの方向の四元数。
ステップ/観察/ee_orientation_vel	テンソル	(3,)	float32	ロボットのエンドエフェクターの方向速度。
歩数/観察/ee_position	テンソル	(3,)	float32	ロボットのエンドエフェクターの位置。
ステップ/観察/ee_vel	テンソル	(3,)	float32	ロボットのエンドエフェクターの速度。
歩数/観察/ee_yaw	テンソル	(4,)	float32	ロボットのエンドエフェクターのヨー。
歩数/観察/ee_yaw_delta	テンソル	(4,)	float32	ロボットのエンドエフェクターのヨーデルタ。
手順・観察・イメージ	画像	(128、128、3)	uint8	メインカメラRGB観察。
歩数/観察/joint_pos	テンソル	（7、）	float32	ロボットの関節の位置。
ステップ/観察/joint_vel	テンソル	（7、）	float32	ロボットの関節速度。
ステップ/観察/光学フロー	テンソル	(128、128、2)	float32	オプティカルフロー。
ステップ/観察/状態	テンソル	(8,)	float32	ロボットの固有受容情報、[7x ジョイント pos、1x グリッパー開閉]。
歩数/報酬	スカラー		float32	提供されている場合は報酬、デモの最終ステップで 1。

監視キー( as_supervised docを参照): None
図( tfds.show_examples ): サポートされていません。
例( tfds.as_dataframe ): 欠落しています。
引用：

@inproceedings{lee2019icra,
  title={Making sense of vision and touch: Self-supervised learning of multimodal representations for contact-rich tasks},
  author={Lee, Michelle A and Zhu, Yuke and Srinivasan, Krishnan and Shah, Parth and Savarese, Silvio and Fei-Fei, Li and  Garg, Animesh and Bohg, Jeannette},
  booktitle={2019 IEEE International Conference on Robotics and Automation (ICRA)},
  year={2019},
  url={https://arxiv.org/abs/1810.10191}
}