Thanks for tuning in to Google I/O. View all sessions on demandWatch on demand


  • Description:

AFLW2000-3D is a dataset of 2000 images that have been annotated with image-level 68-point 3D facial landmarks. This dataset is typically used for evaluation of 3D facial landmark detection models. The head poses are very diverse and often hard to be detected by a cnn-based face detector. The 2D landmarks are skipped in this dataset, since some of the data are not consistent to 21 points, as the original paper mentioned.

Split Examples
'train' 2,000
  • Feature structure:
    'image': Image(shape=(450, 450, 3), dtype=uint8),
    'landmarks_68_3d_xy_normalized': Tensor(shape=(68, 2), dtype=float32),
    'landmarks_68_3d_z': Tensor(shape=(68, 1), dtype=float32),
  • Feature documentation:
Feature Class Shape Dtype Description
image Image (450, 450, 3) uint8
landmarks_68_3d_xy_normalized Tensor (68, 2) float32
landmarks_68_3d_z Tensor (68, 1) float32


  • Citation:
  author    = {Xiangyu Zhu and
               Zhen Lei and
               Xiaoming Liu and
               Hailin Shi and
               Stan Z. Li},
  title     = {Face Alignment Across Large Poses: {A} 3D Solution},
  journal   = {CoRR},
  volume    = {abs/1511.07212},
  year      = {2015},
  url       = {},
  archivePrefix = {arXiv},
  eprint    = {1511.07212},
  timestamp = {Mon, 13 Aug 2018 16:48:23 +0200},
  biburl    = {},
  bibsource = {dblp computer science bibliography,}