Data
Drive&Act: Dataset for Action Recognition in Automated Vehicles
- 12h of video data in 29 long sequences
- Multi-modal videos: NIR, depth and color data
- 83 manually-annotated hierarchical activity labels
[download-page]
[paper]
CLEVR Scene Graphs
- Scene graphs of the images in the CLEVR validation and test set
- Objects were generated in TensorFlow using SSD with a ResNet152 backbone
- These scene graphs were used for evaluating the softpaths architecture
[val-graphs]
[test-graphs]
[paper]
WiSe - Slide Segmentation in the Wild
- Annotations for 1300 slides captured during lectures
- Fine-grained pixel-wise labels
- 14 text, 6 image-based and 4 structural classes
- Highly overlapping segments, i.e., multiple labels per pixel
[download-page]
[paper]
SPaSe - Slide Page Segmentation
- The first benchmark dataset for slide-page segmentation
- Annotations for 2000 complex slides
- Fine-grained pixel-wise labels
- 14 text, 6 image-based and 4 structural classes
- Highly overlapping segments i.e. multiple labels per pixel
[download-page]
[paper]
DriveAHead
- Head Pose Dataset captured during real driving scenarios
- Contains frame-by-frame head pose labels obtained from a motion capture system
- Includes 1 Million depth and infra-red images
[download-page]
[paper]