Action
- UCF Sports Action Dataset
- This dataset consists of a set of actions collected from various sports which are typically featured on broadcast television channels such as the BBC and ESPN. The video sequences were obtained from a wide range of stock footage websites including BBC Motion gallery, and GettyImages.
- UCF Aerial Action Dataset
- This dataset features video sequences that were obtained using a R/C-controlled blimp equipped with an HD camera mounted on a gimbal.The collection represents a diverse pool of actions featured at different heights and aerial viewpoints. Multiple instances of each action were recorded at different flying altitudes which ranged from 400-450 feet and were performed by different actors.
- UCF YouTube Action Dataset
- It contains 11 action categories collected from YouTube.
- Weizmann action recognition
- Walk, Run, Jump, Gallop sideways, Bend, One-hand wave, Two-hands wave, Jump in place, Jumping Jack, Skip.
- UCF50
- UCF50 is an action recognition dataset with 50 action categories, consisting of realistic videos taken from YouTube.
- ASLAN
- The Action Similarity Labeling (ASLAN) Challenge.
- MSR Action Recognition Datasets
- The dataset was captured by a Kinect device. There are 12 dynamic American Sign Language (ASL) gestures, and 10 people. Each person performs each gesture 2-3 times.
- KTH Recognition of human actions
- Contains six types of human actions (walking, jogging, running, boxing, hand waving and hand clapping) performed several times by 25 subjects in four different scenarios: outdoors, outdoors with scale variation, outdoors with different clothes and indoors.
- Hollywood-2 Human Actions and Scenes dataset
- Hollywood-2 datset contains 12 classes of human actions and 10 classes of scenes distributed over 3669 video clips and approximately 20.1 hours of video in total.
- Collective Activity Dataset
- This dataset contains 5 different collective activities : crossing, walking, waiting, talking, and queueing and 44 short video sequences some of which were recorded by consumer hand-held digital camera with varying view point.
- Olympic Sports Dataset
- The Olympic Sports Dataset contains YouTube videos of athletes practicing different sports.
- SDHA 2010
- Surveillance-type videos
- VIRAT Video Dataset
- The dataset is designed to be realistic, natural and challenging for video surveillance domains in terms of its resolution, background clutter, diversity in scenes, and human activity/event categories than existing action recognition datasets.
- HMDB: A Large Video Database for Human Motion Recognition
- Collected from various sources, mostly from movies, and a small proportion from public databases, YouTube and Google videos. The dataset contains 6849 clips divided into 51 action categories, each containing a minimum of 101 clips.
- Stanford 40 Actions Dataset
- Dataset of 9,532 images of humans performing 40 different actions, annotated with bounding-boxes.
- 50Salads dataset
- Fully annotated dataset of RGB-D video data and data from accelerometers attached to kitchen objects capturing 25 people preparing two mixed salads each (4.5h of annotated data). Annotated activities correspond to steps in the recipe and include phase (pre-/ core-/ post) and the ingredient acted upon.
Human pose/Expression
- AFEW (Acted Facial Expressions In The Wild)/SFEW (Static Facial Expressions In The Wild)
- Dynamic temporal facial expressions data corpus consisting of close to real world environment extracted from movies.
- ETHZ CALVIN Dataset
Action Databases
- 50 Salads - fully annotated 4.5 hour dataset of RGB-D video + accelerometer data, capturing 25 people preparing two mixed salads each (Dundee University, Sebastian Stein)
- ASLAN Action similarity labeling challenge database (Orit Kliper-Gross)
- Berkeley MHAD: A Comprehensive Multimodal Human Action Database (Ferda Ofli)
- BEHAVE Interacting Person Video Data with markup (Scott Blunsden, Bob Fisher, Aroosha Laghaee)
- CVBASE06: annotated sports videos (Janez Pers)
- G3D - synchronised video, depth and skeleton data for 20 gaming actions captured with Microsoft Kinect (Victoria Bloom)
- Hollywood 3D - 650 3D action recognition in the wild videos, 14 action classes (Simon Hadfield)
- Human Actions and Scenes Dataset (Marcin Marszalek, Ivan Laptev, Cordelia Schmid)
- HumanEva: Synchronized Video and Motion Capture Dataset for Evaluation of Articulated Human Motion (Brown University)
- i3DPost Multi-View Human Action Datasets (Hansung Kim)
- i-LIDS video event image dataset (Imagery library for intelligent detection systems) (Paul Hosner)
- INRIA Xmas Motion Acquisition Sequences (IXMAS) (INRIA)
- JPL First-Person Interaction dataset - 7 types of human activity videos taken from a first-person viewpoint (Michael S. Ryoo, JPL)
- KTH human action recognition database (KTH CVAP lab)
- LIRIS human activities dataset - 2 cameras, annotated, depth images (Christian Wolf, et al)
- MuHAVi - Multicamera Human Action Video Data (Hossein Ragheb)
- Oxford TV based human interactions (Oxford Visual Geometry Group)
- Rochester Activities of Daily Living Dataset (Ross Messing)
- SDHA Semantic Description of Human Activities 2010 contest - aerial views (Michael S. Ryoo, J. K. Aggarwal, Amit K. Roy-Chowdhury)
- SDHA Semantic Description of Human Activities 2010 contest - Human Interactions (Michael S. Ryoo, J. K. Aggarwal, Amit K. Roy-Chowdhury)
- TUM Kitchen Data Set of Everyday Manipulation Activities (Moritz Tenorth, Jan Bandouch)
- TV Human Interaction Dataset (Alonso Patron-Perez)
- Univ of Central Florida - Feature Films Action Dataset (Univ of Central Florida)
- Univ of Central Florida - YouTube Action Dataset (sports) (Univ of Central Florida)
- Univ of Central Florida - 50 Action Category Recognition in Realistic Videos (3 GB) (Kishore Reddy)
- UCF 101 action dataset 101 action classes, over 13k clips and 27 hours of video data (Univ of Central Florida)
- Univ of Central Florida - Sports Action Dataset (Univ of Central Florida)
- Univ of Central Florida - ARG Aerial camera, Rooftop camera and Ground camera (UCF Computer Vision Lab)
- UCR Videoweb Multi-camera Wide-Area Activities Dataset (Amit K. Roy-Chowdhury)
- Verona Social interaction dataset (Marco Cristani)
- Videoweb (multicamera) Activities Dataset (B. Bhanu, G. Denina, C. Ding, A. Ivers, A. Kamal, C. Ravishankar, A. Roy-Chowdhury, B. Varda)
- ViHASi: Virtual Human Action Silhouette Data (userID: VIHASI password: virtual$virtual) (Hossein Ragheb, Kingston University)
- WorkoutSU-10 Kinect dataset for exercise actions (Ceyhun Akgul)
- YouCook - 88 open-source YouTube cooking videos with annotations (Jason Corso)
- WVU Multi-view action recognition dataset (Univ. of West Virginia)
时间: 2024-10-29 19:07:37