Google Research

Features Extracted From YouTube Videos for Multiview Learning


This is a dataset of feature values and class labels for about 120,000 YouTube videos (instances). Each instance is described by up to 13 feature types, from 3 high level feature families: textual, visual, and auditory features. There are 31 class labels.

The dataset should be useful particularly for research on multiview (multimodal) learning, such as multiview clustering and/or supervised learning, co-training, early/late fusion, and ensemble techniques. See our related blog post for more information.

There is currently no code associated with this dataset.