Ego4d dataset repository. Download the dataset, visualize, extract features & example usage of the dataset

Last update: Jan 07, 2023

Overview

Ego4D

EGO4D is the world's largest egocentric (first person) video ML dataset and benchmark suite, with 3,600 hrs (and counting) of densely narrated video and a wide range of annotations across five new benchmark tasks. It covers hundreds of scenarios (household, outdoor, workplace, leisure, etc.) of daily life activity captured in-the-wild by 926 unique camera wearers from 74 worldwide locations and 9 different countries. Portions of the video are accompanied by audio, 3D meshes of the environment, eye gaze, stereo, and/or synchronized videos from multiple egocentric cameras at the same event. The approach to data collection was designed to uphold rigorous privacy and ethics standards with consenting participants and robust de-identification procedures where relevant.

Public Documentation/Start Here: Ego4D Docs

For the CLI readme (to download/access): CLI README

For a demo notebook: Annotation Notebook

For the visualization engine: Viz README

For feature extraction: Feature README

License

Ego4D is released under the MIT License.

Ego4d dataset repository. Download the dataset, visualize, extract features & example usage of the dataset

Related tags

Overview

Ego4D

License

Owner

Meta Research

Code for 'Self-Guided and Cross-Guided Learning for Few-shot segmentation. (CVPR' 2021)'

A Unified Generative Framework for Various NER Subtasks.

Malware Env for OpenAI Gym

Improving 3D Object Detection with Channel-wise Transformer

PIKA: a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi

A Low Complexity Speech Enhancement Framework for Full-Band Audio (48kHz) based on Deep Filtering.

SatelliteNeRF - PyTorch-based Neural Radiance Fields adapted to satellite domain

Privacy-Preserving Portrait Matting [ACM MM-21]

Laplacian Score-regularized Concrete Autoencoders

A library for uncertainty representation and training in neural networks.

Arxiv harvester - Poor man's simple harvester for arXiv resources

PyTorch implementation of ShapeConv: Shape-aware Convolutional Layer for RGB-D Indoor Semantic Segmentation.

A developer interface for creating Chat AIs for the Chai app.

Official code for Score-Based Generative Modeling through Stochastic Differential Equations

Using modified BiSeNet for face parsing in PyTorch

This project aims at providing a concise, easy-to-use, modifiable reference implementation for semantic segmentation models using PyTorch.

Datasets for new state-of-the-art challenge in disentanglement learning

keyframes-CNN-RNN(action recognition)

Xview3 solution - XView3 challenge, 2nd place solution

Companion code for the paper "Meta-Learning the Search Distribution of Black-Box Random Search Based Adversarial Attacks" by Yatsura et al.