ComPhy: Compositional Physical Reasoning ofObjects and Events from Videos

Last update: Dec 29, 2022

Overview

ComPhy

This repository holds the code for the paper.

ComPhy: Compositional Physical Reasoning ofObjects and Events from Videos, (Under review)

PDF

Project Website

Framework

Code Preparation

git clone https://github.com/comphyreasoning/compositional_physics_learner.git

Installation

pip install -r requirements

Data Preparation

Download videos, video annotation, questions from the project website.

Fast Evaluation

Download the regional proposals with attribute and physical property prediction from the anonymous Google drive
Download the dynamic predictions from the anonymous Google drive
Run executor for factual questions.

sh scripts/test_oe_release.sh

Run executor for multiple-choice questions.

sh scripts/test_mc_release.sh

Supporting sub-modules

Physical Property Learner and Dynamic predictor

Please refer to this repo for property learning and dynamics prediction.

Perception

This module uses the public NS-VQA's perception module object detection and visual attribute extraction.

Program parser

This module uses the public NS-VQA's program parser module to tranform language into executable programs.

ComPhy: Compositional Physical Reasoning ofObjects and Events from Videos

Related tags

Overview

ComPhy

Framework

Code Preparation

Installation

Data Preparation

Fast Evaluation

Supporting sub-modules

Physical Property Learner and Dynamic predictor

Perception

Program parser

Owner

A simple root calculater for python

Implementation of the paper Scalable Intervention Target Estimation in Linear Models (NeurIPS 2021), and the code to generate simulation results.

Implementation of the final project of the course DDA6309 Probabilistic Graphical Model

[Open Source]. The improved version of AnimeGAN. Landscape photos/videos to anime

TFOD-MASKRCNN - Tensorflow MaskRCNN With Python

A model to classify a piece of news as REAL or FAKE

Variational Attention: Propagating Domain-Specific Knowledge for Multi-Domain Learning in Crowd Counting (ICCV, 2021)

Code for "Intra-hour Photovoltaic Generation Forecasting based on Multi-source Data and Deep Learning Methods."

SynNet - synthetic tree generation using neural networks

Ranger - a synergistic optimizer using RAdam (Rectified Adam), Gradient Centralization and LookAhead in one codebase

PyTorch implementation of the paper: "Preference-Adaptive Meta-Learning for Cold-Start Recommendation", IJCAI, 2021.

Yolov5-lite - Minimal PyTorch implementation of YOLOv5

Revisiting Self-Training for Few-Shot Learning of Language Model.

From Perceptron model to Deep Neural Network from scratch in Python.

Anomaly detection analysis and labeling tool, specifically for multiple time series (one time series per category)

EfficientNetv2 TensorRT int8

Newt - a Gaussian process library in JAX.

Code for our EMNLP 2021 paper "Learning Kernel-Smoothed Machine Translation with Retrieved Examples"

A testcase generation tool for Persistent Memory Programs.

PyTorch implementation of MoCo: Momentum Contrast for Unsupervised Visual Representation Learning