SRA's seminar on Introduction to Computer Vision Fundamentals

Last update: Dec 04, 2022

Overview

Introduction to Computer Vision

This repository includes basics to :

Python
Numpy: A python library
Git
Computer Vision.

The aim of this repository is to provide:

A brief idea of algorithms involved in Computer Vision .
Introduction to Version Control System: Git and GitHub.
Computer Vision and Image Processing basics, idea of implementation of various algorithms involved using numpy (instead of any dedicated image processing library like OpenCV.)
Introduction to a commonly used Image Processing Library: OpenCV

Demonstration

Comments

Add suboptimal 2D convolution

This pull request intends to add a suboptimal implementation of generic 2D convolution. This is done for the purpose of giving a rough idea to Fys about how to work with python arrays/loops, etc. Fys will be asked to improve this implementation and complete tasks related to convolution on top of it.

opened by meshtag 5
Morphology notes updated.

I have added images for dilation and erosion, replaced the previous gif of dilation and erosion with new ones and added a few lines explaining morphology.

opened by Aryaman22102002 2
Updated cv-basics/
Optimised code and flow as discussed in:

cv-basics/5_opencv_overview.ipynb

python-numpy-basics/7_classes_and_objects.ipynb

Added an image :

cv-basics/image/bcci.png
opened by dhairyashah1 1
Port to C++ : Assignments related to PIXELS seminar
Is your feature request related to a problem? Please describe. This feature request is created to keep a record of porting and potential addition of new assignments related to the seminar in C++ as discussed in this thread.

Describe the solution you'd like

Create a separate main folder for containing all assignments. Individual assignments related to specific topics might be grouped together inside the main parent folder of assignments.

You might chose to add reference links in individual questions, which may provide additional material on a related topic for that question (this is suggested solely for the purpose of providing more (potentially real world) info related to the topic asked in original question and hence, should not in any way lead to the solution).

enhancement
opened by meshtag 0
Add Content: Interpolations.
Is your feature request related to a problem? Please describe. As discussed in the thread, concepts of interpolation can also be added.

Describe the solution you'd like

Create a implementations of interpolation from scratch using necessary OpenCV C++ API.

Add a Makefile to compile and build executables.

Add a .md file to explain the theory of interpolations and instructions to build and run the executables.

Additional context Reference: Ancient Secrets of computer vision.

Note: Content is not finalised and open for discussion
enhancement
opened by amanchhaparia 0
Add Content: Image Storing Formats.
Is your feature request related to a problem? Please describe. As discussed in the thread, It is important to have a familiarity of how images are store.

Describe the solution you'd like

Add the theory of basic image storing formats such as .bmp, .tiff, .jpg, png etc.

Implement a .cpp file on how image can be read from the bmp format.

Consider only 8 bit grayscale BitMap image (Since they are easy to read and contains only 2D form of data).

Use simple posix read() api to read the image bitmap file.

Directly storing the values of various attributes of image in struct is suggested.

A similar example can be added to demonstrate how to edit/write a grayscale bitmap image.

Add a Makefile to compile and build the executable.

Add a .md file explaining the theory and instructions to build and run the executables.

Note: Content is not finalised and open for discussion.
enhancement C++
opened by amanchhaparia 2
Add Content: Build Systems
Is your feature request related to a problem? Please describe. As discussed in the thread, Concepts of Build System should be added.

Describe the solution you'd like

Content should be added for manual creating and linking the object files.

Importance of build systems.

Add the contents for Makefile.

Add contents for Cmake.

Additional context Can refer from here: Embedded Study Group Week 2.

Note: Content is not finalised and open for discussion.
enhancement Build-Systems
opened by amanchhaparia 0
Add Content: C++ basic concepts for seminar.
Is your feature request related to a problem? Please describe. Since the seminar is being ported to C++ as discussed in this thread, it is important to teach some important C++ concepts.

Describe the solution you'd like

Some advance concepts of C++ like handling 2D arrays/vector, pointer etc.

Note: Content is not finalised and open for discussion.
enhancement C++
opened by amanchhaparia 1

Releases(v1.0)

v1.0(Sep 7, 2022)
This release contains the 1st version of the PIXELS Seminar conducted in 2021. The content of this release is implemented in Python and uses numpy and OpenCV Python API.

This release can be used as a reference to basic Image Processing using Python.

Contains a tutorial for necessary numpy methods.

Tutorials on commonly used OpenCV functions in Python.

Implementation of blob detection a very commonly used algorithm in Python.

Source code(tar.gz)
Source code(zip)

Owner

Society of Robotics and Automation

The Society of Robotics and Automation is a society for VJTI students. As the name suggests, we deal with Robotics, Machine Vision and Automation .

GitHub Repository

Single Shot Text Detector with Regional Attention

Single Shot Text Detector with Regional Attention Introduction SSTD is initially described in our ICCV 2017 spotlight paper. A third-party implementat

215 Dec 07, 2022

Brief idea about our project is mentioned in project presentation file.

Brief idea about our project is mentioned in project presentation file. You just have to run attendance.py file in your suitable IDE but we prefer jupyter lab.

3 Mar 20, 2022

Image Detector and Convertor App created using python's Pillow, OpenCV, cvlib, numpy and streamlit packages.

11 Jan 02, 2022

OCR powered screen-capture tool to capture information instead of images

NormCap OCR powered screen-capture tool to capture information instead of images. Links: Repo | PyPi | Releases | Changelog | FAQs Content: Quickstart

575 Dec 31, 2022

The code for CVPR2022 paper "Likert Scoring with Grade Decoupling for Long-term Action Assessment".

Likert Scoring with Grade Decoupling for Long-term Action Assessment This is the code for CVPR2022 paper "Likert Scoring with Grade Decoupling for Lon

10 Oct 21, 2022

Application that instantly translates sign-language to letters.

Sign Language Translator Project Description The main purpose of project is translating sign-language to letters. In accordance with this purpose we d

3 Sep 29, 2022

Automatic Number Plate Recognition (ANPR) is a highly accurate system capable of reading vehicle number plates without human intervention

ANPR ANPR is therefore the underlying technology used to find a vehicle license/number plate and it, in turn, supplies this information to a next stag

1 Jan 09, 2022

Automatically fishes for you while you are afk :)

Dank-memer-afk-script A simple and quick way to make easy money in Dank Memer! How to use Open a discord channel which has the Dank Memer bot enabled.

9 Nov 11, 2022

OCR, Scene-Text-Understanding, Text Recognition

Scene-Text-Understanding Survey [2015-PAMI] Text Detection and Recognition in Imagery: A Survey paper [2014-Front.Comput.Sci] Scene Text Detection and

354 Dec 12, 2022

A facial recognition device is a device that takes an image or a video of a human face and compares it to another image faces in a database.

A facial recognition device is a device that takes an image or a video of a human face and compares it to another image faces in a database. The structure, shape and proportions of the faces are comp

4 Mar 19, 2022

A version of nrsc5-gui that merges the interface developed by cmnybo with the architecture developed by zefie in order to start a new baseline that is not heavily dependent upon Python processing.

NRSC5-DUI is a graphical interface for nrsc5. It makes it easy to play your favorite FM HD radio stations using an RTL-SDR dongle. It will also displa

61 Dec 22, 2022

Layout Analysis Evaluator for the ICDAR 2017 competition on Layout Analysis for Challenging Medieval Manuscripts

LayoutAnalysisEvaluator Layout Analysis Evaluator for: ICDAR 2019 Historical Document Reading Challenge on Large Structured Chinese Family Records ICD

17 Dec 08, 2022

Aloception is a set of package for computer vision: aloscene, alodataset, alonet.

86 Dec 28, 2022

Generic framework for historical document processing

dhSegment dhSegment is a tool for Historical Document Processing. Its generic approach allows to segment regions and extract content from different ty

343 Dec 24, 2022

Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation

This is the official implementation of "Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation". For more details, please

309 Dec 06, 2022

Handwritten Text Recognition (HTR) system implemented with TensorFlow.

Handwritten Text Recognition with TensorFlow Update 2021: more robust model, faster dataloader, word beam search decoder also available for Windows Up

1.5k Jan 07, 2023

Train custom VR face tracking parameters

Pal Buddy Guy: The anipal's best friend This is a small script to improve upon the tracking capabilities of the Vive Pro Eye and facial tracker. You c

7 Dec 12, 2021

A Tensorflow model for text recognition (CNN + seq2seq with visual attention) available as a Python package and compatible with Google Cloud ML Engine.

Attention-based OCR Visual attention-based OCR model for image recognition with additional tools for creating TFRecords datasets and exporting the tra

933 Dec 29, 2022

BNF Globalization Code (CVPR 2016)

Boundary Neural Fields Globalization This is the code for Boundary Neural Fields globalization method. The technical report of the method can be found

25 Apr 15, 2022

Repository for playing the computer vision apps: People analytics on Raspberry Pi.

play-with-torch Repository for playing the computer vision apps: People analytics on Raspberry Pi. Tools Tested Hardware RasberryPi 4 Model B here, RA

1 Sep 23, 2021