SRA's seminar on Introduction to Computer Vision Fundamentals

Overview

Introduction to Computer Vision

This repository includes basics to :

  1. Python
  2. Numpy: A python library
  3. Git
  4. Computer Vision.

The aim of this repository is to provide:

  • A brief idea of algorithms involved in Computer Vision .
  • Introduction to Version Control System: Git and GitHub.
  • Computer Vision and Image Processing basics, idea of implementation of various algorithms involved using numpy (instead of any dedicated image processing library like OpenCV.)
  • Introduction to a commonly used Image Processing Library: OpenCV

Demonstration

Table Of Contents

Comments
  • Add suboptimal 2D convolution

    Add suboptimal 2D convolution

    This pull request intends to add a suboptimal implementation of generic 2D convolution. This is done for the purpose of giving a rough idea to Fys about how to work with python arrays/loops, etc. Fys will be asked to improve this implementation and complete tasks related to convolution on top of it.

    opened by meshtag 5
  • Morphology notes updated.

    Morphology notes updated.

    I have added images for dilation and erosion, replaced the previous gif of dilation and erosion with new ones and added a few lines explaining morphology.

    opened by Aryaman22102002 2
  • Updated cv-basics/

    Updated cv-basics/

    Optimised code and flow as discussed in:

    1. cv-basics/5_opencv_overview.ipynb
    2. python-numpy-basics/7_classes_and_objects.ipynb

    Added an image :

    1. cv-basics/image/bcci.png
    opened by dhairyashah1 1
  • Port to C++ : Assignments related to PIXELS seminar

    Port to C++ : Assignments related to PIXELS seminar

    Is your feature request related to a problem? Please describe. This feature request is created to keep a record of porting and potential addition of new assignments related to the seminar in C++ as discussed in this thread.

    Describe the solution you'd like

    • Create a separate main folder for containing all assignments. Individual assignments related to specific topics might be grouped together inside the main parent folder of assignments.
    • You might chose to add reference links in individual questions, which may provide additional material on a related topic for that question (this is suggested solely for the purpose of providing more (potentially real world) info related to the topic asked in original question and hence, should not in any way lead to the solution).
    enhancement 
    opened by meshtag 0
  • Add Content: Interpolations.

    Add Content: Interpolations.

    Is your feature request related to a problem? Please describe. As discussed in the thread, concepts of interpolation can also be added.

    Describe the solution you'd like

    • Create a implementations of interpolation from scratch using necessary OpenCV C++ API.
    • Add a Makefile to compile and build executables.
    • Add a .md file to explain the theory of interpolations and instructions to build and run the executables.

    Additional context Reference: Ancient Secrets of computer vision.

    Note: Content is not finalised and open for discussion

    enhancement 
    opened by amanchhaparia 0
  • Add Content: Image Storing Formats.

    Add Content: Image Storing Formats.

    Is your feature request related to a problem? Please describe. As discussed in the thread, It is important to have a familiarity of how images are store.

    Describe the solution you'd like

    • Add the theory of basic image storing formats such as .bmp, .tiff, .jpg, png etc.
    • Implement a .cpp file on how image can be read from the bmp format.
      • Consider only 8 bit grayscale BitMap image (Since they are easy to read and contains only 2D form of data).
      • Use simple posix read() api to read the image bitmap file.
      • Directly storing the values of various attributes of image in struct is suggested.
    • A similar example can be added to demonstrate how to edit/write a grayscale bitmap image.
    • Add a Makefile to compile and build the executable.
    • Add a .md file explaining the theory and instructions to build and run the executables.

    Note: Content is not finalised and open for discussion.

    enhancement C++ 
    opened by amanchhaparia 2
  • Add Content: Build Systems

    Add Content: Build Systems

    Is your feature request related to a problem? Please describe. As discussed in the thread, Concepts of Build System should be added.

    Describe the solution you'd like

    • Content should be added for manual creating and linking the object files.
    • Importance of build systems.
    • Add the contents for Makefile.
    • Add contents for Cmake.

    Additional context Can refer from here: Embedded Study Group Week 2.

    Note: Content is not finalised and open for discussion.

    enhancement Build-Systems 
    opened by amanchhaparia 0
  • Add Content: C++ basic concepts for seminar.

    Add Content: C++ basic concepts for seminar.

    Is your feature request related to a problem? Please describe. Since the seminar is being ported to C++ as discussed in this thread, it is important to teach some important C++ concepts.

    Describe the solution you'd like

    • Some advance concepts of C++ like handling 2D arrays/vector, pointer etc.

    Note: Content is not finalised and open for discussion.

    enhancement C++ 
    opened by amanchhaparia 1
Releases(v1.0)
  • v1.0(Sep 7, 2022)

    This release contains the 1st version of the PIXELS Seminar conducted in 2021. The content of this release is implemented in Python and uses numpy and OpenCV Python API.

    • This release can be used as a reference to basic Image Processing using Python.
    • Contains a tutorial for necessary numpy methods.
    • Tutorials on commonly used OpenCV functions in Python.
    • Implementation of blob detection a very commonly used algorithm in Python.
    Source code(tar.gz)
    Source code(zip)
Owner
Society of Robotics and Automation
The Society of Robotics and Automation is a society for VJTI students. As the name suggests, we deal with Robotics, Machine Vision and Automation .
Society of Robotics and Automation
Dataset and Code for ICCV 2021 paper "Real-world Video Super-resolution: A Benchmark Dataset and A Decomposition based Learning Scheme"

Dataset and Code for RealVSR Real-world Video Super-resolution: A Benchmark Dataset and A Decomposition based Learning Scheme Xi Yang, Wangmeng Xiang,

Xi Yang 91 Nov 22, 2022
Packaged, Pytorch-based, easy to use, cross-platform version of the CRAFT text detector

CRAFT: Character-Region Awareness For Text detection Packaged, Pytorch-based, easy to use, cross-platform version of the CRAFT text detector | Paper |

188 Dec 28, 2022
Unofficial implementation of "TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Document Images"

TableNet Unofficial implementation of ICDAR 2019 paper : TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from

Jainam Shah 243 Dec 30, 2022
Open Source Computer Vision Library

OpenCV: Open Source Computer Vision Library Resources Homepage: https://opencv.org Courses: https://opencv.org/courses Docs: https://docs.opencv.org/m

OpenCV 65.7k Jan 03, 2023
This can be use to convert text in a file to handwritten text.

TextToHandwriting This can be used to convert text to handwriting. Clone this project or download the code. Run TextToImage.py give the filename of th

Ashutosh Mahapatra 2 Feb 06, 2022
A Vietnamese personal card OCR website built with Django.

Django VietCardOCR Installation Creation of virtual environments is done by executing the command venv: python -m venv venv That will create a new fol

Truong Hoang Thuan 4 Sep 04, 2021
This is a tensorflow re-implementation of PSENet: Shape Robust Text Detection with Progressive Scale Expansion Network.My blog:

PSENet: Shape Robust Text Detection with Progressive Scale Expansion Network Introduction This is a tensorflow re-implementation of PSENet: Shape Robu

Michael liu 498 Dec 30, 2022
Source code of our TPAMI'21 paper Dual Encoding for Video Retrieval by Text and CVPR'19 paper Dual Encoding for Zero-Example Video Retrieval.

Dual Encoding for Video Retrieval by Text Source code of our TPAMI'21 paper Dual Encoding for Video Retrieval by Text and CVPR'19 paper Dual Encoding

81 Dec 01, 2022
Ddddocr - 通用验证码识别OCR pypi版

带带弟弟OCR通用验证码识别SDK免费开源版 今天ddddocr又更新啦! 当前版本为1.3.1 想必很多做验证码的新手,一定头疼碰到点选类型的图像,做样本费时

Sml2h3 4.4k Dec 31, 2022
PSENet - Shape Robust Text Detection with Progressive Scale Expansion Network.

News Python3 implementations of PSENet [1], PAN [2] and PAN++ [3] are released at https://github.com/whai362/pan_pp.pytorch. [1] W. Wang, E. Xie, X. L

1.1k Dec 24, 2022
Select range and every time the screen changes, OCR is activated.

ASOCR(Auto Screen OCR) Select range and every time you press Space key, OCR is activated. 範囲を選ぶと、あなたがスペースキーを押すたびに、画面が変わる度にOCRが起動します。 usage1: simple OC

1 Feb 13, 2022
Create single line SVG illustrations from your pictures

Create single line SVG illustrations from your pictures

Javier Bórquez 686 Dec 26, 2022
Awesome anomaly detection in medical images

A curated list of awesome anomaly detection works in medical imaging, inspired by the other awesome-* initiatives.

Kang Zhou 57 Dec 19, 2022
[ICCV, 2021] Cloud Transformers: A Universal Approach To Point Cloud Processing Tasks

Cloud Transformers: A Universal Approach To Point Cloud Processing Tasks This is an official PyTorch code repository of the paper "Cloud Transformers:

Visual Understanding Lab @ Samsung AI Center Moscow 27 Dec 15, 2022
chineseocr/table_line 表格线检测模型pytorch版

table_line_pytorch chineseocr/table_detct 表格线检测模型table_line pytorch版 原项目github: https://github.com/chineseocr/table-detect 1、模型转换 下载原项目table_detect模型文

1 Oct 21, 2021
Usando o Amazon Textract como OCR para Extração de Dados no DynamoDB

dio-live-textract2 Repositório de código para o live coding do dia 05/10/2021 sobre extração de dados estruturados e gravação em banco de dados a part

hugoportela 0 Jan 19, 2022
Amazing 3D explosion animation using Pygame module.

3D Explosion Animation 💣 💥 🔥 Amazing explosion animation with Pygame. 💣 Explosion physics An Explosion instance is made of a set of Particle objec

Dylan Tintenfich 12 Mar 11, 2022
A Python wrapper for the tesseract-ocr API

tesserocr A simple, Pillow-friendly, wrapper around the tesseract-ocr API for Optical Character Recognition (OCR). tesserocr integrates directly with

Fayez 1.7k Dec 31, 2022
scene-linear test images

Scene-Referred Image Collection A collection of OpenEXR Scene-Referred images, encoded as max 2048px width, DWAA 80 compression. All exrs are encoded

Gralk Klorggson 7 Aug 25, 2022
Python-based tools for document analysis and OCR

ocropy OCRopus is a collection of document analysis programs, not a turn-key OCR system. In order to apply it to your documents, you may need to do so

OCRopus 3.2k Dec 31, 2022