M-LSDを用いて四角形を検出し、射影変換を行うサンプルプログラム

Last update: Oct 14, 2022

Related tags

Computer Vision opencv tensorflow warpperspective m-lsd

Overview

M-LSD-warpPerspective-Example

M-LSDを用いて四角形を検出し、射影変換を行うサンプルプログラムです。

Requirements

OpenCV 3.4.2 or Later
tensorflow 2.4.1 or Later

Usage

実行方法は以下です。

python example.py

実行時には、以下のオプションが指定可能です。

--device
カメラデバイス番号の指定
デフォルト：0
--file
動画ファイルの指定 ※指定時にはカメラデバイスより優先
デフォルト：None
--width
カメラキャプチャ時の横幅
デフォルト：640
--height
カメラキャプチャ時の縦幅
デフォルト：480
--crop_width
射影変換後の画像の横幅
デフォルト：224
--crop_height
射影変換後の画像の縦幅
デフォルト：224
--model
モデルパス
デフォルト：mlsd/tflite_models/M-LSD_320_tiny_fp32.tflite
--model_shape
モデルの入力形状幅
デフォルト：320
--top_n
検出スコアの高い順にいくつ使用するか
デフォルト：1
--score
M_LSDパラメータ：score
デフォルト：0.1
--outside_ratio
M_LSDパラメータ：outside_ratio
デフォルト：0.1
--inside_ratio
M_LSDパラメータ：inside_ratio
デフォルト：0.5
--w_overlap
M_LSDパラメータ：w_overlap
デフォルト：0.0
--w_degree
M_LSDパラメータ：w_degree
デフォルト：1.14
--w_length
M_LSDパラメータ：w_length
デフォルト：0.03
--w_area
M_LSDパラメータ：w_area
デフォルト：1.84
--w_center
M_LSDパラメータ：w_center
デフォルト：1.46

Reference

推論用プログラム(mlsd/utils.py)、および学習済モデル(mlsd/tflite_models)は、
navervision/mlsdリポジトリのものを使用しています。

Author

高橋かずひと(https://twitter.com/KzhtTkhs)

License

M-LSD-warpPerspective-Example is under Apache v2 license.

Owner

KazuhitoTakahashi

KazuhitoTakahashi

GitHub Repository

document image degradation

ocrodeg The ocrodeg package is a small Python library implementing document image degradation for data augmentation for handwriting recognition and OC

134 Nov 18, 2022

The code for CVPR2022 paper "Likert Scoring with Grade Decoupling for Long-term Action Assessment".

Likert Scoring with Grade Decoupling for Long-term Action Assessment This is the code for CVPR2022 paper "Likert Scoring with Grade Decoupling for Lon

10 Oct 21, 2022

Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)

ocr-fileformat Validate and transform between OCR file formats (hOCR, ALTO, PAGE, FineReader) Installation Docker System-wide Usage CLI GUI API Transf

152 Dec 20, 2022

Image processing is one of the most common term in computer vision

Image processing is one of the most common term in computer vision. Computer vision is the process by which computers can understand images and videos, and how they are stored, manipulated, and retri

3 Feb 15, 2022

A collection of resources (including the papers and datasets) of OCR (Optical Character Recognition).

OCR Resources This repository contains a collection of resources (including the papers and datasets) of OCR (Optical Character Recognition). Contents

363 Jan 03, 2023

Recognizing cropped text in natural images.

ASTER: Attentional Scene Text Recognizer with Flexible Rectification ASTER is an accurate scene text recognizer with flexible rectification mechanism.

681 Jan 02, 2023

Source Code for AAAI 2022 paper "Graph Convolutional Networks with Dual Message Passing for Subgraph Isomorphism Counting and Matching"

Graph Convolutional Networks with Dual Message Passing for Subgraph Isomorphism Counting and Matching This repository is an official implementation of

13 Sep 08, 2022

3点クリックで円を指定し、極座標変換を行うサンプルプログラム

click-warpPolar 3点クリックで円を指定し、極座標変換を行うサンプルプログラムです。 Requirements OpenCV 3.4.2 or Later Usage 実行方法は以下です。起動後、マウスで3点をクリックし円を指定してください。 python click-warpPol

17 Dec 30, 2022

MeshToGeotiff - A fast Python algorithm to convert a 3D mesh into a GeoTIFF

MeshToGeotiff - A fast Python algorithm to convert a 3D mesh into a GeoTIFF Python class for converting (very fast) 3D Meshes/Surfaces to Raster DEMs

8 Sep 10, 2022

Use Convolutional Recurrent Neural Network to recognize the Handwritten line text image without pre segmentation into words or characters. Use CTC loss Function to train.

Handwritten Line Text Recognition using Deep Learning with Tensorflow Description Use Convolutional Recurrent Neural Network to recognize the Handwrit

224 Jan 07, 2023

CNN+LSTM+CTC based OCR implemented using tensorflow.

CNN_LSTM_CTC_Tensorflow CNN+LSTM+CTC based OCR(Optical Character Recognition) implemented using tensorflow. Note: there is No restriction on the numbe

356 Dec 08, 2022

The code for “Oriented RepPoints for Aerail Object Detection”

Oriented RepPoints for Aerial Object Detection The code for the implementation of “Oriented RepPoints”, Under review. (arXiv preprint) Introduction Or

207 Dec 24, 2022

The CIS OCR PostCorrectionTool

The CIS OCR Post Correction Tool PoCoTo Source code for the Java-based PoCoTo client enabling fast interactive batch corrections of complete OCR error

36 Dec 15, 2022

Image processing using OpenCv

Image processing using OpenCv Write a program that opens the webcam, and the user selects one of the following on the video: ✅ If the user presses the

4 Feb 18, 2022

Dataset and Code for ICCV 2021 paper "Real-world Video Super-resolution: A Benchmark Dataset and A Decomposition based Learning Scheme"

Dataset and Code for RealVSR Real-world Video Super-resolution: A Benchmark Dataset and A Decomposition based Learning Scheme Xi Yang, Wangmeng Xiang,

91 Nov 22, 2022

Detect textlines in document images

Textline Detection Detect textlines in document images Introduction This tool performs border, region and textline detection from document image data

70 Jun 30, 2022

Regions sanitàries (RS), Sectors Sanitàris (SS) i Àrees Bàsiques de Salut (ABS) de Catalunya

Regions sanitàries (RS), Sectors Sanitaris (SS), Àrees de Gestió Assistencial (AGA) i Àrees Bàsiques de Salut (ABS) de Catalunya Fitxers GeoJSON de le

2 Jan 23, 2022

📷 This repository is focused on having various feature implementation of OpenCV in Python.

📷 This repository is focused on having various feature implementation of OpenCV in Python. The aim is to have a minimal implementation of all OpenCV features together, under one roof.

128 Dec 04, 2022

a micro OCR network with 0.07mb params.

MicroOCR a micro OCR network with 0.07mb params. Layer (type) Output Shape Param # Conv2d-1 [-1, 64, 8,

29 Aug 06, 2022

Document blur detection based on Laplacian operator and text detection.

Document Blur Detection For general blurred image, using the variance of Laplacian operator is a good solution. But as for the blur detection of docum

5 Oct 20, 2022