FastOCR is a desktop application for OCR API.

Overview

FastOCR

GitHub top language GitHub

FastOCR is a desktop application for OCR API.

Installation

Arch Linux

fastocr-git @ AUR

Build from AUR or install with your favorite AUR helper.

yay -S fastocr-git  # Using yay
pikaur -S fastocr-git  # Using pikaur
# ...

PyPI

fastocr @ PyPI

pip install --user fastocr
# Then copy desktop file to ~/.local/share/applications

Manually

Proudly use DepHell to generate setup.py file.

python setup.py install
# Then copy desktop file to ~/.local/share/applications

Development

Proudly use Poetry for developing.

poetry install

Usage

fastocr

DBus

Service name: io.github.brucezhang1993.FastOCR

Object: /io/github/brucezhang1993/FastOCR

Methods

Actions Arguments Description
captureToClipboard {seconds} {no_copy} Capture and OCR to system clipboard or dbus signal
seconds: Delay capture in seconds
no_copy: If true, the result will not be saved in clipboard
quitApp -- Quit app

Signal

Signals Arguments Description
captured {text} OCR result will be sent to this signal
text: OCR result in plain text

Contributing

Pull requests are welcome.

For major changes, please open an issue first to discuss what you would like to change.

License

LGPL3

Comments
  • invalid token doesn't need cache, fix potential TypeError

    invalid token doesn't need cache, fix potential TypeError

    For Baidu OCR, token and expires_in in .cache/fastocr/baidu_token_data.json will be null if wrong API (or empty API) is given, where there's no need to save token in the cache file. (Otherwise it can raise a TypeError during timestamp + expires_in)

    opened by wsyxbcl 1
  • [BUG] 截图操作后操作区内容可能存在未更新的问题

    [BUG] 截图操作后操作区内容可能存在未更新的问题

    Describe the bug 截图操作后操作区内容可能存在未更新的问题

    To Reproduce 用户反馈待复现

    Expected behavior TODO

    Environment information 用户反馈待复现 [Strongly recommend to run fastocr diagnose in the console and paste the outputs here]

    Additional context Add any other context about the problem here.

    unconfirmed bug 
    opened by BruceZhang1993 0
  • [BUG] The token cache for BaiduOCR may block the API update in setting?

    [BUG] The token cache for BaiduOCR may block the API update in setting?

    As the property method of BaiduOcr would use the cache file directly if there is, the API change in setting won't work.

    Remove the cache file after Setting.save or redefine the "cache file behavior" would probably be nice?

    confirmed bug 
    opened by wsyxbcl 0
  • [Feature Request] Please support Mathpix API

    [Feature Request] Please support Mathpix API

    Describe the feature Could you please add support for mathpix API so that users can use this to copy and paste math formulars? Reference: https://mathpix.com/ocr

    feature request 
    opened by sukanka 2
  • Distribution

    Distribution

    Linux

    • [x] Arch Linux derivative AUR ~~Thanks to myself~~
    • [x] NixOS/Nix fastocr Thanks to Potato Hatsue
    • [ ] Debian/Ubuntu derivative
    • [ ] Gentoo derivative
    • [x] AppImage

    ...

    Windows

    • [x] PyInstaller (Exe)
    • [x] PyInstaller (Zip)

    macOS

    • [x] PyInstaller (App)

    Common

    help wanted 
    opened by BruceZhang1993 2
Owner
Bruce Zhang
Bruce Zhang
1st place solution for SIIM-FISABIO-RSNA COVID-19 Detection Challenge

SIIM-COVID19-Detection Source code of the 1st place solution for SIIM-FISABIO-RSNA COVID-19 Detection Challenge. 1.INSTALLATION Ubuntu 18.04.5 LTS CUD

Nguyen Ba Dung 170 Dec 21, 2022
Open Source Differentiable Computer Vision Library for PyTorch

Kornia is a differentiable computer vision library for PyTorch. It consists of a set of routines and differentiable modules to solve generic computer

kornia 7.6k Jan 04, 2023
Virtual Zoom Gesture using OpenCV

Virtual_Zoom_Gesture I have created a virtual zoom gesture where we can Zoom in and Zoom out any image and even we can move that image anywhere on the

Mudit Sinha 2 Dec 26, 2021
Code for the paper "Controllable Video Captioning with an Exemplar Sentence"

SMCG Code for the paper "Controllable Video Captioning with an Exemplar Sentence" Introduction We investigate a novel and challenging task, namely con

10 Dec 04, 2022
The open source extract transaction infomation by using OCR.

Transaction OCR Mã nguồn trích xuất thông tin transaction từ file scaned pdf, ở đây tôi lựa chọn tài liệu sao kê công khai của Thuy Tien. Mã nguồn có

Nguyen Xuan Hung 18 Jun 02, 2022
Ackermann Line Follower Robot Simulation.

Ackermann Line Follower Robot This is a simulation of a line follower robot that works with steering control based on Stanley: The Robot That Won the

Lucas Mazzetto 2 Apr 16, 2022
Maze generator and solver with python

Procedural-Maze-Generator-Algorithms Check out my youtube channel : Auctux Ressources Thanks to Jamis Buck Book : Mazes for programmers Requirements P

Joseph 19 Dec 07, 2022
OpenGait is a flexible and extensible gait recognition project

A flexible and extensible framework for gait recognition. You can focus on designing your own models and comparing with state-of-the-arts easily with the help of OpenGait.

Shiqi Yu 335 Dec 22, 2022
利用Paddle框架复现CRAFT

CRAFT-Paddle 利用Paddle框架复现CRAFT CRAFT 本项目基于paddlepaddle框架复现CRAFT,并参加百度第三届论文复现赛,将在2021年5月15日比赛完后提供AIStudio链接~敬请期待 参考项目: CRAFT: Character-Region Awarenes

QuanHao Guo 2 Mar 07, 2022
Forked from argman/EAST for the ICPR MTWI 2018 CHALLENGE

EAST_ICPR: EAST for ICPR MTWI 2018 CHALLENGE Introduction This is a repository forked from argman/EAST for the ICPR MTWI 2018 CHALLENGE. Origin Reposi

Haozheng Li 157 Aug 23, 2022
Detect textlines in document images

Textline Detection Detect textlines in document images Introduction This tool performs border, region and textline detection from document image data

QURATOR-SPK 70 Jun 30, 2022
Drowsiness Detection and Alert System

A countless number of people drive on the highway day and night. Taxi drivers, bus drivers, truck drivers, and people traveling long-distance suffer from lack of sleep.

Astitva Veer Garg 4 Aug 01, 2022
WACV 2022 Paper - Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching

Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching Code based on our WACV 2022 Accepted Paper: https://arxiv.org/pdf/

Andres 13 Dec 17, 2022
End-to-end pipeline for real-time scene text detection and recognition.

Real-time-Scene-Text-Detection-and-Recognition-System End-to-end pipeline for real-time scene text detection and recognition. The detection model use

Fangneng Zhan 89 Aug 04, 2022
Deep Learning Chinese Word Segment

引用 本项目模型BiLSTM+CRF参考论文:http://www.aclweb.org/anthology/N16-1030 ,IDCNN+CRF参考论文:https://arxiv.org/abs/1702.02098 构建 安装好bazel代码构建工具,安装好tensorflow(目前本项目需

2.1k Dec 23, 2022
CNN+Attention+Seq2Seq

Attention_OCR CNN+Attention+Seq2Seq The model and its tensor transformation are shown in the figure below It is necessary ch_ train and ch_ test the p

Tsukinousag1 2 Jul 14, 2022
A curated list of papers and resources for scene text detection and recognition

Awesome Scene Text A curated list of papers and resources for scene text detection and recognition The year when a paper was first published, includin

Jan Zdenek 43 Mar 15, 2022
A Python wrapper for the tesseract-ocr API

tesserocr A simple, Pillow-friendly, wrapper around the tesseract-ocr API for Optical Character Recognition (OCR). tesserocr integrates directly with

Fayez 1.7k Dec 31, 2022
Python Computer Vision application that allows users to draw/erase on the screen using their webcam.

CV-Virtual-WhiteBoard The Virtual WhiteBoard is a project I made using the OpenCV and Mediapipe Python libraries. Using your index and middle finger y

Stephen Wang 1 Jan 07, 2022
Slice a single image into multiple pieces and create a dataset from them

OpenCV Image to Dataset Converter Slice a single image of Persian digits into mu

Meysam Parvizi 14 Dec 29, 2022