Skip to content

daltonnyx/sheet-parser

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Sheet Data Image/PDF-to-CSV Converter

Dependencies

pip install pdf2image
pip install opencv-python
pip install pytesseract

This tool also require Pillow library, this could install through conda:

conda install -c conda-forge pillow

Usage

python spreadsheet_parser.py [input-file] [output-file] [language]

input-file: Input file, must be pdf file or image file
output-file: Output path for csv output
language: content language default is eng (must be pre-install tessdata model for that language)

Example:

python spreadsheet_parser.py examples/test.png output.csv

About

Sheet Data Image/PDF-to-CSV Converter

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages