Skip to content

path2-ai/data-utils

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

data-utils

Use this repository to easily convert your source files (csv, txt, excel, json, html) into record-oriented JSON files that can be uploaded into onetask.

Installation

Clone this repository and install the dependencies into an environment (e.g. Conda) using pip install -r requirements.txt

How to use

This tool is CLI-based, so you can just open a terminal, change directory into this repository, and then execute python json_converter.py --filename <your_filename>

For instance, if you have a file my_input_file.csv with the following data

running_id,text
1,hello world!
2,how are you doing?
3,if you have any questions about onetask - contact us anytime

, running python json_converter.py --filename my_input_file.csv will output

[
    {
        "running_id": 1,
        "text": "hello world!"
    },
    {
        "running_id": 2,
        "text": "how are you doing?"
    },
    {
        "running_id": 3,
        "text": "if you have any questions about onetask - contact us anytime"
    }
]

You can configure the conversion in typical Pandas style by providing arguments, e.g. for the delimiter of csv files such as python json_converter.py --filename my_input_file.csv --sep ';'

We currently support:

  • Excel (supports xls, xlsx, xlsm, xlsb, odf, ods and odt file extensions read from a local filesystem or URL)
  • CSV (also working for .txt files)
  • HTML
  • JSON

Support

If you have any questions or run into issues, feel free to contact us anytime.

About

Data Utilities e.g. for importing files to onetask

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages