Datashredder is a simple data corruption engine written in python. You can corrupt anything text, images and video.

Last update: Jul 22, 2022

Related tags

Overview

Datashredder

Datashredder is a simple data corruption engine written in python. You can corrupt anything text, images and video.

You can chose the chance of corruption e.g i have a chance of 100 therfore there is a 1 in 100 chance of the next peice of data to be corrupted this allows you to controll how much corruption you want.

You can also chose to have a random peice of corruption data or random e.g Corruption data is FF

Not Corrupted: 30 32 35 53 f0 72

Corrupted: 30 FF 35 53 FF 72

A random corruption would chose a random corruption data each iteration

Examples

Cats

Each image has a corruption data of 00

There is 206824 iterations on this image

Not corrupted image

Corrupted images

Image #	Chance	Corruptions
1	2000	39
2	1500	133
3	1000	200
4	500	432
5	200	1020
6	100	2069

simple way to build the declarative and destributed data pipelines with python

unipipeline simple way to build the declarative and distributed data pipelines. Why you should use it Declarative strict config Scaffolding Fully type

0 Jan 26, 2022

A python package which can be pip installed to perform statistics and visualize binomial and gaussian distributions of the dataset

GBiStat package A python package to assist programmers with data analysis. This package could be used to plot : Binomial Distribution of the dataset p

4 Oct 17, 2022

Python data processing, analysis, visualization, and data operations

Python This is a Python data processing, analysis, visualization and data operations of the source code warehouse, book ISBN: 9787115527592 Descriptio

1 Jan 16, 2022

PrimaryBid - Transform application Lifecycle Data and Design and ETL pipeline architecture for ingesting data from multiple sources to redshift

Transform application Lifecycle Data and Design and ETL pipeline architecture for ingesting data from multiple sources to redshift This project is composed of two parts: Part1 and Part2

1 Jan 19, 2022

A computer algebra system written in pure Python

SymPy See the AUTHORS file for the list of authors. And many more people helped on the SymPy mailing list, reported bugs, helped organize SymPy's part

9.9k Dec 31, 2022

Very basic but functional Kakuro solver written in Python.

kakuro.py Very basic but functional Kakuro solver written in Python. It uses a reduction to exact set cover and Ali Assaf's elegant implementation of

4 Jan 15, 2022

Catalogue data - A Python Scripts to prepare catalogue data

catalogue_data Scripts to prepare catalogue data. Setup Clone this repo. Install

3 Mar 3, 2022

Convert tables stored as images to an usable .csv file

Convert an image of numbers to a .csv file This Python program aims to convert images of array numbers to corresponding .csv files. It uses OpenCV for

711 Dec 26, 2022

fds is a tool for Data Scientists made by DAGsHub to version control data and code at once.

Fast Data Science, AKA fds, is a CLI for Data Scientists to version control data and code at once, by conveniently wrapping git and dvc

359 Dec 22, 2022

Releases(0.2.17)

0.2.17(Nov 18, 2021)
Changes:

Bug patches 9433cbf501bf18b2871df117121e8dbaed9a46dd

Removed tqdm 9ad0d65c49226755f5d7dffad99a5698ada68d22

Install Command: pip install pip install Datashredder==0.2.17

Full Changelog: https://github.com/awesomelewis2007/Datashredder/compare/0.2.15...0.2.17
Source code(tar.gz)
Source code(zip)
Datashredder-0.2.17-py3-none-any.whl(16.34 KB)
Datashredder-0.2.17.tar.gz(16.13 KB)
0.2.15(Nov 14, 2021)
Changes:

Added C installer

Added C help file

Added Makefile

Added pyproject.toml

Added setup.py

Improved Demo

Install Command: pip install pip install Datashredder==0.2.15

Full Changelog: https://github.com/awesomelewis2007/Datashredder/compare/0.1.10...0.2.15
Source code(tar.gz)
Source code(zip)
Datashredder-0.2.15-py3-none-any.whl(14.97 KB)
Datashredder-0.2.15.tar.gz(15.82 KB)
0.1.10(Oct 31, 2021)

This is the first release of datashredder

This release is not on pypi Full Changelog: https://github.com/awesomelewis2007/Datashredder/commits/0.1.10
Source code(tar.gz)
Source code(zip)

Owner

GitHub Repository

A program that uses an API and a AI model to get info of sotcks

Stock-Market-AI-Analysis I dont mind anyone using this code but please give me credit A program that uses an API and a AI model to get info of stocks

1 Dec 17, 2021

Karate Club: An API Oriented Open-source Python Framework for Unsupervised Learning on Graphs (CIKM 2020)

Karate Club is an unsupervised machine learning extension library for NetworkX. Please look at the Documentation, relevant Paper, Promo Video, and Ext

1.8k Jan 09, 2023

Python Kalman filtering and optimal estimation library. Implements Kalman filter, particle filter, Extended Kalman filter, Unscented Kalman filter, g-h (alpha-beta), least squares, H Infinity, smoothers, and more. Has companion book 'Kalman and Bayesian Filters in Python'.

FilterPy - Kalman filters and other optimal and non-optimal estimation filters in Python. NOTE: Imminent drop of support of Python 2.7, 3.4. See secti

2.5k Dec 30, 2022

ASOUL直播间弹幕抓取&&数据分析

ASOUL直播间弹幕抓取&&数据分析（更新中）这些文件用于爬取ASOUL直播间的弹幕（其他直播间也可以）和其他信息，以及简单的数据分析生成。

159 Dec 10, 2022

2019 Data Science Bowl

Kaggle-2019-Data-Science-Bowl-Solution - Here i present my solution to kaggle 2019 data science bowl and how i improved it to win a silver medal in that competition.

1 Jan 01, 2022

Bamboolib - a GUI for pandas DataFrames

Community repository of bamboolib bamboolib is joining forces with Databricks. For more information, please read our announcement. Please note that th

863 Jan 08, 2023

Calculate multilateral price indices in Python (with Pandas and PySpark).

IndexNumCalc Calculate multilateral price indices using the GEKS-T (CCDI), Time Product Dummy (TPD), Time Dummy Hedonic (TDH), Geary-Khamis (GK) metho

3 Apr 27, 2022

CINECA molecular dynamics tutorial set

High Performance Molecular Dynamics Logging into CINECA's computer systems To logon to the M100 system use the following command from an SSH client ss

0 Mar 13, 2022

Tokyo 2020 Paralympics, Analytics

Tokyo 2020 Paralympics, Analytics Thanks for checking out my app! It was built entirely using matplotlib and Tokyo 2020 Paralympics data. This applica

1 Nov 18, 2021

Using Python to derive insights on particular Pokemon, Types, Generations, and Stats

Pokémon Analysis Andreas Nikolaidis February 2022 Introduction Exploratory Analysis Correlations & Descriptive Statistics Principal Component Analysis

1 Feb 18, 2022

a tool that compiles a csv of all h1 program stats

h1stats - h1 Program Stats Scraper This python3 script will call out to HackerOne's graphql API and scrape all currently active programs for informati

40 Oct 27, 2022

[CVPR2022] This repository contains code for the paper "Nested Collaborative Learning for Long-Tailed Visual Recognition", published at CVPR 2022

Nested Collaborative Learning for Long-Tailed Visual Recognition This repository is the official PyTorch implementation of the paper in CVPR 2022: Nes

65 Dec 09, 2022

Datashredder is a simple data corruption engine written in python. You can corrupt anything text, images and video.

Related tags

Overview

Datashredder

Examples

Cats

Not corrupted image

Corrupted images

You might also like...

simple way to build the declarative and destributed data pipelines with python

A python package which can be pip installed to perform statistics and visualize binomial and gaussian distributions of the dataset

Python data processing, analysis, visualization, and data operations

PrimaryBid - Transform application Lifecycle Data and Design and ETL pipeline architecture for ingesting data from multiple sources to redshift

A computer algebra system written in pure Python

Very basic but functional Kakuro solver written in Python.

Catalogue data - A Python Scripts to prepare catalogue data

Convert tables stored as images to an usable .csv file

fds is a tool for Data Scientists made by DAGsHub to version control data and code at once.

Releases(0.2.17)

0.2.17(Nov 18, 2021)

0.2.15(Nov 14, 2021)

0.1.10(Oct 31, 2021)

Owner

A program that uses an API and a AI model to get info of sotcks

Karate Club: An API Oriented Open-source Python Framework for Unsupervised Learning on Graphs (CIKM 2020)

Python Kalman filtering and optimal estimation library. Implements Kalman filter, particle filter, Extended Kalman filter, Unscented Kalman filter, g-h (alpha-beta), least squares, H Infinity, smoothers, and more. Has companion book 'Kalman and Bayesian Filters in Python'.

ASOUL直播间弹幕抓取&&数据分析

2019 Data Science Bowl

Bamboolib - a GUI for pandas DataFrames

Calculate multilateral price indices in Python (with Pandas and PySpark).

CINECA molecular dynamics tutorial set

Tokyo 2020 Paralympics, Analytics

Using Python to derive insights on particular Pokemon, Types, Generations, and Stats

a tool that compiles a csv of all h1 program stats

[CVPR2022] This repository contains code for the paper "Nested Collaborative Learning for Long-Tailed Visual Recognition", published at CVPR 2022

A crude Hy handle on Pandas library

MoRecon - A tool for reconstructing missing frames in motion capture data.

This tool parses log data and allows to define analysis pipelines for anomaly detection.

Binance Kline Data With Python

ICLR 2022 Paper submission trend analysis

Data processing with Pandas.

In this project, ETL pipeline is build on data warehouse hosted on AWS Redshift.

DaCe is a parallel programming framework that takes code in Python/NumPy and other programming languages