PolyGlot, a fuzzing framework for language processors

Last update: Dec 27, 2022

Related tags

Deep Learning Polyglot

Overview

PolyGlot, a fuzzing framework for language processors

Build

We tested PolyGlot on Ubuntu 18.04.

Get the source code: git clone https://github.com/s3team/Polyglot && cd Polyglot
Install prerequisite: sudo apt install -y make python g++ bison flex clang-format clang
Modify the Makefile to choose the language you want to test
Build everything: make
The fuzzer is in AFL_replate_mutate/afl-fuzz
Use the afl-gcc/afl-g++/afl-clang/afl-clang++ in AFL_replace_mutate to compile the program you want to fuzz.

Config the semantic.json

Before we run the fuzzer, we need to set some values in semantic.json. Here are some important values that you should set:

InitFileDir: This should be an absolute path of your init seed file dir. It can be the same as/different from your path of input.
BuiltinObjFile: If you want to use the build-in functions/variables/class for semantic validation, set this path (not a single file). Refer to grammar/solidity_grammar/semantic.json for an example.

Run

To run the fuzzer, we just run it like normal afl-fuzz:

afl-fuzz -i path/to/inputs -o path/to/outputs -- prog [args @@]

You should collect your own seed inputs for the fuzzer.

Apply on a new language

To do

Video tutorial

Publication

One Engine to Fuzz ‘em All: Generic Language Processor Testing with Semantic Validation

Yongheng Chen, Rui Zhong(co-first author), Hong Hu, Hangfan Zhang, Yupeng Yang, Dinghao Wu and Wenke Lee.
In Proceedings of the 41st IEEE Symposium on Security and Privacy (Oakland 2021).

Contact

Yongheng Chen: [email protected]

Rui Zhong: [email protected]

Hong Hu: [email protected]

Hangfan Zhang: [email protected]

Yupeng Yang: [email protected]

Dinghao Wu: [email protected]

Wenke Lee: [email protected]

PolyGlot, a fuzzing framework for language processors

Related tags

Overview

PolyGlot, a fuzzing framework for language processors

Build

Config the semantic.json

Run

Apply on a new language

Video tutorial

Publication

Contact

Owner

Software Systems Security Team at Penn State University

This is an official implementation for the WTW Dataset in "Parsing Table Structures in the Wild " on table detection and table structure recognition.

JAX-based neural network library

D2Go is a toolkit for efficient deep learning

SIMULEVAL A General Evaluation Toolkit for Simultaneous Translation

auto-tuning momentum SGD optimizer

Codes for Causal Semantic Generative model (CSG), the model proposed in "Learning Causal Semantic Representation for Out-of-Distribution Prediction" (NeurIPS-21)

classify fashion-mnist dataset with pytorch

Unrestricted Facial Geometry Reconstruction Using Image-to-Image Translation

ByteTrack: Multi-Object Tracking by Associating Every Detection Box

Python package for dynamic system estimation of time series

Sequence-tagging using deep learning

xitorch: differentiable scientific computing library

Image super-resolution (SR) is a fast-moving field with novel architectures attracting the spotlight

A tool for calculating distortion parameters in coordination complexes.

NumQMBasic - A mini-course offered to Undergrad physics students

Open source implementation of AceNAS: Learning to Rank Ace Neural Architectures with Weak Supervision of Weight Sharing

Evolutionary Scale Modeling (esm): Pretrained language models for proteins

This Artificial Intelligence program can take a black and white/grayscale image and generate a realistic or plausible colorized version of the same picture.

Official implementation of "StyleCariGAN: Caricature Generation via StyleGAN Feature Map Modulation" (SIGGRAPH 2021)

A Large Scale Benchmark for Individual Treatment Effect Prediction and Uplift Modeling