Collection of scripts to pinpoint obfuscated code

Last update: Nov 26, 2022

Related tags

Text Data & NLP obfuscation_detection

Overview

Obfuscation Detection (v1.0)

Author: Tim Blazytko

Automatically detect control-flow flattening and other state machines

Description:

Scripts and binaries to automatically detect control-flow flattening and other state machines in binaries.

Implementation is based on Binary Ninja. Check out the following blog post for more information:

Automated Detection of Control-flow Flattening

Usage

$ ./detect_flattening.py samples/finspy 
Function 0x401602 has a flattening score of 0.9473684210526315.
Function 0x4017c0 has a flattening score of 0.9981378026070763.
Function 0x405150 has a flattening score of 0.9166666666666666.
Function 0x405270 has a flattening score of 0.9166666666666666.
Function 0x405370 has a flattening score of 0.9984544049459042.
Function 0x4097a0 has a flattening score of 0.9992378048780488.
Function 0x412c70 has a flattening score of 0.9629629629629629.
Function 0x412df0 has a flattening score of 0.9629629629629629.
Function 0x412f70 has a flattening score of 0.9927007299270073.
Function 0x4138e0 has a flattening score of 0.9629629629629629.

Note

The password for the zipped malware samples is "infected". To unpack, use the following command line:

$ unzip -P infected samples.zip

Contact

For more information, contact @mr_phrazer.

A collection of models for image - text generation in ACM MM 2021.

Bi-directional Image and Text Generation UMT-BITG (image & text generator) Unifying Multimodal Transformer for Bi-directional Image and Text Generatio

63 Oct 30, 2022

An open collection of annotated voices in Japanese language

声庭 (Koniwa): オープンな日本語音声とアノテーションのコレクション Koniwa (声庭): An open collection of annotated voices in Japanese language 概要 Koniwa(声庭)は利用・修正・再配布が自由でオープンな音声とアノテ

32 Dec 14, 2022

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab

AliceMind AliceMind: ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab This repository provides pre-trained encode

922 Dec 10, 2021

Code examples for my Write Better Python Code series on YouTube.

Write Better Python Code This repository contains the code examples used in my Write Better Python Code series published on YouTube: https:/

858 Dec 29, 2022

Code to use Augmented Shapiro Wilks Stopping, as well as code for the paper "Statistically Signifigant Stopping of Neural Network Training"

This codebase is being actively maintained, please create and issue if you have issues using it Basics All data files are included under losses and ea

32 Nov 9, 2021

Code for the Python code smells video on the ArjanCodes channel.

7 Python code smells This repository contains the code for the Python code smells video on the ArjanCodes channel (watch the video here). The example

55 Dec 29, 2022

Code for CodeT5: a new code-aware pre-trained encoder-decoder model.

CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for Code Understanding and Generation This is the official PyTorch implementation

564 Jan 8, 2023

Galois is an auto code completer for code editors (or any text editor) based on OpenAI GPT-2.

Galois is an auto code completer for code editors (or any text editor) based on OpenAI GPT-2. It is trained (finetuned) on a curated list of approximately 45K Python (~470MB) files gathered from the Github. Currently, it just works properly on Python but not bad at other languages (thanks to GPT-2's power).

91 Sep 23, 2022

Code-autocomplete, a code completion plugin for Python

Code AutoComplete code-autocomplete, a code completion plugin for Python.

13 Jan 7, 2023

Comments

plugin?

Are you interested in a PR to add a plugin.json so this could be used either in headless mode on the command-line or via the UI inside BN itself which would let it be installable via the plugin manager?

opened by psifertex 2
Replace Counter.total() for users with python < 3.10

I'm running Binary Ninja on windows 10 and it's got Python 3.9.2, which means the Counter.total() function in calc_uncommon_instruction_sequences_score() doesn't work. I've replaced this with sum(counter.values()) which should do the same thing

opened by samrussell 1

Collection of scripts to pinpoint obfuscated code

Related tags

Overview

Obfuscation Detection (v1.0)

Description:

Usage

Note

Contact

You might also like...

A collection of models for image - text generation in ACM MM 2021.

An open collection of annotated voices in Japanese language

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab

Code examples for my Write Better Python Code series on YouTube.

Code to use Augmented Shapiro Wilks Stopping, as well as code for the paper "Statistically Signifigant Stopping of Neural Network Training"

Code for the Python code smells video on the ArjanCodes channel.

Code for CodeT5: a new code-aware pre-trained encoder-decoder model.

Galois is an auto code completer for code editors (or any text editor) based on OpenAI GPT-2.

Code-autocomplete, a code completion plugin for Python

Comments

plugin?

Replace Counter.total() for users with python < 3.10

Releases(v1.4)

v1.4(Feb 23, 2022)

v1.3(Feb 14, 2022)

v1.2(Aug 14, 2021)

v1.1(Aug 10, 2021)

v1.0(Mar 5, 2021)

Owner

Tim Blazytko

CoNLL-English NER Task (NER in English)

Word Bot for JKLM Bomb Party

Chinese segmentation library

Entity Disambiguation as text extraction (ACL 2022)

A demo for end-to-end English and Chinese text spotting using ABCNet.

CLIPfa: Connecting Farsi Text and Images

自然言語で書かれた時間情報表現を抽出/規格化するルールベースの解析器

Voice Assistant inspired by Google Assistant, Cortana, Alexa, Siri, ...

Long text token classification using LongFormer

This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages" published in Findings of the Association for Computational Linguistics: ACL 2021.

Text editor on python to convert english text to malayalam(Romanization/Transiteration).

Kurumi ChatBot

मराठी भाषा वाचविण्याचा एक प्रयास. इंग्रजी ते मराठीचा शब्दकोश. An attempt to preserve the Marathi language. A lightweight and ad free English to Marathi thesaurus.

Textpipe: clean and extract metadata from text

BMInf (Big Model Inference) is a low-resource inference package for large-scale pretrained language models (PLMs).

Natural language Understanding Toolkit

Training and evaluation codes for the BertGen paper (ACL-IJCNLP 2021)

Simple NLP based project without any use of AI

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

This repository details the steps in creating a Part of Speech tagger using Trigram Hidden Markov Models and the Viterbi Algorithm without using external libraries.