CoderX

A proof-of-concept AI system by Graham Neubig (June 30, 2021).

About CoderX

CoderX is a retrieval-based code generation AI system reminiscent of Hayati et al. (2018) or Hashimoto et al. (2018). However, compared to these methods, CoderX uses a highly-sophisticated sequence-to-sequence model based on introspection-based learning to remove content that may be harmful to the usability of the code in certain application scenarios, such as inconvenient licensing restrictions.

Usage

The usage of CoderX is simple, if you specify a github repository, CoderX will generate code that has very similar or identical functionality as that github repository. However, because it was generated by an AI system, it fortunately belongs to you, is not encumbered by copyright, and can be used for any purpose you want. Use it as follows:

python coderx.py [address of github repo]

As a result, the retrieved repository will exist in the current directory, and the generated code will exist in a directory of the same name with _coderx appended to the end.

License

CoderX is licensed under the JSON license, which basically means you can use it for anything but evil. Unless you use CoderX on itself of course, in which case you can get a library with similar functionality not encumbered by this pesky "do no evil" clause.

A highly sophisticated sequence-to-sequence model for code generation

Related tags

Overview

CoderX

About CoderX

Usage

License

Owner

Graham Neubig

Code for our ACL 2021 (Findings) Paper - Fingerprinting Fine-tuned Language Models in the wild .

Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing

This is Assignment1 code for the Web Data Processing System.

Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch

A simple version of DeTR

Research code for the paper "Fine-tuning wav2vec2 for speaker recognition"

Practical Natural Language Processing Tools for Humans is build on the top of Senna Natural Language Processing (NLP)

The Easy-to-use Dialogue Response Selection Toolkit for Researchers

A high-level yet extensible library for fast language model tuning via automatic prompt search

Extract rooms type, door, neibour rooms, rooms corners nad bounding boxes, and generate graph from rplan dataset

This script just scrapes the most recent Nepali news from Kathmandu Post and notifies the user about current events at regular intervals.It sends out the most recent news at random!

A repository to run gpt-j-6b on low vram machines (4.2 gb minimum vram for 2000 token context, 3.5 gb for 1000 token context). Model loading takes 12gb free ram.

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

SurvTRACE: Transformers for Survival Analysis with Competing Events

Natural Language Processing Specialization

Exploring dimension-reduced embeddings

FactSumm: Factual Consistency Scorer for Abstractive Summarization

Tools and data for measuring the popularity & growth of various programming languages.

Text Analysis & Topic Extraction on Android App user reviews

Using BERT-based models for toxic span detection