项目说明:

百度2021年语言与智能技术竞赛机器阅读理解Pytorch版baseline
比赛链接:https://aistudio.baidu.com/aistudio/competition/detail/66?isFromLuge=true

官方的baseline版本是基于paddlepaddle框架的,我把它改写成了Pytorch框架,其中大部分代码沿用的是官方提供的代码,只是有一些框架部分进行了修改,另外增加了早停策略/对抗训练等优化措施,习惯用Pytorch版本的可以基于此进行优化.

环境

python=3.6
torch=1.7
transformers=4.5.0

训练示例

训练

python run.py
--max_len=256
--model_name_or_path=下载的预训练模型路径
--per_gpu_train_batch_size=7
--per_gpu_eval_batch_size=40
--learning_rate=1e-5
--linear_learning_rate=1e-4
--num_train_epochs=100
--output_dir="./output"
--weight_decay=0.01
--early_stop=2

预测

python predict.py
--max_len=400
--model_name_or_path=下载的预训练模型路径
--per_gpu_eval_batch_size=120
--output_dir="./output"
--fine_tunning_model=微调后的模型路径

实验结果

用的baseline模型是base版MacBERT(具体请看https://github.com/ymcui/MacBERT)

后续优化策略

数据清洗，据官方工作人员讲解到，训练集的准确率只能确保92%以上
更多的数据
更细粒度的数据增强
模型结构的优化

百度2021年语言与智能技术竞赛机器阅读理解Pytorch版baseline

Related tags

Overview

项目说明:

环境

训练示例

实验结果

后续优化策略

Owner

周俊贤

Minimal PyTorch implementation of Generative Latent Optimization from the paper "Optimizing the Latent Space of Generative Networks"

Sentinel-1 vessel detection model used in the xView3 challenge

This is an official implementation for "Self-Supervised Learning with Swin Transformers".

HyperDict - Self linked dictionary in Python

This a classic fintech problem that introduces real life difficulties such as data imbalance. Check out the notebook to find out more!

A Python type explainer!

An updated version of virtual model making

Bringing sanity to world of messed-up data

PyTorch implementation of the REMIND method from our ECCV-2020 paper "REMIND Your Neural Network to Prevent Catastrophic Forgetting"

IEEE Winter Conference on Applications of Computer Vision 2022 Accepted

An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.

Official implementation of particle-based models (GNS and DPI-Net) on the Physion dataset.

LiDAR R-CNN: An Efficient and Universal 3D Object Detector

Beginner-friendly repository for Hacktober Fest 2021. Start your contribution to open source through baby steps. 💜

A simple and lightweight genetic algorithm for optimization of any machine learning model

This repository contains demos I made with the Transformers library by HuggingFace.

InterfaceGAN++: Exploring the limits of InterfaceGAN

Baseline and template code for node21 detection track

Code for the upcoming CVPR 2021 paper

Library of various Few-Shot Learning frameworks for text classification