GitHub - hwijeen/OffensEval2020: Codes for NLPDove at SemEval 2020 Task 6: OffensEval, COLING 2020

This is the code for our submission on Semeval 2020 task 12: Offensive Language Identification in Social Media. The code and description for data augmentation with transferable data selection, will be updated in before paper submission. Our results: 1st place in greek, 3rd in Danish, 6th in Turkish, 15th in English, and 20th in Arabic.

Data

OffensEval2019: Official OLID dataset page. Data for OffensEval2020: upon request.

python merge_test.py # OffensEval2019 - merge 3 test data into one file

Quick start

Below is the command to train a model with some potentially important arguments. For exhaustive list of arguments, read train.py.

NOTE=kor
python train.py \
--train_path ../data/olid/da/offenseval-da-training-v1-train.tsv \
--test_path ../data/olid/da/offenseval-da-training-v1-test.tsv \
--demojize --lower_hashtag --segment_hashtag --textify_emoji --add_cap_sign \
--mention_limit 3 --punc_limit 3 \
--model mbert --time_pooling max_avg --layer 11 \
--attention_probs_dropout_prob 0.1 --hidden_dropout_prob 0.3 \
--lr 0.00002 --weight_decay 0.0 --layer_decrease 1.0 --freeze_upto -1 --warmup_ratio 0.1 \
--batch_size 32 --train_step 700 --patience 20 --cuda 1 --note $NOTE 

python inference.py $NOTE

Name		Name	Last commit message	Last commit date
Latest commit History 130 Commits
data_utils		data_utils
scripts		scripts
.gitignore		.gitignore
README.md		README.md
dataloading.py		dataloading.py
inference.py		inference.py
model.py		model.py
optimizer.py		optimizer.py
preprocessing.py		preprocessing.py
train.py		train.py
trainer.py		trainer.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data

Quick start

Dependencies

About

Releases

Packages

Contributors 2

Languages

hwijeen/OffensEval2020

Folders and files

Latest commit

History

Repository files navigation

Data

Quick start

Dependencies

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages