M3TR

Pytorch implementation of M3TR: Multi-modal Multi-label Recognition with Transformer. ACM MM 2021

Prerequisites

Python 3.6+

Pytorch 1.7

CUDA 10.1

Tesla V100 × 4

Datasets

MS-COCO: train val annotations
VOC 2007: trainval test test_anno

Train

python main.py  --data COCO2014 --data_root_dir $DATA_PATH$ --save_dir $SAVE_PATH$ --i 448  --lr 3e-4 -b 64

Test

python main.py  --data COCO2014 --data_root_dir $DATA_PATH$ --save_dir $SAVE_PATH$ --i 448  --lr 3e-4 -b 64 -e --resume checkpoint/COCO2014/checkpoint_COCO.pth

Citation

If you find this work is helpful, please cite our paper

@inproceedings{Zhao2021M3TR,
author = {Zhao, Jiawei and Zhao, Yifan and Li, Jia},
title = {M3TR: Multi-Modal Multi-Label Recognition with Transformer},
year = {2021},
address = {New York, NY, USA},
booktitle = {Proceedings of the 29th ACM International Conference on Multimedia},
pages = {469–477},
}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Bert		Bert
Glove		Glove
checkpoint/COCO2014		checkpoint/COCO2014
data		data
figs		figs
models		models
LICENSE		LICENSE
README.md		README.md
main.py		main.py
trainer.py		trainer.py
util.py		util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bert

Bert

Glove

Glove

checkpoint/COCO2014

checkpoint/COCO2014

data

data

figs

figs

models

models

LICENSE

LICENSE

README.md

README.md

main.py

main.py

trainer.py

trainer.py

util.py

util.py

Repository files navigation

M3TR

Prerequisites

Datasets

Train

Test

Citation

About

Releases

Packages

Languages

License

iCVTEAM/M3TR

Folders and files

Latest commit

History

Repository files navigation

M3TR

Prerequisites

Datasets

Train

Test

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Languages