Zero-shot capabilities on ImageNet #119

kimihailv · 2023-02-02T10:51:52Z

Hello, I evaluated ALBEF 14M on ImageNetV2 classification task and it showed relatively low accuracy: top1 – 32.9, top5 - 60.7.
How do you think what reasons of such results? Much smaller training dataset compared to CLIP?

LiJunnan1992 · 2023-02-03T00:25:19Z

Hi @kimihailv , we haven't evaluated this result, but yes the zero-shot performance is largely correlated with the training dataset size.

shyammarjit · 2024-03-15T15:55:41Z

Zero-shot capabilities on other datasets (such as dtd, food101, caltech101, sun397 & etc) is much lower as compared to CLIP, MetaCLIP and open_clip methods.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zero-shot capabilities on ImageNet #119

Zero-shot capabilities on ImageNet #119

kimihailv commented Feb 2, 2023

LiJunnan1992 commented Feb 3, 2023

shyammarjit commented Mar 15, 2024 •

edited

Zero-shot capabilities on ImageNet #119

Zero-shot capabilities on ImageNet #119

Comments

kimihailv commented Feb 2, 2023

LiJunnan1992 commented Feb 3, 2023

shyammarjit commented Mar 15, 2024 • edited

shyammarjit commented Mar 15, 2024 •

edited