1,000,000 sets of images and descriptions,the types of pictures include landscapes, animals, flowers and trees, people, cars, sports, industries, and buildings. Category and an aesthetic subset, each image has no less than two descriptions, each with one sentence; a small number of images have only one description, and the description languages are English and Chinese For more details, please refer to the link: https://www.nexdata.ai/datasets/1331?source=Github
1,000,000 sets of images and descriptions
covers landscapes, animals, flowers and trees, people, cars, sports, industry, and architecture, as well as an aesthetic subset
image format is .jpg, text format is .txt
Chinese, English
in principle, a single sentence should be 5-20 characters, and each picture should cover no less than two types of descriptions, each with one sentence; a few images have only one description
the main scene or some salient features in the image
the proportion of correctly labeled images is not less than 95%
Commercial License