graviti
产品服务
解决方案
知识库
公开数据集
关于我们
avatar
The Massively Multilingual Image Dataset
2D Classification
许可协议: Research Only

Overview

MMID is a large-scale, massively multilingual dataset of images paired with the words they represent collected at the University of Pennsylvania. The dataset is doubly parallel: for each language, words are stored parallel to images that represent the word, and parallel to the word’s translation into English (and corresponding images.)

By far the largest dataset of its kind, it has 100 languages (including English) and up to 10,000 words per language! (and many more for English.)

数据概要
数据格式
image,
数据量
1000K
文件大小
--
| 数据量 1000K | 大小 --
The Massively Multilingual Image Dataset
2D Classification
许可协议: Research Only

Overview

MMID is a large-scale, massively multilingual dataset of images paired with the words they represent collected at the University of Pennsylvania. The dataset is doubly parallel: for each language, words are stored parallel to images that represent the word, and parallel to the word’s translation into English (and corresponding images.)

By far the largest dataset of its kind, it has 100 languages (including English) and up to 10,000 words per language! (and many more for English.)

0
立即开始构建AI
graviti
wechat-QR
长按保存识别二维码,关注Graviti公众号

Copyright@Graviti
沪ICP备19019574号
沪公网安备 31011002004865号