graviti
产品服务
解决方案
知识库
公开数据集
关于我们
avatar
TextVQA
Image Captioning
|...
许可协议: CC-BY

Overview

TextVQA requires models to read and reason about text in images to answer questions about them. Specifically, models need to incorporate a new modality of text present in the images and reason over it to answer TextVQA questions.

Statistics

  • 28,408 images from OpenImages
  • 45,336 questions
  • 453,360 ground truth answers
数据概要
数据格式
image,
数据量
28K
文件大小
--
| 数据量 28K | 大小 --
TextVQA
Image Captioning
许可协议: CC-BY

Overview

TextVQA requires models to read and reason about text in images to answer questions about them. Specifically, models need to incorporate a new modality of text present in the images and reason over it to answer TextVQA questions.

Statistics

  • 28,408 images from OpenImages
  • 45,336 questions
  • 453,360 ground truth answers
0
立即开始构建AI
graviti
wechat-QR
长按保存识别二维码,关注Graviti公众号

Copyright@Graviti
沪ICP备19019574号
沪公网安备 31011002004865号