graviti
产品服务
解决方案
知识库
公开数据集
关于我们
Cornell NLVR
Image Captioning
|...
许可协议: Unknown

Overview

Cornell Natural Language Visual Reasoning (NLVR) is a language grounding dataset. It contains 92,244 pairs of natural language statements grounded in synthetic images. The task is to determine whether a sentence is true or false about an image. The data was collected through crowdsourcing, and requires reasoning about sets of objects, quantities, comparisons, and spatial relations.

数据概要
数据格式
image,
数据量
--
文件大小
--
发布方
Alane Suhr
| 数据量 -- | 大小 --
Cornell NLVR
Image Captioning
许可协议: Unknown

Overview

Cornell Natural Language Visual Reasoning (NLVR) is a language grounding dataset. It contains 92,244 pairs of natural language statements grounded in synthetic images. The task is to determine whether a sentence is true or false about an image. The data was collected through crowdsourcing, and requires reasoning about sets of objects, quantities, comparisons, and spatial relations.

0
立即开始构建AI
graviti
wechat-QR
长按保存识别二维码,关注Graviti公众号

Copyright@Graviti
沪ICP备19019574号
沪公网安备 31011002004865号