在计算视觉的领域中,Pascal VOC Challenge 就好比是数学中的哥德巴赫猜想一样。Pascal的全称是Pattern Analysis, Statical Modeling and Computational Learning。每年,该组织都会提供一系列类别的、带标签的图片,挑战者通过设计各种精妙的算法,仅根据分析图片内容来将其分类,最终通过准确率、召回率、效率来一决高下。
这项活动从2005年开始,每年的样本数据库都有所不同:
Year | Statistics | New developments | Notes |
2005 | Only 4 classes: bicycles, cars, motorbikes, people. Train/validation/test: 1578 images containing 2209 annotated objects. | Two competitions: classification and detection | Images were largely taken from exising public datasets, and were not as challenging as the flickr images subsequently used. This dataset is obsolete. |
2006 | 10 classes: bicycle, bus, car, cat, cow, dog, horse, motorbike, person, sheep. Train/validation/test: 2618 images containing 4754 annotated objects. | Images from flickr and from Microsoft Research Cambridge (MSRC) dataset | The MSRC images were easier than flickr as the photos often concentrated on the object of interest. This dataset is obsolete. |
2007 | 20 classes:Person: personAnimal: bird, cat, cow, dog, horse, sheepVehicle: aeroplane, bicycle, boat, bus, car, motorbike, trainIndoor: bottle, chair, dining table, potted plant, sofa, tv/monitorTrain/validation/test: 9,963 images containing 24,640 annotated objects. | Number of classes increased from 10 to 20Segmentation taster introducedPerson layout taster introducedTruncation flag added to annotationsEvaluation measure for the classification challenge changed to Average Precision. Previously it had been ROC-AUC. | This year established the 20 classes, and these have been fixed since then. This was the final year that annotation was released for the testing data. |
2008 | 20 classes. The data is split (as usual) around 50% train/val and 50% test. The train/val data has 4,340 images containing 10,363 annotated objects. | Occlusion flag added to annotationsTest data annotation no longer made public.The segmentation and person layout data sets include images from the corresponding VOC2007 sets. | |
2009 | 20 classes. The train/val data has 7,054 images containing 17,218 ROI annotated objects and 3,211 segmentations. | From now on the data for all tasks consists of the previous years' images augmented with new images. In earlier years an entirely new data set was released each year for the classification/detection tasks.Augmenting allows the number of images to grow each year, and means that test results can be compared on the previous years' images.Segmentation becomes a standard challenge (promoted from a taster) | No difficult flags were provided for the additional images (an omission).Test data annotation not made public. |
2010 | 20 classes. The train/val data has 10,103 images containing 23,374 ROI annotated objects and 4,203 segmentations. | Action Classification taster introduced.Associated challenge on large scale classification introduced based on ImageNet.Amazon Mechanical Turk used for early stages of the annotation. | Method of computing AP changed. Now uses all data points rather than TREC style sampling.Test data annotation not made public. |
以一张人物肖像为例,对应的Annotation描述为下:
<annotation>
<folder>VOC2007</folder>
<filename>000001.jpg</filename>
<source>
<database>The VOC2007 Database</database>
<annotation>PASCAL VOC2007</annotation>
<image>flickr</image>
<flickrid>341012865</flickrid>
</source>
<owner>
<flickrid>Fried Camels</flickrid>
<name>Jinky the Fruit Bat</name>
</owner>
<size>
<width>353</width>
<height>500</height>
<depth>3</depth>
</size>
<segmented>0</segmented>
<object>
<name>dog</name>
<pose>Left</pose>
<truncated>1</truncated>
<difficult>0</difficult>
<bndbox>
<xmin>48</xmin>
<ymin>240</ymin>
<xmax>195</xmax>
<ymax>371</ymax>
</bndbox>
</object>
<object>
<name>person</name>
<pose>Left</pose>
<truncated>1</truncated>
<difficult>0</difficult>
<bndbox>
<xmin>8</xmin>
<ymin>12</ymin>
<xmax>352</xmax>
<ymax>498</ymax>
</bndbox>
</object>
</annotation>
- 大小: 76.9 KB
分享到:
相关推荐
行人检测数据集——pascalvoc格式
介绍了Pascal VOC Challenge,讲述了一下pascal VOC Challenge的历史,同时讲述了VOC数据集的组织结构。
NWPU VHR-10的pascal voc格式NWPU VHR-10的pascal voc格式NWPU VHR-10的pascal voc格式NWPU VHR-10的pascal voc格式NWPU VHR-10的pascal voc格式NWPU VHR-10的pascal voc格式NWPU VHR-10的pascal voc格式NWPU VHR-10...
Pascal 函数集——Delphi 控件属性集
用于注释图像的免费在线网络工具,输出格式为 xml 文件列表(Pascal VOC xml 格式)。此图像标记应用程序将帮助您创建图像识别的学习基础。
pascal voc2012 train 和test 官网数据集下载真的很慢。这里提供百度网盘下载链接,保证可用!
Pascal VOC 2007数据集(用于物体检测),可用于检验 YOLO、Fast-RCNN 等算法
pascal voc 2012提取某一类的图片,例子中提取的牛这一类,新手上路,请多指教。
Pascal VOC数据集2005-2012的发展改进,官方文档中英文对照打印版
【Demo】对PASCAL VOC 数据集进行数据增强.zip
pascal语言描述,pdz文件需要使用超星阅读器。
提取VOC07或VOC12数据(07、12年的可以执行,其它年的应该也可以)中的某一类或几类,产生对应的Annotations、JPEGImages文件;python代码
PASCAL Visual Object Classes Challenge 2010年的图像数据集。PASCAL Visual Object Classes 是一个图像物体识别竞赛,用来从真实世界的图像中识别特定对象物体,共包括 4 大类 20 小类物体的识别。其类别信息如下...
著名的voc2007数据集,做目标检测的任务时常会用到,但官网下载有时比较慢~所以我把资源传到了我的百度网盘,要下载的朋友可以访问我的网盘地址~
使用Python写的一个用于标注数据集的软件,该软件可以将数据集标注成为VOC2007格式,适合计算机视觉应用。
PASCAL Visual Object Classes Challenge 2012年的图像数据集。PASCAL Visual Object Classes 是一个图像物体识别竞赛,用来从真实世界的图像中识别特定对象物体,共包括 4 大类 20 小类物体的识别。其类别信息如下...
Pascal voc2012数据集的info.json文件,用于对各类别的mIoU的计算
对应的模型文件, 直接去github链接里下载,有需要的从这里下载让我赚一个积分吧。 压缩包里包含文件: models_VGGNet_VOC0712_SSD_300x300.tar.gz models_VGGNet_VOC0712_SSD_512x512.tar.gz
数据集1000+张图片,包含五种水果,图片均已经通过ImageNet标定结束。
Pascal voc 2007 行人数据集