ML-datasets -- 物体识别

[toc]

开源数据集-物体识别：

Cifar10：go: ref:

http://www.cs.toronto.edu/~kriz/cifar-10-python.tar.gz

http://www.cs.toronto.edu/~kriz/cifar-10-matlab.tar.gz

http://www.cs.toronto.edu/~kriz/cifar-10-binary.tar.gz

该数据集文件包含data_batch1……data_batch5，和test_batch。他们都是由cPickle库产生的序列化后的对象（关于pickle,移步https://docs.python.org/3/library/pickle.html）。

def unpickle(file):
	import pickle
	with open(file, 'rb') as fo:
	    dict = pickle.load(fo, encoding='bytes')
    return dict

Cifar100 go:

Version	Size	md5sum
CIFAR-100 python version	161 MB	eb9058c3a382ffc7106e4002c42a8d85
CIFAR-100 Matlab version	175 MB	6a4bfa1dcd5c9453dda6bb54194911f4
CIFAR-100 binary version (suitable for C programs)	161 MB	03b5dce01913d631647c71ecec9e9cb8

VOC:

LSUN: go

LSUN: Construction of a Large-scale Image Dataset using Deep Learning with Humans in the Loop

国外的PASCAL
VOC和ImageNet ILSVRC比赛使用的数据集，数据领域包括卧室、冰箱、教师、厨房、起居室、酒店等多个主题。

推荐度：★★，推荐应用方向：图像识别

介绍和下载地址：http://lsun.cs.princeton.edu

Abstract

While there has been remarkable progress in the performance of visual recognition algorithms, the state-of-the-art models tend to be exceptionally data-hungry. Large labeled training datasets, expensive and tedious to produce, are required to optimize millions of parameters in deep network models. Lagging behind the growth in model capacity, the available datasets are quickly becoming outdated in terms of size and density. To circumvent this bottleneck, we propose to amplify human effort through a partially automated labeling scheme, leveraging deep learning with humans in the loop. Starting from a large set of candidate images for each category, we iteratively sample a subset, ask people to label them, classify the others with a trained model, split the set into positives, negatives, and unlabeled based on the classification confidence, and then iterate with the unlabeled set. To assess the effectiveness of this cascading procedure and enable further progress in visual recognition research, we construct a new image dataset, LSUN. It contains around one million labeled images for each of 10 scene categories and 20 object categories. We experiment with training popular convolutional networks and find that they achieve substantial performance gains when trained on this dataset.

Paper

Fisher Yu, Ari Seff, Yinda Zhang, Shuran Song, Thomas Funkhouser and Jianxiong Xiao
LSUN: Construction of a Large-scale Image Dataset using Deep Learning with Humans in the Loop
arXiv:1506.03365 [cs.CV], 10 Jun 2015

Data

10 scene categories for LSUN Scene Classification Challange: Downloading Code

20 object categories: Link List. Images for each category are stored in LMDB format and the database is then zipped. After downloading and decompressing the zip files, please to refer to LSUN utility code to visualize and export the images. MD5 sum for each zip file is also provided so that you can verify your downloads.

../
airplane.zip                                       06-Mar-2019 00:14     34G
airplane.zip.md5                                   19-Dec-2019 20:04      47
bicycle.zip                                        06-Mar-2019 00:44    129G
bicycle.zip.md5                                    19-Dec-2019 20:04      46
bird.zip                                           06-Mar-2019 00:57     65G
bird.zip.md5                                       19-Dec-2019 20:04      43
boat.zip                                           06-Mar-2019 01:12     86G
boat.zip.md5                                       19-Dec-2019 20:04      43
bottle.zip                                         06-Mar-2019 01:24     64G
bottle.zip.md5                                     19-Dec-2019 20:04      45
bus.zip                                            06-Mar-2019 01:29     24G
bus.zip.md5                                        19-Dec-2019 20:04      42
car.zip                                            06-Mar-2019 02:05    173G
car.zip.md5                                        19-Dec-2019 20:04      42
cat.zip                                            06-Mar-2019 02:12     42G
cat.zip.md5                                        19-Dec-2019 20:04      42
chair.zip                                          06-Mar-2019 02:31    116G
chair.zip.md5                                      19-Dec-2019 20:04      44
cow.zip                                            06-Mar-2019 02:34     15G
cow.zip.md5                                        19-Dec-2019 20:04      42
dining_table.zip                                   06-Mar-2019 02:50     48G
dining_table.zip.md5                               19-Dec-2019 20:04      51
dog.zip                                            06-Mar-2019 03:14    145G
dog.zip.md5                                        19-Dec-2019 20:04      42
horse.zip                                          06-Mar-2019 03:25     69G
horse.zip.md5                                      19-Dec-2019 20:04      44
motorbike.zip                                      06-Mar-2019 03:32     42G
motorbike.zip.md5                                  19-Dec-2019 20:04      48
person.zip                                         06-Mar-2019 04:47    477G
person.zip.md5                                     19-Dec-2019 20:04      45
potted_plant.zip                                   06-Mar-2019 04:54     43G
potted_plant.zip.md5                               19-Dec-2019 20:04      51
sheep.zip                                          06-Mar-2019 04:57     18G
sheep.zip.md5                                      19-Dec-2019 20:04      44
sofa.zip                                           06-Mar-2019 05:06     56G
sofa.zip.md5                                       19-Dec-2019 20:04      43
train.zip                                          06-Mar-2019 05:13     43G
train.zip.md5                                      19-Dec-2019 20:04      44
tv-monitor.zip                                     06-Mar-2019 05:21     46G
tv-monitor.zip.md5                                 19-Dec-2019 20:04      49

LSUN Challenge

In CVPR 2015 and 2016, a image classification challenge has been hosted in LSUN Challenge workshop to evaluate the progress of large-scale image understanding. More information can be found at the challenge webpage.

ImageNet数据集

ImageNet数据集是目前深度学习图像领域应用得非常多的一个领域，该数据集有1000多个图像，涵盖图像分类、定位、检测等应用方向。Imagenet数据集文档详细，有专门的团队维护，在计算机视觉领域研究论文中应用非常广，几乎成为了目前深度学习图像领域算法性能检验的“标准”数据集。很多大型科技公司都会参加ImageNet图像识别大赛，包括百度、谷歌、微软等。

推荐度：★★★，推荐应用方向：图像识别

介绍和下载地址：http://www.image-net.org/