深度学习、机器学习图像/人脸/字幕/自动驾驶数据集(Dataset)汇总

机器学习AI算法工程

发布于 2018-03-15 15:01:22

1.6K0

发布于 2018-03-15 15:01:22

文章被收录于专栏：机器学习AI算法工程

1. CIFAR-10 & CIFAR-100

CIFAR-10包含10个类别，50,000个训练图像，彩色图像大小：32x32，10,000个测试图像。

下载地址

https://www.cs.toronto.edu/~kriz/cifar.html

（类别：airplane，automobile, bird, cat, deer, dog, frog, horse, ship, truck）

（作者：Alex Krizhevsky, Vinod Nair, and Geoffrey Hinton）

（数据格式：Python版本、Matlab版本、二进制版本<for C程序>）

CIFAR-100

下载地址

https://www.cs.toronto.edu/~kriz/cifar.html

与CIFAR-10类似，包含100个类，每类有600张图片，其中500张用于训练，100张用于测试；这100个类分组成20个超类。每个图像有一个"find" label和一个"coarse"label。

2. 图像分类结果及对应的论文

下载地址

http://rodrigob.github.io/are_we_there_yet/build/classification_datasets_results.html

包含数据集：MNIST、CIFAR-10、CIFAR-100、STL-10、SVHN、ILSVRC2012 task 1

ILSVRC： ImageNet Large Scale Visual Recognition Challenge

下载地址

http://www.image-net.org/challenges/LSVRC/2014/

3. ImageNet

下载地址

http://www.image-net.org/

ImageNet相关信息如下：

1）Total number of non-empty synsets: 21841 2）Total number of images: 14,197,122 3）Number of images with bounding box annotations: 1,034,908 4）Number of synsets with SIFT features: 1000 5）Number of images with SIFT features: 1.2 million

4. COCO

下载地址

http://mscoco.org/

COCO(Common Objects in Context)是一个新的图像识别、分割、和字幕数据集，它有如下特点：

1）Object segmentation

2）Recognition in Context 3）Multiple objects per image 4）More than 300,000 images 5）More than 2 Million instances 6）80 object categories 7）5 captions per image 8）Keypoints on 100,000 people

COCO 2016 Detection Challenge(2016.6.1-2016.9.9)

http://mscoco.org/dataset/#detections-challenge2016

和COCO 2016 Keypoint Challenge(2016.6.1-2016.9.9)

http://mscoco.org/dataset/#keypoints-challenge2016

已经由Microsoft发起由ECCV 2016(ECCV：European Conference On Computer Vision )。

4. 3D数据

1）RGB-D People Dataset

http://www2.informatik.uni-freiburg.de/~spinello/RGBD-dataset.html

2）NYU Hand Pose Dataset

http://cims.nyu.edu/~tompson/NYU_Hand_Pose_Dataset.htm

code

https://github.com/jonathantompson/ModelFit

3）Human3.6M (3D Human Pose Dataset)

http://vision.imar.ro/human3.6m/description.php

《Iterated Second-Order Label Sensitive Pooling for 3D Human Pose Estimation》

5. 人脸Dataset

1）LFW (Labeled Faces in the Wild)

http://vis-www.cs.umass.edu/lfw/index.html

6. Stereo Datasets

1）Middlebury Stereo Datasets

http://vision.middlebury.edu/stereo/data/

2）KITTI Vision Benchmark Suite

http://www.cvlibs.net/datasets/kitti/eval_scene_flow.php?benchmark=stereo

7. 普林斯顿大学人工智能自动驾驶汽车项目

1）Deep Drive

http://deepdrive.io/

2）Source Code and Data

http://deepdriving.cs.princeton.edu/

本文参与腾讯云自媒体同步曝光计划，分享自微信公众号。

原始发表：2018-01-17，如有侵权请联系 cloudcommunity@tencent.com 删除

其他

本文分享自大数据挖掘DT数据分析微信公众号，前往查看

如有侵权，请联系 cloudcommunity@tencent.com 删除。

本文参与腾讯云自媒体同步曝光计划，欢迎热爱写作的你一起参与！

其他

登录后参与评论

0 条评论

热度