【论文推荐】最新5篇行人重识别( Person Re-ID)相关论文—样本生成、超越人类、实践指南、姿态归一化、图像生成

【导读】专知内容组整理了最近五篇行人重识别( Person Re-Identification)相关文章,为大家进行介绍,欢迎查看!

1. Multi-pseudo Regularized Label for Generated Samples in Person Re-Identification(行人重识别:基于多伪正则化标签的样本生成方法)



作者:Yan Huang,Jinsong Xu,Qiang Wu,Zhedong Zheng,Zhaoxiang Zhang,Jian Zhang

摘要:Sufficient training data is normally required to train deeply learned models. However, the number of pedestrian images per ID in person re-identification (re-ID) datasets is usually limited, since manually annotations are required for multiple camera views. To produce more data for training deeply learned models, generative adversarial network (GAN) can be leveraged to generate samples for person re-ID. However, the samples generated by vanilla GAN usually do not have labels. So in this paper, we propose a virtual label called Multi-pseudo Regularized Label (MpRL) and assign it to the generated images. With MpRL, the generated samples will be used as supplementary of real training data to train a deep model in a semi-supervised learning fashion. Considering data bias between generated and real samples, MpRL utilizes different contributions from predefined training classes. The contribution-based virtual labels are automatically assigned to generated samples to reduce ambiguous prediction in training. Meanwhile, MpRL only relies on predefined training classes without using extra classes. Furthermore, to reduce over-fitting, a regularized manner is applied to MpRL to regularize the learning process. To verify the effectiveness of MpRL, two state-of-the-art convolutional neural networks (CNNs) are adopted in our experiments. Experiments demonstrate that by assigning MpRL to generated samples, we can further improve the person re-ID performance on three datasets i.e., Market-1501, DukeMTMCreID, and CUHK03. The proposed method obtains +6.29%, +6.30% and +5.58% improvements in rank-1 accuracy over a strong CNN baseline respectively, and outperforms the state-of-the- art methods.

x

期刊:arXiv, 2018年1月29日

网址

http://www.zhuanzhi.ai/document/735fe58ab843f2fb02adb71bd0dcbbb7

2. AlignedReID: Surpassing Human-Level Performance in Person Re-Identification(AlignedReID:在行人重识别中超越了人类水平)



作者:Xuan Zhang,Hao Luo,Xing Fan,Weilai Xiang,Yixiao Sun,Qiqi Xiao,Wei Jiang,Chi Zhang,Jian Sun

摘要:In this paper, we propose a novel method called AlignedReID that extracts a global feature which is jointly learned with local features. Global feature learning benefits greatly from local feature learning, which performs an alignment/matching by calculating the shortest path between two sets of local features, without requiring extra supervision. After the joint learning, we only keep the global feature to compute the similarities between images. Our method achieves rank-1 accuracy of 94.4% on Market1501 and 97.8% on CUHK03, outperforming state-of-the-art methods by a large margin. We also evaluate human-level performance and demonstrate that our method is the first to surpass human-level performance on Market1501 and CUHK03, two widely used Person ReID datasets.

期刊:arXiv, 2018年1月31日

网址

http://www.zhuanzhi.ai/document/bc360742187b5572c5e07cb0a2284fe7

3. Re-ID done right: towards good practices for person re-identification(Re-ID:行人重识别中实践指南)



作者:Jon Almazan,Bojana Gajic,Naila Murray,Diane Larlus

摘要:Training a deep architecture using a ranking loss has become standard for the person re-identification task. Increasingly, these deep architectures include additional components that leverage part detections, attribute predictions, pose estimators and other auxiliary information, in order to more effectively localize and align discriminative image regions. In this paper we adopt a different approach and carefully design each component of a simple deep architecture and, critically, the strategy for training it effectively for person re-identification. We extensively evaluate each design choice, leading to a list of good practices for person re-identification. By following these practices, our approach outperforms the state of the art, including more complex methods with auxiliary components, by large margins on four benchmark datasets. We also provide a qualitative analysis of our trained representation which indicates that, while compact, it is able to capture information from localized and discriminative regions, in a manner akin to an implicit attention mechanism.

期刊:arXiv, 2018年1月17日

网址

http://www.zhuanzhi.ai/document/074aefb3ce8c22258d68c3e721e21e8a

4. Pose-Normalized Image Generation for Person Re-identification(基于姿态归一化图像生成的行人重识别方法)



作者:Xuelin Qian,Yanwei Fu,Wenxuan Wang,Tao Xiang,Yang Wu,Yu-Gang Jiang,Xiangyang Xue

摘要:Person Re-identification (re-id) faces two major challenges: the lack of cross-view paired training data and learning discriminative identity-sensitive and view-invariant features in the presence of large pose variations. In this work, we address both problems by proposing a novel deep person image generation model for synthesizing realistic person images conditional on pose. The model is based on a generative adversarial network (GAN) and used specifically for pose normalization in re-id, thus termed pose-normalization GAN (PN-GAN). With the synthesized images, we can learn a new type of deep re-id feature free of the influence of pose variations. We show that this feature is strong on its own and highly complementary to features learned with the original images. Importantly, we now have a model that generalizes to any new re-id dataset without the need for collecting any training data for model fine-tuning, thus making a deep re-id model truly scalable. Extensive experiments on five benchmarks show that our model outperforms the state-of-the-art models, often significantly. In particular, the features learned on Market-1501 can achieve a Rank-1 accuracy of 68.67% on VIPeR without any model fine-tuning, beating almost all existing models fine-tuned on the dataset.

期刊:arXiv, 2018年1月18日

网址

http://www.zhuanzhi.ai/document/7ef1e354b55bce36833394c4270ca649

5. Disentangled Person Image Generation(分解行人图像生成方法)



作者:Liqian Ma,Qianru Sun,Stamatios Georgoulis,Luc Van Gool,Bernt Schiele,Mario Fritz

摘要:Generating novel, yet realistic, images of persons is a challenging task due to the complex interplay between the different image factors, such as the foreground, background and pose information. In this work, we aim at generating such images based on a novel, two-stage reconstruction pipeline that learns a disentangled representation of the aforementioned image factors and generates novel person images at the same time. First, a multi-branched reconstruction network is proposed to disentangle and encode the three factors into embedding features, which are then combined to re-compose the input image itself. Second, three corresponding mapping functions are learned in an adversarial manner in order to map Gaussian noise to the learned embedding feature space, for each factor respectively. Using the proposed framework, we can manipulate the foreground, background and pose of the input image, and also sample new embedding features to generate such targeted manipulations, that provide more control over the generation process. Experiments on Market-1501 and Deepfashion datasets show that our model does not only generate realistic person images with new foregrounds, backgrounds and poses, but also manipulates the generated factors and interpolates the in-between states. Another set of experiments on Market-1501 shows that our model can also be beneficial for the person re-identification task.

期刊:arXiv, 2018年1月22日

网址

http://www.zhuanzhi.ai/document/85812c4cdf8ae54ce0e29c7ff251c2b5

原文发布于微信公众号 - 专知(Quan_Zhuanzhi)

原文发表时间:2018-02-14

本文参与腾讯云自媒体分享计划,欢迎正在阅读的你也加入,一起分享。

发表于

我来说两句

0 条评论
登录 后参与评论

相关文章

来自专栏技术沉淀

01 The Learning Problem

也就是要依次回答:何时可以用机器学习?为何可以机器学习?怎样机器学习?怎样更好地机器学习?构建一幅大Picture!

12420
来自专栏GAN&CV

GAN原理,优缺点、应用总结

版权声明:本文为博主原创文章,未经博主允许不得转载。 https://blog.csdn.net/qq_25737169/article/d...

42420
来自专栏目标检测和深度学习

全球最全计算机视觉资料(1:入门学习|课程|综述|图书|期刊会议)

35620
来自专栏AI科技大本营的专栏

读了那么多GANs的原理,还是不懂怎么用!两个案例教教你

编译|AI科技大本营(rgznai100) 参与 | 尚岩奇、周翔 生成式对抗网络(GANs)是一类用于解决无监督学习问题的神经网络,它们可以完成各种任务,例如...

33980
来自专栏新智元

赋予人工智能记忆的人,带你梳理深度学习核心算法

作者介绍:Jürgen Schmidhuber 被称为是赋予人工智能记忆的人,递归神经网络之父,2004 年到 2009 年,担任慕尼黑大学认知与机器人领域的教...

37470
来自专栏PPV课数据科学社区

以撩妹为例,5分钟让你秒懂深度学习!

爱在七夕 七夕,农历七月初七, 人们说它是中国的情人节, 可最初它是中国少女的乞巧节, 而现在,这些都不重要, 重要的是, 它是属于所有心中有“爱”之人的节日...

33040
来自专栏Petrichor的专栏

深度学习: ILSVRC竞赛

Large Scale Visual Recognition Challenge (ILSVRC):

63530
来自专栏量化投资与机器学习

【动态时间规整算法】之股指期货交易策略(一)

前言 Dynamic Time Warping(DTW),动态时间规整算法诞生有一定的历史了(日本学者Itakura提出),它出现的目的也比较单纯,是一种衡量两...

38270
来自专栏AI2ML人工智能to机器学习

Hinton和Jordan理解的EM算法

在“Hinton是如何理解PCA?”里面,我们体会到Hinton高人一等的见解。 Hinton, 这个深度学习的缔造者( 参考 攒说 Geoff Hinton ...

16630
来自专栏AI研习社

神经网络有什么理论支持?

三秒钟理解本文主旨: 问:神经网络有什么理论支持? 答:目前为止(2017 年)没有什么特别靠谱的。 下面是正文。 [本文主要介绍与神经网络相关的理论工作。 个...

50160

扫码关注云+社区

领取腾讯云代金券