当时尚遇上AI！港中文MMLab开源MMFashion工具箱

代码医生工作室

发布于 2019-11-12 12:46:00

1.1K0

文章被收录于专栏：相约机器人相约机器人

我们今天在OpenMMLab下正式开源了MMFashion这个项目的代码和预训练模型。它是一个针对视觉时尚分析（Visual Fashion Analysis) 的工具箱。

开源地址：

https://github.com/open-mmlab/mmfashion

我们在v0.1版本中实现了服饰领域的三大主流任务：

1）服饰属性预测；

2）给定服饰找同款；

3）服饰关键点定位。

我们将在后续版本中增加服饰语义分割和服饰推荐这两个任务。

早在2016年，中国香港中文大学多媒体实验室（Multimedia Lab）就开始了将深度学习应用于时尚分析领域的研究。

我们向研究社区贡献了DeepFashion [1] 这个大型数据集：

http://mmlab.ie.cuhk.edu.hk/projects/DeepFashion.html

同时提出并定义了服饰领域的五大任务：attribute prediction，in-shop clothes retrieval，consumer-to-ship clothes retrieval，fashion landmark detection和fashion systhesis。

DeepFashion已经成为了时尚分析领域公认的通用数据集。实验室也基于此不断深入人工智能在时尚领域的研究，比如后续的fashion landmark detection [2][3], fashion synthesis [4]和fashion recommendation。

为了整理实验室在服饰分析领域的技术积累并进一步推进相关领域的发展，我们启动了MMFashion的开源项目。MMFashion旨在给服饰分析领域提供一个通用性广，易于拓展，功能性强的toolbox，它具有以下三个特征：

1）通用性好：目前收集了基于VGG16和ResNet50的网络结构作为backbone, 用户还可以根据需要在config文件中轻松替换backbone。

2）拓展性好：代码框架延续了open-mmlab的特征，比如mmdetection, mmaction等，用户可以像搭积木一样拓展功能结构，比如加loss function，pooling method。

3）功能性强：0.1版本实现了三个任务，我们将在后续的版本中增加fashion parsing/segmentaion和fashion compatibility/recommendation这两个任务，基本可满足fashion领域的所有功能需求。

人工智能在时尚领域的应用越来越受到关注，而业界/学界目前还没有这样一个易于理解，易于使用的代码平台。

我们希望MMFashion可以为领域内外的研究者和工程师们提供这样一个全面且易用的codebase。

在此基础上，我们也希望视觉时尚分析领域能够蓬勃发展，产生更多具有重大影响力的研究成果。

[1] Liu, Z., Luo, P., Qiu, S., Wang, X., & Tang, X. (2016). Deepfashion: Powering robust clothes recognition and retrieval with rich annotations. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1096-1104).

[2] Liu, Z., Yan, S., Luo, P., Wang, X., & Tang, X. (2016). Fashion landmark detection in the wild. In European Conference on Computer Vision (pp. 229-245). Springer, Cham.

[3] Yan, S., Liu, Z., Luo, P., Qiu, S., Wang, X., & Tang, X. (2017). Unconstrained fashion landmark detection via hierarchical recurrent transformer networks. In Proceedings of the 25th ACM international conference on Multimedia (pp. 172-180). ACM.

[4] Zhu, S., Urtasun, R., Fidler, S., Lin, D., & Loy, C. (2017). Be your own prada: Fashion synthesis with structural coherence. In Proceedings of the IEEE International Conference on Computer Vision (pp. 1680-1688).

本文参与腾讯云自媒体同步曝光计划，分享自微信公众号。

原始发表：2019-11-07，如有侵权请联系 cloudcommunity@tencent.com 删除

backbone