【每周CV论文推荐】初学细粒度分类值得阅读的文章

用户1508658

发布于 2022-11-07 21:46:46

3320

发布于 2022-11-07 21:46:46

文章被收录于专栏：有三AI有三AI

欢迎来到《每周CV论文推荐》。在这个专栏里，还是本着有三AI一贯的原则，专注于让大家能够系统性完成学习，所以我们推荐的文章也必定是同一主题的。

图像分类是计算机视觉中最基础的技术，细粒度分类是图像分类任务中比较复杂的问题，是解决现实生活中很多图像分类任务必须掌握的方法，本次我们来简单给大家推荐一些工作。

作者&编辑 | 言有三

1 基于定位的模型

最早期的细粒度分类方法是以Part-based R-CNN为代表的强监督模型，它们的核心思想就是基于细粒度分类需要对局部的细节进行识别，因为先对这些语义区域进行定位，然后将提取的特征进行融合。

文章引用量：1000+

推荐指数：✦✦✦✦✧

[1] Zhang N, Donahue J, Girshick R, et al. Part-based R-CNNs for fine-grained category detection[C]//European conference on computer vision. Springer, Cham, 2014: 834-849.

[2] Branson S, Van Horn G, Belongie S, et al. Bird species categorization using pose normalized deep convolutional nets[J]. arXiv preprint arXiv:1406.2952, 2014.

2 高阶特征模型

对于细粒度分类任务来说，需要提取比粗粒度分类更加具有代表性的特征才能获得更好的性能，其中以双线性模型（Bilinear CNN）为代表的高阶特征模型就取得了非常好的精度。

文章引用量：3000+

推荐指数：✦✦✦✦✦

[3] Lin T Y, RoyChowdhury A, Maji S. Bilinear CNN models for fine-grained visual recognition[C]//Proceedings of the IEEE international conference on computer vision. 2015: 1449-1457.

[4] Gao Y, Beijbom O, Zhang N, et al. Compact bilinear pooling[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2016: 317-326.

3 注意力模型

很显然，注意力机制是细粒度分类任务的关键，基于目标检测的模型是通过检测实现了显式的注意力，前述的双线性模型则是一种自注意力机制，而通用的注意力机制模型众多，在细粒度分类任务中应用广泛，比较典型的是MA-CNN模型，它是基于聚类的注意力模型。

文章引用量：2000+

推荐指数：✦✦✦✦✦

[5] Zheng H, Fu J, Mei T, et al. Learning multi-attention convolutional neural network for fine-grained image recognition[C]//Proceedings of the IEEE international conference on computer vision. 2017: 5209-5217.

[6] Fu J, Zheng H, Mei T. Look closer to see better: Recurrent attention convolutional neural network for fine-grained image recognition[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2017: 4438-4446.

4 优化目标设计

细粒度分类任务与人脸识别算法都属于非常精细的分类任务，在人脸识别领域，针对softmax等优化目标的改进是一个非常重要的研究方向，可以提升模型的精度和泛化能力，这在细粒度分类领域也有相关研究。

文章引用量：100+

推荐指数：✦✦✦✦✧