首页
学习
活动
专区
工具
TVP
发布
社区首页 >专栏 >语义分割--Loss Max-Pooling for Semantic Image Segmentation

语义分割--Loss Max-Pooling for Semantic Image Segmentation

作者头像
用户1148525
发布2019-05-26 12:10:53
7920
发布2019-05-26 12:10:53
举报

Loss Max-Pooling for Semantic Image Segmentation CVPR2017 https://arxiv.org/abs/1704.02966

本文采用偏数学语言的角度来描述语义分割样本类别分布不均匀问题,提出一个方法 Pixel-Loss Max-Pooling

本文主要解决什么问题了? 针对语义分割训练样本类别分布不均匀问题 handle imbalanced (or skewed) class distributions, as often encountered in semantic segmentation datasets, within deep neural network training so far.

With imbalanced, we refer to datasets having dominant portions of their data assigned to (few) majority classes while the rest belongs to minority classes, forming comparably under-represented categories. As (mostly undesired) consequence, it can be observed that classifiers trained without correction mechanisms tend to be biased towards the majority classes during inference

这里所说的不均匀,就是我们的训练数据库有少数类别的样本很多( (few) majority classes),剩下的类别的样本都比较少。 如果在训练的时候没有修正这个不均匀,那么学习到的分类器就会偏向于 那些少数样本很多的类。

解决这个问题目前有几个思路: 1)就是在建立数据库的时候就注意到样本的均匀分布问题,例如 ImageNet, Caltech101/256 or CIFAR10/100

2)通过对样本少的类别 over-sampling 或 对样本多的类别 under-sampling over-sampling of minority classes or under-sampling from the majority classes when compiling the actual training data

3)通过引入样本类别分布的权值来改变算法行为 cost-sensitive learning changes the algorithmic behavior by introducing class-specific weights, often derived from the original data statistics

首先使用一个数学公式描述语义分割问题

这里写图片描述
这里写图片描述

其中 L 是 loss function penalizing wrong image labelings, R 是一个 regularizer

The loss function L commonly decomposes into a sum of pixel-specific losses as follows

这里写图片描述
这里写图片描述

The loss function defined in (2) weights uniformly the contribution of each pixel within the image. 上面定义的损失函数默认图像中每个像素的权重是均匀分布的

The effect of this choice is a bias of the learner towards elements that are dominant within the image (e.g. sky, building, road) to the detriment of elements occupying smaller portions of the image 这使得学习器偏向于图像中占比大的区域,相对忽略占比小的区域

In order to alleviate this issue, we propose to adaptively reweigh the contribution of each pixel based on the actual loss we observe

Our goal is to shift the focus on image parts where the loss is higher, while re-taining a theoretical link to the loss in (2). The solution we propose is an upper bound to L, which is constructed by relaxing the pixel weighting scheme

这里主要是给每个像素的权值引入一个上限 L,就是占比大的像素权值有一个上限,这样防止学习到的分类器有偏向性。

如果从样本类别分布不均匀的角度来说,我们对每个样本类型乘以一个权重系数,达到归一化的目的。

Cityscapes validation set

这里写图片描述
这里写图片描述

Evolution of semantic segmentation images during training

这里写图片描述
这里写图片描述

Cityscapes Pascal VOC 2012 validation

这里写图片描述
这里写图片描述

ResNet based results on Pascal VOC 2012 segmentation validation data

这里写图片描述
这里写图片描述
本文参与 腾讯云自媒体分享计划,分享自作者个人站点/博客。
原始发表:2017年06月09日,如有侵权请联系 cloudcommunity@tencent.com 删除

本文分享自 作者个人站点/博客 前往查看

如有侵权,请联系 cloudcommunity@tencent.com 删除。

本文参与 腾讯云自媒体分享计划  ,欢迎热爱写作的你一起参与!

评论
登录后参与评论
0 条评论
热度
最新
推荐阅读
领券
问题归档专栏文章快讯文章归档关键词归档开发者手册归档开发者手册 Section 归档