[计算机视觉论文速递] 2018-06-13 图像分割专场

Amusi

发布于 2018-07-24 11:16:16

6540

发布于 2018-07-24 11:16:16

文章被收录于专栏：CVer

导言

这篇文章有4篇论文速递，都是图像分割（image segmentation）方向，其实3篇是对U-Net网络进行了改进。

编辑: Amusi

校稿: Amusi

前戏

Amusi 将日常整理的论文都会同步发布到 daily-paper-computer-vision 上。名字有点露骨，还请见谅。喜欢的童鞋，欢迎star、fork和pull。

直接点击“阅读全文”即可访问daily-paper-computer-vision。

link: https://github.com/amusi/daily-paper-computer-vision

图像分割（Image Segmentation）

《dhSegment: A generic deep-learning approach for document segmentation》

arXiv 2018

Network architecture of dhSegment

Abstract：In recent years there have been multiple successful attempts tackling document processing problems separately by designing task specific hand-tuned strategies. We argue that the diversity of historical document processing tasks prohibits to solve them one at a time and shows a need for designing generic approaches in order to handle the variability of historical series. In this paper, we address multiple tasks simultaneously such as page extraction, baseline extraction, layout analysis or multiple typologies of illustrations and photograph extraction. We propose an open-source implementation of a CNN-based pixel-wise predictor coupled with task dependent post-processing blocks. We show that a single CNN-architecture can be used across tasks with competitive results. Moreover most of the task-specific post-precessing steps can be decomposed in a small number of simple and standard reusable operations, adding to the flexibility of our approach.

arXiv：https://arxiv.org/abs/1804.10371

注：为什么不翻译，因为......原文很稳

《Automatic Pixelwise Object Labeling for Aerial Imagery Using Stacked U-Nets》

arXiv 2018

Our model labels (red) for buildings with different shapes at Innsbruck

Abstract：航空影像中物体标记的自动化是一项具有许多实际应用的计算机视觉任务。像能源勘探这样的领域需要一种自动化方法来每天处理连续的图像流（stream）。在本文中，我们提出了一个 pipeline来解决这个问题，使用一堆（a stack of）端到端的卷积神经网络（U-Net架构）。每个网络都作为后一个处理器工作。我们的模型在两个不同的数据集上胜过当前的最新技术：Inria Aerial Image Labeling数据集和Massachusetts Buildings数据集，每个数据集都具有不同的特征，如空间分辨率，物体形状和尺度。此外，我们通过处理子采样图像并稍后向上采样按像素标记来实验验证计算时间节省。节省的这些资源对分割质量的影响可以忽略不计。虽然本文进行的实验仅涵盖航空影像，但所呈现的技术是通用的并且可以处理其他类型的影像。

arXiv：https://arxiv.org/abs/1803.04953

《Stacked U-Nets: A No-Frills Approach to Natural Image Segmentation》

arXiv 2018

A typical u-net module with outer residual connection

SUNet architectures for ImageNet

Visualization of the segmentation output on PASCAL VOC 2012 test set

Abstract：许多成像任务需要有关图像中所有像素的全局信息。传统的自下而上分类网络通过降低分辨率来globalize information; 特征被池化并下采样为单个输出。但是对于语义分割和对象检测任务，网络必须提供更高分辨率的像素级输出。为了在保持解决方案的同时globalize information，许多研究人员提出了包含复杂的辅助模块，但这些代价是网络规模和计算成本大幅增加的代价。本文提出堆叠式网络（SUNets，stacked u-nets ），它在保持分辨率的同时迭代地组合不同分辨率尺度的特征。SUNets在能够处理自然图像复杂性的深层网络架构中充分利用了U-net的information globalization信息全球化能力。使用少量参数，SUNets在语义分割任务上表现出色。

arXiv：https://arxiv.org/abs/1804.10343

github：https://github.com/shahsohil/sunets

《Stack-U-Net: Refinement Network for Image Segmentation on the Example of Optic Disc and Cup》

arXiv 2018

Res-U-Net architecture | a basic block of the Stack-U-Net model

Stack-U-Net | the main proposed model

Abstract：在这项工作中，我们提出了一个特殊的级联网络图像分割，它是基于U-Net网络作为构建模块和迭代改进的思想。该模型主要用于获得更高的识别质量，用于寻找 borders of the optic disc and cup。与单个U-Net和最新的方法相比，无需增加数据集的数量即可实现非常高的分割质量。我们的实验包括与公共数据库DRIONS-DB，RIM-ONE v.3，DRISHTI-GS上最著名的方法的比较，以及与加利福尼亚大学旧金山医学院合作收集的私人数据集的评估。提出了对体系结构细节的分析，并且认为该模型可以用于广泛范围的类似性质的图像分割问题。

arXiv：https://arxiv.org/abs/1804.11294

Amusi 总结：U-Net在图像分割领域（特别是医学领域）真的可以为所欲为啊！啊！啊！

本文参与腾讯云自媒体同步曝光计划，分享自微信公众号。

原始发表：2018-06-13，如有侵权请联系 cloudcommunity@tencent.com 删除

其他

本文分享自 CVer 微信公众号，前往查看

如有侵权，请联系 cloudcommunity@tencent.com 删除。

本文参与腾讯云自媒体同步曝光计划，欢迎热爱写作的你一起参与！

其他

登录后参与评论

0 条评论

热度

[计算机视觉论文速递] 2018-06-13 图像分割专场

[计算机视觉论文速递] 2018-06-13 图像分割专场

社区

活动

资源

关于

腾讯云开发者

热门产品

热门推荐

更多推荐