图像语义分割基础

History

TextonForest -> Random Forest -> FCN -> Encoder-decoder structure or CRF

Papers

FCN -> SegNet -> Dilated Convolutions -> DeepLab (v1 & v2) -> RefineNet -> PSPNet -> Large Kernel Matters ->DeepLab v3

Contributions of these papers

FCN：（ SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation）

将端到端的卷积网络推广到语义分割中；重新将与训练好的Imagenet网络用于分割问题中；使用反卷积层进行上采样（deconvolution）；提出了跳跃连接来盖上上采样的粗糙程度（skip layer）。

Keywords：编码器为ResNet-101模块，空洞卷积计算成本高

FCN vs SegNet 在上采样方法上的不同
SegNet：（ SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation）

将最大池化指数转移至解码器中，改善了分割分辨率。

Keywords：encoder-decoder模型
空洞卷积：（ Multi-Scale Context Aggregation by Dilated Convolutions

使用了空洞卷积，可用于密集预测的卷积层；提出了在多尺度聚集条件下使用空洞卷积的“背景模块”。下图为空洞卷积，又称多孔卷积（atrous convolution）。

Keywords：背景模型，空洞卷积
DeepLab（v1 & v2）：（ Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs && DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs）

使用了空洞卷积；提出了在空间维度上实现金字塔型的空洞池化（ASPP）；使用了全连接条件随机场。

Keywords：ASPP，多尺度处理
RefineNet：（RefineNet: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation）

精心设计解码器模块；所有组件遵循残差连接的设计方式。

Keywords：编码器为ResNet-101模块，空洞卷积计算成本高
PSPNet： （Pyramid Scene Parsing Network）

提出了金字塔池化模块来聚合背景信息；使用了附加损失（auxiliary loss）。

Keywords：ResNet，金字塔池化模块，intermediate supervision
Large Kernel Matters：（Large Kernel Matters - Improve Semantic Segmentation by Global Convolutional Network）

提出了一种带有大维度卷积核的编码器-解码器结构。

Keywords：有效感受野，ResNet，Boundary Refinement
DeepLab v3：（Rethinking Atrous Convolution for Semantic Image Segmentation）

改进了ASPP（金字塔空洞池化方法）；级联了多个空洞卷积结构。

Keywords：ResNet，级联模块，批量归一化层

Methods：

semantic-slam-1

"图像语义分割发展过程"

History

Papers

Contributions of these papers

Methods：

CATALOG

FEATURED TAGS

FRIENDS