CASIA OpenIR  > 毕业生  > 硕士学位论文
Thesis Advisor王飞跃
Degree Grantor中国科学院研究生院
Place of Conferral北京
Keyword语义分割 视差计算 平行驾驶
Other Abstract 


另一方面,语义分割作为视觉理解的一个重要的方面也发挥着重要的作用。由于大规模数据集的不断出现,尤其是在深度学习的背景下,物体检测的精确度在不断提高。语义分割的目标是为图像中的每个像素点分配一个类别,这在自动驾驶中也扮演着重要的角色。深度卷积神经网络的成功使得像素级的语义分割在精度上取得巨大的提高,这一切要归功于丰富的层级特征和端对端训练的框架。监督式语义分割网络通常专注于将比较深层次的全卷积网络应用到其中。尤其是101层的ResNet-101网络在PASCAL VOC2012数据集上将平均交叉区域(mIoU)提高到新的层级。因为越深的网络通常可以提取更加具有判别里的特征,因而可以更好地区分不同的种类。





;  Binocular Stereo Vision has been studied for many years. The depth information of a picture is very important in practice. It provides an important basis for numerous higher level challenges such as advanced driver assistance and autonomous systems. Given a left-right image pair, depth can be estimated by matching corresponding pixels. As depth is inversely proportional to disparity, a stereo matching system is targeted to produce an accurate dense disparity instead.
On the other hand, Semantic segmentation act as one of the aspect of Visual Understanding has shown its importance in Scene Understanding. Object detection has benefited enormously from large-scale datasets, especially in the context of deep learning. Semantic segmentation aims to assign a categorical label to every pixel in an image, which plays an important role in self-driving system. The recent success of deep convolutional neural network models has enabled remarkable progress in pixel-wise semantic segmentation tasks due to rich hierarchical features and an end-to-end trainable framework. Improvements on fully-supervised semantic segmentation systems are generally focus on applying deeper FCN models. The introduction of a 101-layer ResNet-101 achieved a significant gain in mean Intersection over Union (mIoU) scores on PASCAL VOC2012 datasets. Since deeper networks generally can model more complex representations and learn more discriminative features that better distinguish among categories.
 Although great achievement has been made in the area of Semantic Segmentation and Disparity estimation respectively. However, they have never been combined to do research which is necessary just as we human do when recognize the outside world. So we propose a new architecture calling Seg-Disp which compute Disparity estimation and Semantic Segmentation jointly using deep convolutional neutral network:
  1. Design the Disparity estimation part of Seg-Disp: We use the hour-glass structure DispNet as our baseline to compute disparity and concatenate it with the segmentation branch calling ResNet-101. After that, the extracted disparity feature will be fused with features extracted from the semantic branch which has the same size using concatenate or crop and up-sampling, and we use End-Point-Error as our Loss to update our net. The pretrained caffemodel which is achieved from the monkaa dataset will be used to initialize the disparity branch and will not be updated during training. We input the semantic part of Cityscapes dataset to update the disparity branch until convergence.
  2. Design the semantic segmentation part of the Seg-Disp: Using the same structure in the front-end as mentioned above, and fuse inter feature the same way, what is different is the Loss function. We use Softmax-Loss as our Loss function to update the network. This time we use the updated caffemodel to initialize the disparity branch and update weighs in the semantic part of Cityscapes dataset. Doing the cycle until the joint network achieves convergence.
Introduce the conception of Parallel Driving and elaborate that Parallel Driving can be an efficient way for unmanned vehicles to drive safely on real world way, then we introduce the meaning of our SegDisp in parallel driving which can be reflected in parallel testing.
Document Type学位论文
First Author AffilicationInstitute of Automation, Chinese Academy of Sciences
Recommended Citation
GB/T 7714
沈宇. 面向平行驾驶的语义视差深度学习计算方法研究[D]. 北京. 中国科学院研究生院,2018.
Files in This Item:
File Name/Size DocType Version Access License
面向平行驾驶的语义视差深度学习计算方法研(3908KB)学位论文 暂不开放CC BY-NC-SAApplication Full Text
Related Services
Recommend this item
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[沈宇]'s Articles
Baidu academic
Similar articles in Baidu academic
[沈宇]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[沈宇]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.