Semantic-spatial fusion network for human parsing
Zhang, Xiaomei1,2; Chen, Yingying1,2; Zhu, Bingke1,2; Wang, Jinqiao1,2; Tang, Ming1,2
发表期刊NEUROCOMPUTING
ISSN0925-2312
2020-08-18
卷号91期号:402页码:375-383
摘要

Recently, many methods have united low-level and high-level features to generate the desired accurate high-resolution prediction for human parsing. Nevertheless, there exists a semantic-spatial gap between low-level and high-level features in some methods, i.e., high-level features represent more semantics and less spatial details, while low-level ones have less semantics and more spatial details. In this paper, we propose a Semantic-Spatial Fusion Network (SSFNet) for human parsing to shrink the gap, which generates the accurate high-resolution prediction by aggregating multi-resolution features. SSFNet includes two models, a semantic modulation model and a resolution-aware model. The semantic modulation model guides spatial details with semantics and then effectively facilitates the feature fusion, narrowing the gap. The resolution-aware model sufficiently boosts the feature fusion and obtains multi-receptive-fields, which generates reliable and fine-grained high-resolution features for each branch, in bottom-up and top-down processes. Extensive experiments on three public datasets, PASCAL-Person-Part, LIP and PPSS, show that SSFNet achieves significant improvements over state-of-the-art methods. (C) 2020 Elsevier B.V. All rights reserved.

关键词SSFNet Semantic modulation model Resolution-aware model Human parsing
DOI10.1016/j.neucom.2020.03.096
关键词[WOS]SEGMENTATION ; MODELS
收录类别SCI
语种英语
资助项目National Natural Science Foundation of China[61976210] ; National Natural Science Foundation of China[61772527]
项目资助者National Natural Science Foundation of China
WOS研究方向Computer Science
WOS类目Computer Science, Artificial Intelligence
WOS记录号WOS:000538815500005
出版者ELSEVIER
七大方向——子方向分类图像视频处理与分析
引用统计
被引频次:11[WOS]   [WOS记录]     [WOS相关记录]
文献类型期刊论文
条目标识符http://ir.ia.ac.cn/handle/173211/39811
专题紫东太初大模型研究中心_图像与视频分析
通讯作者Chen, Yingying
作者单位1.Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, 95 Zhongguancun East Rd, Beijing 100190, Peoples R China
2.Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
第一作者单位模式识别国家重点实验室
通讯作者单位模式识别国家重点实验室
推荐引用方式
GB/T 7714
Zhang, Xiaomei,Chen, Yingying,Zhu, Bingke,et al. Semantic-spatial fusion network for human parsing[J]. NEUROCOMPUTING,2020,91(402):375-383.
APA Zhang, Xiaomei,Chen, Yingying,Zhu, Bingke,Wang, Jinqiao,&Tang, Ming.(2020).Semantic-spatial fusion network for human parsing.NEUROCOMPUTING,91(402),375-383.
MLA Zhang, Xiaomei,et al."Semantic-spatial fusion network for human parsing".NEUROCOMPUTING 91.402(2020):375-383.
条目包含的文件 下载所有文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
Neurocomputing_2020 (2060KB)期刊论文作者接受稿开放获取CC BY-NC-SA浏览 下载
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Zhang, Xiaomei]的文章
[Chen, Yingying]的文章
[Zhu, Bingke]的文章
百度学术
百度学术中相似的文章
[Zhang, Xiaomei]的文章
[Chen, Yingying]的文章
[Zhu, Bingke]的文章
必应学术
必应学术中相似的文章
[Zhang, Xiaomei]的文章
[Chen, Yingying]的文章
[Zhu, Bingke]的文章
相关权益政策
暂无数据
收藏/分享
文件名: Neurocomputing_2020 Semantic-spatial fusion network for human parsing.pdf
格式: Adobe PDF
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。