SurgiNet: Pyramid Attention Aggregation and Class-wise Self-Distillation for Surgical Instrument Segmentation | |
Ni, Zhen-Liang1,2![]() ![]() ![]() ![]() ![]() | |
Source Publication | MEDICAL IMAGE ANALYSIS
![]() |
ISSN | 1361-8415 |
2022-02-01 | |
Volume | 76Pages:102310 |
Abstract | Surgical instrument segmentation plays a promising role in robot-assisted surgery. However, illumination issues often appear in surgical scenes, altering the color and texture of surgical instruments. Changes in visual features make surgical instrument segmentation difficult. To address illumination issues, the SurgiNet is proposed to learn pyramid attention features. The double attention module is designed to capture the semantic dependencies between locations and channels. Based on semantic dependencies, the semantic features in the disturbed area can be inferred for addressing illumination issues. Pyramid attention is aggregated to capture multi-scale features and make predictions more accurate. To perform model compression, class-wise self-distillation is proposed to enhance the representation learning of the network, which performs feature distillation within the class to eliminate interference from other classes. Top-down and multi-stage knowledge distillation is designed to distill class probability maps. By inter layer supervision, high-level probability maps are applied to calibrate the probability distribution of lowlevel probability maps. Since class-wise distillation enhances the self-learning of the network, the network can get excellent performance with a lightweight backbone. The proposed network achieves the state-of-the-art performance of 89.14% mIoU on CataIS with only 1.66 GFlops and 2.05 M parameters. It also takes first place on EndoVis 2017 with 66.30% mIoU. |
Keyword | Surgical Insturment Segmentation Class-wise Self-Distillation Pyramid Attention |
DOI | 10.1016/j.media.2021.102310 |
Indexed By | SCI |
Language | 英语 |
Funding Project | National Natural Science Foundation of China[62027813] ; National Natural Science Foundation of China[62003343] ; National Natural Science Foundation of China[U1913210] ; National Natural Science Foundation of China[U1913601] ; National Natural Science Foundation of China[U1713220] ; Beijing Science and Technology Star[Z19110 0 0 01119046] ; Youth Innovation Promotion Association of the Chinese Academy of Sciences[2018165] ; Youth Innovation Promotion Association of the Chinese Academy of Sciences[2020140] |
Funding Organization | National Natural Science Foundation of China ; Beijing Science and Technology Star ; Youth Innovation Promotion Association of the Chinese Academy of Sciences |
WOS Research Area | Computer Science ; Engineering ; Radiology, Nuclear Medicine & Medical Imaging |
WOS Subject | Computer Science, Artificial Intelligence ; Computer Science, Interdisciplinary Applications ; Engineering, Biomedical ; Radiology, Nuclear Medicine & Medical Imaging |
WOS ID | WOS:000742838900002 |
Publisher | ELSEVIER |
Sub direction classification | 多模态智能 |
Citation statistics | |
Document Type | 期刊论文 |
Identifier | http://ir.ia.ac.cn/handle/173211/47043 |
Collection | 复杂系统管理与控制国家重点实验室_先进机器人 |
Corresponding Author | Bian, Gui-Bin; Hou, Zeng-Guang |
Affiliation | 1.Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China 2.Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China 3.Macau Univ Sci & Technol, Inst Syst Engn, Joint Lab Intelligence Sci & Technol, Macau, Peoples R China |
First Author Affilication | Institute of Automation, Chinese Academy of Sciences |
Corresponding Author Affilication | Institute of Automation, Chinese Academy of Sciences |
Recommended Citation GB/T 7714 | Ni, Zhen-Liang,Zhou, Xiao-Hu,Wang, Guan-An,et al. SurgiNet: Pyramid Attention Aggregation and Class-wise Self-Distillation for Surgical Instrument Segmentation[J]. MEDICAL IMAGE ANALYSIS,2022,76:102310. |
APA | Ni, Zhen-Liang.,Zhou, Xiao-Hu.,Wang, Guan-An.,Yue, Wen-Qian.,Li, Zhen.,...&Hou, Zeng-Guang.(2022).SurgiNet: Pyramid Attention Aggregation and Class-wise Self-Distillation for Surgical Instrument Segmentation.MEDICAL IMAGE ANALYSIS,76,102310. |
MLA | Ni, Zhen-Liang,et al."SurgiNet: Pyramid Attention Aggregation and Class-wise Self-Distillation for Surgical Instrument Segmentation".MEDICAL IMAGE ANALYSIS 76(2022):102310. |
Files in This Item: | Download All | |||||
File Name/Size | DocType | Version | Access | License | ||
MIA.pdf(1944KB) | 期刊论文 | 作者接受稿 | 开放获取 | CC BY-NC-SA | View Download |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment