SignParser: An End-to-End Framework for Traffic Sign Understanding

doi:10.1007/s11263-023-01912-9

	SignParser: An End-to-End Framework for Traffic Sign Understanding
	Guo, Yunfei1,2 ; Feng, Wei1,2 ; Yin, Fei1,2 ; Liu, Cheng-Lin1,2
发表期刊	INTERNATIONAL JOURNAL OF COMPUTER VISION
ISSN	0920-5691
	2023-10-17
卷号	132 期号:2 页码:805-821
摘要	In intelligent transportation systems, parsing traffic signs and transmitting traffic information to humans is an urgent need. However, despite the success achieved in the detection and recognition of low-level circular or triangular traffic signs, parsing the more complex and informative rectangular traffic signs remains unexplored and challenging. Our work is devoted to the topic called "Traffic Sign Understanding (TSU)", which is aimed to parse various traffic signs and generate semantic descriptions for them. To achieve this goal, we propose an end-to-end framework that integrates component detection, content reasoning, and semantic description generation. The component detection module first detects initial components in the sign image. Then the content reasoning module acquires the detailed content of the sign, including final components, their relations, and layout category, which provide local and global information for the subsequent module. In the end, the semantic description generation module mines relational attributes and text semantic attributes from the preceding results, embeds them with the layout categories, and transforms them into semantic descriptions through a dynamic prediction transformer. The three modules are trained jointly in an end-to-end manner for optimizing the overall performance. This method achieves state-of-the-art performance not only in the final semantic description generation stage but also on multiple subtasks of the CASIA-Tencent CTSU Dataset. Abundant ablation experiments are provided to prove the effectiveness of this method.
关键词	Traffic sign understanding Content reasoning Semantic description generation
DOI	10.1007/s11263-023-01912-9
关键词[WOS]	NEURAL-NETWORK
收录类别	SCI
语种	英语
WOS研究方向	Computer Science
WOS类目	Computer Science, Artificial Intelligence
WOS记录号	WOS:001084915000001
出版者	SPRINGER
是否为代表性论文	是
七大方向——子方向分类	人工智能+交通
国重实验室规划方向分类	视觉信息处理
是否有论文关联数据集需要存交	否
引用统计
文献类型	期刊论文
条目标识符	http://ir.ia.ac.cn/handle/173211/54326
专题	多模态人工智能系统全国重点实验室多模态人工智能系统全国重点实验室_模式分析与学习
通讯作者	Guo, Yunfei
作者单位	1.Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence S, Beijing 100190, Peoples R China 2.Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
第一作者单位	中国科学院自动化研究所
通讯作者单位	中国科学院自动化研究所
推荐引用方式 GB/T 7714	Guo, Yunfei,Feng, Wei,Yin, Fei,et al. SignParser: An End-to-End Framework for Traffic Sign Understanding[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION,2023,132(2):805-821.
APA	Guo, Yunfei,Feng, Wei,Yin, Fei,&Liu, Cheng-Lin.(2023).SignParser: An End-to-End Framework for Traffic Sign Understanding.INTERNATIONAL JOURNAL OF COMPUTER VISION,132(2),805-821.
MLA	Guo, Yunfei,et al."SignParser: An End-to-End Framework for Traffic Sign Understanding".INTERNATIONAL JOURNAL OF COMPUTER VISION 132.2(2023):805-821.

条目包含的文件		下载所有文件
文件名称/大小	文献类型	版本类型	开放类型	使用许可
dd073158-75c1-4130-9（7011KB）	期刊论文	作者接受稿	开放获取	CC BY-NC-SA	浏览下载