CASIA OpenIR  > 多模态人工智能系统全国重点实验室
Plane Geometry Diagram Parsing
Zhang Ming-Liang1,2; Yin Fei1,2; Hao Yi-Han3; Liu Cheng-Lin1,2
2022-07
Conference NameProceedings of the 31st International Joint Conference on Artificial Intelligence
Pages1636-1643
Conference Date2022-7-24
Conference Place奥地利 维也纳
Abstract

Geometry diagram parsing plays a key role in geometry problem solving, wherein the primitive extraction and relation parsing remain challenging due to the complex layout and between-primitive relationship. In this paper, we propose a powerful diagram parser based on deep learning and graph reasoning. Specifically, a modified instance segmentation method is proposed to extract geometric primitives, and the graph neural network (GNN) is leveraged to realize relation parsing and primitive classification incorporating geometric features and prior knowledge. All the modules are integrated into an end-to-end model called PGDPNet to perform all the sub-tasks simultaneously. In addition, we build a new large-scale geometry diagram dataset named PGDP5K with primitive level annotations. Experiments on PGDP5K and an existing dataset IMP-Geometry3K show that our model outperforms state-of-the-art methods in four sub-tasks remarkably.

Indexed ByEI
Language英语
IS Representative Paper
Sub direction classification文字识别与文档分析
planning direction of the national heavy laboratory认知决策知识体系
Paper associated data
Document Type会议论文
Identifierhttp://ir.ia.ac.cn/handle/173211/55698
Collection多模态人工智能系统全国重点实验室
Corresponding AuthorZhang Ming-Liang
Affiliation1.National Laboratory of Pattern Recognition, Institute of Automation of Chinese Academy of Sciences
2.School of Artificial Intelligence, University of Chinese Academy of Sciences
3.School of Electronic Information Engineering, Beijing Jiaotong University
First Author AffilicationChinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
Corresponding Author AffilicationChinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
Recommended Citation
GB/T 7714
Zhang Ming-Liang,Yin Fei,Hao Yi-Han,et al. Plane Geometry Diagram Parsing[C],2022:1636-1643.
Files in This Item:
File Name/Size DocType Version Access License
0228.pdf(1024KB)会议论文 开放获取CC BY-NC-SAView
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Zhang Ming-Liang]'s Articles
[Yin Fei]'s Articles
[Hao Yi-Han]'s Articles
Baidu academic
Similar articles in Baidu academic
[Zhang Ming-Liang]'s Articles
[Yin Fei]'s Articles
[Hao Yi-Han]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Zhang Ming-Liang]'s Articles
[Yin Fei]'s Articles
[Hao Yi-Han]'s Articles
Terms of Use
No data!
Social Bookmark/Share
File name: 0228.pdf
Format: Adobe PDF
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.