CASIA OpenIR  > 多模态人工智能系统全国重点实验室
A Multi-Modal Neural Geometric Solver with Textual Clauses Parsed from Diagram
Zhang Ming-Liang1,2; Yin Fei1,2; Liu Cheng-Lin1,2
2023-07
Conference NameProceedings of the 32nd International Joint Conference on Artificial Intelligence
Pages3374-3382
Conference Date2023-7-19
Conference Place中国 澳门
Abstract

Geometry problem solving (GPS) is a high-level mathematical reasoning requiring the capacities of multi-modal fusion and geometric knowledge application. Recently, neural solvers have shown great potential in GPS but still be short in diagram presentation and modal fusion. In this work, we convert diagrams into basic textual clauses to describe diagram features effectively, and propose a new neural solver called PGPSNet to fuse multimodal information efficiently. Combining structural and semantic pre-training, data augmentation and self-limited decoding, PGPSNet is endowed with rich knowledge of geometry theorems and geometric representation, and therefore promotes geometric understanding and reasoning. In addition, to facilitate the research of GPS, we build a new large-scale and fine-annotated GPS dataset named PGPS9K, labeled with both fine-grained diagram annotation and interpretable solution program. Experiments on PGPS9K and an existing dataset Geometry3K validate the superiority of our method over the state-of-the-art neural solvers.

Indexed ByEI
Language英语
IS Representative Paper
Sub direction classification知识表示与推理
planning direction of the national heavy laboratory认知决策知识体系
Paper associated data
Document Type会议论文
Identifierhttp://ir.ia.ac.cn/handle/173211/55697
Collection多模态人工智能系统全国重点实验室
Corresponding AuthorLiu Cheng-Lin
Affiliation1.MAIS, Institute of Automation of Chinese Academy of Sciences
2.School of Artificial Intelligence, University of Chinese Academy of Sciences
Recommended Citation
GB/T 7714
Zhang Ming-Liang,Yin Fei,Liu Cheng-Lin. A Multi-Modal Neural Geometric Solver with Textual Clauses Parsed from Diagram[C],2023:3374-3382.
Files in This Item: Download All
File Name/Size DocType Version Access License
0376.pdf(1110KB)会议论文 开放获取CC BY-NC-SAView Download
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Zhang Ming-Liang]'s Articles
[Yin Fei]'s Articles
[Liu Cheng-Lin]'s Articles
Baidu academic
Similar articles in Baidu academic
[Zhang Ming-Liang]'s Articles
[Yin Fei]'s Articles
[Liu Cheng-Lin]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Zhang Ming-Liang]'s Articles
[Yin Fei]'s Articles
[Liu Cheng-Lin]'s Articles
Terms of Use
No data!
Social Bookmark/Share
File name: 0376.pdf
Format: Adobe PDF
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.