CASIA OpenIR  > 毕业生  > 硕士学位论文
Alternative TitleBilingual Multi-modal Dialogue System Application Server Based on SIP
Thesis Advisor徐波
Degree Grantor中国科学院研究生院
Place of Conferral中国科学院自动化研究所
Degree Discipline软件工程
Keyword多模态 Sip Xml 应用服务器 Multi-modal Sip Xml Application Server
Abstract在全球化的背景下,基于双语翻译的多模态系统正随着互联网技术以及多媒体技术的飞速发展,成为了人们最热衷的应用服务之一,有着不可估量的前景。双语多模态系统主要由两部分组成,即媒体服务器和应用服务器。媒体服务器为多模态系统提供多种媒体引擎,包含了机器翻译,识别,语音合成,会话管理等功能;而应用服务器是系统的核心,更是起到了举足轻重的作用,他为整个系统提供了控制命令以及业务逻辑。 本文以此为背景展开,分析了国内外多模态系统的发展现状,介绍了基于双语多模态系统的特定需求,并基于IMS语音服务器的架构给出了具体的网络实现方案。同时SIP(Session Initiation Protocol,会话初始协议)作为IETF制定的多媒体会话控制信令协议,其独立于底层协议,简洁、灵活、可扩展性强等特点也非常适合用于多模态系统应用服务器的开发。整个系统采用分布式体系结构,信令和媒体流完全分离,具有高度的灵活性和扩展性。 文中重点介绍了SIP应用服务器实现的部分。为了控制机器翻译、识别、语音合成、会话管理等多媒体引擎,采用了MSCML,STML等XML扩展标记语言,设计和实现了SDML标记语言,并为不同XML语言编写了解析器。 系统的SIP协议栈基于开源项目Asterisk来实现。系统遵循SIP标准协议(RFC3261)和相关草案,并根据系统的需要对SIP协议进行了适当扩展。这也使得该系统具有良好的通用性。 【关键字】 多模态, sip, xml, 应用服务器
Other AbstractUnder the background of globalization, along with the rapid development of internet technology as well as multimedia technology, bilingual multi-modal dialogue system has become one of the most popular services, which has inestimable prospect. Multilingual multi-modal dialogue system is mainly composed of media server and application server. Media server provides kinds of multimedia engines such as machine translation, speech recognition, speech synthesis, and session management. Application server is the core of the system and also plays a pivotal role that provids control commands and service logic for the system. This article takes this as the background. It analyses the present status of multi-modal system, introduces the specific demand for multilingual speech, and gives a specific network scheme based on IMS server construction. Meanwhile, as signal control protocol for multimedia session developed by IETF, SIP (Session Initiation Protocol) gets more preponderant and it is simple, flexible and scalable. So the system uses SIP protocol. The system adopts a distributed architecture with a high degree of flexibility and scalability. This article focuses on the achievement of the sip application server, using the MSCML, STML and the other xml extensible markup languages in order to control the machine translation, speech recognition and synthesis and session management engines. Designed and implemented SDML markup language and written XML parser for different XML languages. SIP stack achieved based on Asterisk. This system follows SIP protocol (RFC3261) and its associated drafts, makes the suitable expansion according to the needs and with good versatility. 【keywords】 multi-modal, sip, xml, application server
Other Identifier200828009029043
Document Type学位论文
Recommended Citation
GB/T 7714
高鹏飞. 基于SIP的双语多模态系统应用服务器[D]. 中国科学院自动化研究所. 中国科学院研究生院,2011.
Files in This Item:
File Name/Size DocType Version Access License
CASIA_20082800902904(1537KB) 暂不开放CC BY-NC-SAApplication Full Text
Related Services
Recommend this item
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[高鹏飞]'s Articles
Baidu academic
Similar articles in Baidu academic
[高鹏飞]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[高鹏飞]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.