面向自动驾驶的平行视觉关键问题研究 | |
王建功 | |
2023-06 | |
页数 | 160 |
学位类型 | 博士 |
中文摘要 | 智能车对周边环境的精准感知理解是后续规划与控制的基础,也决定了自动驾驶系统的能力上限。而视觉感知作为人类以及众多智能系统获取环境信息的主要来源之一,在自动驾驶感知理解的过程中发挥着关键作用。目前,基于深度学习的计算机视觉技术在自动驾驶感知任务中应用广泛,总体取得了十分不错的成绩。然而,由于深度学习需要借助大规模的训练数据来提升模型感知精度,同时自动驾驶任务对系统的安全与可靠性也有着严格的要求,而已有的视觉感知方法往往面临着训练数据量不足、对长尾场景的识别能力不够等众多亟需解决的难题。基于平行系统提出的平行视觉是一个能够填补数据鸿沟、提升感知可靠性的新型视觉理论框架。 本文以自动驾驶为应用场景,研究平行视觉中的多项关键问题。从数据生成、算法优化、数据与算法闭环、系统优化训练四个角度构建了一个有效且可靠的自动驾驶平行视觉系统。最终缓解了交通场景训练数据不足与长尾场景感知能力欠佳的问题、提高了视觉系统的准确度与稳定性。其中,可控的图像数据生成既有助于解决训练数据量不足的问题同时又是构建数据算法闭环的基础。算法的优化学习作为视觉感知系统的关键任务,具有提高系统感知能力的作用。数据与算法的闭环则是构建完整平行视觉系统的核心,在实现数据生成与算法优化的协同趋优的同时能够提高系统鲁棒性与可靠性。最后,针对完整的平行视觉系统,科学高效的平行训练方法可以进一步提高系统感知精度与稳定性。本文的主要工作如下:
本文针对自动驾驶中平行视觉系统的多个关键问题,提出了多项具体的关键方法,对于平行视觉在自动驾驶中的实际应用具有重要意义。 |
英文摘要 | The intelligent vehicle's accurate perception and understanding of the surrounding environment is the basis for subsequent planning and control. As a result, it is crucial in determining the maximum capabilities of an autonomous driving system. As one of the main sources of environmental information for humans and many intelligent systems, visual perception plays a critical role in the process of autonomous driving perception and understanding. Currently, computer vision technology based on deep learning is widely used in autonomous driving perception tasks and has generally achieved good results. However, due to the need for large-scale training data to improve the accuracy and the strict requirements of system safety and reliability, the existing visual perception methods often face many challenges that urgently need to be addressed, such as insufficient training data and inadequate recognition capabilities for long-tail scenarios. Parallel vision based on parallel systems is a new visual theoretical framework that can fill the data gap and improve the reliability of perception. This thesis focuses on multiple key issues related to parallel vision in the application scenario of autonomous driving. An effective and reliable parallel vision system for autonomous driving is constructed from four perspectives: data generation, algorithm optimization, data-algorithm closed loop, and system training. This eventually alleviates the problems of insufficient training data in traffic scenarios and inadequate perception capabilities in long-tail scenarios, improving the accuracy and stability of the visual system. Among these, controllable image data generation helps solve the problem of insufficient training data while also serving as the basis for constructing a closed-loop between data and algorithm. The optimization learning of models is the key task of the visual perception system and contributes to improving the system's perception capabilities. The closed-loop between data and algorithms is the core of constructing a complete parallel vision system, which can enhance the system's robustness and reliability while achieving the synergistic optimization of data generation and algorithm optimization. Finally, for a complete parallel vision system, scientific and efficient parallel training methods can further improve the system's perception accuracy and stability. The main work and contributions of this thesis include the following aspects:
This thesis addresses several key issues of parallel vision systems in autonomous driving and proposes several specific key approaches that are important for the practical application of parallel vision in autonomous driving. |
关键词 | 平行视觉 自动驾驶 人工系统 计算实验 平行执行 平行训练 |
语种 | 中文 |
七大方向——子方向分类 | 图像视频处理与分析 |
国重实验室规划方向分类 | 视觉信息处理 |
是否有论文关联数据集需要存交 | 是 |
文献类型 | 学位论文 |
条目标识符 | http://ir.ia.ac.cn/handle/173211/52102 |
专题 | 毕业生_博士学位论文 |
推荐引用方式 GB/T 7714 | 王建功. 面向自动驾驶的平行视觉关键问题研究[D],2023. |
条目包含的文件 | ||||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 | ||
面向自动驾驶的平行视觉关键问题研究.pd(19546KB) | 学位论文 | 限制开放 | CC BY-NC-SA |
个性服务 |
推荐该条目 |
保存到收藏夹 |
查看访问统计 |
导出为Endnote文件 |
谷歌学术 |
谷歌学术中相似的文章 |
[王建功]的文章 |
百度学术 |
百度学术中相似的文章 |
[王建功]的文章 |
必应学术 |
必应学术中相似的文章 |
[王建功]的文章 |
相关权益政策 |
暂无数据 |
收藏/分享 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论