“机器智能、系统优化与最优决策”专刊前言

CASIA OpenIR > 多模态人工智能系统全国重点实验室 > 复杂系统智能机理与平行控制团队

	“机器智能、系统优化与最优决策”专刊前言
	王成红; 孙长银; 苏剑波; 周彤; 赵东斌; 胡跃明
发表期刊	控制理论与应用
	2016
卷号	33 期号:12 页码:1553-1554
其他摘要	2016年3月, 谷歌公司开发的计算机程序AlphaGo(初弈号)在韩国首都首尔挑战当今世界顶级棋手 —–韩国职业九段李世石(Lee Sedol), 并最终取得4胜1负的令世界震惊战绩. 这标志着人工智能方法已经能够在复杂的棋类博弈游戏中达到匹敌、甚至超越人类的水平. 其基本原理是将具有“感知”能力的深度学习(deep learning)和具有“决策”能力的强化学习(reinforcement learning)紧密结合, 构成深度强化学习(deep reinforcement learning)算法, 并与蒙特卡罗树搜索结合. 它极大地减少了目标优化过程的计算量, 提升了对棋局估计的准确度. ; In March 2016, AlphaGo, a computer program developed by Google Corporation, challenged Lee Sedol, Korea's top nine player in the Korean capital, to challenge the world record of 4-1 in Korea's capital Seoul This signifies that artificial intelligence methods have been able to match or even surpass humankind in complex chess game games whose basic principle is to combine deep learning with "perceived" abilities and "decision-making" abilities Reinforcement learning is tightly coupled to form a deep reinforcement learning algorithm that is combined with Monte Carlo tree search, which greatly reduces the computational complexity of the goal optimization process and improves the accuracy of the game estimate.
关键词	机器智能系统优化最优决策
文献类型	期刊论文
条目标识符	http://ir.ia.ac.cn/handle/173211/19324
专题	多模态人工智能系统全国重点实验室_复杂系统智能机理与平行控制团队
推荐引用方式 GB/T 7714	王成红,孙长银,苏剑波,等. “机器智能、系统优化与最优决策”专刊前言[J]. 控制理论与应用,2016,33(12):1553-1554.
APA	王成红,孙长银,苏剑波,周彤,赵东斌,&胡跃明.(2016).“机器智能、系统优化与最优决策”专刊前言.控制理论与应用,33(12),1553-1554.
MLA	王成红,et al."“机器智能、系统优化与最优决策”专刊前言".控制理论与应用 33.12(2016):1553-1554.

条目包含的文件		下载所有文件
文件名称/大小	文献类型	版本类型	开放类型	使用许可
“机器智能、系统优化与最优决策”专刊前言（164KB）	期刊论文	作者接受稿	开放获取	CC BY-NC-SA	浏览下载