CASIA OpenIR

Browse/Search Results:  1-10 of 149 Help

Selected(0)Clear Items/Page:    Sort:
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 页码: 1 - 13
Authors:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  Favorite  |  View/Download:8/0  |  Submit date:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
Parallel Learning: Overview and Perspective for Computational Learning Across Syn2Real and Sim2Real 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 3, 页码: 603-631
Authors:  Qinghai Miao;  Yisheng Lv;  Min Huang;  Xiao Wang;  Fei-Yue Wang
Adobe PDF(11937Kb)  |  Favorite  |  View/Download:35/4  |  Submit date:2023/03/02
Machine learning  parallel learning  parallel systems  sim-to-real  syn-to-real  virtual-to-real  
DAO to Hanoi via DeSci: AI Paradigm Shift from AlphaGo to ChatGPT 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 4, 页码: 877-897
Authors:  Miao, Qinghai (proxy) (contact);  Zheng, Wenbo;  吕, 宜生;  Huang, Min;  Ding, Wenwen;  Wang, Fei-Yue
Adobe PDF(4968Kb)  |  Favorite  |  View/Download:31/4  |  Submit date:2023/03/22
ChatGPT, decentralized science (DeSci)  decentralized autonomous organization (DAO)  machine learning  paradigm shift  
BSTG-Trans: A Bayesian Spatial-Temporal Graph Transformer for Long-term Pose Forecasting 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: Early Access, 期号: Early Access, 页码: Early Access
Authors:  Shentong Mo;  Xin M(辛淼)
Adobe PDF(2209Kb)  |  Favorite  |  View/Download:34/0  |  Submit date:2023/04/25
long-term forecasting  spatial-temporal graph transformer  Bayesian transformer  uncertainty estimation  
Pre-training in Medical Data: A Survey 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 2, 页码: 147-149
Authors:  Yixuan Qiu
Adobe PDF(3559Kb)  |  Favorite  |  View/Download:19/1  |  Submit date:2023/04/03
Medical data  pre-training  transfer learning  self-supervised learning  medical image data  electrocardiograms (ECG) data  
Dual-stream Representation Fusion Learning for accurate medical image segmentation 期刊论文
Engineering Applications of Artificial Intelligence, 2023, 卷号: 123, 页码: 106402
Authors:  Xu RT(许镕涛);  Wang CW(王常维);  Xu SB(徐士彪);  Meng WL(孟维亮);  Zhang XP(张晓鹏)
Adobe PDF(1893Kb)  |  Favorite  |  View/Download:25/1  |  Submit date:2023/05/18
鲁棒的口语翻译方法研究 学位论文
, 2022
Authors:  王世宁
Adobe PDF(3004Kb)  |  Favorite  |  View/Download:138/7  |  Submit date:2022/12/13
口语翻译  鲁棒神经机器翻译  不流利现象  对比学习  
历史文档版面分析与文字识别 学位论文
工学博士, 中国科学院自动化研究所: 中国科学院自动化研究所, 2022
Authors:  徐玥
Adobe PDF(34832Kb)  |  Favorite  |  View/Download:183/6  |  Submit date:2022/09/20
版面分析  文字识别  类别增量学习  文档数据库  历史文档  全卷积神经网络  卷积原型网络  
平行交通系统中的预测与控制关键技术研究 学位论文
工学博士, 中国科学院自动化研究所: 中国科学院自动化研究所, 2022
Authors:  戴星原
Adobe PDF(14868Kb)  |  Favorite  |  View/Download:160/8  |  Submit date:2022/10/09
平行交通系统  交通预测  交通控制  深度学习  强化学习  
会议场景智能语音处理技术研究 学位论文
工学博士, 中国科学院自动化研究所: 中国科学院自动化研究所, 2022
Authors:  范志赟
Adobe PDF(3323Kb)  |  Favorite  |  View/Download:117/6  |  Submit date:2022/09/15
会议场景,语音识别,说话人转换点检测,说话人自适应