Selected(0)Clear
Items/Page: Sort: |
| Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文 IEEE Transactions on Multimedia, 2023, 页码: 1 - 13 Authors: Liu, Jiawei ; Wang, Weining ; Chen, Sihan; Zhu, Xinxin ; Liu, Jing
Adobe PDF(7741Kb)  |   Favorite  |  View/Download:8/0  |  Submit date:2023/05/03 Text-guided sounding-video generation Videoaudio representation Contrastive learning Transformer |
| Parallel Learning: Overview and Perspective for Computational Learning Across Syn2Real and Sim2Real 期刊论文 IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 3, 页码: 603-631 Authors: Qinghai Miao; Yisheng Lv ; Min Huang; Xiao Wang ; Fei-Yue Wang
Adobe PDF(11937Kb)  |   Favorite  |  View/Download:35/4  |  Submit date:2023/03/02 Machine learning parallel learning parallel systems sim-to-real syn-to-real virtual-to-real |
| DAO to Hanoi via DeSci: AI Paradigm Shift from AlphaGo to ChatGPT 期刊论文 IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 4, 页码: 877-897 Authors: Miao, Qinghai (proxy) (contact); Zheng, Wenbo; 吕, 宜生; Huang, Min; Ding, Wenwen; Wang, Fei-Yue
Adobe PDF(4968Kb)  |   Favorite  |  View/Download:31/4  |  Submit date:2023/03/22 ChatGPT, decentralized science (DeSci) decentralized autonomous organization (DAO) machine learning paradigm shift |
| BSTG-Trans: A Bayesian Spatial-Temporal Graph Transformer for Long-term Pose Forecasting 期刊论文 IEEE Transactions on Multimedia, 2023, 卷号: Early Access, 期号: Early Access, 页码: Early Access Authors: Shentong Mo; Xin M(辛淼)
Adobe PDF(2209Kb)  |   Favorite  |  View/Download:34/0  |  Submit date:2023/04/25 long-term forecasting spatial-temporal graph transformer Bayesian transformer uncertainty estimation |
| Pre-training in Medical Data: A Survey 期刊论文 Machine Intelligence Research, 2023, 卷号: 20, 期号: 2, 页码: 147-149 Authors: Yixuan Qiu
Adobe PDF(3559Kb)  |   Favorite  |  View/Download:19/1  |  Submit date:2023/04/03 Medical data pre-training transfer learning self-supervised learning medical image data electrocardiograms (ECG) data |
| Dual-stream Representation Fusion Learning for accurate medical image segmentation 期刊论文 Engineering Applications of Artificial Intelligence, 2023, 卷号: 123, 页码: 106402 Authors: Xu RT(许镕涛); Wang CW(王常维); Xu SB(徐士彪) ; Meng WL(孟维亮) ; Zhang XP(张晓鹏)
Adobe PDF(1893Kb)  |   Favorite  |  View/Download:25/1  |  Submit date:2023/05/18 |
| 鲁棒的口语翻译方法研究 学位论文 , 2022 Authors: 王世宁
Adobe PDF(3004Kb)  |   Favorite  |  View/Download:138/7  |  Submit date:2022/12/13 口语翻译 鲁棒神经机器翻译 不流利现象 对比学习 |
| 历史文档版面分析与文字识别 学位论文 工学博士, 中国科学院自动化研究所: 中国科学院自动化研究所, 2022 Authors: 徐玥
Adobe PDF(34832Kb)  |   Favorite  |  View/Download:183/6  |  Submit date:2022/09/20 版面分析 文字识别 类别增量学习 文档数据库 历史文档 全卷积神经网络 卷积原型网络 |
| 平行交通系统中的预测与控制关键技术研究 学位论文 工学博士, 中国科学院自动化研究所: 中国科学院自动化研究所, 2022 Authors: 戴星原
Adobe PDF(14868Kb)  |   Favorite  |  View/Download:160/8  |  Submit date:2022/10/09 平行交通系统 交通预测 交通控制 深度学习 强化学习 |
| 会议场景智能语音处理技术研究 学位论文 工学博士, 中国科学院自动化研究所: 中国科学院自动化研究所, 2022 Authors: 范志赟
Adobe PDF(3323Kb)  |   Favorite  |  View/Download:117/6  |  Submit date:2022/09/15 会议场景,语音识别,说话人转换点检测,说话人自适应 |