CASIA OpenIR

浏览/检索结果: 共54条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition 会议论文
, Washington D.C., USA, 2023-2-9
作者:  Qingyu Wang;  Tielin Zhang;  Minglun Han;  Yi Wang;  Duzhen Zhang;  Bo Xu
Adobe PDF(1714Kb)  |  收藏  |  浏览/下载:124/41  |  提交时间:2023/06/20
Knowledge Transfer from Pre-Trained Language Models to CIF-Based Speech Recognizers via Hierarchical Distillation 会议论文
, Dublin, Ireland, 2023-8-20
作者:  Minglun Han;  Feilong Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(563Kb)  |  收藏  |  浏览/下载:132/50  |  提交时间:2023/06/20
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Feilong Chen;  Duzhen Zhang;  Minglun Han;  Xiuyi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(969Kb)  |  收藏  |  浏览/下载:111/27  |  提交时间:2023/06/21
Skeleton-aware Implicit Function for Single-view Human Reconstruction 期刊论文
CAAI Transactions on Intelligence Technology, 2023, 页码: 379-389
作者:  Pengpeng Liu;  Guixuan Zhang;  Shuwu Zhang;  Yuanhao Li;  Zhi Zeng
Adobe PDF(1470Kb)  |  收藏  |  浏览/下载:94/26  |  提交时间:2024/01/12
Body Pose, Human Reconstruction, Implicit Function, Parametric Body Model, Single-view  
Relative Pose Estimation for RGB-D Human Input Scans via Implicit Function Reconstruction 期刊论文
WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 卷号: 2022, 页码: 9
作者:  Liu, Pengpeng;  Yu, Tao;  Zeng, Zhi;  Liu, Yebin;  Zhang, Guixuan;  Song, Zhen
Adobe PDF(1262Kb)  |  收藏  |  浏览/下载:194/21  |  提交时间:2022/06/06
Unsupervised and Pseudo-Supervised Vision-Language Alignment in Visual Dialog 会议论文
, Lisboa, Portugal, October 10–14, 2022
作者:  Feilong Chen;  Duzhen Zhang;  Xiuyi Chen;  Jing Shi;  Shang Xu;  Bo Xu
Adobe PDF(9035Kb)  |  收藏  |  浏览/下载:227/143  |  提交时间:2023/06/05
Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection 会议论文
, Singapore, Singapore, 2022.05
作者:  Minglun Han;  Linhao Dong;  Zhenlin Liang;  Meng Cai;  Shiyu Zhou;  Zejun Ma;  Bo Xu
Adobe PDF(463Kb)  |  收藏  |  浏览/下载:146/43  |  提交时间:2023/05/29
Automatic Speech Recognition  Context Biasing  Speech Recognition Customization  Continuous Integrate-and-Fire Mechanism  
IMPROVING CROSS-MODAL UNDERSTANDING IN VISUAL DIALOG VIA CONTRASTIVE LEARNING 会议论文
, Singapore, 2022.5
作者:  Feilong Chen;  Duzhen Zhang;  Xiuyi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(9035Kb)  |  收藏  |  浏览/下载:174/82  |  提交时间:2023/06/07
DFAN: Dual Feature Aggregation Network for Lightweight Image Super-Resolution 期刊论文
Wireless Communications and Mobile Computing, 2022, 期号: 2022, 页码: 8116846
作者:  Li, Shang;  Zhang, Guixuan;  Luo, Zhengxiong;  Liu, Jie
Adobe PDF(2507Kb)  |  收藏  |  浏览/下载:176/29  |  提交时间:2022/04/06
Super-resolution  Lightweight  Feature aggregation  
Enhancing Feature Point-Based Video Watermarking against Geometric Attacks with Template 期刊论文
WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 期号: 2021, 页码: 9
作者:  Lv, Zhongze;  Guan, Hu;  Huang, Ying;  Zhang, Shuwu;  Zheng, Yang
Adobe PDF(889Kb)  |  收藏  |  浏览/下载:239/46  |  提交时间:2022/01/27
geometric attacks, template, feature point, video watermarking