CASIA OpenIR

浏览/检索结果: 共19条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Feilong Chen;  Duzhen Zhang;  Minglun Han;  Xiuyi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(969Kb)  |  收藏  |  浏览/下载:145/29  |  提交时间:2023/06/21
IMPROVING CROSS-MODAL UNDERSTANDING IN VISUAL DIALOG VIA CONTRASTIVE LEARNING 会议论文
, Singapore, 2022.5
作者:  Feilong Chen;  Duzhen Zhang;  Xiuyi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(9035Kb)  |  收藏  |  浏览/下载:212/91  |  提交时间:2023/06/07
Unsupervised and Pseudo-Supervised Vision-Language Alignment in Visual Dialog 会议论文
, Lisboa, Portugal, October 10–14, 2022
作者:  Feilong Chen;  Duzhen Zhang;  Xiuyi Chen;  Jing Shi;  Shang Xu;  Bo Xu
Adobe PDF(9035Kb)  |  收藏  |  浏览/下载:243/147  |  提交时间:2023/06/05
Automatic Curriculum Learning for Large-Scale Cooperative Multiagent Systems 期刊论文
IEEE Transactions on Emerging Topics in Computational Intelligence, 2023, 卷号: 7, 期号: 3, 页码: 912-930
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强)
Adobe PDF(4728Kb)  |  收藏  |  浏览/下载:294/85  |  提交时间:2023/06/02
Category-Level 6D Object Pose Estimation With Structure Encoder and Reasoning Attention 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 10, 页码: 6728-6740
作者:  Liu, Jierui;  Cao, Zhiqiang;  Tang, Yingbo;  Liu, Xilong;  Tan, Min
Adobe PDF(22124Kb)  |  收藏  |  浏览/下载:250/2  |  提交时间:2022/11/14
Shape  Three-dimensional displays  Cognition  Pose estimation  Feature extraction  Decoding  Solid modeling  Category-level  6D object pose estimation  structure encoder  reasoning attention  
SPEAKER-AWARE SPEECH-TRANSFORMER 会议论文
, 新加坡, 2019-12-14
作者:  Fan ZY(范志赟);  Li J(李杰);  Zhou SY(周世玉);  Xu B(徐波)
Adobe PDF(361Kb)  |  收藏  |  浏览/下载:161/53  |  提交时间:2022/09/17
Speech-Transformer, speaker adaptation, end-to-end speech recognition, speaker aware training, i-vector  
Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training 会议论文
0, 线上会议, 2021-7-18
作者:  Zhang Peng;  Xu Jiaming;  Shi Jing;  Hao Yunzhe;  Qin Lei;  Xu Bo
Adobe PDF(1900Kb)  |  收藏  |  浏览/下载:225/58  |  提交时间:2021/06/21
audio-visual speech separation  robust  adversarial training method  time-domain approach  
行人再识别的特征表达研究 学位论文
工学博士, 中国科学院自动化研究所: 中国科学院自动化研究所, 2021
作者:  杨文杰
Adobe PDF(33650Kb)  |  收藏  |  浏览/下载:224/4  |  提交时间:2021/06/21
行人再识别  表达学习  行人遮挡  行人检测  
Towards Rich Feature Discovery with Class Activation Maps Augmentation for Person Re-Identification 会议论文
, Long Beach, United States, June 16-20
作者:  Yang, Wenjie;  Huang, Houjing;  Zhang, Zhang;  Chen, Xiaotang;  Huang, Kaiqi
Adobe PDF(1481Kb)  |  收藏  |  浏览/下载:171/43  |  提交时间:2021/06/21
Spatial Preserved Graph Convolution Networks for Person Re-identification 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2020, 卷号: 16, 期号: 1, 页码: 14
作者:  Li, Zhaoju;  Zhou, Zongwei;  Jiang, Nan;  Han, Zhenjun;  Xing, Junliang;  Jiao, Jianbin
Adobe PDF(937Kb)  |  收藏  |  浏览/下载:220/39  |  提交时间:2021/01/06
Person re-identification  graph convolution  feature embedding