CASIA OpenIR

浏览/检索结果: 共712条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
TextFormer: A Query-based End-to-end Text Spotter with Mixed Supervision 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 4, 页码: 704-717
作者:  Yukun Zhai;   Xiaoqiang Zhang;   Xiameng Qin;   Sanyuan Zhao;  Xingping Dong;   Jianbing Shen
Adobe PDF(2312Kb)  |  收藏  |  浏览/下载:26/8  |  提交时间:2024/07/18
End-to-end text spotting  arbitrarily-shaped texts  transformer  mixed supervision  multitask modeling  
面向视觉-语言的跨模态预训练与匹配方法研究 学位论文
, 2024
作者:  chen yuxin
Adobe PDF(46981Kb)  |  收藏  |  浏览/下载:35/2  |  提交时间:2024/07/11
视觉语言匹配  图像文本预训练  知识蒸馏  双向匹配评估  令牌合并  
NExT-OOD: Overcoming Dual Multiple-Choice VQA Biases 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 页码: 1913-1931
作者:  Zhang Xi(张熙);  Feifei Zhang;  Changsheng Xu
Adobe PDF(4719Kb)  |  收藏  |  浏览/下载:41/10  |  提交时间:2024/07/08
Multi-Level Counterfactual Contrast for Visual Commonsense Reasoning 会议论文
, Chengdu, China, 2021-10
作者:  Zhang X(张熙);  Feifei Zhang;  Changsheng Xu
Adobe PDF(5740Kb)  |  收藏  |  浏览/下载:40/9  |  提交时间:2024/07/08
CGNN: A Compatibility-aware Graph Neural Network for Social Media Bot Detection 期刊论文
IEEE Transactions on Computational Social System, 2024, 页码: Early Access
作者:  Huang, Haitao;  Tian, Hu;  Zheng, Xiaolong;  Zhang, Xingwei;  Zeng, Dajun;  Wang, Feiyue
Adobe PDF(2267Kb)  |  收藏  |  浏览/下载:36/14  |  提交时间:2024/07/08
graph neural network  heterogeneous compatibility  social media bot detection  
CMFN: Cross-Modal Fusion Network for Irregular Scene Text Recognition 会议论文
, 中国, 2023.06.08
作者:  Jinzhi Zheng;  Ruyi Ji;  Libo Zhang;  Yanjun Wu;  Chen Zhao
Adobe PDF(1516Kb)  |  收藏  |  浏览/下载:35/12  |  提交时间:2024/07/08
Emotion selectable end-to-end text-based speech editing 期刊论文
ARTIFICIAL INTELLIGENCE, 2024, 卷号: 329, 页码: 16
作者:  Wang, Tao;  Yi, Jiangyan;  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi;  Zhang, Chu Yuan
收藏  |  浏览/下载:20/0  |  提交时间:2024/07/03
Emotion selectable  Text-based speech editing  Emotion decoupling  Mask prediction  Few-shot learning  Text-to-speech  
Disentangled Text Representation Learning With Information-Theoretic Perspective for Adversarial Robustness 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 卷号: 32, 页码: 1237-1247
作者:  Zhao, Jiahao;  Mao, Wenji;  Zeng, Daniel Dajun
收藏  |  浏览/下载:16/0  |  提交时间:2024/07/03
Adversarial robustness  variation of information  disentangled text representation learning  
SELF-SUPERVISED MATCHING NETWORK BASED ON FREQUENCY DOMAIN INFORMATION GUIDANCE FOR REMOTE SENSING IMAGE REGISTRATION 会议论文
, Athens, Greece, Jul 7, 2024 - Jul 12, 2024
作者:  Zhou YX(周雨欣);  Wan L(万玲);  Ma L(马雷)
Adobe PDF(11699Kb)  |  收藏  |  浏览/下载:22/9  |  提交时间:2024/07/01
MapGuide: A Simple yet Effective Method to Reconstruct Continuous Language from Brain Activities 会议论文
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), Mexico City, Mexico, 2024-6
作者:  Xinpei, Zhao;  Jingyuan, Sun;  Shaonan, Wang;  Jing, Ye;  Xiaohan, Zhang;  Chengqing, Zong
Adobe PDF(843Kb)  |  收藏  |  浏览/下载:39/12  |  提交时间:2024/06/27
neural decoding