CASIA OpenIR

浏览/检索结果: 共139条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:35/18  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
Distinguishing Neural Speech Synthesis Models Through Fingerprints in Speech Waveforms 会议论文
, Taiyuan, Shanxi, China, 2024-07-27
作者:  Zhang, Chu Yuan;  Yi, Jiangyan;  Tao, Jianhua;  Wang, Chenglong;  Yan, Xinrui
Adobe PDF(2254Kb)  |  收藏  |  浏览/下载:28/12  |  提交时间:2024/06/26
The survey on multi-source data fusion in cyber-physical-social systems: Foundational infrastructure for industrial metaverses and industries 5.0 期刊论文
Information Fusion, 2024, 卷号: 107, 页码: 1-16
作者:  Xiao Wang;  Yutong Wang;  Jing Yang;  Xiaofeng Jia;  Lijun Li;  Weiping Ding;  Fei-Yue Wang
Adobe PDF(4446Kb)  |  收藏  |  浏览/下载:49/8  |  提交时间:2024/06/06
Multi-source data fusion  CPSS  Industrial metaverses  Parallel manufacturing  Social manufacturing  
GraphMLLM: A Graph-based Multi-level Layout Language-independent Model for Document Understanding 会议论文
, 希腊雅典, 2024-09
作者:  He-Sen Dai;  Xiao-Hui Li;  Fei Yin;  Xudong Yan;  Shuqi Mei;  Cheng-Lin Liu
Adobe PDF(967Kb)  |  收藏  |  浏览/下载:55/13  |  提交时间:2024/06/05
Visual information extraction  Self-supervised pre-training  Multi-level page layouts  
Health and Senior Care Video Moment Localization With Procedure Knowledge Distillation 会议论文
, Istanbul, Turkiye, Dec 5-8
作者:  Chaochen Wu;  Meiyun Zuo;  Guan Luo;  Yuna Jiang
Adobe PDF(3140Kb)  |  收藏  |  浏览/下载:47/18  |  提交时间:2024/06/05
Prompting Large Language Models for Automatic Question Tagging 期刊论文
Machine Intelligence Research, 2024, 页码: 0
作者:  Nuojia Xu;  Dizhan Xue;  Shengsheng Qian;  Quan Fang;  Jun Hu
Adobe PDF(1493Kb)  |  收藏  |  浏览/下载:47/20  |  提交时间:2024/06/04
Community Question Answering  Machine Learning  Large Language Model  Prompt Learning  Question Tagging  
Tri-relational multi-faceted graph neural networks for automatic question tagging 期刊论文
Neurocomputing, 2024, 卷号: 576, 页码: 127250
作者:  Nuojia Xu;  Jun Hu;  Quan Fang;  Dizhan Xue;  Yongxi Li;  Shengsheng Qian
Adobe PDF(2105Kb)  |  收藏  |  浏览/下载:49/21  |  提交时间:2024/06/04
Graph Neural Networks  Community Question Answering  Question Tagging  
DARTScore: DuAl-Reconstruction Transformer for Video Captioning Evaluation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 4, 页码: 2041-2055
作者:  Chen, Yuxin;  Zhang, Ziqi;  Qi, Zhongang;  Yuan, Chunfeng;  Wang, Jie;  Shan, Ying;  Li, Bing;  Hu, Weiming;  Qie, Xiaohu;  Wu, Jianping
Adobe PDF(13765Kb)  |  收藏  |  浏览/下载:48/1  |  提交时间:2024/05/30
Chinese video captioning evaluation  dual-reconstruction transformer  
Hierarchical Attention Network for Open-Set Fine-Grained Recognition 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2023, 页码: 1-14
作者:  Jiayin, Sun;  Hong, Wang;  Qiulei, Dong
Adobe PDF(2596Kb)  |  收藏  |  浏览/下载:60/18  |  提交时间:2024/05/28
AnyFace++: A Unified Framework for Free-style Text-to-Face Synthesis and Manipulation 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 页码: 1-15
作者:  Sun, Jianxin;  Deng, Qiyao;  Li, Qi;  Sun, Muyi;  Liu, Yunfan;  Sun, Zhenan
Adobe PDF(16839Kb)  |  收藏  |  浏览/下载:93/14  |  提交时间:2024/02/23