CASIA OpenIR

浏览/检索结果: 共52条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:20/9  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
Efficient multimodal transformer with dual-level feature restoration for robust multimodal sentiment analysis 期刊论文
IEEE Transactions on Affective Computing, 2023, 卷号: 15, 期号: 1, 页码: 1-17
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2371Kb)  |  收藏  |  浏览/下载:51/17  |  提交时间:2024/05/31
Transformers  Robustness  Semantics  Data models  Computational modeling  Videos  Training  Multimodal sentiment analysis  unaligned and incomplete data  efficient multimodal Transformer  dual-level feature restoration  robustness  
A New Lightweight Script Independent Scene Text Style Transfer Network 期刊论文
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 页码: 29
作者:  Shivakumara, Palaiahnakote;  Roy, Ayush;  Nandanwar, Lokesh;  Pal, Umapada;  Lu, Yue;  Liu, Cheng-Lin
收藏  |  浏览/下载:48/0  |  提交时间:2024/02/22
Text detection  style transfer  CNN models  multi-lingual transfer  
Coarse Mask Guided Interactive Object Segmentation 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 5808-5822
作者:  Li, Jing;  Fan, Junsong;  Wang, Yuxi;  Yang, Yuran;  Zhang, Zhaoxiang
Adobe PDF(4323Kb)  |  收藏  |  浏览/下载:55/3  |  提交时间:2024/02/22
Segmentation  interactive  transformer  annotation tool  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:91/23  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
Physics-Based Modeling and Fluttering Dynamic Process Simulation for Catkins 期刊论文
FORESTS, 2023, 卷号: 14, 期号: 12, 页码: 28
作者:  Zhang, Jiaxiu;  Yang, Meng;  Xi, Benye;  Duan, Jie;  Huang, Qingqing;  Meng, Weiliang
Adobe PDF(10070Kb)  |  收藏  |  浏览/下载:74/10  |  提交时间:2024/02/22
clustering  catkin modeling  wind field simulation  catkin control  
Robust Video-Text Retrieval Via Noisy Pair Calibration 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 8632-8645
作者:  Zhang, Huaiwen;  Yang, Yang;  Qi, Fan;  Qian, Shengsheng;  Xu, Changsheng
收藏  |  浏览/下载:54/0  |  提交时间:2024/02/22
Noise calibration  uncertainty  video text retrieval  
A Graded Assessment System for Parkinsons Upper-Limb Bradykinesia Based on a Temporal Convolutional Network Model 期刊论文
IEEE SENSORS JOURNAL, 2023, 卷号: 23, 期号: 23, 页码: 29283-29292
作者:  Tong, Lina;  Liu, Dai-Song;  Peng, Liang;  Hao, Hong-Lin;  Wang, Chen
Adobe PDF(9425Kb)  |  收藏  |  浏览/下载:68/8  |  提交时间:2024/02/21
Bradykinesia grade  inertial sensors  Parkinson's disease (PD)  temporal convolutional network (TCN)  wearable device  
Emotion-Aware Music Driven Movie Montage 期刊论文
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2023, 卷号: 38, 期号: 3, 页码: 540-553
作者:  Liu, Wu-Qin;  Lin, Min-Xuan;  Huang, Hai-Bin;  Ma, Chong-Yang;  Song, Yu;  Dong, Wei-Ming;  Xu, Chang-Sheng
收藏  |  浏览/下载:133/0  |  提交时间:2023/12/21
movie montage  emotion analysis  audio-visual modality  contrastive learning  
Artificial intelligence for automatic surgical phase recognition of laparoscopic gastrectomy in gastric cancer 期刊论文
INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2023, 页码: 9
作者:  Zhai, Yuhao;  Chen, Zhen;  Zheng, Zhi;  Wang, Xi;  Yan, Xiaosheng;  Liu, Xiaoye;  Yin, Jie;  Wang, Jinqiao;  Zhang, Jun
收藏  |  浏览/下载:117/0  |  提交时间:2023/12/21
Artificial intelligence  Gastric cancer  Surgical phase