CASIA OpenIR

浏览/检索结果: 共99条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:32/17  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
A robust transformer-based pipeline of 3D cell alignment, denoise and instance segmentation on electron microscopy sequence images 期刊论文
Journal of Plant Physiology, 2024, 页码: 154236
作者:  Jiazheng, Liu;  Yafeng, Zheng;  Limei, Lin;  Jingyue, Guo;  Yanan, Lv;  Jingbin, Yuan;  Hao, Zhai;  Xi, Chen;  Lijun, Shen;  LinLin, Li;  Shunong, Bai;  Hua, Han
Adobe PDF(15549Kb)  |  收藏  |  浏览/下载:36/10  |  提交时间:2024/06/11
Health and Senior Care Video Moment Localization With Procedure Knowledge Distillation 会议论文
, Istanbul, Turkiye, Dec 5-8
作者:  Chaochen Wu;  Meiyun Zuo;  Guan Luo;  Yuna Jiang
Adobe PDF(3140Kb)  |  收藏  |  浏览/下载:43/18  |  提交时间:2024/06/05
Dual-Path Transformer for 3D Human Pose Estimation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 5, 页码: 3260-3270
作者:  Zhou Lu;  Chen Yingying;  Wang Jinqiao
Adobe PDF(2410Kb)  |  收藏  |  浏览/下载:47/20  |  提交时间:2024/06/03
Transformers  Three-dimensional displays  Pose estimation  Task analysis  Solid modeling  Feature extraction  Benchmark testing  3D human pose estimation  transformer  motion  distillation  
DARTScore: DuAl-Reconstruction Transformer for Video Captioning Evaluation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 4, 页码: 2041-2055
作者:  Chen, Yuxin;  Zhang, Ziqi;  Qi, Zhongang;  Yuan, Chunfeng;  Wang, Jie;  Shan, Ying;  Li, Bing;  Hu, Weiming;  Qie, Xiaohu;  Wu, Jianping
Adobe PDF(13765Kb)  |  收藏  |  浏览/下载:45/1  |  提交时间:2024/05/30
Chinese video captioning evaluation  dual-reconstruction transformer  
Cross-Modal Prototype Learning for Zero-Shot Handwritten Character Recognition 期刊论文
Pattern Recognition, 2022, 卷号: 131, 页码: 108859
作者:  Ao, Xiang;  Zhang, Xu-Yao;  Liu, Cheng-Lin
Adobe PDF(3111Kb)  |  收藏  |  浏览/下载:58/25  |  提交时间:2024/05/30
Hierarchical Attention Network for Open-Set Fine-Grained Recognition 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2023, 页码: 1-14
作者:  Jiayin, Sun;  Hong, Wang;  Qiulei, Dong
Adobe PDF(2596Kb)  |  收藏  |  浏览/下载:59/18  |  提交时间:2024/05/28
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:102/28  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
Reparameterizing and dynamically quantizing image features for image generation 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 146, 页码: 11
作者:  Sun, Mingzhen;  Wang, Weining;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(3612Kb)  |  收藏  |  浏览/下载:179/27  |  提交时间:2023/12/21
Vector quantization  Variational auto-encoder  Unconditional image generation  Text-to-image generation  Autoregressive generation  
SignParser: An End-to-End Framework for Traffic Sign Understanding 期刊论文
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 卷号: 132, 期号: 2, 页码: 805-821
作者:  Guo, Yunfei;  Feng, Wei;  Yin, Fei;  Liu, Cheng-Lin
Adobe PDF(7011Kb)  |  收藏  |  浏览/下载:133/7  |  提交时间:2023/12/21
Traffic sign understanding  Content reasoning  Semantic description generation