CASIA OpenIR

浏览/检索结果: 共15条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:35/18  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
Personalized Graph Generation for Monocular 3D Human Pose and Shape Estimation 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2023, 卷号: 34, 期号: 4, 页码: 2399-2413
作者:  Junxing Hu;  Hongwen Zhang;  Yunlong Wang;  Min Ren;  Zhenan Sun
Adobe PDF(4963Kb)  |  收藏  |  浏览/下载:58/17  |  提交时间:2024/05/30
3D Human Pose and Shape Estimation  Personalized Graph Generation  Body-Oriented Adjacency Matrix  
Learning to Reduce Scale Differences for Large-Scale Invariant Image Matching 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2023, 卷号: 33, 期号: 3, 页码: 1335 - 1348
作者:  Fu Yujie;  Zhang Pengju;  Liu Bingxi;  Rong Zheng;  Wu Yihong
Adobe PDF(4879Kb)  |  收藏  |  浏览/下载:64/27  |  提交时间:2024/05/28
Image Matching  Large Scale Changes  Scale Difference Reduction  Scale Ratio Estimation  Covisibility-attention-reinforced Matching Module  
Transformer-based Spiking Neural Networks for Multimodal Audio-Visual Classification 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2023, 页码: DOI 10.1109/TCDS.2023.3327081
作者:  Guo LY(郭凌月);  Zeyu Gao;  Jinye Qu;  Suiwu Zheng;  Runhao Jiang;  Yanfeng Lu;  Hong Qiao
Adobe PDF(3922Kb)  |  收藏  |  浏览/下载:49/16  |  提交时间:2024/05/28
Towards Prior Gap and Representation Gap for Long-tailed Recognition, Pattern Recognition 期刊论文
Pattern Recognition, 2023, 卷号: 133, 期号: 109012, 页码: 109012
作者:  Zhang Ming-Liang;  Zhang Xu-Yao;  Wang Chang;  Liu Cheng-Lin
Adobe PDF(2258Kb)  |  收藏  |  浏览/下载:130/33  |  提交时间:2024/04/03
Long-tailed learning  Prior gap  Representation gap  Image recognition  
Exploring Explicitly Disentangled Features for Domain Generalization 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 11, 页码: 6360-6373
作者:  Li, Jingwei;  Li, Yuan;  Wang, Huanjie;  Liu, Chengbao;  Tan, Jie
Adobe PDF(2432Kb)  |  收藏  |  浏览/下载:134/14  |  提交时间:2023/12/21
Domain generalization  feature disentanglement  Fourier transform  data augmentation  
SignParser: An End-to-End Framework for Traffic Sign Understanding 期刊论文
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 卷号: 132, 期号: 2, 页码: 805-821
作者:  Guo, Yunfei;  Feng, Wei;  Yin, Fei;  Liu, Cheng-Lin
Adobe PDF(7011Kb)  |  收藏  |  浏览/下载:136/8  |  提交时间:2023/12/21
Traffic sign understanding  Content reasoning  Semantic description generation  
ScoreMix: A Scalable Augmentation Strategy for Training GANs With Limited Data 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 7, 页码: 8920-8935
作者:  Cao, Jie;  Luo, Mandi;  Yu, Junchi;  Yang, Ming-Hsuan;  He, Ran
Adobe PDF(1823Kb)  |  收藏  |  浏览/下载:132/9  |  提交时间:2023/11/17
Generative adversarial networks  image synthesis  data augmentation  few-shot image-to-image translation  
DyGAT: Dynamic stroke classification of online handwritten documents and sketches 期刊论文
PATTERN RECOGNITION, 2023, 卷号: 141, 页码: 12
作者:  Yang, Yu-Ting;  Zhang, Yan-Ming;  Yun, Xiao-Long;  Yin, Fei;  Liu, Cheng-Lin
Adobe PDF(3180Kb)  |  收藏  |  浏览/下载:162/3  |  提交时间:2023/11/17
Stroke classification  Sketch semantic segmentation  Document layout analysis  Diagram recognition  Streaming recognition  
BViT: Broad Attention-Based Vision Transformer 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2023, 页码: 1 - 12
作者:  Nannan Li;  Yaran Chen;  Weifan Li;  Zixiang Ding;  Dongbin Zhao;  Shuai Nie
Adobe PDF(2171Kb)  |  收藏  |  浏览/下载:236/64  |  提交时间:2023/06/27
Broad attention  broad connection  image classification  parameter-free attention  vision transformer