CASIA OpenIR

浏览/检索结果: 共284条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
An end-to-end model for multi-view scene text recognition 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 149, 页码: 17
作者:  Banerjee, Ayan;  Shivakumara, Palaiahnakote;  Bhattacharya, Saumik;  Pal, Umapada;  Liu, Cheng-Lin
收藏  |  浏览/下载:7/0  |  提交时间:2024/07/03
Text detection  Scene text recognition  Siamese network  Natural language model  Genetic algorithm  Multi-view text detection  
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:24/12  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
GFFNet: Global Feature Fusion Network for Semantic Segmentation of Large-Scale Remote Sensing Images 期刊论文
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2024, 卷号: 17, 期号: 2024, 页码: 4222 - 4234
作者:  Cao, Yong;  Huo, Chunlei;  Xiang, Shiming;  Pan, Chunhong
Adobe PDF(4340Kb)  |  收藏  |  浏览/下载:20/4  |  提交时间:2024/06/25
Cross feature fusion (CFF)  global context learning  group transformer  semantic segmentation  
An Approach for Handwritten Chinese Text Recognition Unifying Character Segmentation and Recognition 期刊论文
Pattern Recognition, 2024, 页码: 110373
作者:  MingMing Yu(于明明);  Zhang H(张恒);  Fei Yin(殷飞);  Cheng-Lin Liu(刘成林)
Adobe PDF(5849Kb)  |  收藏  |  浏览/下载:34/11  |  提交时间:2024/06/24
BioDrone: A Bionic Drone-Based Single Object Tracking Benchmark for Robust Vision 期刊论文
International Journal of Computer Vision, 2024, 卷号: 132, 页码: 1659-1684
作者:  Xin Zhao;  Shiyu Hu;  Yipei Wang;  Zhang Jing;  Yimin Hu;  Rongshuai Liu;  Haibin Ling;  Yin Li;  Renshu Li;  Kun Liu;  Jiadong Li
Adobe PDF(9076Kb)  |  收藏  |  浏览/下载:24/7  |  提交时间:2024/06/21
Learnable Graph Matching: A Practical Paradigm for Data Association 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 卷号: 46, 期号: 7, 页码: 4880-4895
作者:  He, Jiawei;  Huang, Zehao;  Wang, Naiyan;  Zhang, Zhaoxiang
Adobe PDF(3520Kb)  |  收藏  |  浏览/下载:36/9  |  提交时间:2024/06/18
Graph matching  data association  multiple object tracking  image matching  
RC-Net: Row and Column Net with Text Feature for Deep Parsing Floor Plan Images 期刊论文
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2023, 页码: 526-539
作者:  Wang T(王腾);  Meng WL(孟维亮);  Lu ZD(卢政达);  Guo JW(郭建伟);  Xiao J(肖俊);  Zhang XP(张晓鹏)
Adobe PDF(2370Kb)  |  收藏  |  浏览/下载:22/6  |  提交时间:2024/06/11
HiCMAE: Hierarchical Contrastive Masked Autoencoder for self-supervised Audio-Visual Emotion Recognition 期刊论文
Information Fusion, 2024, 卷号: 108, 页码: 1-20
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2281Kb)  |  收藏  |  浏览/下载:47/11  |  提交时间:2024/05/31
Audio-Visual Emotion Recognition  Self-supervised learning  Masked autoencoder  Contrastive learning  
DARTScore: DuAl-Reconstruction Transformer for Video Captioning Evaluation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 4, 页码: 2041-2055
作者:  Chen, Yuxin;  Zhang, Ziqi;  Qi, Zhongang;  Yuan, Chunfeng;  Wang, Jie;  Shan, Ying;  Li, Bing;  Hu, Weiming;  Qie, Xiaohu;  Wu, Jianping
Adobe PDF(13765Kb)  |  收藏  |  浏览/下载:40/1  |  提交时间:2024/05/30
Chinese video captioning evaluation  dual-reconstruction transformer  
PmcaNet: Pyramid multiscale channel attention network for electron microscopy image segmentation 期刊论文
Journal of Intelligent & Fuzzy Systems, 2024, 卷号: 46, 期号: 2, 页码: 4895-4907
作者:  Gao, Kaihan;  Ju, Yiwei;  Li, Shuai;  Yang, Xuebing;  Zhang, Wensheng;  Li, Guoqing
Adobe PDF(1371Kb)  |  收藏  |  浏览/下载:41/10  |  提交时间:2024/05/28
Electron microscopy  Image segmentation  Convolutional neural network  Multiscale feature pyramid