CASIA OpenIR

浏览/检索结果: 共141条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
IterDepth: Iterative Residual Refinement for Outdoor Self-Supervised Multi-Frame Monocular Depth Estimation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 1, 页码: 329-341
作者:  Feng, Cheng;  Chen, Zhen;  Zhang, Congxuan;  Hu, Weiming;  Li, Bing;  Lu, Feng
收藏  |  浏览/下载:28/0  |  提交时间:2024/03/26
Estimation  Iterative methods  Cameras  Task analysis  Feature extraction  Decoding  Training  Monocular depth estimation  iterative refinement  self-supervised learning  deep learning  
Quality-Aware Network for Human Parsing 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 7128-7138
作者:  Yang, Lu;  Song, Qing;  Wang, Zhihui;  Liu, Zhiwei;  Xu, Songcen;  Li, Zhihao
收藏  |  浏览/下载:22/0  |  提交时间:2024/02/22
Computer vision  image segmentation  multi-media computing  
Multi-modal spatial relational attention networks for visual question answering 期刊论文
IMAGE AND VISION COMPUTING, 2023, 卷号: 140, 页码: 13
作者:  Yao, Haibo;  Wang, Lipeng;  Cai, Chengtao;  Sun, Yuxin;  Zhang, Zhi;  Luo, Yongkang
收藏  |  浏览/下载:64/0  |  提交时间:2024/02/22
Visual question answering  Spatial relation  Attention mechanism  Pre -training strategy  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:43/13  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
Incremental Audio-Visual Fusion for Person Recognition in Earthquake Scene 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 卷号: 20, 期号: 2, 页码: 19
作者:  You, Sisi;  Zuo, Yukun;  Yao, Hantao;  Xu, Changsheng
收藏  |  浏览/下载:70/0  |  提交时间:2023/12/21
Cross-modal audio-visual fusion  incremental learning  person recognition  elastic weight consolidation  feature replay  
So Many Heads, So Many Wits: Multimodal Graph Reasoning for Text-Based Visual Question Answering 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 页码: 12
作者:  Zheng, Wenbo;  Yan, Lan;  Wang, Fei-Yue
收藏  |  浏览/下载:86/0  |  提交时间:2023/12/21
Graph attention  graph reasoning  multimodal graph  self-attention  text-based visual question answering  
PSAQ-ViT V2: Toward Accurate and General Data-Free Quantization for Vision Transformers 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 12
作者:  Li, Zhikai;  Chen, Mengjuan;  Xiao, Junrui;  Gu, Qingyi
收藏  |  浏览/下载:53/0  |  提交时间:2023/11/17
Data-free quantization  model compression  patch similarity  quantized vision transformers (ViTs)  
Hierarchical Curriculum Learning for No-Reference Image Quality Assessment 期刊论文
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 页码: 20
作者:  Wang, Juan;  Chen, Zewen;  Yuan, Chunfeng;  Li, Bing;  Ma, Wentao;  Hu, Weiming
收藏  |  浏览/下载:95/0  |  提交时间:2023/11/17
No-reference image quality assessment  Hierarchical curriculum learning  Prior knowledge  Cross-dataset quality assessment correlation  
Contour Primitive of Interest Extraction Network Based on Dual-Metric One-Shot Learning for Vision Measurement 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 卷号: 19, 期号: 4, 页码: 5839-5848
作者:  Qin, Fangbo;  Lin, Shan;  Xu, De
收藏  |  浏览/下载:99/0  |  提交时间:2023/11/17
Feature extraction  Measurement  Task analysis  Imaging  Image segmentation  Prototypes  Training  Contour extraction  deep learning  metric learning  one-shot learning  vision measurement  
Human Parsing With Part-Aware Relation Modeling 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 25, 页码: 2601-2612
作者:  Zhang, Xiaomei;  Chen, Yingying;  Tang, Ming;  Wang, Jinqiao;  Zhu, Xiangyu;  Lei, Zhen
Adobe PDF(6053Kb)  |  收藏  |  浏览/下载:118/10  |  提交时间:2023/11/17
Human parsing  modeling  part-aware relation