CASIA OpenIR

浏览/检索结果: 共155条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Calibration & Reconstruction: Deep Integrated Language for Referring Image Segmentation 会议论文
Proceedings of the 2024 International Conference on Multimedia Retrieval, Phuket, Thailand, 2024/03/08
作者:  Yichen Yan;  Xingjian He;  Sihan Chen;  Jing Liu
Adobe PDF(2868Kb)  |  收藏  |  浏览/下载:28/11  |  提交时间:2024/07/08
Referring Image Segmentation, CLIP, Hierarchical Fusion, Computer Vision  
Fuse & Calibrate: A bi-directional Vision-Language Guided Framework for Referring Image Segmentation 会议论文
, Tianjin, China, 2024/08/05
作者:  Yichen Yan;  Xingjian He;  Sihan Chen;  Shichen Lu;  Jing Liu
Adobe PDF(1978Kb)  |  收藏  |  浏览/下载:27/10  |  提交时间:2024/07/08
Referring Image Segmentation, CLIP, Hierarchical Fusion, Computer Vision  
BFRFormer: Transformer-based generator for Real-World Blind Face Restoration 会议论文
, Seoul, Korea, 2024年4月14日到2024年4月19日
作者:  Guojing Ge;  Qi Song;  Guibo Zhu;  Yuting Zhang;  Jinglu Chen;  Miao Xin;  Ming Tang;  Jinqiao Wang
Adobe PDF(6872Kb)  |  收藏  |  浏览/下载:54/15  |  提交时间:2024/06/06
Progressive Direction-Aware Pose Grammar for Human Pose Estimation 期刊论文
IEEE TRANSACTIONS ON BIOMETRICS, BEHAVIOR, AND IDENTITY SCIENCE, 2023, 卷号: 5, 期号: 4, 页码: 593-605
作者:  Zhou Lu;  Chen Yingying;  Wang Jinqiao
Adobe PDF(3192Kb)  |  收藏  |  浏览/下载:59/27  |  提交时间:2024/06/03
The Devil is in Details: Delving Into Lite FFN Design for Vision Transformers 会议论文
, Seoul, Korea, 2024-4-14
作者:  Chen, Zhiyang;  Zhu, Yousong;  Li, Zhaowen;  Yang, Fan;  Zhao, Chaoyang;  Wang, Jinqiao;  Tang, Ming
Adobe PDF(407Kb)  |  收藏  |  浏览/下载:69/20  |  提交时间:2024/05/28
Vision Transformer  Light-Weight Structure  Feed-Forward Networks  
Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks 会议论文
, New Orleans, Louisiana & Online, 2022-11-28
作者:  Chen, Zhiyang;  Zhu, Yousong;  Li, Zhaowen;  Yang, Fan;  Li, Wei;  Wang, Haixin;  Zhao, Chaoyang;  Wu, Liwei;  Zhao, Rui;  Wang, Jinqiao;  Tang, Ming
Adobe PDF(1289Kb)  |  收藏  |  浏览/下载:49/12  |  提交时间:2024/05/28
transformer  general visual framework  sequence prediction  multi-task  
PIDray: A Large-Scale X-ray Benchmark for Real-World Prohibited Item Detection (Aug, 10.1007/s11263-023-01855-1, 2023) 期刊论文
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 卷号: 131, 页码: 3170–3192
作者:  Zhang, Libo;  Jiang, Lutao;  Ji, Ruyi;  Fan, Heng
Adobe PDF(3227Kb)  |  收藏  |  浏览/下载:58/5  |  提交时间:2023/11/17
Dual Transformer With Multi-Grained Assembly for Fine-Grained Visual Classification 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 5009-5021
作者:  Ji, Ruyi;  Li, Jiaying;  Zhang, Libo;  Liu, Jing;  Wu, Yanjun
Adobe PDF(4636Kb)  |  收藏  |  浏览/下载:187/20  |  提交时间:2023/11/16
Transformer  multi-grained assembly  fine-grained visual classification  
WL-MSR: Watch and Listen for Multimodal Subtitle Recognition 会议论文
, Greece, 2023-6-4
作者:  Liu, Jiawei;  Wang, Hao;  Wang, Weining;  He, Xingjian;  Liu, Jing
Adobe PDF(1673Kb)  |  收藏  |  浏览/下载:188/42  |  提交时间:2023/07/06
文本指导的视频生成方法研究 学位论文
, 2023
作者:  刘佳伟
Adobe PDF(15246Kb)  |  收藏  |  浏览/下载:167/6  |  提交时间:2023/06/08
基于人工智能的内容生成  多模态  视频生成