CASIA OpenIR

浏览/检索结果: 共46条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Calibration & Reconstruction: Deep Integrated Language for Referring Image Segmentation 会议论文
Proceedings of the 2024 International Conference on Multimedia Retrieval, Phuket, Thailand, 2024/03/08
作者:  Yichen Yan;  Xingjian He;  Sihan Chen;  Jing Liu
Adobe PDF(2868Kb)  |  收藏  |  浏览/下载:22/8  |  提交时间:2024/07/08
Referring Image Segmentation, CLIP, Hierarchical Fusion, Computer Vision  
Global Patch Cross-Attention for Point Cloud Analysis 会议论文
5th Chinese Conference, PRCV 2022, Shenzhen, China, November 4–7, 2022, Proceedings, Part III, 深圳, 2022.11.4-2022.11.7
作者:  Tao ML(陶满礼);  Zhao CY(赵朝阳);  Wang JQ(王金桥);  Tang M(唐明)
Adobe PDF(6422Kb)  |  收藏  |  浏览/下载:47/12  |  提交时间:2024/05/30
Global patch · Cross-attention · Contextual description · Point cloud analysis  
ImFusion: Boosting Two-Stage 3D Object Detection via Image Candidates 期刊论文
IEEE SIGNAL PROCESSING LETTERS, 2024, 卷号: 31, 页码: 241-245
作者:  Tao, Manli;  Zhao, Chaoyang;  Wang, Jinqiao;  Tang, Ming
Adobe PDF(3506Kb)  |  收藏  |  浏览/下载:78/7  |  提交时间:2024/03/26
Three-dimensional displays  Proposals  Object detection  Feature extraction  Point cloud compression  Aggregates  Sun  3D object detection  image candidates  pseudo 3D proposal  target missing  
Temporal Action Proposal Generation With Action Frequency Adaptive Network 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 2340 - 2353
作者:  Yepeng Tang;  Weining Wang;  Chunjie Zhang;  Jing Liu;  Yao Zhao
Adobe PDF(10095Kb)  |  收藏  |  浏览/下载:79/25  |  提交时间:2024/03/26
Proposals  Task analysis  Data models  Time-frequency analysis  Representation learning  Predictive models  Information science  Temporal action proposal generation  expert learning  fine-gained detection  action frequency  
Objformer: Boosting 3D object detection via instance-wise interaction 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 146, 页码: 9
作者:  Tao, Manli;  Zhao, Chaoyang;  Tang, Ming;  Wang, Jinqiao
Adobe PDF(3261Kb)  |  收藏  |  浏览/下载:158/11  |  提交时间:2024/02/22
3D object detection  Point clouds  Incompletion and occlusion  Instance-wise interaction  
Quality-Aware Network for Human Parsing 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 7128-7138
作者:  Yang, Lu;  Song, Qing;  Wang, Zhihui;  Liu, Zhiwei;  Xu, Songcen;  Li, Zhihao
收藏  |  浏览/下载:45/0  |  提交时间:2024/02/22
Computer vision  image segmentation  multi-media computing  
PIDray: A Large-Scale X-ray Benchmark for Real-World Prohibited Item Detection (Aug, 10.1007/s11263-023-01855-1, 2023) 期刊论文
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 卷号: 131, 页码: 3170–3192
作者:  Zhang, Libo;  Jiang, Lutao;  Ji, Ruyi;  Fan, Heng
Adobe PDF(3227Kb)  |  收藏  |  浏览/下载:52/3  |  提交时间:2023/11/17
Dual Transformer With Multi-Grained Assembly for Fine-Grained Visual Classification 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 5009-5021
作者:  Ji, Ruyi;  Li, Jiaying;  Zhang, Libo;  Liu, Jing;  Wu, Yanjun
Adobe PDF(4636Kb)  |  收藏  |  浏览/下载:175/17  |  提交时间:2023/11/16
Transformer  multi-grained assembly  fine-grained visual classification  
Semi-supervised Temporal Action Proposal Generation via Exploiting 2-d Proposal Map 期刊论文
IEEE Transactions on Multimedia, 2021, 页码: 3624 - 3635
作者:  Wang, Weining;  Lin, Tianwei;  He, Dongliang;  Li, Fu;  Wen, Shilei;  Wang, Liang;  Liu, Jing
Adobe PDF(4851Kb)  |  收藏  |  浏览/下载:165/27  |  提交时间:2023/05/03
Semi-supervised learning  proposal map oriented mean-teacher  pseudo label  
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:170/36  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer