CASIA OpenIR

浏览/检索结果: 共64条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
ImFusion: Boosting Two-Stage 3D Object Detection via Image Candidates 期刊论文
IEEE SIGNAL PROCESSING LETTERS, 2024, 卷号: 31, 页码: 241-245
作者:  Tao, Manli;  Zhao, Chaoyang;  Wang, Jinqiao;  Tang, Ming
Adobe PDF(3506Kb)  |  收藏  |  浏览/下载:35/0  |  提交时间:2024/03/26
Three-dimensional displays  Proposals  Object detection  Feature extraction  Point cloud compression  Aggregates  Sun  3D object detection  image candidates  pseudo 3D proposal  target missing  
Objformer: Boosting 3D object detection via instance-wise interaction 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 146, 页码: 9
作者:  Tao, Manli;  Zhao, Chaoyang;  Tang, Ming;  Wang, Jinqiao
Adobe PDF(3261Kb)  |  收藏  |  浏览/下载:109/1  |  提交时间:2024/02/22
3D object detection  Point clouds  Incompletion and occlusion  Instance-wise interaction  
Reparameterizing and dynamically quantizing image features for image generation 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 146, 页码: 11
作者:  Sun, Mingzhen;  Wang, Weining;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(3612Kb)  |  收藏  |  浏览/下载:129/16  |  提交时间:2023/12/21
Vector quantization  Variational auto-encoder  Unconditional image generation  Text-to-image generation  Autoregressive generation  
Learning Semantics-Consistent Stripes With Self-Refinement for Person Re-Identification 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2022, 页码: 1-12
作者:  Zhu Kuan;  Guo Haiyun;  Liu Songyan;  Wang Jinqiao;  Tang Ming
Adobe PDF(4384Kb)  |  收藏  |  浏览/下载:123/33  |  提交时间:2023/06/08
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:136/24  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
Enhanced Bounding Box Estimation with Distribution Calibration for Visual Tracking 期刊论文
SENSORS, 2021, 卷号: 21, 期号: 23, 页码: 14
作者:  Yu, Bin;  Tang, Ming;  Zhu, Guibo;  Wang, Jinqiao;  Lu, Hanqing
Adobe PDF(10825Kb)  |  收藏  |  浏览/下载:394/67  |  提交时间:2022/02/16
visual tracking  bounding box estimation  overlap maximization  distribution calibration  
Question-Guided Erasing-Based Spatiotemporal Attention Learning for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 0
作者:  Liu, Fei;  Liu, Jing;  Hong, Richang;  Lu, Hanqing
Adobe PDF(3550Kb)  |  收藏  |  浏览/下载:328/80  |  提交时间:2022/01/27
video question answering  attention mechanism  metric learning  
Visual Question Answering With Dense Inter- and Intra-Modality Interactions 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 3518-3529
作者:  Liu, Fei;  Liu, Jing;  Fang, Zhiwei;  Hong, Richang;  Lu, Hanqing
Adobe PDF(2891Kb)  |  收藏  |  浏览/下载:295/61  |  提交时间:2021/12/28
Visualization  Knowledge discovery  Connectors  Encoding  Task analysis  Image coding  Stacking  Visual question answering  attention  dense interactions  
Skeleton-Based Action Recognition with Directed Graph Neural Networks 会议论文
, Long Beach, CA, United states, June 16, 2019 - June 20, 2019
作者:  Shi L(史磊);  Zhang YF(张一帆);  Cheng J(程健);  Lu HQ(卢汉清)
Adobe PDF(554Kb)  |  收藏  |  浏览/下载:163/50  |  提交时间:2021/05/31
Show, Tell, and Polish: Ruminant Decoding for Image Captioning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 卷号: 22, 期号: 8, 页码: 2149-2162
作者:  Guo, Longteng;  Liu, Jing;  Lu, Shichen;  Lu, Hanqing
Adobe PDF(4378Kb)  |  收藏  |  浏览/下载:209/32  |  提交时间:2020/08/31
Image captioning  Multi-pass decoding  Rumination