CASIA OpenIR

浏览/检索结果: 共26条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Calibration & Reconstruction: Deep Integrated Language for Referring Image Segmentation 会议论文
Proceedings of the 2024 International Conference on Multimedia Retrieval, Phuket, Thailand, 2024/03/08
作者:  Yichen Yan;  Xingjian He;  Sihan Chen;  Jing Liu
Adobe PDF(2868Kb)  |  收藏  |  浏览/下载:19/7  |  提交时间:2024/07/08
Referring Image Segmentation, CLIP, Hierarchical Fusion, Computer Vision  
Learning Semantics-Consistent Stripes With Self-Refinement for Person Re-Identification 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2022, 页码: 1-12
作者:  Zhu Kuan;  Guo Haiyun;  Liu Songyan;  Wang Jinqiao;  Tang Ming
Adobe PDF(4384Kb)  |  收藏  |  浏览/下载:160/44  |  提交时间:2023/06/08
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:169/36  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
HAIR: Hierarchical Visual-Semantic Relational Reasoning for Video Question Answering 会议论文
, 线上, 2021-10
作者:  Liu, Fei;  Liu, Jing;  Wang, Weining;  Lu, Hanqing
Adobe PDF(1174Kb)  |  收藏  |  浏览/下载:216/48  |  提交时间:2022/06/15
Question-Guided Erasing-Based Spatiotemporal Attention Learning for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 0
作者:  Liu, Fei;  Liu, Jing;  Hong, Richang;  Lu, Hanqing
Adobe PDF(3550Kb)  |  收藏  |  浏览/下载:368/93  |  提交时间:2022/01/27
video question answering  attention mechanism  metric learning  
Antidecay LSTM for Siamese Tracking With Adversarial Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 卷号: 32, 期号: 10, 页码: 4475-4489
作者:  Zhao, Fei;  Zhang, Ting;  Wu, Yi;  Tang, Ming;  Wang, Jinqiao
收藏  |  浏览/下载:262/0  |  提交时间:2021/12/28
Target tracking  Feature extraction  Computer architecture  Visualization  Training  Task analysis  Adversarial learning  deep learning  long short-term memory (LSTM)  visual tracking  
Blended Grammar Network for Human Parsing 会议论文
, 线上会议, 2020
作者:  Xiaomei Zhang;  Yingying Chen;  Bingke Zhu;  Jinqiao Wang;  Ming Tang
Adobe PDF(1602Kb)  |  收藏  |  浏览/下载:175/65  |  提交时间:2021/06/21
TREE HIERARCHICAL CNNS FOR OBJECT PARSING 会议论文
, 希腊雅典, 2020
作者:  Xiaomei Zhang;  Yingying Chen;  Bingke Zhu;  Jinqiao Wang;  Ming Tang;  Hanqing Lu
Adobe PDF(1722Kb)  |  收藏  |  浏览/下载:219/58  |  提交时间:2021/06/21
Gesture recognition based on deep deformable 3D convolutional neural networks 期刊论文
PATTERN RECOGNITION, 2020, 期号: 107, 页码: 12
作者:  Zhang, Yifan;  Shi, Lei;  Wu, Yi;  Cheng, Ke;  Cheng, Jian;  Lu, Hanqing
浏览  |  Adobe PDF(1310Kb)  |  收藏  |  浏览/下载:503/143  |  提交时间:2020/08/31
Gesture recognition  Spatiotemporal deformable convolution  Spatiotemporal convolutional neural network  
Show, Tell, and Polish: Ruminant Decoding for Image Captioning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 卷号: 22, 期号: 8, 页码: 2149-2162
作者:  Guo, Longteng;  Liu, Jing;  Lu, Shichen;  Lu, Hanqing
Adobe PDF(4378Kb)  |  收藏  |  浏览/下载:240/38  |  提交时间:2020/08/31
Image captioning  Multi-pass decoding  Rumination