CASIA OpenIR

浏览/检索结果: 共50条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Calibration & Reconstruction: Deep Integrated Language for Referring Image Segmentation 会议论文
Proceedings of the 2024 International Conference on Multimedia Retrieval, Phuket, Thailand, 2024/03/08
作者:  Yichen Yan;  Xingjian He;  Sihan Chen;  Jing Liu
Adobe PDF(2868Kb)  |  收藏  |  浏览/下载:12/5  |  提交时间:2024/07/08
Referring Image Segmentation, CLIP, Hierarchical Fusion, Computer Vision  
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:162/32  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
Macro-micro mutual learning inside compositional model for human pose estimation 期刊论文
Neurocomputing, 2021, 卷号: 449, 期号: 449, 页码: 176-188
作者:  Zhou Lu;  Chen Yingying;  Cao Congqi;  Chu Yakui;  Wang Jinqiao;  Lu Hanqing
Adobe PDF(3325Kb)  |  收藏  |  浏览/下载:243/55  |  提交时间:2021/06/21
Mutual learning  Macro-micro  BMMSE  Human pose estimation  
Rethinking the pid optimizer for stochastic optimization of deep networks 会议论文
, London, United kingdom, July 6, 2020 - July 10, 2020
作者:  Shi, Lei;  Zhang, Yifan;  Wang, Wanguo;  Cheng, Jian;  Lu, Hanqing
浏览  |  Adobe PDF(325Kb)  |  收藏  |  浏览/下载:254/63  |  提交时间:2021/01/27
Gesture recognition based on deep deformable 3D convolutional neural networks 期刊论文
PATTERN RECOGNITION, 2020, 期号: 107, 页码: 12
作者:  Zhang, Yifan;  Shi, Lei;  Wu, Yi;  Cheng, Ke;  Cheng, Jian;  Lu, Hanqing
浏览  |  Adobe PDF(1310Kb)  |  收藏  |  浏览/下载:496/140  |  提交时间:2020/08/31
Gesture recognition  Spatiotemporal deformable convolution  Spatiotemporal convolutional neural network  
Semantic-spatial fusion network for human parsing 期刊论文
NEUROCOMPUTING, 2020, 卷号: 91, 期号: 402, 页码: 375-383
作者:  Zhang, Xiaomei;  Chen, Yingying;  Zhu, Bingke;  Wang, Jinqiao;  Tang, Ming
Adobe PDF(2060Kb)  |  收藏  |  浏览/下载:336/51  |  提交时间:2020/07/20
SSFNet  Semantic modulation model  Resolution-aware model  Human parsing  
A Novel Data Augmentation Scheme for Pedestrian Detection with Attribute Preserving GAN 期刊论文
Neurocomputing, 2020, 卷号: 401, 期号: 11, 页码: 123-132
作者:  Liu, Songyan;  Guo, Haiyun;  Hu, Jian-Guo;  Zhao, Xu;  Zhao, Chaoyang;  Wang, Tong;  Zhu, Yousong;  Wang, Jinqiao;  Tang, Ming
浏览  |  Adobe PDF(2691Kb)  |  收藏  |  浏览/下载:404/88  |  提交时间:2020/06/10
Generative Adversarial Networks  Pedestrian detection  Data augmentation  
Contextual deconvolution network for semantic segmentation 期刊论文
PATTERN RECOGNITION, 2020, 卷号: 101, 页码: 11
作者:  Fu, Jun;  Liu, Jing;  Li, Yong;  Bao, Yongjun;  Yan, Weipeng;  Fang, Zhiwei;  Lu, Hanqing
Adobe PDF(3400Kb)  |  收藏  |  浏览/下载:438/104  |  提交时间:2020/06/02
Semantic segmentation  Deconvolution network  Channel contextual module  Spatial contextual module  
Food det: Detecting foods in refrigerator with supervised transformer network 期刊论文
NEUROCOMPUTING, 2020, 卷号: 379, 期号: 28, 页码: 162-171
作者:  Zhu, Yousong;  Zhao, Xu;  Zhao, Chaoyang;  Wang, Jinqiao;  Lu, Hanqing
Adobe PDF(2790Kb)  |  收藏  |  浏览/下载:506/80  |  提交时间:2020/03/30
Food detection  Spatial transformer  Object detection  
Image Captioning with Word Gate and Adaptive Self-Critical Learning 期刊论文
APPLIED SCIENCES-BASEL, 2018, 卷号: 8, 期号: 6, 页码: 13
作者:  Zhu, Xinxin;  Li, Lixiang;  Liu, Jing;  Guo, Longteng;  Fang, Zhiwei;  Peng, Haipeng;  Niu, Xinxin
Adobe PDF(3312Kb)  |  收藏  |  浏览/下载:411/65  |  提交时间:2019/12/16
image caption  image understanding  deep learning  computer vision