CASIA OpenIR

浏览/检索结果: 共44条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Adversarial-Metric Learning for Audio-Visual Cross-Modal Matching 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 338-351
作者:  Zheng, Aihua;  Hu, Menglan;  Jiang, Bo;  Huang, Yan;  Yan, Yan;  Luo, Bin
收藏  |  浏览/下载:224/0  |  提交时间:2022/03/17
Visualization  Task analysis  Measurement  Speech recognition  Videos  Location awareness  Image recognition  Adversarial learning  audio-visual matching  cross-modal learning  metric learning  
Holographic Feature Learning of Egocentric-Exocentric Videos for Multi-Domain Action Recognition 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 2273-2286
作者:  Huang, Yi;  Yang, Xiaoshan;  Gao, Junyun;  Xu, Changsheng
Adobe PDF(2409Kb)  |  收藏  |  浏览/下载:305/61  |  提交时间:2022/07/25
Videos  Feature extraction  Visualization  Task analysis  Computational modeling  Target recognition  Prototypes  Egocentric videos  exocentric videos  holographic feature  multi-domain  action recognition  
Weakly-Supervised Facial Expression Recognition in the Wild With Noisy Data 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 1800-1814
作者:  Zhang, Feifei;  Xu, Mingliang;  Xu, Changsheng
收藏  |  浏览/下载:214/0  |  提交时间:2022/06/10
Noise measurement  Face recognition  Data models  Task analysis  Training data  Training  Annotations  Facial expression recognition  noisy labeled data  clean labels  end-to-end  pose modeling  noise modeling  
Human Parsing With Part-Aware Relation Modeling 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 25, 页码: 2601-2612
作者:  Zhang, Xiaomei;  Chen, Yingying;  Tang, Ming;  Wang, Jinqiao;  Zhu, Xiangyu;  Lei, Zhen
Adobe PDF(6053Kb)  |  收藏  |  浏览/下载:101/5  |  提交时间:2023/11/17
Human parsing  modeling  part-aware relation  
The Model May Fit You: User-Generalized Cross-Modal Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 24, 页码: 2998-3012
作者:  Ma, Xinhong;  Yang, Xiaoshan;  Gao, Junyu;  Xu, Changsheng
Adobe PDF(6549Kb)  |  收藏  |  浏览/下载:238/46  |  提交时间:2022/06/17
cross-modal retrieval  domain generalization  meta-learning  
Learning Coarse-to-Fine Graph Neural Networks for Video-Text Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 2386-2397
作者:  Wang, Wei;  Gao, Junyu;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(2165Kb)  |  收藏  |  浏览/下载:306/42  |  提交时间:2021/11/02
Feature extraction  Encoding  Task analysis  Semantics  Data models  Cognition  Focusing  Video-text retrieval  graph neural network  coarse-to-fine strategy  
Single-Image Specular Highlight Removal via Real-World Dataset Construction 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 页码: 12
作者:  Wu ZQ(吴仲琦);  Zhuang CQ(庄传青);  Shi J(石剑);  Guo JW(郭建伟);  Xiao J(肖俊);  Zhang XP(张晓鹏);  Yan DM(严冬明)
Adobe PDF(49307Kb)  |  收藏  |  浏览/下载:241/94  |  提交时间:2022/06/15
Specular highlight removal, PSD-Dataset, Deep learning  
Joint Learning in the Spatio-Temporal and Frequency Domains for Skeleton-Based Action Recognition 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 卷号: 22, 期号: 9, 页码: 2207-2220
作者:  Guyue, Hu;  Bo, Cui;  Shan, Yu
Adobe PDF(4803Kb)  |  收藏  |  浏览/下载:274/54  |  提交时间:2020/09/28
Skeleton-based Action Recognition  Frequency Attention  Synchronous Local and Non-local Learning  Soft-margin Focal Loss  Pesudo Multi-task Learning  
Realistic Facial Expression Reconstruction for VR HMD Users 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 卷号: 22, 期号: 3, 页码: 730-743
作者:  Lou, Jianwen;  Wang, Yiming;  Nduka, Charles;  Hamedi, Mahyar;  Mavridou, Ifigeneia;  Wang, Fei-Yue;  Yu, Hui
收藏  |  浏览/下载:173/0  |  提交时间:2020/06/02
Face  Sensors  Three-dimensional displays  Image reconstruction  Resists  Electromyography  Cameras  Facial expression reconstruction  head-mounted display  electromyogram  3D face reconstruction  facial action unit  
WiderPerson: A Diverse Dataset for Dense Pedestrian Detection in the Wild 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 卷号: 22, 期号: 2, 页码: 380-393
作者:  Zhang, Shifeng;  Xie, Yiliang;  Wan, Jun;  Xia, Hansheng;  Li, Stan Z.;  Guo, Guodong
浏览  |  Adobe PDF(6651Kb)  |  收藏  |  浏览/下载:295/50  |  提交时间:2020/04/07
Benchmark testing  Detectors  Training  Urban areas  Cameras  Task analysis  Deep learning  Pedestrian detection  dataset  rich diversity  high density