CASIA OpenIR

浏览/检索结果: 共102条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Unsupervised Dialogue State Tracking for End-to-End Task-Oriented Dialogue with a Multi-Span Prediction Network 期刊论文
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2023, 卷号: 38, 期号: 4, 页码: 834-852
作者:  Liu, Qing-Bin;  He, Shi-Zhu;  Liu, Cao;  Liu, Kang;  Zhao, Jun
收藏  |  浏览/下载:27/0  |  提交时间:2024/02/22
end-to-end task-oriented dialogue  dialogue state tracking (DST)  unsupervised learning  reinforcement learning  
Multi-modal spatial relational attention networks for visual question answering 期刊论文
IMAGE AND VISION COMPUTING, 2023, 卷号: 140, 页码: 13
作者:  Yao, Haibo;  Wang, Lipeng;  Cai, Chengtao;  Sun, Yuxin;  Zhang, Zhi;  Luo, Yongkang
收藏  |  浏览/下载:59/0  |  提交时间:2024/02/22
Visual question answering  Spatial relation  Attention mechanism  Pre -training strategy  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:42/13  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
Emotion-Aware Music Driven Movie Montage 期刊论文
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2023, 卷号: 38, 期号: 3, 页码: 540-553
作者:  Liu, Wu-Qin;  Lin, Min-Xuan;  Huang, Hai-Bin;  Ma, Chong-Yang;  Song, Yu;  Dong, Wei-Ming;  Xu, Chang-Sheng
收藏  |  浏览/下载:106/0  |  提交时间:2023/12/21
movie montage  emotion analysis  audio-visual modality  contrastive learning  
Exploring Explicitly Disentangled Features for Domain Generalization 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 11, 页码: 6360-6373
作者:  Li, Jingwei;  Li, Yuan;  Wang, Huanjie;  Liu, Chengbao;  Tan, Jie
收藏  |  浏览/下载:72/0  |  提交时间:2023/12/21
Domain generalization  feature disentanglement  Fourier transform  data augmentation  
Incremental Audio-Visual Fusion for Person Recognition in Earthquake Scene 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 卷号: 20, 期号: 2, 页码: 19
作者:  You, Sisi;  Zuo, Yukun;  Yao, Hantao;  Xu, Changsheng
收藏  |  浏览/下载:66/0  |  提交时间:2023/12/21
Cross-modal audio-visual fusion  incremental learning  person recognition  elastic weight consolidation  feature replay  
Jointing Recurrent Across-Channel and Spatial Attention for Multi-Object Tracking With Block-Erasing Data Augmentation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 8, 页码: 4054-4069
作者:  Deng, Keyu;  Zhang, Congxuan;  Chen, Zhen;  Hu, Weiming;  Li, Bing;  Lu, Feng
收藏  |  浏览/下载:95/0  |  提交时间:2023/11/17
Multi-object tracking  one shot  multiattention feature learning  block erasing strategy  object occlusions  
Zero-Shot Predicate Prediction for Scene Graph Parsing 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 3140-3153
作者:  Li, Yiming;  Yang, Xiaoshan;  Huang, Xuhui;  Ma, Zhe;  Xu, Changsheng
收藏  |  浏览/下载:130/0  |  提交时间:2023/11/17
Deep learning  zero-shot  scene graph  
Omnidirectional Depth Estimation With Hierarchical Deep Network for Multi-Fisheye Navigation Systems 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 页码: 12
作者:  Su, Xiaojie;  Liu, Shimin;  Li, Rui
收藏  |  浏览/下载:75/0  |  提交时间:2023/11/17
Feature extraction  Cameras  Estimation  Task analysis  Navigation  Costs  Semantics  Omnidirectional depth estimation  hierarchical deep network  multi-fisheye navigation system  
Human Parsing With Part-Aware Relation Modeling 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 25, 页码: 2601-2612
作者:  Zhang, Xiaomei;  Chen, Yingying;  Tang, Ming;  Wang, Jinqiao;  Zhu, Xiangyu;  Lei, Zhen
Adobe PDF(6053Kb)  |  收藏  |  浏览/下载:111/8  |  提交时间:2023/11/17
Human parsing  modeling  part-aware relation