CASIA OpenIR

浏览/检索结果: 共11条,第1-10条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
Pixel-Wise Grasp Detection via Twin Deconvolution and Multi-Dimensional Attention 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2023, 卷号: 33, 期号: 8, 页码: 4002-4010
作者:  Ren, Guangli;  Geng, Wenjie;  Guan, Peiyu;  Cao, Zhiqiang;  Yu, Junzhi
Adobe PDF(4013Kb)  |  收藏  |  浏览/下载:7/1  |  提交时间:2024/05/28
Social Vision for Intelligent Vehicles: From Computer Vision to Foundation Vision 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 卷号: 8, 期号: 11, 页码: 4474-4476
作者:  Yu, Hui;  Wang, Yutong;  Tian, Yonglin;  Zhang, Hui;  Zheng, Wenbo;  Wang, Fei-Yue
Adobe PDF(135Kb)  |  收藏  |  浏览/下载:42/1  |  提交时间:2024/03/27
Social Vision  Parallel Vision  Knowledge Vision  Foundation Vision  intelligent vehicles  social interaction  sustainability  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:52/15  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
Adversarial Multi-Task Learning for Mandarin Prosodic Boundary Prediction With Multi-Modal Embeddings 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 卷号: 31, 页码: 2963-2973
作者:  Yi, Jiangyan;  Tao, Jianhua;  Fu, Ruibo;  Wang, Tao;  Zhang, Chu Yuan;  Wang, Chenglong
收藏  |  浏览/下载:58/0  |  提交时间:2023/11/17
Adversarial training  multi-task learning  prosodic boundaries  speech synthesis  multi-modal embeddings  
Attention Weighted Local Descriptors 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 9, 页码: 10632-10649
作者:  Wang, Changwei;  Xu, Rongtao;  Lu, Ke;  Xu, Shibiao;  Meng, Weiliang;  Zhang, Yuyang;  Fan, Bin;  Zhang, Xiaopeng
Adobe PDF(8075Kb)  |  收藏  |  浏览/下载:136/2  |  提交时间:2023/11/17
Local features detection and description  consistent attention mechanism  context augmentation  lightweight local descriptors  knowledge distillation  
GCNet: Graph Completion Network for Incomplete Multimodal Learning in Conversation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 7, 页码: 8419-8432
作者:  Lian, Zheng;  Chen, Lan;  Sun, Licai;  Liu, Bin;  Tao, Jianhua
Adobe PDF(3959Kb)  |  收藏  |  浏览/下载:145/2  |  提交时间:2023/11/17
Oral communication  Correlation  Data models  Task analysis  Feature extraction  Tensors  Benchmark testing  Conversational data  graph complete network (GCNet)  incomplete multimodal learning  speaker-sensitive modeling  temporal-sensitive modeling  
SMIN: Semi-Supervised Multi-Modal Interaction Network for Conversational Emotion Recognition 期刊论文
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 卷号: 14, 期号: 3, 页码: 2415-2429
作者:  Lian, Zheng;  Liu, Bin;  Tao, Jianhua
Adobe PDF(2103Kb)  |  收藏  |  浏览/下载:116/3  |  提交时间:2023/11/15
Emotion recognition  Feature extraction  Training  Acoustics  Semisupervised learning  Benchmark testing  Hidden Markov models  Semi-supervised multi-modal interaction network (SMIN)  conversational emotion recognition  semi-supervised learning  intra-modal interaction  cross-modal interaction  
Graph-Enhanced Emotion Neural Decoding 期刊论文
IEEE Transactions on Medical Imaging, 2023, 卷号: 42, 期号: 8, 页码: 2262 - 2273
作者:  Huang Zhongyu(黄中昱);  Du Changde(杜长德);  Wang Yingheng;  Fu Kaicheng(付铠城);  He Huiguang(何晖光)
Adobe PDF(6049Kb)  |  收藏  |  浏览/下载:275/58  |  提交时间:2023/05/05
Brain region  emotion  graph neural networks  neural decoding  representation  
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:139/26  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
BSTG-Trans: A Bayesian Spatial-Temporal Graph Transformer for Long-term Pose Forecasting 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: Early Access, 期号: Early Access, 页码: Early Access
作者:  Shentong Mo;  Xin M(辛淼)
Adobe PDF(2209Kb)  |  收藏  |  浏览/下载:98/16  |  提交时间:2023/04/25
long-term forecasting  spatial-temporal graph transformer  Bayesian transformer  uncertainty estimation