CASIA OpenIR

浏览/检索结果: 共26条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
HiCMAE: Hierarchical Contrastive Masked Autoencoder for self-supervised Audio-Visual Emotion Recognition 期刊论文
Information Fusion, 2024, 卷号: 108, 页码: 1-20
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2281Kb)  |  收藏  |  浏览/下载:44/10  |  提交时间:2024/05/31
Audio-Visual Emotion Recognition  Self-supervised learning  Masked autoencoder  Contrastive learning  
Context feature fusion and enhanced non-maximum suppression for pedestrian detection in crowded scenes 期刊论文
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 页码: 21
作者:  Shao, Yu;  Hu, Jianhua;  Hu, Lihua;  Zhang, Jifu;  Wang, Xinbo
收藏  |  浏览/下载:18/0  |  提交时间:2024/05/30
Densely populated  Pedestrian detection  Occlusion  Contextual information  
Subband fusion of complex spectrogram for fake speech detection 期刊论文
SPEECH COMMUNICATION, 2023, 卷号: 155, 页码: 8
作者:  Fan, Cunhang;  Xue, Jun;  Dong, Shunbo;  Ding, Mingming;  Yi, Jiangyan;  Li, Jinpeng;  Lv, Zhao
收藏  |  浏览/下载:49/0  |  提交时间:2024/03/26
Automatic speaker verification  Complex spectrogram  Fake speech detection  Phase information  Subband  
VLP2MSA: Expanding vision-language pre-training to multimodal sentiment analysis 期刊论文
KNOWLEDGE-BASED SYSTEMS, 2024, 卷号: 283, 页码: 9
作者:  Yi, Guofeng;  Fan, Cunhang;  Zhu, Kang;  Lv, Zhao;  Liang, Shan;  Wen, Zhengqi;  Pei, Guanxiong;  Li, Taihao;  Tao, Jianhua
收藏  |  浏览/下载:103/0  |  提交时间:2024/02/22
Multimodal sentiment analysis  Vision-language  Multimodal fusion  
Transformer-based stroke relation encoding for online handwriting and sketches 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 148, 页码: 13
作者:  Liu, Jing-Yu;  Zhang, Yan-Ming;  Yin, Fei;  Liu, Cheng-Lin
Adobe PDF(1999Kb)  |  收藏  |  浏览/下载:93/4  |  提交时间:2024/02/22
Online stroke classification  Handwritten document analysis  Diagram recognition  Sketch semantic segmentation  Position encoding in transformer  
NIR-II fluorescence-guided liver cancer surgery by a small molecular HDAC6 targeting probe 期刊论文
EBIOMEDICINE, 2023, 卷号: 98, 页码: 18
作者:  Wang, Bo;  Tang, Chu;  Lin, En;  Jia, Xiaohua;  Xie, Ganyuan;  Li, Peiping;  Li, Decheng;  Yang, Qiyue;  Guo, Xiaoyong;  Cao, Caiguang;  Shi, Xiaojing;  Zou, Baojia;  Cai, Chaonong;  Tian, Jie;  Hu, Zhenhua;  Li, Jian
Adobe PDF(4013Kb)  |  收藏  |  浏览/下载:150/4  |  提交时间:2024/02/22
Hepatocellular carcinoma  Second near-infrared window  Molecular imaging  HDAC6  Fluorescence guided surgery  
Delivery of pollen to forsythia flower pistils autonomously and precisely using a robot arm 期刊论文
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2023, 卷号: 214, 页码: 13
作者:  Yang, Minghao;  Lyu, Hongchang;  Zhao, Yongjia;  Sun, Yangchang;  Pan, Hang;  Sun, Qi;  Chen, Jinlong;  Qiang, Baohua;  Yang, Hongbo
Adobe PDF(10694Kb)  |  收藏  |  浏览/下载:155/3  |  提交时间:2023/12/21
Pollination robot  Flower detection  Pistil identification  Convolutional neural network (CNN)  
GCNet: Graph Completion Network for Incomplete Multimodal Learning in Conversation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 7, 页码: 8419-8432
作者:  Lian, Zheng;  Chen, Lan;  Sun, Licai;  Liu, Bin;  Tao, Jianhua
Adobe PDF(3959Kb)  |  收藏  |  浏览/下载:173/6  |  提交时间:2023/11/17
Oral communication  Correlation  Data models  Task analysis  Feature extraction  Tensors  Benchmark testing  Conversational data  graph complete network (GCNet)  incomplete multimodal learning  speaker-sensitive modeling  temporal-sensitive modeling  
Dynamic High-Resolution Network for Semantic Segmentation in Remote-Sensing Images 期刊论文
REMOTE SENSING, 2023, 卷号: 15, 期号: 9, 页码: 28
作者:  Guo, Shichen;  Yang, Qi;  Xiang, Shiming;  Wang, Pengfei;  Wang, Xuezhi
收藏  |  浏览/下载:55/0  |  提交时间:2023/11/17
semantic segmentation  remote-sensing image  neural architecture search  sparse regularization  HRNet  
Two-stage deep spectrum fusion for noise-robust end-to-end speech recognition 期刊论文
APPLIED ACOUSTICS, 2023, 卷号: 212, 页码: 10
作者:  Fan, Cunhang;  Ding, Mingming;  Yi, Jiangyan;  Li, Jinpeng;  Lv, Zhao
收藏  |  浏览/下载:51/0  |  提交时间:2023/11/16
Robust end-to-end ASR  Speech enhancement  Masking and mapping  Speech distortion  Deep spectrum fusion