CASIA OpenIR

浏览/检索结果: 共14条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
VLP2MSA: Expanding vision-language pre-training to multimodal sentiment analysis 期刊论文
KNOWLEDGE-BASED SYSTEMS, 2024, 卷号: 283, 页码: 9
作者:  Yi, Guofeng;  Fan, Cunhang;  Zhu, Kang;  Lv, Zhao;  Liang, Shan;  Wen, Zhengqi;  Pei, Guanxiong;  Li, Taihao;  Tao, Jianhua
收藏  |  浏览/下载:80/0  |  提交时间:2024/02/22
Multimodal sentiment analysis  Vision-language  Multimodal fusion  
RC-Net: Row and Column Network with Text Feature for Parsing Floor Plan Images 期刊论文
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2023, 卷号: 38, 期号: 3, 页码: 526-539
作者:  Wang, Teng;  Meng, Wei-Liang;  Lu, Zheng-Da;  Guo, Jian-Wei;  Xiao, Jun;  Zhang, Xiao-Peng
Adobe PDF(2370Kb)  |  收藏  |  浏览/下载:126/0  |  提交时间:2023/12/21
floor plan understanding  text feature  Row and Column (RC) constraint module  Row and Column network (RC-Net)  
Color-Unrelated Head-Shoulder Networks for Fine-Grained Person Re-identification 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 卷号: 19, 期号: 6, 页码: 21
作者:  Xu, Boqiang;  Liang, Jian;  He, Lingxiao;  Wu, Jinlin;  Fan, Chao;  Sun, Zhenan
收藏  |  浏览/下载:85/0  |  提交时间:2023/11/17
Person re-identification  fine-grained matching  visual surveillance  
ProxyMix: Proxy-based Mixup training with label refinery for source-free domain adaptation 期刊论文
NEURAL NETWORKS, 2023, 卷号: 167, 页码: 92-103
作者:  Ding, Yuhe;  Sheng, Lijun;  Liang, Jian;  Zheng, Aihua;  He, Ran
收藏  |  浏览/下载:62/0  |  提交时间:2023/11/16
Source-free unsupervised domain  adaptation  Pseudo labeling  
Data-driven floor plan understanding in rural residential buildings via deep recognition 期刊论文
INFORMATION SCIENCES, 2021, 卷号: 567, 页码: 58-74
作者:  Lu, Zhengda;  Wang, Teng;  Guo, Jianwei;  Meng, Weiliang;  Xiao, Jun;  Zhang, Wei;  Zhang, Xiaopeng
Adobe PDF(3227Kb)  |  收藏  |  浏览/下载:351/60  |  提交时间:2021/08/15
Floor plan understanding  Rural residence  Neural networks  
Hand Pose Understanding With Large-Scale Photo-Realistic Rendering Dataset 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 卷号: 30, 页码: 4275-4290
作者:  Deng, Xiaoming;  Zhang, Yinda;  Shi, Jian;  Zhu, Yuying;  Cheng, Dachuan;  Zuo, Dexin;  Cui, Zhaopeng;  Tan, Ping;  Chang, Liang;  Wang, Hongan
收藏  |  浏览/下载:254/0  |  提交时间:2021/06/07
Three-dimensional displays  Annotations  Pose estimation  Task analysis  Color  Image color analysis  Rendering (computer graphics)  Hand pose estimation  photo-realistic synthetic dataset  physical-based rendering  multi-task CNN  
Exploiting the directional coherence function for multichannel source extraction 期刊论文
SPEECH COMMUNICATION, 2021, 卷号: 128, 页码: 1-14
作者:  Liang, Shan;  Li, Guanjun;  Nie, Shuai;  Yang, ZhanLei;  Liu, WenJu;  Tao, Jianhua
收藏  |  浏览/下载:214/0  |  提交时间:2021/05/06
Directional coherence function  Coherent-to-Diffuse Ratio  General sidelobe canceller  Desired Speech Presence Probability  
Deep Learning Based Speech Separation via NMF-style Reconstructions 期刊论文
IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, 2018, 卷号: 26, 期号: 11, 页码: 2043-2055
作者:  Shuai Nie;  Shan Liang;  Wenju Liu;  Xueliang Zhang;  Jianhua Tao
浏览  |  Adobe PDF(2922Kb)  |  收藏  |  浏览/下载:213/81  |  提交时间:2020/10/22
Speech separation  deep neural network (DNN)  nonnegative matrix factorization (NMF)  spectro-temporal structures  
Deep Noise Tracking Network: A Hybrid Signal Processing/Deep Learning Approach to Speech Enhancement 会议论文
, Hyderabad, India, 2018-9-2~2018-9-6
作者:  Nie S(聂帅);  Shan Liang;  Bin Liu;  Yaping Zhang;  Wenju Liu;  Jianhua Tao
浏览  |  Adobe PDF(925Kb)  |  收藏  |  浏览/下载:87/30  |  提交时间:2020/10/22
OS-LFFD: a light and fast face detector with Ommateum structure 期刊论文
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 页码: 20
作者:  Xu, Dezhong;  Wu, Lifang;  He, Yonghao;  Zhao, Qing;  Jian, Meng;  Yan, Junchi;  Zhao, Liang
收藏  |  浏览/下载:183/0  |  提交时间:2020/08/21
Edge devices  Face detector  Effective receptive field  Ommateum block