CASIA OpenIR

浏览/检索结果: 共49条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Ensemble Quadratic Assignment Network for Graph Matching 期刊论文
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 页码: 23
作者:  Tan, Haoru;  Wang, Chuang;  Wu, Sitong;  Zhang, Xu-Yao;  Yin, Fei;  Liu, Cheng-Lin
收藏  |  浏览/下载:7/0  |  提交时间:2024/07/03
Graph matching  Combinatorial optimization  Graph neural network  Ensemble learning  
Emotion selectable end-to-end text-based speech editing 期刊论文
ARTIFICIAL INTELLIGENCE, 2024, 卷号: 329, 页码: 16
作者:  Wang, Tao;  Yi, Jiangyan;  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi;  Zhang, Chu Yuan
收藏  |  浏览/下载:7/0  |  提交时间:2024/07/03
Emotion selectable  Text-based speech editing  Emotion decoupling  Mask prediction  Few-shot learning  Text-to-speech  
HiCMAE: Hierarchical Contrastive Masked Autoencoder for self-supervised Audio-Visual Emotion Recognition 期刊论文
Information Fusion, 2024, 卷号: 108, 页码: 1-20
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2281Kb)  |  收藏  |  浏览/下载:44/10  |  提交时间:2024/05/31
Audio-Visual Emotion Recognition  Self-supervised learning  Masked autoencoder  Contrastive learning  
Context feature fusion and enhanced non-maximum suppression for pedestrian detection in crowded scenes 期刊论文
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 页码: 21
作者:  Shao, Yu;  Hu, Jianhua;  Hu, Lihua;  Zhang, Jifu;  Wang, Xinbo
收藏  |  浏览/下载:18/0  |  提交时间:2024/05/30
Densely populated  Pedestrian detection  Occlusion  Contextual information  
VLP2MSA: Expanding vision-language pre-training to multimodal sentiment analysis 期刊论文
KNOWLEDGE-BASED SYSTEMS, 2024, 卷号: 283, 页码: 9
作者:  Yi, Guofeng;  Fan, Cunhang;  Zhu, Kang;  Lv, Zhao;  Liang, Shan;  Wen, Zhengqi;  Pei, Guanxiong;  Li, Taihao;  Tao, Jianhua
收藏  |  浏览/下载:103/0  |  提交时间:2024/02/22
Multimodal sentiment analysis  Vision-language  Multimodal fusion  
Multi-task safe reinforcement learning for navigating intersections in dense traffic 期刊论文
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2023, 卷号: 360, 期号: 17, 页码: 13737-13760
作者:  Liu, Yuqi;  Gao, Yinfeng;  Zhang, Qichao;  Ding, Dawei;  Zhao, Dongbin
Adobe PDF(3095Kb)  |  收藏  |  浏览/下载:74/13  |  提交时间:2024/02/22
Transformer-based stroke relation encoding for online handwriting and sketches 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 148, 页码: 13
作者:  Liu, Jing-Yu;  Zhang, Yan-Ming;  Yin, Fei;  Liu, Cheng-Lin
Adobe PDF(1999Kb)  |  收藏  |  浏览/下载:93/4  |  提交时间:2024/02/22
Online stroke classification  Handwritten document analysis  Diagram recognition  Sketch semantic segmentation  Position encoding in transformer  
NIR-II fluorescence-guided liver cancer surgery by a small molecular HDAC6 targeting probe 期刊论文
EBIOMEDICINE, 2023, 卷号: 98, 页码: 18
作者:  Wang, Bo;  Tang, Chu;  Lin, En;  Jia, Xiaohua;  Xie, Ganyuan;  Li, Peiping;  Li, Decheng;  Yang, Qiyue;  Guo, Xiaoyong;  Cao, Caiguang;  Shi, Xiaojing;  Zou, Baojia;  Cai, Chaonong;  Tian, Jie;  Hu, Zhenhua;  Li, Jian
Adobe PDF(4013Kb)  |  收藏  |  浏览/下载:150/4  |  提交时间:2024/02/22
Hepatocellular carcinoma  Second near-infrared window  Molecular imaging  HDAC6  Fluorescence guided surgery  
Delivery of pollen to forsythia flower pistils autonomously and precisely using a robot arm 期刊论文
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2023, 卷号: 214, 页码: 13
作者:  Yang, Minghao;  Lyu, Hongchang;  Zhao, Yongjia;  Sun, Yangchang;  Pan, Hang;  Sun, Qi;  Chen, Jinlong;  Qiang, Baohua;  Yang, Hongbo
Adobe PDF(10694Kb)  |  收藏  |  浏览/下载:156/3  |  提交时间:2023/12/21
Pollination robot  Flower detection  Pistil identification  Convolutional neural network (CNN)  
Adversarial Multi-Task Learning for Mandarin Prosodic Boundary Prediction With Multi-Modal Embeddings 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 卷号: 31, 页码: 2963-2973
作者:  Yi, Jiangyan;  Tao, Jianhua;  Fu, Ruibo;  Wang, Tao;  Zhang, Chu Yuan;  Wang, Chenglong
收藏  |  浏览/下载:74/0  |  提交时间:2023/11/17
Adversarial training  multi-task learning  prosodic boundaries  speech synthesis  multi-modal embeddings