CASIA OpenIR

浏览/检索结果: 共668条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
BiMNet: A Multimodal Data Fusion Network for continuous circular capsulorhexis Action Segmentation 期刊论文
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 卷号: 238, 页码: 10
作者:  Bian, Gui-Bin;  Zheng, Jia-Ying;  Li, Zhen;  Wang, Jie;  Fu, Pan;  Xin, Chen;  da Silva, Daniel Santos;  Wu, Wan-Qing;  De Albuquerque, Victor Hugo C.
收藏  |  浏览/下载:73/0  |  提交时间:2023/12/21
Cataract surgery  Continuous circumferential capsulotomy  Continuous action segmentation  Multimodal data fusion  Imbalanced data  
AnyFace++: A Unified Framework for Free-style Text-to-Face Synthesis and Manipulation 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 页码: 1-15
作者:  Sun, Jianxin;  Deng, Qiyao;  Li, Qi;  Sun, Muyi;  Liu, Yunfan;  Sun, Zhenan
Adobe PDF(16839Kb)  |  收藏  |  浏览/下载:39/7  |  提交时间:2024/02/23
Involving Distinguished Temporal Graph Convolutional Networks for Skeleton-Based Temporal Action Segmentation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 1, 页码: 647-660
作者:  Li, Yun-Heng;  Liu, Kai-Yuan;  Liu, Sheng-Lan;  Feng, Lin;  Qiao, Hong
收藏  |  浏览/下载:7/0  |  提交时间:2024/03/26
Feature extraction  Motion segmentation  Correlation  Convolution  Topology  Convolutional neural networks  Solid modeling  Skeleton-based temporal action segmentation  enhanced spatial graph structure  segmented encoding  
Attentional Composition Networks for Long-Tailed Human Action Recognition 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 卷号: 20, 期号: 1, 页码: 18
作者:  Wang, Haoran;  Wang, Yajie;  Yu, Baosheng;  Zhan, Yibing;  Yuan, Chunfeng;  Yang, Wankou
收藏  |  浏览/下载:73/0  |  提交时间:2023/11/15
Compositional learning  long tail  few-shot  zero-shot  action recognition  
Biphasic Face Photo-Sketch Synthesis via Semantic-Driven Generative Adversarial Network With Graph Representation Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 14
作者:  Qi, Xingqun;  Sun, Muyi;  Wang, Zijian;  Liu, Jiaming;  Li, Qi;  Zhao, Fang;  Zhang, Shanghang;  Shan, Caifeng
Adobe PDF(6718Kb)  |  收藏  |  浏览/下载:70/28  |  提交时间:2024/02/22
Face photo-sketch synthesis  generative adversarial network  graph representation learning  intraclass and interclass  iterative cycle training (ICT)  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:31/9  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
VQAPT: A New visual question answering model for personality traits in social media images 期刊论文
PATTERN RECOGNITION LETTERS, 2023, 卷号: 175, 页码: 66-73
作者:  Biswas, Kunal;  Shivakumara, Palaiahnakote;  Pal, Umapada;  Liu, Cheng-Lin;  Lu, Yue
收藏  |  浏览/下载:31/0  |  提交时间:2024/02/22
Personality trait images  Multimodal concept  Text recognition  Social media images  Natural language processing  Visual question answering  
SOTVerse: A User-Defined Task Space of Single Object Tracking 期刊论文
International Journal of Computer Vision, 2023, 页码: 1-59
作者:  Shiyu, Hu;  Xin, Zhao;  Kaiqi Huang
Adobe PDF(53048Kb)  |  收藏  |  浏览/下载:36/4  |  提交时间:2024/01/22
Single object tracking  Experimental environment  Evaluation system  Performance analysis  
EventMix: An efficient data augmentation strategy for event-based learning 期刊论文
INFORMATION SCIENCES, 2023, 卷号: 644, 页码: 11
作者:  Shen, Guobin;  Zhao, Dongcheng;  Zeng, Yi
收藏  |  浏览/下载:54/0  |  提交时间:2023/11/17
Event based data augmentation  Neuromorphic data  Spiking neural networks  Reasonable label assignment  Gaussian mixture model  
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 4616-4629
作者:  Ma, Chengcheng;  Liu, Yang;  Deng, Jiankang;  Xie, Lingxi;  Dong, Weiming;  Xu, Changsheng
Adobe PDF(1644Kb)  |  收藏  |  浏览/下载:82/11  |  提交时间:2023/11/16
Vision-language model  prompt tuning  over-fitting  subspace learning  gradient projection