多域学习及其在检索、聚类和分类中的应用研究 | |
梁坚 | |
2018-12-05 | |
页数 | 170 |
学位类型 | 博士 |
中文摘要 | 随着信息化社会的快速发展以及移动互联网等平台的快速普及,全球范围内的数据呈现爆炸式的增长,我们迎来了大数据时代。 1. 有监督跨模态检索试图学习一个优异的异质度量去衡量不同模态特征表达之间的相似度,使得语义标签相似性比较大(小)的异质模态特征间的相似度比较大 2. 不同于有监督算法,无监督跨模态算法掌握的监督信息仅仅来自于模态间的匹配关系,这就增大了基于语义的跨模态检索的难度。 3. 多视角学习方法试图在特征层或得分层融合不同视角的观察信息,学习一个统一的表达或分类器去执行聚类任务。为了清除多视角数据中语义无关和视角间冗余的信息,本文提出了一种基于双层判别性降维的多视角聚类方法。该方法首先利用视角之间的相关性去除一些视角间差异过大且与聚类无关的特征。其次利用费歇尔判别准则,通过第二次降维进一步消除前一层降维后的多视角数据中存在的冗余信息,并学习新的聚类指示变量,将之返回给第一层重新进行降维学习。为了验证降维后的统一表达是否有效,本文进一步分析了该方法在分类学习下的效果。实验结果证明该方法不仅在多视角聚类上取得了不错的效果,同时还可以获得良好的分类表达能力。 4. 域自适应学习试图减少目标域数据标注的高昂成本,转而利用源域的监督信息和无标注的目标域数据,学习到有效的目标域分类器。本文首先设计了一种域无关的聚类目标作为学习域不变投影的准则,事实上,这一准则可以看做是域内聚类和衡量域间差异性的最大均值距离(Maximum Mean Discrepancy, MMD)的整合。随后针对目标域伪标签的不确定性设计了一种更为准确的域间差异性衡量标准,并提出了一种渐进式的自适应学习方法,在学习的过程中逐渐加入伪标签确定性比较大的目标域样本,学习到最终的投影函数。最后,为了解决前面方法引入了高维的MMD矩阵所带来的时间成本,本文基于类均值近邻分类器还提出了一种快速简单的域自适应学习基准。多个标准跨域数据库上的结果证实了这些方法的有效性。 |
英文摘要 | With the rapid development of the information society and the mobile Internet, the data around the world is exploding, we usher in the era of big data. 1. Supervised cross-modal retrieval attempts to learn a metric to measure the similarity between different modalities, making the similarity between heterogeneous modalities of large (small) semantic similarity large (small). 2. Unlike supervised methods, supervisory information in unsupervised ones only comes from the matching relationship between modalities, which increases the difficulty of semantic-based cross-modal retrieval. To address the missing semantic tags, a group-invariant structure is proposed in this paper. It considers an additional group variable based on canonical correlation analysis (CCA), so that the two modalities and the latent variable are consistent at the same time. On the other hand, due to the differences in the degree of difficulty in learning hidden variables, we propose a learning strategy from `easy' to `difficult', and learn the potential semantic tags and regression functions. 3. Multi-view learning attempts to integrate the information from different views at the feature or score levels, learning a unified feature or classifier to perform a clustering task. To eliminate the semantic-independent features and view redundancy in multi-view data, a coupled discriminative dimensionality reduction is proposed in this paper. It first uses the correlation between different views to remove the noisy features that are considered not related to clustering characteristics. Secondly, the second dimensionality reduction further eliminates the redundant information after the previous layer dimensionality reduction via the fisher discriminant criterion, and learns the new cluster indicator variable, and returns it to the first layer to guide the dimension learning. To verify the validity of the learned unified feature, the classification accuracies of this method are further analyzed. 4. Domain adaptation attempts to reduce the high cost of labeling the target domain data, instead using the source domain's supervisory information to learn an effective target domain classifier. In this paper, we first design a domain-independent clustering objective that can be regarded as the integration of the maximum mean discrepancy (MMD) and intra-domain clusterings. Then, to address the uncertainty of target pseudo-labeling, a progressive learning method is put forward where the definite target samples are firstly added. Finally, in order to solve the time cost brought by the high-dimensional MMD matrices, |
关键词 | 多域学习 跨模态检索 子空间学习 多视角聚类 域自适应学习 |
语种 | 中文 |
文献类型 | 学位论文 |
条目标识符 | http://ir.ia.ac.cn/handle/173211/23802 |
专题 | 毕业生_博士学位论文 |
通讯作者 | 梁坚 |
推荐引用方式 GB/T 7714 | 梁坚. 多域学习及其在检索、聚类和分类中的应用研究[D]. 中国科学院自动化研究所. 中国科学院自动化研究所,2018. |
条目包含的文件 | ||||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 | ||
Thesis.pdf(10072KB) | 学位论文 | 限制开放 | CC BY-NC-SA |
个性服务 |
推荐该条目 |
保存到收藏夹 |
查看访问统计 |
导出为Endnote文件 |
谷歌学术 |
谷歌学术中相似的文章 |
[梁坚]的文章 |
百度学术 |
百度学术中相似的文章 |
[梁坚]的文章 |
必应学术 |
必应学术中相似的文章 |
[梁坚]的文章 |
相关权益政策 |
暂无数据 |
收藏/分享 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论