首页 | 本学科首页   官方微博 | 高级检索  
     检索      

一种通用的跨模态遥感信息关联学习方法
引用本文:吕亚飞,熊伟,张筱晗.一种通用的跨模态遥感信息关联学习方法[J].武汉大学学报(信息科学版),2022,47(11):1887-1895.
作者姓名:吕亚飞  熊伟  张筱晗
作者单位:1.海军航空大学信息融合研究所, 山东 烟台, 264000
基金项目:国家自然科学基金61790550国家自然科学基金61790554国家自然科学基金91538201
摘    要:针对“异质鸿沟”问题导致的不同模态遥感信息间相似性难以度量的问题,构建并公开了一个包含4种模态信息的跨模态遥感数据集,并基于不同模态信息间潜在的语义一致性,提出了一种通用的跨模态遥感信息关联学习方法。利用深度神经网络的表征能力,分别对图像类信息和序列类信息设计各模态信息的特征学习网络,实现对不同模态高层语义信息的准确表示;设计了一个新的关联学习损失函数对模态内的语义一致性和模态间的互补性进行限制,利用知识蒸馏的思想,以先融合后迁移各模态间信息的方式增强模态间的语义相关性。在构建的数据集上进行实验,结果表明,所提方法平均精度均值达到70%,超过基准方法。

关 键 词:跨模态检索    关联分析    深度学习    遥感图像    特征表示
收稿时间:2020-06-16

A General Cross-Modal Correlation Learning Method for Remote Sensing
Institution:1.Information Fusion Institute, Naval Aviation University, Yantai 264000, China2.Naval Research Institute, Beijing 100086, China3.Beijing Institute of Remote Sensing Information, Beijing 100086, China
Abstract:  Objectives  Aiming at the problem of inconsistent data distribution between cross-modal remote sensing information caused by "heterogeneity gap", a new cross-modal remote sensing dataset is constructed and released for public.  Methods  To solve the problem of "heterogeneity gap", a general cross-modal correlation learning method (CCLM) is proposed for remote sensing. Based on the latent semantic consistency between different modality information, CCLM includes two stages: The learning of feature representation and the construction of common feature space. Firstly, deep neural networks are adopted to extract the feature representation of image and sequence information. To construct a common feature space, a new loss function is designed for correlation learning, by exploring the semantic consistency within intra-modality and complementary information contained in inter-modality. Secondly, knowledge distillation is used to enhance the semantic relevance to achieve the semantic consistency of common space.  Results  The experiments are performed on our dataset. The experimental results show that the mean average precision (mAP) of our CCLM on cross-modal retrieval tasks exceeds 70%.  Conclusions  The results outperform other baseline methods, and verify effectiveness of the proposed dataset and method.
Keywords:
点击此处可从《武汉大学学报(信息科学版)》浏览原始摘要信息
点击此处可从《武汉大学学报(信息科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号