Mutual attention inception network developed for remote sensing visual question answering

Mutual attention inception network developed for remote sensing visual question answering
Overview of the proposed method. Credit: XIOPM

A research team led by Prof. Lu Xiaoqiang from the Xi'an Institute of Optics and Precision Mechanics (XIOPM) of the Chinese Academy of Sciences proposed a novel mutual attention inception network (MAIN) and a dataset named RSIVQA for remote sensing visual question answering. The results were published in IEEE Transactions on Geoscience and Remote Sensing

Remote sensing visual question answering (VQA) mainly aims at making semantic understanding of images (RSIs) objective and interactive. Specifically, given an RSI, an intelligent agent will answer a question about the remote sensing scene.

Most of the existing methods ignore the spatial information of RSIs and word-level semantic information of questions, which restricts their applications in many complex scenes.

Accordingly, in this study, the proposed MAIN was made up of two parts, including the representation and the fusion module. The representation module was devised to obtain the features of image and question which can provide better representations.

As for the fusion module, it enhanced the discriminative ability of representations which can acquire by reinforcing the representations of image and question.

According to the experiments results, the proposed method can capture the alignments between images and questions under different evaluation metrics. This study provides a new perspective for the remote sensing visual question answering.

More information: Xiangtao Zheng et al, Mutual Attention Inception Network for Remote Sensing Visual Question Answering, IEEE Transactions on Geoscience and Remote Sensing (2021). DOI: 10.1109/TGRS.2021.3079918

Citation: Mutual attention inception network developed for remote sensing visual question answering (2021, August 30) retrieved 28 March 2024 from https://techxplore.com/news/2021-08-mutual-attention-inception-network-remote.html
This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Cross-resolution difference learning for change detection between multitemporal images

73 shares

Feedback to editors