# Multi-modal remote sensory learning for multi-objects over autonomous devices

**Authors:** Aysha Naseer, Naif Almudawi, Hanan Aljuaid, Abdulwahab Alazeb, Yahay AlQahtani, Asaad Algarni, Ahmad Jalal, Hui Liu

PMC · DOI: 10.3389/fbioe.2025.1430222 · Frontiers in Bioengineering and Biotechnology · 2025-05-20

## TL;DR

This paper introduces a new method for identifying objects in remote sensing images using a combination of MRF and Alex Net, achieving high accuracy on benchmark datasets.

## Contribution

A novel multi-modal approach combining MRF and Alex Net for improved object recognition in remote sensing imagery.

## Key findings

- The proposed method achieves 97.90% recognition accuracy on the AID dataset.
- It reaches 98.90% recognition accuracy on the UC Merced Land Use dataset.
- The method outperforms existing approaches in classification accuracy and generalization.

## Abstract

There has been an increasing focus on object segmentation within remote sensing images in recent years due to advancements in remote sensing technology and the growing significance of these images in both military and civilian realms. In these situations, it is critical to accurately and quickly identify a wide variety of objects. In many computer vision applications, scene recognition in aerial-based remote sensing imagery presents a common issue.

However, several challenging elements make this work especially difficult: (i) Different objects have different pixel densities; (ii) objects are not evenly distributed in remote sensing images; (iii) objects can appear differently depending on viewing angle and lighting conditions; and (iv) there are fluctuations in the number of objects, even the same type, in remote sensing images. Using a synergistic combination of Markov Random Field (MRF) for accurate labeling and Alex Net model for robust scene recognition, this work presents a novel method for the identification of remote sensing objects. During the labeling step, the use of MRF guarantees precise spatial contextual modeling, which improves comprehension of intricate interactions between nearby aerial objects. By simultaneously using deep learning model, the incorporation of Alex Net in the following classification phase enhances the model’s capacity to identify complex patterns in aerial images and adapt to a variety of object attributes.

Experiments show that our method performs better than others in terms of classification accuracy and generalization, indicating its efficacy analysis on benchmark datasets such as UC Merced Land Use and AID.

Several performance measures were calculated to assess the efficacy of the suggested technique, including accuracy, precision, recall, error, and F1-Score. The assessment findings show a remarkable recognition rate of around 97.90% and 98.90%, on the AID and the UC Merced Land datasets, respectively.

## Full-text entities

- **Genes:** AICDA (activation induced cytidine deaminase) [NCBI Gene 57379] {aka AID, ARP2, CDA2, HEL-S-284, HIGM2}
- **Diseases:** HL (MESH:C538324)
- **Chemicals:** water (MESH:D014867), Alex (-)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12130010/full.md

## Figures

17 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12130010/full.md

## References

69 references — full list in the complete paper: https://tomesphere.com/paper/PMC12130010/full.md

---
Source: https://tomesphere.com/paper/PMC12130010