IDEAS home Printed from https://ideas.repec.org/a/gam/jftint/v18y2026i2p77-d1854285.html

Aerial Image Analysis: When LLMs Assist (And When Not)

Author

Listed:
  • Salvatore Calcagno

    (Dipartimento di Matematica e Informatica, University of Catania, 95125 Catania, Italy)

  • Erika Scaletta

    (Dipartimento di Matematica e Informatica, University of Catania, 95125 Catania, Italy)

  • Emiliano Tramontana

    (Dipartimento di Matematica e Informatica, University of Catania, 95125 Catania, Italy)

  • Gabriella Verga

    (Dipartimento di Matematica e Informatica, University of Catania, 95125 Catania, Italy)

Abstract

Large language models (LLMs) have shown remarkable results when tasked with the analysis and production of texts or images and for captioning images. Aerial images differ from other images since they exhibit many natural objects that have a highly variable color range and no clear contours. This paper reports to what extent an LLM, i.e., Llama-4, can be tasked with the identification and captioning in aerial images of natural objects, such as tree categories, uncultivated land, and some man-made objects, such as roads. This valuable automation is needed to scan large areas and detect the parts for which a sudden maintenance or an emergency intervention is due. Tests on the chosen LLM were performed against a custom image dataset built to overcome the limited availability of such a domain-specific aerial image set. To evaluate the identification and captioning results, the accuracy, precision and recall metrics were computed. The results given by a cutting-edge variant of Llama-4, namely Maverick, reveal its strengths and weaknesses in this context. Although it is remarkable that an out-of-the-box tool can give assistance in such a complex observation and detection task, substantial progress is needed for such a model to improve accuracy and constitute a reliable support, as accuracy is at most 58.6% and recall is at most 56.1%.

Suggested Citation

  • Salvatore Calcagno & Erika Scaletta & Emiliano Tramontana & Gabriella Verga, 2026. "Aerial Image Analysis: When LLMs Assist (And When Not)," Future Internet, MDPI, vol. 18(2), pages 1-19, February.
  • Handle: RePEc:gam:jftint:v:18:y:2026:i:2:p:77-:d:1854285
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1999-5903/18/2/77/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1999-5903/18/2/77/
    Download Restriction: no
    ---><---

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jftint:v:18:y:2026:i:2:p:77-:d:1854285. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.