IDEAS home Printed from https://ideas.repec.org/a/gam/jdataj/v10y2025i6p84-d1668909.html
   My bibliography  Save this article

Macao-ebird : A Curated Dataset for Artificial-Intelligence-Powered Bird Surveillance and Conservation in Macao

Author

Listed:
  • Xiaoyuan Huang

    (Faculty of Applied Sciences, Macao Polytechnic University, Macao SAR 999078, China
    School of Electrical Engineering and Intelligentization, Dongguan University of Technology, Dongguan 523000, China)

  • Silvia Mirri

    (Department of Computer Science and Engineering, University of Bologna, 40128 Bologna, Italy)

  • Su-Kit Tang

    (Faculty of Applied Sciences, Macao Polytechnic University, Macao SAR 999078, China)

Abstract

Artificial intelligence (AI) currently exhibits considerable potential within the realm of biodiversity conservation. However, high-quality regionally customized datasets remain scarce, particularly within urban environments. The existing large-scale bird image datasets often lack a dedicated focus on endangered species endemic to specific geographic regions, as well as a nuanced consideration of the complex interplay between urban and natural environmental contexts. Therefore, this paper introduces Macao-ebird , a novel dataset designed to advance AI-driven recognition and conservation of endangered bird species in Macao. The dataset comprises two subsets: (1) Macao-ebird-cls , a classification dataset with 7341 images covering 24 bird species, emphasizing endangered and vulnerable species native to Macao; and (2) Macao-ebird-det , an object detection dataset generated through AI-agent-assisted labeling using grounding DETR with improved denoising anchor boxes (DINO), significantly reducing manual annotation effort while maintaining high-quality bounding-box annotations. We validate the dataset’s utility through baseline experiments with the You Only Look Once (YOLO) v8–v12 series, achieving a mean average precision (mAP50) of up to 0.984. Macao-ebird addresses critical gaps in the existing datasets by focusing on region-specific endangered species and complex urban–natural environments, providing a benchmark for AI applications in avian conservation.

Suggested Citation

  • Xiaoyuan Huang & Silvia Mirri & Su-Kit Tang, 2025. "Macao-ebird : A Curated Dataset for Artificial-Intelligence-Powered Bird Surveillance and Conservation in Macao," Data, MDPI, vol. 10(6), pages 1-15, May.
  • Handle: RePEc:gam:jdataj:v:10:y:2025:i:6:p:84-:d:1668909
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2306-5729/10/6/84/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2306-5729/10/6/84/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jdataj:v:10:y:2025:i:6:p:84-:d:1668909. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.