Author
Listed:
- Amani Homoud
- Saptarshi Das
Abstract
This paper provides an exploratory analysis of underwater video analysis techniques to enhance image quality and facilitate accurate classification of different marine species. Our methodology progresses through several steps, beginning with the quality of underwater images that might be reduced by variables such as decreased light intensity, color modification, and limited visibility. These attributes pose significant challenges to develop accurate object detection methods. This paper outlines the processing pipeline employed to enhance the quality of images from underwater videos and facilitate precise object detection. First, we use the Gray World (GW) algorithm for image enhancement, effectively mitigating the challenges of aquatic environment, such as color distortion and low contrast. Subsequently, we compare the traditional Histogram Equalization (HE) and the Contrast Limited Adaptive Histogram Equalization (CLAHE) algorithms to assess their efficacy in enhancing underwater image quality. Next, Canny Edge Detection is utilized to identify the prominent features in the enhanced images, aiding in subsequent classification tasks. Next, three state-of-the-art deep learning models, Visual Geometry Group 16-layer network (VGG16), 50-layer Residual Network (ResNet50), and 121-layer Densely Connected Convolutional Network (DenseNet121), are leveraged through transfer learning to classify underwater species, including fish, coral reefs, and sea turtles. Finally, by enhancing the visual quality of underwater images, our research contributes to better understanding of the underwater ecosystem and supports conservation efforts. Enhanced Super-Resolution GAN (ESRGAN) is a superior Generative Adversarial Network (GAN) technique to improve the quality of noisy images. This paper contributes to advancing the field of underwater image and video analysis, offering valuable insights for applications in marine biology, environmental monitoring, underwater robotics, and autonomous navigation.
Suggested Citation
Amani Homoud & Saptarshi Das, 2026.
"GAN-based underwater image enhancement and scene classification using transfer learning,"
PLOS ONE, Public Library of Science, vol. 21(3), pages 1-27, March.
Handle:
RePEc:plo:pone00:0345593
DOI: 10.1371/journal.pone.0345593
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0345593. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.