Author
Listed:
- Antonio Cedillo-Hernandez
(Tecnologico de Monterrey, Escuela de Ingenieria y Ciencias, Av. Eugenio Garza Sada 2501, Colonia Tecnologico, Monterrey, Nuevo Leon CP 64700, Mexico)
- Lydia Velazquez-Garcia
(Instituto Politecnico Nacional, Centro de Investigaciones, Economicas, Administrativas y Sociales, Lauro Aguirre 120, Colonia Agricultura, Ciudad de Mexico CP 11360, Mexico)
- Francisco Javier Garcia-Ugalde
(Facultad de Ingenieria, Universidad Nacional Autonoma de Mexico, Av. Universidad 3000, Ciudad Universitaria, Coyoacan, Ciudad de Mexico CP 04510, Mexico)
- Manuel Cedillo-Hernandez
(Instituto Politecnico Nacional, Escuela Superior de Ingenieria Mecanica y Electrica, Unidad Culhuacan, Av. Santa Ana 1000, Colonia Culhuacan CTM V, Coyoacan, Ciudad de Mexico CP 04440, Mexico)
Abstract
This paper introduces a deep learning-based framework for video watermarking that achieves robust, imperceptible, and fast embedding under a wide range of visual and temporal conditions. The proposed method is organized into seven modules that collaboratively perform frame encoding, semantic region analysis, block selection, watermark transformation, and spatiotemporal injection, followed by decoding and multi-objective optimization. A key component of the framework is its ability to learn a visual importance map, which guides a saliency-based block selection strategy. This allows the model to embed the watermark in perceptually redundant regions while minimizing distortion. To enhance resilience, the watermark is distributed across multiple frames, leveraging temporal redundancy to improve recovery under frame loss, insertion, and reordering. Experimental evaluations conducted on a large-scale video dataset demonstrate that the proposed method achieves high fidelity, while preserving low decoding error rates under compression, noise, and temporal distortions. The proposed method operates processing 38 video frames per second on a standard GPU. Additional ablation studies confirm the contribution of each module to the system’s robustness. This framework offers a promising solution for watermarking in streaming, surveillance, and content verification applications.
Suggested Citation
Antonio Cedillo-Hernandez & Lydia Velazquez-Garcia & Francisco Javier Garcia-Ugalde & Manuel Cedillo-Hernandez, 2026.
"Deep Learning-Based Video Watermarking: A Robust Framework for Spatial–Temporal Embedding and Retrieval,"
Future Internet, MDPI, vol. 18(2), pages 1-28, February.
Handle:
RePEc:gam:jftint:v:18:y:2026:i:2:p:104-:d:1866162
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jftint:v:18:y:2026:i:2:p:104-:d:1866162. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.