Author
Listed:
- Azib Farooq,Nirban Bhowmick,Muhammad Zahid*,Muhammad Adeel Asghar and Yasar Amin
(Department of Computer Science and Engineering, Miami University, Oxford, OH 45056, USA.Department of Electrical & Computer Engineering, University of Central Florida, 32816, Orlando, Florida.Department of Telecommunication Engineering, University of Engineering and Technology, Taxila 47050, Pakistan.Department of Electrical and Computer Engineering, Riphah International University, I-14 Campus, Islamabad)
Abstract
The effective deployment of generative AI models in real-time applications has been impeded by the computational and memory requirements of Generative Adversarial Networks (GANs), Variational Autoencoders (VAEs), and transformer-based models. We investigate the deployment of generative models on low-resource hardware (edge devices, mobile devices) using optimization techniques such as pruning, quantization, and knowledge distillation. In this study, we defined a detailed experimental framework to measure the performance of the studied methods against standard benchmarks, including CIFAR-10, CelebA, and OpenWebText, across heterogeneous hardware platforms ranging from the Raspberry Pi 4 and Jetson Nano to the Google Pixel 6. The results demonstrate that applying pruning techniques reduces model parameters by approximately 50 percent without statistically significant degradation in output quality. In contrast, quantization significantly decreases both inference latency and power consumption by 70.3 ± 3.2% and 61.7 ± 4.1%, respectively. Additionally, knowledge distillation methods compress transformer architectures while maintaining acceptable perplexity values. Collectively, these optimizations reduce inference time by up to 70 percent and energy consumption by more than 60 percent, supporting the feasibility of deploying generative artificial intelligence on devices with constrained processing and energy resources. Practically, these findings have implications for the successful deployment of useful, privacy-preserving, and portable AI across a wide range of application domains such as health, communications, and education.
Suggested Citation
Azib Farooq,Nirban Bhowmick,Muhammad Zahid*,Muhammad Adeel Asghar and Yasar Amin, 2026.
"Low-Resource Generative AI: Model Optimization for Edge and Mobile Devices,"
International Journal of Innovations in Science & Technology, 50sea, vol. 8(2), pages 760-772, May.
Handle:
RePEc:abq:ijist1:v:8:y:2026:i:2:p:760-772
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:abq:ijist1:v:8:y:2026:i:2:p:760-772. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Iqra Nazeer (email available below). General contact details of provider: .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.