IDEAS home Printed from https://ideas.repec.org/a/spr/joinma/v36y2025i8d10.1007_s10845-024-02491-3.html

Unsupervised reward engineering for reinforcement learning controlled manufacturing

Author

Listed:
  • Thomas Hirtz

    (Tsinghua University)

  • He Tian

    (Tsinghua University)

  • Yi Yang

    (Tsinghua University)

  • Tian-Ling Ren

    (Tsinghua University)

Abstract

Reward engineering is a key challenge in reinforcement learning (RL) that can significantly affect the performance and applicability of RL algorithms. In the field of manufacturing, shaping the reward function for RL algorithms can be particularly difficult due to the complex and multi-objective nature of the manufacturing process. To address these challenges, we propose unsupervised reward engineering method based on a variational autoencoder (VAE) that uses the latent representation of the product for computing the environment’s reward. Our approach optimizes the underlying distribution of the fabricated product directly by leveraging the latent space distance or divergence between the manufactured and ideal products. This strategy circumvents issues commonly associated with conventional reward engineering, such as misaligned and hacked rewards. Our technique enables convenient multi-objective optimization and reward value bounding. Through a $$\beta $$ β -VAE architecture, we can adjust the weight of the Kullback–Leibler divergence term, prioritizing ideal characteristics or latent distribution based on the desired outcome. Applying our approach to semiconductor manufacturing, we demonstrate its benefits, including effective multi-objective optimization, stable reward, and meaningful data representations. Our method shows promise for optimizing complex manufacturing processes with RL and can be extended to various manufacturing-related fields. It can enhance product quality and offers opportunities for cross-facility manufacturing matching.

Suggested Citation

  • Thomas Hirtz & He Tian & Yi Yang & Tian-Ling Ren, 2025. "Unsupervised reward engineering for reinforcement learning controlled manufacturing," Journal of Intelligent Manufacturing, Springer, vol. 36(8), pages 5875-5888, December.
  • Handle: RePEc:spr:joinma:v:36:y:2025:i:8:d:10.1007_s10845-024-02491-3
    DOI: 10.1007/s10845-024-02491-3
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10845-024-02491-3
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10845-024-02491-3?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Julian Senoner & Torbjørn Netland & Stefan Feuerriegel, 2022. "Using Explainable Artificial Intelligence to Improve Process Quality: Evidence from Semiconductor Manufacturing," Management Science, INFORMS, vol. 68(8), pages 5704-5723, August.
    2. Ercan Oztemel & Samet Gursev, 2020. "Literature review of Industry 4.0 and related technologies," Journal of Intelligent Manufacturing, Springer, vol. 31(1), pages 127-182, January.
    3. Ahmad Barari & Marcos Sales Guerra Tsuzuki & Yuval Cohen & Marco Macchi, 2021. "Editorial: intelligent manufacturing systems towards industry 4.0 era," Journal of Intelligent Manufacturing, Springer, vol. 32(7), pages 1793-1796, October.
    4. Hasan Tercan & Tobias Meisen, 2022. "Machine learning and deep learning based predictive quality in manufacturing: a systematic review," Journal of Intelligent Manufacturing, Springer, vol. 33(7), pages 1879-1905, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Bianca Maria Colosimo & Luca Pagani & Marco Grasso, 2024. "Modeling spatial point processes in video-imaging via Ripley’s K-function: an application to spatter analysis in additive manufacturing," Journal of Intelligent Manufacturing, Springer, vol. 35(1), pages 429-447, January.
    2. Sangkyoung Lee & Zhuoxiao Chen & Yadan Luo & Xuliang Li & Mingyuan Lu & Zi Helen Huang & Han Huang, 2025. "Enhanced prediction accuracy in high-speed grinding of brittle materials using advanced machine learning techniques," Journal of Intelligent Manufacturing, Springer, vol. 36(8), pages 5415-5459, December.
    3. Chenxi Yuan & Guoyan Li & Sagar Kamarthi & Xiaoning Jin & Mohsen Moghaddam, 2022. "Trends in intelligent manufacturing research: a keyword co-occurrence network based review," Journal of Intelligent Manufacturing, Springer, vol. 33(2), pages 425-439, February.
    4. Spyros Theodoropoulos & Dimitrios Dardanis & Georgios Makridis & Patrik Zajec & Jože M. Rožanec & Dimosthenis Kyriazis & Panayiotis Tsanakas, 2025. "Enhancing robustness to novel visual defects through StyleGAN latent space navigation: a manufacturing use case," Journal of Intelligent Manufacturing, Springer, vol. 36(5), pages 3527-3541, June.
    5. Dinko Herman BOIKANYO, 2024. "Utilization Of 4ir Technologies To Enhance Strategic Intelligence And Dynamic Capabilities For A Sustainable Competitive Advantage," Business Excellence and Management, Faculty of Management, Academy of Economic Studies, Bucharest, Romania, vol. 14(1), pages 5-24, March.
    6. Shugui Wang & Yunxian Cui & Yuxin Song & Chenggang Ding & Wanyu Ding & Junwei Yin, 2024. "A novel surface temperature sensor and random forest-based welding quality prediction model," Journal of Intelligent Manufacturing, Springer, vol. 35(7), pages 3291-3314, October.
    7. Cao, Cejun & He, Yufan & Liu, Yang & Huang, Hao & Zhang, Fanshun, 2025. "Blockchain technology adoption mechanism for semiconductor chip supply chains considering information disclosure under cost-sharing contract," International Journal of Production Economics, Elsevier, vol. 282(C).
    8. Govindan, Kannan & Kannan, Devika & Jørgensen, Thomas Ballegård & Nielsen, Tim Straarup, 2022. "Supply Chain 4.0 performance measurement: A systematic literature review, framework development, and empirical evidence," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 164(C).
    9. Tiago Afonso & Anabela C. Alves & Paula Carneiro, 2021. "Lean Thinking, Logistic and Ergonomics: Synergetic Triad to Prepare Shop Floor Work Systems to Face Pandemic Situations," International Journal of Global Business and Competitiveness, Springer, vol. 16(1), pages 62-76, December.
    10. Shuting Wang & Jie Meng & Yuanlong Xie & Liquan Jiang & Han Ding & Xinyu Shao, 2023. "Reference training system for intelligent manufacturing talent education: platform construction and curriculum development," Journal of Intelligent Manufacturing, Springer, vol. 34(3), pages 1125-1164, March.
    11. Vincenzo Varriale & Antonello Cammarano & Francesca Michelino & Mauro Caputo, 2025. "Critical analysis of the impact of artificial intelligence integration with cutting-edge technologies for production systems," Journal of Intelligent Manufacturing, Springer, vol. 36(1), pages 61-93, January.
    12. Borgonovo, Emanuele & Plischke, Elmar & Rabitti, Giovanni, 2024. "The many Shapley values for explainable artificial intelligence: A sensitivity analysis perspective," European Journal of Operational Research, Elsevier, vol. 318(3), pages 911-926.
    13. Xiaoyu Zhan & Delia Mioara Popescu & Valentin Radu, 2020. "Challenges for Romanian Entrepreneurs in Managing Remote Workers," Book chapters-LUMEN Proceedings, in: Marcin Waldemar STANIEWSKI & Valentina VASILE & Adriana Grigorescu (ed.), International Conference Innovative Business Management & Global Entrepreneurship (IBMAGE 2020), edition 1, volume 14, chapter 49, pages 670-687, Editura Lumen.
    14. Christoph March & Ina Schieferdecker, 2021. "Technological Sovereignty as Ability, Not Autarky," CESifo Working Paper Series 9139, CESifo.
    15. Song, Yanwu & Niu, Niu & Song, Xinyi & Zhang, Bin, 2024. "Decoding the influence of servitization on green transformation in manufacturing firms: The moderating effect of artificial intelligence," Energy Economics, Elsevier, vol. 139(C).
    16. Rui Wang & Xiangyu Guo & Shisheng Zhong & Gaolei Peng & Lin Wang, 2022. "Decision rule mining for machining method chains based on rough set theory," Journal of Intelligent Manufacturing, Springer, vol. 33(3), pages 799-807, March.
    17. Tushar D. Bhoite & Rajesh B. Buktar & Parikshit N. Mahalle & Mohan P. Khond & Ganesh S. Pise & Yogeshrao Y. More, 2025. "Productivity Enhancement in the Indian Auto Component Manufacturing Supply Chain Through IoT, Digital Twins with Generative AI, and Stacked Encoder-Enhanced Neural Networks," SN Operations Research Forum, Springer, vol. 6(4), pages 1-27, December.
    18. Esther Calderon-Monge & Domingo Ribeiro-Soriano, 2024. "The role of digitalization in business and management: a systematic literature review," Review of Managerial Science, Springer, vol. 18(2), pages 449-491, February.
    19. Pompeu Casanovas & Louis de Koker & Mustafa Hashmi, 2022. "Law, Socio-Legal Governance, the Internet of Things, and Industry 4.0: A Middle-Out/Inside-Out Approach," J, MDPI, vol. 5(1), pages 1-28, January.
    20. Anna Kwiotkowska & Radosław Wolniak & Bożena Gajdzik & Magdalena Gębczyńska, 2022. "Configurational Paths of Leadership Competency Shortages and 4.0 Leadership Effectiveness: An fs/QCA Study," Sustainability, MDPI, vol. 14(5), pages 1-21, February.

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:joinma:v:36:y:2025:i:8:d:10.1007_s10845-024-02491-3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.