Author
Listed:
- Giuseppe Trimigno
(Department of Engineering and Architecture, University of Parma, 43124 Parma, Italy)
- Gianfranco Lombardo
(Department of Engineering and Architecture, University of Parma, 43124 Parma, Italy)
- Michele Tomaiuolo
(Department of Engineering and Architecture, University of Parma, 43124 Parma, Italy)
- Stefano Cagnoni
(Department of Engineering and Architecture, University of Parma, 43124 Parma, Italy)
- Agostino Poggi
(Department of Engineering and Architecture, University of Parma, 43124 Parma, Italy)
Abstract
Retrieval-augmented generation (RAG) enriches prompts with external knowledge, but it often relies on additional infrastructure that may be impractical in resource-constrained or offline settings. In addition, updating the internal knowledge of a language model through retraining is costly and inflexible. To address these limitations, we propose an explainable and structured prompt augmentation pipeline that enhances inputs using pre-trained models and rule-based extractors, without requiring external sources. We describe this approach as an orchestrated LLM workflow: a structured sequence in which lightweight LLM modules assume specialized roles. Specifically, (1) an extractor module identifies factual triples from input prompts by combining dependency parsing with a rule-based extraction algorithm; (2) a scorer module, based on a generic lightweight LLM, evaluates the importance of each triple via its self-attention patterns, leveraging internal beliefs to promote explainability and trustworthy cooperation with the downstream model; (3) a performer module processes the augmented prompt for downstream tasks in supervised fine-tuning or zero-shot settings. Much like in a theater staging, each module operates transparently behind the scenes to support and elevate the performer’s final output. We evaluate this approach across multiple performer architectures (encoder-only, encoder-decoder, and decoder-only) and NLP tasks (multiple-choice QA, open-book QA, and summarization). Our results show that this structured augmentation with scored facts yields consistent improvements compared to baseline prompting: up to a 28.78 % accuracy improvement for multiple-choice QA, up to a 9.42 % BLEURT improvement for open-book QA, and up to a 18.14 % ROUGE-L improvement for summarization. By decoupling knowledge scoring from task execution, our method provides a practical, interpretable, and low-cost alternative to RAG in static or knowledge-limited environments.
Suggested Citation
Giuseppe Trimigno & Gianfranco Lombardo & Michele Tomaiuolo & Stefano Cagnoni & Agostino Poggi, 2025.
"LLMs in Staging: An Orchestrated LLM Workflow for Structured Augmentation with Fact Scoring,"
Future Internet, MDPI, vol. 17(12), pages 1-19, November.
Handle:
RePEc:gam:jftint:v:17:y:2025:i:12:p:535-:d:1801340
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jftint:v:17:y:2025:i:12:p:535-:d:1801340. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.