Author
Listed:
- Hakho Kim
(Department of Artificial Intelligence, Kyung Hee University, Yongin 17104, Republic of Korea)
- Teh-Jen Sun
(Department of Artificial Intelligence, Kyung Hee University, Yongin 17104, Republic of Korea)
- Eui-Nam Huh
(Department of Computer Engineering, Kyung Hee University, Yongin 17104, Republic of Korea)
Abstract
Content delivery networks (CDNs) face steadily rising, uneven demand, straining heuristic cache replacement. Reinforcement learning (RL) is promising, but most work assumes a fully observable Markov Decision Process (MDP), unrealistic under delayed, partial, and noisy signals. We model cache replacement as a Partially Observable MDP (POMDP) and present the Miss-Triggered Cache Transformer (MTCT), a Transformer-decoder Q-learning agent that encodes recent histories with self-attention. MTCT invokes its policy only on cache misses to align compute with informative events and uses a delayed-hit reward to propagate information from hits. A compact, rank-based action set (12 actions by default) captures popularity–recency trade-offs with complexity independent of cache capacity. We evaluate MTCT on a real trace (MovieLens) and two synthetic workloads (Mandelbrot–Zipf, Pareto) against Adaptive Replacement Cache (ARC), Windowed TinyLFU (W-TinyLFU), classical heuristics, and Double Deep Q-Network (DDQN). MTCT achieves the best or statistically comparable cache-hit rates on most cache sizes; e.g., on MovieLens at M = 600 , it reaches 0.4703 (DDQN 0.4436 , ARC 0.4513 ). Miss-triggered inference also lowers mean wall-clock time per episode; Transformer inference is well suited to modern hardware acceleration. Ablations support C L = 50 and show that finer action grids improve stability and final accuracy.
Suggested Citation
Hakho Kim & Teh-Jen Sun & Eui-Nam Huh, 2025.
"Miss-Triggered Content Cache Replacement Under Partial Observability: Transformer-Decoder Q-Learning,"
Mathematics, MDPI, vol. 13(19), pages 1-27, October.
Handle:
RePEc:gam:jmathe:v:13:y:2025:i:19:p:3217-:d:1766066
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:13:y:2025:i:19:p:3217-:d:1766066. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.