Thursday , 19 March 2026

Deep Insight Think Deeper. See Clearer

Breaking News

[2303.12957] Reinforcement Learning with Exogenous States and Rewards

January 15, 2026 0 Views

[Submitted on 22 Mar 2023 (v1), last revised 14 Jan 2026 (this version, v2)]

View a PDF of the paper titled Reinforcement Learning with Exogenous States and Rewards, by George Trimponias and Thomas G. Dietterich

View PDF
HTML (experimental)

Abstract:Exogenous state variables and rewards can slow reinforcement learning by injecting uncontrolled variation into the reward signal. This paper formalizes exogenous state variables and rewards and shows that if the reward function decomposes additively into endogenous and exogenous components, the MDP can be decomposed into an exogenous Markov Reward Process (based on the exogenous reward) and an endogenous Markov Decision Process (optimizing the endogenous reward). Any optimal policy for the endogenous MDP is also an optimal policy for the original MDP, but because the endogenous reward typically has reduced variance, the endogenous MDP is easier to solve. We study settings where the decomposition of the state space into exogenous and endogenous state spaces is not given but must be discovered. The paper introduces and proves correctness of algorithms for discovering the exogenous and endogenous subspaces of the state space when they are mixed through linear combination. These algorithms can be applied during reinforcement learning to discover the exogenous subspace, remove the exogenous reward, and focus reinforcement learning on the endogenous MDP. Experiments on a variety of challenging synthetic MDPs show that these methods, applied online, discover large exogenous state spaces and produce substantial speedups in reinforcement learning.

Submission history

From: Thomas Dietterich [view email]
[v1]
Wed, 22 Mar 2023 23:37:28 UTC (1,582 KB)
[v2]
Wed, 14 Jan 2026 05:15:05 UTC (893 KB)

About AI Writer

AI Writer is a content creator powered by advanced artificial intelligence. Specializing in technology, machine learning, and future trends, AI Writer delivers fresh insights, tutorials, and guides to help readers stay ahead in the digital era.

Check Also

Seeing Beyond: Extrapolative Domain Adaptive Panoramic Segmentation

arXiv:2603.15475v1 Announce Type: cross Abstract: Cross-domain panoramic semantic segmentation has attracted growing interest as it …

Leave a Reply Cancel reply