Surriani, Atikah and Maghfiroh, Hari and Wahyunggoro, Oyas and Cahyadi, Adha Imam and Fajrin, Hanifah Rahmi (2025) Discount Factor Parametrization for Deep Reinforcement Learning for Inverted Pendulum Swing-up Control. Buletin Ilmiah Sarjana Teknik Elektro, 7 (1). pp. 56-67.
10268-Article Text-54648-1-10-20250412.pdf - Published Version
Download (1MB)
Abstract
This study explores the application of deep reinforcement learning (DRL) to solve the control problem of a single swing-up inverted pendulum. The primary focus is on investigating the impact of discount factor parameterization within the DRL framework. Specifically, the Deep Deterministic Policy Gradient (DDPG) algorithm is employed due to its effectiveness in handling continuous action spaces. A range of discount factor values is tested to evaluate their influence on training performance and stability. The results indicate that a discount factor of 0.99 yields the best overall performance, enabling the DDPG agent to successfully learn a stable swing-up strategy and maximize cumulative rewards. These findings highlight the critical role of the discount factor in DRL-based control systems and offer insights for optimizing learning performance in similar nonlinear control problems.
| Item Type: | Article |
|---|---|
| Subjects: | T Technology > TK Electrical engineering. Electronics Nuclear engineering |
| Depositing User: | BISTE UAD |
| Date Deposited: | 18 May 2026 14:14 |
| Last Modified: | 18 May 2026 14:14 |
| URI: | https://alxiv.org/id/eprint/888 |
