Discount Factor Parametrization for Deep Reinforcement Learning for Inverted Pendulum Swing-up Control

Surriani, Atikah and Maghfiroh, Hari and Wahyunggoro, Oyas and Cahyadi, Adha Imam and Fajrin, Hanifah Rahmi (2025) Discount Factor Parametrization for Deep Reinforcement Learning for Inverted Pendulum Swing-up Control. Buletin Ilmiah Sarjana Teknik Elektro, 7 (1). pp. 56-67.

[thumbnail of 10268-Article Text-54648-1-10-20250412.pdf] Text
10268-Article Text-54648-1-10-20250412.pdf - Published Version

Download (1MB)

Abstract

This study explores the application of deep reinforcement learning (DRL) to solve the control problem of a single swing-up inverted pendulum. The primary focus is on investigating the impact of discount factor parameterization within the DRL framework. Specifically, the Deep Deterministic Policy Gradient (DDPG) algorithm is employed due to its effectiveness in handling continuous action spaces. A range of discount factor values is tested to evaluate their influence on training performance and stability. The results indicate that a discount factor of 0.99 yields the best overall performance, enabling the DDPG agent to successfully learn a stable swing-up strategy and maximize cumulative rewards. These findings highlight the critical role of the discount factor in DRL-based control systems and offer insights for optimizing learning performance in similar nonlinear control problems.

Item Type: Article
Subjects: T Technology > TK Electrical engineering. Electronics Nuclear engineering
Depositing User: BISTE UAD
Date Deposited: 18 May 2026 14:14
Last Modified: 18 May 2026 14:14
URI: https://alxiv.org/id/eprint/888

Actions (login required)

View Item
View Item