Haddad, Maryam Allawi and Khudher, Dhayaa Raissan (2025) Dual-Memory Architecture for Robust UAV: Navigation Integrating LSTM and Transformer within a PPO Framework. International Journal of Robotics and Control Systems, 5 (5). pp. 2522-2545.
2208-7822-1-PB.pdf - Published Version
Download (2MB)
Abstract
Autonomous UAV navigation typically suffers from partial observability (POMDP), where noisy and limited sensing degrades the reliability of decisions. We introduce a dual-memory PPO that augments an LSTM for short-horizon responsiveness with a Transformer for long-horizon context, fused by a learnable gate that adaptively weights both streams end-to-end. Unlike Dual-Transformer PPO and other attention-only variants our model retains recurrent memory and learns the fusion rather than prespecifying it (e.g., concatenation or sum). The observation vector merges normalized proprioceptive and range data the reward balances progress collision penalties and trajectory smoothness with tuned coefficients to avoid dominance. In simulated corridor worlds (with a dynamic variant) the hybrid policy completes 96.5% of episodes 9.7 pp over PPO-LSTM and 17.1 pp over PPO-Transformer while reducing final collisions to 2 per episode, reductions of 37.5% vs PPO-LSTM 64.3% vs PPO-Transformer: 85.7% vs PPO. It converges in 20k episodes (vs 25–29k for baselines), with shorter episodes (150 steps), and greater path efficiency (0.85) than either baseline. Findings are presented as the average plus or minus the standard deviation for all five seeds when p < 0.05. Limitations include a simulation-only study and limited environment diversity further, larger-scale environments and fusion and reward design ablations are pending. Overall learnable gating of complementary short- and long-term memories improves reliability under partial observability without compromising on practical training efficiency.
| Item Type: | Article |
|---|---|
| Subjects: | T Technology > TK Electrical engineering. Electronics Nuclear engineering |
| Depositing User: | IJRCS ASCEE |
| Date Deposited: | 30 Apr 2026 01:53 |
| Last Modified: | 30 Apr 2026 01:53 |
| URI: | https://alxiv.org/id/eprint/263 |
