RL Optimization PPO Algorithm

Rethinking Robotics Reinforcement Learning: A Practical Humanoid Training Workflow

A complete pipeline that can run on a single workstation to train a humanoid robot to walk over rough terrain.

Adaptive Path Tracking Using a Dynamic PID Controller Enhanced by Proximal Policy Optimization

Abstract: This paper introduces a novel adaptive path tracking controller that integrates the Proximal Policy Optimization (PPO) algorithm with a Proportional-Integral-Derivative (PID) control ...

IEEE

Leaky PPO: A Simple and Efficient RL Algorithm for Autonomous Vehicles

Abstract: Interest in applying Reinforcement Learning (RL) to Autonomous Vehicles (AVs) is experiencing a rapid and substantial expansion. Proximal Policy Optimization (PPO), a well-known RL algorithm ...

GitHub

Reinforcement learning in portfolio management

Motivated by "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem" by Jiang et. al. 2017 [1]. In this project: Implement three state-of-art continous deep ...

GitHub

mohamednoorulnaseem/airfoil-rl-optimizer

airfoil-rl-optimizer/ │ ├── 📊 app.py # Interactive Dash web interface ├── 🚂 train_rl.py # RL training script with CLI args ├── ⚙️ setup.py # Package installation config │ ├── 📁 src/ # Core source ...

marktechpost

Microsoft Releases Agent Lightning: A New AI Framework that Enables Reinforcement Learning (RL)-based Training of LLMs for Any AI Agent

How do you convert real agent traces into reinforcement learning RL transitions to improve policy LLMs without changing your existing agent stack? Microsoft AI team releases Agent Lightning to help ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results