Reinforcement Learning for an Inverted Pendulum with Image Data
Use Reinforcement Learning Toolbox™ and the DQN algorithm to perform image-based inversion of a simple pendulum. The workflow consists of the following steps: 1) Create the environment, 2) specify policy representation, 3) create agent, 4) train agent, and 5) verify trained policy.
The provided pendulum environment has predefined observations, actions, and reward. The actions include five possible torque values, the observations include a 50x50 grayscale image as well as the angular rate of the pendulum, and the reward is the distance from the desired upward position. Learn how to use Deep Network Designer app to construct a neural network representation of the Q-function, used by the DQN agent to approximate long-term reward.
See how you can visualize the pendulum behavior during training, and monitor training progress. After training is complete, verify the policy in simulation to decide if further training is necessary.
Related Products
Learn More
Featured Product
Reinforcement Learning Toolbox
Up Next:
Related Videos:
Select a Web Site
Choose a web site to get translated content where available and see local events and offers. Based on your location, we recommend that you select: .
You can also select a web site from the following list
How to Get Best Site Performance
Select the China site (in Chinese or English) for best site performance. Other bat365 country sites are not optimized for visits from your location.
Americas
- América Latina (Español)
- Canada (English)
- United States (English)
Europe
- Belgium (English)
- Denmark (English)
- Deutschland (Deutsch)
- España (Español)
- Finland (English)
- France (Français)
- Ireland (English)
- Italia (Italiano)
- Luxembourg (English)
- Netherlands (English)
- Norway (English)
- Österreich (Deutsch)
- Portugal (English)
- Sweden (English)
- Switzerland
- United Kingdom (English)
Asia Pacific
- Australia (English)
- India (English)
- New Zealand (English)
- 中国
- 日本Japanese (日本語)
- 한국Korean (한국어)