Gymnasium

```{project-logo} _static/img/gymnasium-text.png :alt: Gymnasium Logo

```{project-heading}
强化学习的API标准，拥有多样化的参考环境集合

```{figure} _static/videos/box2d/lunar_lander.gif :alt: Lunar Lander :width: 500

**Gymnasium是OpenAI的Gym库的维护分支。** Gymnasium接口简单、符合Python风格，能够表示通用的强化学习问题，并为旧版Gym环境提供了[迁移指南](introduction/migration_guide)：

```{code-block} python
import gymnasium as gym

# Initialise the environment
env = gym.make("LunarLander-v3", render_mode="human")

# Reset the environment to generate the first observation
observation, info = env.reset(seed=42)
for _ in range(1000):
    # this is where you would insert your policy
    action = env.action_space.sample()

    # step (transition) through the environment with the action
    # receiving the next observation, reward and if the episode has terminated or truncated
    observation, reward, terminated, truncated, info = env.step(action)

    # If the episode has ended then we can reset to start a new episode
    if terminated or truncated:
        observation, info = env.reset()

env.close()

:hidden:
:caption: Introduction

introduction/basic_usage
introduction/train_agent
introduction/create_custom_env
introduction/record_agent
introduction/speed_up_env
introduction/migration_guide

:hidden:
:caption: API

api/env
api/registry
api/spaces
api/wrappers
api/vector
api/utils
api/functional

:hidden:
:caption: Environments

environments/classic_control
environments/box2d
environments/toy_text
environments/mujoco
environments/atari
environments/third_party_environments

:hidden:
:glob:
:caption: Tutorials

tutorials/**/index
tutorials/third-party-tutorials

:hidden:
:caption: Development

Github <https://github.com/Farama-Foundation/Gymnasium>
Paper <https://arxiv.org/abs/2407.17032>
gymnasium_release_notes/index
gym_release_notes/index
Contribute to the Docs <https://github.com/Farama-Foundation/Gymnasium/blob/main/docs/README.md>

This site is open source. Improve this page.