Skip to content

chore: bump stable-baselines3 from 1.6.2 to 1.8.0#76

Open
dependabot[bot] wants to merge 1 commit intomasterfrom
dependabot/pip/stable-baselines3-1.8.0
Open

chore: bump stable-baselines3 from 1.6.2 to 1.8.0#76
dependabot[bot] wants to merge 1 commit intomasterfrom
dependabot/pip/stable-baselines3-1.8.0

Conversation

@dependabot
Copy link
Copy Markdown
Contributor

@dependabot dependabot bot commented on behalf of github Apr 10, 2023

Bumps stable-baselines3 from 1.6.2 to 1.8.0.

Release notes

Sourced from stable-baselines3's releases.

Stable-Baselines3 v1.8.0: Multi-env HerReplayBuffer, Open RL Benchmark, Improved env checker

Warning Stable-Baselines3 (SB3) v1.8.0 will be the last one to use Gym as a backend. Starting with v2.0.0, Gymnasium will be the default backend (though SB3 will have compatibility layers for Gym envs). You can find a migration guide here. If you want to try the SB3 v2.0 alpha version, you can take a look at [PR #1327](DLR-RM/stable-baselines3#1327).

SB3 Contrib (more algorithms): https://github.com/Stable-Baselines-Team/stable-baselines3-contrib RL Zoo3 (training framework): https://github.com/DLR-RM/rl-baselines3-zoo

To upgrade:

pip install stable_baselines3 sb3_contrib rl_zoo3 --upgrade

or simply (rl zoo depends on SB3 and SB3 contrib):

pip install rl_zoo3 --upgrade

Breaking Changes:

  • Removed shared layers in mlp_extractor (@​AlexPasqua)
  • Refactored StackedObservations (it now handles dict obs, StackedDictObservations was removed)
  • You must now explicitely pass a features_extractor parameter when calling extract_features()
  • Dropped offline sampling for HerReplayBuffer
  • As HerReplayBuffer was refactored to support multiprocessing, previous replay buffer are incompatible with this new version
  • HerReplayBuffer doesn't require a max_episode_length anymore

New Features:

  • Added repeat_action_probability argument in AtariWrapper.
  • Only use NoopResetEnv and MaxAndSkipEnv when needed in AtariWrapper
  • Added support for dict/tuple observations spaces for VecCheckNan, the check is now active in the env_checker() (@​DavyMorgan)
  • Added multiprocessing support for HerReplayBuffer
  • HerReplayBuffer now supports all datatypes supported by ReplayBuffer
  • Provide more helpful failure messages when validating the observation_space of custom gym environments using check_env (@​FieteO)
  • Added stats_window_size argument to control smoothing in rollout logging (@​jonasreiher)

SB3-Contrib

  • Added warning about potential crashes caused by check_env in the MaskablePPO docs (@​AlexPasqua)
  • Fixed sb3_contrib/qrdqn/*.py type hints
  • Removed shared layers in mlp_extractor (@​AlexPasqua)

RL Zoo

  • Open RL Benchmark
  • Upgraded to new HerReplayBuffer implementation that supports multiple envs
  • Removed TimeFeatureWrapper for Panda and Fetch envs, as the new replay buffer should handle timeout.

... (truncated)

Commits

Dependabot compatibility score

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

  • @dependabot rebase will rebase this PR
  • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
  • @dependabot merge will merge this PR after your CI passes on it
  • @dependabot squash and merge will squash and merge this PR after your CI passes on it
  • @dependabot cancel merge will cancel a previously requested merge and block automerging
  • @dependabot reopen will reopen this PR if it is closed
  • @dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
  • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Bumps [stable-baselines3](https://github.com/DLR-RM/stable-baselines3) from 1.6.2 to 1.8.0.
- [Release notes](https://github.com/DLR-RM/stable-baselines3/releases)
- [Commits](DLR-RM/stable-baselines3@v1.6.2...v1.8.0)

---
updated-dependencies:
- dependency-name: stable-baselines3
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
@dependabot dependabot bot added the dependencies Pull requests that update a dependency file label Apr 10, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

dependencies Pull requests that update a dependency file

Projects

None yet

Development

Successfully merging this pull request may close these issues.

0 participants