Skip to content

Commit bf51a62

Browse files
authored
Release 2.7.0 (#2161)
1 parent 2dce430 commit bf51a62

File tree

2 files changed

+14
-2
lines changed

2 files changed

+14
-2
lines changed

docs/misc/changelog.rst

Lines changed: 13 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3,9 +3,11 @@
33
Changelog
44
==========
55

6-
Release 2.7.0a1 (WIP)
6+
Release 2.7.0 (2025-07-25)
77
--------------------------
88

9+
**n-step returns for all off-policy algorithms**
10+
911
Breaking Changes:
1012
^^^^^^^^^^^^^^^^^
1113

@@ -22,12 +24,22 @@ Bug Fixes:
2224

2325
`SB3-Contrib`_
2426
^^^^^^^^^^^^^^
27+
- Added support for n-step returns for off-policy algorithms via the `n_steps` parameter
28+
- Use the ``FloatSchedule`` and ``LinearSchedule`` classes instead of lambdas in the ARS, PPO, and QRDQN implementations to improve model portability across different operating systems
2529

2630
`RL Zoo`_
2731
^^^^^^^^^
32+
- `linear_schedule` now returns a `SimpleLinearSchedule` object for better portability
33+
- Renamed `LunarLander-v2` to `LunarLander-v3` in hyperparameters
34+
- Renamed `CarRacing-v2` to `CarRacing-v3` in hyperparameters
35+
- Docker GPU images are now working again
36+
- Use `ConstantSchedule`, and `SimpleLinearSchedule` instead of `constant_fn` and `linear_schedule`
37+
- Fixed `CarRacing-v3` hyperparameters for newer Gymnasium version
2838

2939
`SBX`_ (SB3 + Jax)
3040
^^^^^^^^^^^^^^^^^^
41+
- Added support for n-step returns for off-policy algorithms via the `n_steps` parameter
42+
- Added KL Adaptive LR for PPO and LR schedule for SAC/TQC
3143

3244
Deprecations:
3345
^^^^^^^^^^^^^

stable_baselines3/version.txt

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
2.7.0a1
1+
2.7.0

0 commit comments

Comments
 (0)