[Enhancement]: Wrong gains for weight initialization

### Enhancement

The recommended gains for the weight init depend on the used activation function, see [torch docs](https://pytorch.org/docs/stable/nn.init.html). However, as for now the used gains are statically implemented and always the same in ActorCriticPolicies. See [here](https://github.com/DLR-RM/stable-baselines3/blob/ffe26ccf95d7e3b37067bd81025d3b4b45825038/stable_baselines3/common/policies.py#L589).

I recommend making the gains dependent on the activation function used(, i.e. probably mainly ReLU and tanh).

If you agree with this, I would like to implement it myself and PR.

Thanks and a good day!

### To Reproduce

--

### Relevant log output / Error message

```shell
--
```


### System Info

--

### Checklist

- [X] I have checked that there is no similar [issue](https://github.com/DLR-RM/stable-baselines3/issues) in the repo
- [X] I have read the [documentation](https://stable-baselines3.readthedocs.io/en/master/)
- [X] I have provided a minimal working example to reproduce the bug
- [X] I've used the [markdown code blocks](https://help.github.com/en/articles/creating-and-highlighting-code-blocks) for both code and stack traces.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Enhancement]: Wrong gains for weight initialization #1559

Enhancement

To Reproduce

Relevant log output / Error message

System Info

Checklist

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Enhancement]: Wrong gains for weight initialization #1559

Description

Enhancement

To Reproduce

Relevant log output / Error message

System Info

Checklist

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions