|
"TODO(hartikainen): implement for discrete spaces.") |
Are you planning to implement this?
What would be a good value for a MultiDiscrete([3 3 2 3]) action space?
Depending on how I calculate I get -4, -11 or -54 but I'm a bit clueless to what would be a good value.
softlearning/softlearning/algorithms/sac.py
Line 42 in 46f1443
Are you planning to implement this?
What would be a good value for a MultiDiscrete([3 3 2 3]) action space?
Depending on how I calculate I get -4, -11 or -54 but I'm a bit clueless to what would be a good value.