Change callback for `AdversarialTrainer` by gunnxx · Pull Request #626 · HumanCompatibleAI/imitation

gunnxx · 2022-11-15T09:35:41Z

Changing the callback mechanism of AdversarialTrainer such that we can insert sb3.EvalCallback. See #607.

feynmanix · 2022-11-28T19:59:29Z

src/imitation/algorithms/adversarial/common.py

+        callback: Optional[List[BaseCallback]] = None
    ) -> None:
        """Alternates between training the generator and discriminator.



The last part of description and finally a call to callback(round) is probably misleading now.

feynmanix · 2022-11-28T20:06:08Z

src/imitation/algorithms/adversarial/common.py

+            if self.gen_callback is None:
+                self.gen_callback = callback
+            else:
+                self.gen_callback = callback + [self.gen_callback]


Can someone abuse the API by calling train() multiple times? If so, the value of self.gen_callback would contain nested list, which is not correct. Generally, the value of gen_callback is currently Optional[BaseCallback] and we shouldn't change the type to a list at runtime.

Perhaps it would be better to add an optional callback argument to train_gen(), merge callbacks there, and avoid the stateful change here?

Also, can the learn_kwargs argument from train_gen() be removed, as discussed in the original issue #607 ?

feynmanix · 2022-11-28T20:17:08Z

src/imitation/algorithms/adversarial/common.py

        self,
        total_timesteps: int,
-        callback: Optional[Callable[[int], None]] = None,
+        callback: Optional[List[BaseCallback]] = None


Do we want to change the semantics of the argument here, or should we rather deprecate the feature (and introduce a different parameter for additional gen_callback)?

I think the suggestion in the original issue was to add a new gen_callback argument. (Btw, stable-baselines supports both CallbackList and list of callbacks if we wanted to be fancy)

feynmanix · 2022-11-28T20:30:01Z

src/imitation/algorithms/adversarial/common.py

@@ -421,7 +422,7 @@ def train_gen(
    def train(


One more thing - if you change the arguments, update of training_adversarial.py will also be needed

gunnxx added 2 commits November 15, 2022 18:25

change callback mechanism

fa9b2f4

Merge branch 'master' of https://github.com/gunnxx/imitation

7b3ed75

feynmanix suggested changes Nov 28, 2022

View reviewed changes

smanolloff mentioned this pull request Sep 14, 2023

Add support for SB3 callbacks in adversarial training #786

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change callback for `AdversarialTrainer`#626

Change callback for `AdversarialTrainer`#626
gunnxx wants to merge 2 commits intoHumanCompatibleAI:masterfrom
gunnxx:master

gunnxx commented Nov 15, 2022

Uh oh!

feynmanix Nov 28, 2022

Uh oh!

feynmanix Nov 28, 2022

Uh oh!

feynmanix Nov 28, 2022

Uh oh!

feynmanix Nov 28, 2022

Uh oh!

feynmanix Nov 28, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

gunnxx commented Nov 15, 2022

Uh oh!

feynmanix Nov 28, 2022

Choose a reason for hiding this comment

Uh oh!

feynmanix Nov 28, 2022

Choose a reason for hiding this comment

Uh oh!

feynmanix Nov 28, 2022

Choose a reason for hiding this comment

Uh oh!

feynmanix Nov 28, 2022

Choose a reason for hiding this comment

Uh oh!

feynmanix Nov 28, 2022

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants