Consistency checks for weight initialization

The model weights are currently initialised by matching regexes against the FQNs.
For each regex, we define a routine specifying how the weights are supposed to be initialised. 

This can be prone to errors if FQNs change, as is the case for torch.compile wich introduces a `_orig_mod.` to FQNs.

Since side-effects like these can easily go unnoticed we should introduce some consistency checks:

* Each FQN must match with exactly one regex. 
* If a FQN is not supposed to be initialized via in a initialisation approach (e.g., some norms), the regex should call a "no-op" instead of an init routine.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consistency checks for weight initialization #438

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Consistency checks for weight initialization #438

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions