Transformer instead of TransformerRNN

Hi there!

Recently I've done a deep dive into MarianMT models, specifically to get OpusMT and Firefox translation models to work on Android under ONNX. Getting OpusMT/Firefoxs teacher models to work is relatively straight straightforward as HuggingFace's Transformers package supports the export of Marian models.

However, Firefox student models (which are a lot more efficient) are in TransformerRNN format. Would it be possible to train student models in Transformer structure? Would they still be relatively efficient?

For more context; In my ONNX implementation on Android I have tackled the [mentioned shortcomings](https://github.com/marian-nmt/marian-dev/blob/2d067afb9ce5e3a0b6c32585706affc6e7295920/scripts/onnx/marian_to_onnx.py#L7-L9) (missing beam search) by implementing a custom beam search algorithm.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Transformer instead of TransformerRNN #76

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Transformer instead of TransformerRNN #76

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions