LTX-2 minor fixes by prishajain1 · Pull Request #373 · AI-Hypercomputer/maxdiffusion

prishajain1 · 2026-04-08T03:23:56Z

This PR introduces the following changes for the LTX2 pipeline:

Added Context parallelism replication for audio stream in transformer (this resolved some audio related issues occurring in particular sharding configurations on ironwood and seems to reduce the latency for other cases. Since audio stream is only 126 tokens, this shouldn't be an issue even if replicated)
Replicated VAE params and latents before call to decode method. With this change, single video latency dropped by 3x (observed no OOM in different configs caused by this change, tested on trillium and ironwood 8 chips)
Added flash_min_seq_length parameter to config file, now the choice of attention is controlled by this parameter. Experimentally observed that having dot attention for audio to audio self attn and video to audio cross attn and flash for audio to video provided more speed up. We had to hardcode "flash" as the attention choice for audio to video, because the way the attention mechanism is currently decided bases on flash min seq length compares the seq length for all q, k/v and flash is chosen only if all have seq length > flash min seq length. Experimentally we observed more speed ups if audio to video was kept in flash instead of dot.
Added JAX annotations for profiling
jitted the step() function in scheduling_flow_match_flax.py
Converted the timesteps ‘t’ from static Python integer/float to a dynamic JAX tensor (to prevent compilation of a program for each and every timestep)

Results

We performed the experiments below for dp = 2, cp = 2, tp = 2, fsdp = 1, per device batch size = 0.125

Hardware	Status	Time (s)	Video Link
v7x-8	Current main	105.14	View Video
	After fix	28.63	View Video
v6e-8	Current main	138.24	View Video
	After fix	58.74	View Video

github-actions · 2026-04-08T03:26:00Z

e2e testgrid: https://8bcf50593faf4ea38060e236169827e5-dot-us-central1.composer.googleusercontent.com/dags/maxdiffusion_tpu_e2e/grid

src/maxdiffusion/models/ltx2/text_encoders/text_encoders_ltx2.py

src/maxdiffusion/models/ltx2/transformer_ltx2.py

src/maxdiffusion/schedulers/scheduling_flow_match_flax.py

src/maxdiffusion/pipelines/ltx2/ltx2_pipeline.py

Perseus14

Overall looks good. Few minor comments.

Please add few results comparing baseline and this branch to get a better idea of the improvements.

prishajain1 requested a review from entrpn as a code owner April 8, 2026 03:23

prishajain1 mentioned this pull request Apr 8, 2026

Add LoRA Inference Support for LTX2 Model #372

Open

prishajain1 force-pushed the prisha/ltx2_fixes branch from 0a48d93 to 7b86470 Compare April 8, 2026 05:00

prishajain1 requested review from Perseus14 and mbohlool April 8, 2026 05:05

Perseus14 reviewed Apr 8, 2026

View reviewed changes

src/maxdiffusion/models/ltx2/text_encoders/text_encoders_ltx2.py Show resolved Hide resolved

src/maxdiffusion/models/ltx2/transformer_ltx2.py Outdated Show resolved Hide resolved

src/maxdiffusion/schedulers/scheduling_flow_match_flax.py Show resolved Hide resolved

Perseus14 reviewed Apr 8, 2026

View reviewed changes

src/maxdiffusion/pipelines/ltx2/ltx2_pipeline.py Outdated Show resolved Hide resolved

Perseus14 reviewed Apr 8, 2026

View reviewed changes

src/maxdiffusion/pipelines/ltx2/ltx2_pipeline.py Show resolved Hide resolved

Perseus14 reviewed Apr 8, 2026

View reviewed changes

LTX-2 Minor Fixes

c5bb862

prishajain1 force-pushed the prisha/ltx2_fixes branch from 4ae0dff to c5bb862 Compare April 8, 2026 12:42

prishajain1 requested a review from Perseus14 April 8, 2026 12:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LTX-2 minor fixes#373

LTX-2 minor fixes#373
prishajain1 wants to merge 1 commit intomainfrom
prisha/ltx2_fixes

prishajain1 commented Apr 8, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Apr 8, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Perseus14 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

prishajain1 commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Results

Uh oh!

github-actions bot commented Apr 8, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Perseus14 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

prishajain1 commented Apr 8, 2026 •

edited

Loading