Enable Runtime Selection of Attention Functions #9639

Kosinkadink · 2025-08-30T22:26:44Z

This PR adds support for choosing the attention to be used for sampling via optimized_attention_override in transformer_options in all natively-supported models. This opens up a lot of doors for attention tricks/scheduling/different attention in different blocks without massive hacks/monkey patches.

All available attention types are also registered so it can be tracked/chosen easily in the future - flash attn/sage attention not being available will only cause immediate exit if the corresponding --use-flash-attention or --use-sage-attention arguments are used. Otherwise, the attention types will still be registered but only used upon request in code.

More changes are coming in the future regarding models tracking the current block index in transformer_options.

Changes went through two rounds of QA + my personal testing. No observable slowdowns were noticed.

…override' entry in transformer_options

…down all the code paths where transformer_options would need to be added

…n to test out attention override later

…d potentially be passed through

…o load SageAttention and FlashAttention if not enabled so that they can be marked as available or not, create registry for available attention

…ve a dropdown with available attention (this is a test node only)

… into attention function

…tions

…e to call that, just in case

Kosinkadink added 30 commits August 27, 2025 14:18

Looking into a @wrap_attn decorator to look for 'optimized_attention_…

b58db69

…override' entry in transformer_options

Created logging code for this branch so that it can be used to track …

68b00e9

…down all the code paths where transformer_options would need to be added

Fix memory usage issue with inspect

29b7990

Made WAN attention receive transformer_options, test node added to wa…

dd21b4a

…n to test out attention override later

Added **kwargs to all attention functions so transformer_options coul…

669b9ef

…d potentially be passed through

Make sure wrap_attn doesn't make itself recurse infinitely, attempt t…

51a30c2

…o load SageAttention and FlashAttention if not enabled so that they can be marked as available or not, create registry for available attention

Turn off attention logging for now, make AttentionOverrideTestNode ha…

1f499f0

…ve a dropdown with available attention (this is a test node only)

Make flux work with optimized_attention_override

a7d70e4

Add logs to verify optimized_attention_override is passed all the way…

48ed71c

… into attention function

Make Qwen work with optimized_attention_override

f752715

Made hidream work with optimized_attention_override

4cafd58

Made wan patches_replace work with optimized_attention_override

1ddfb5b

Made SD3 work with optimized_attention_override

0ac5c63

Made HunyuanVideo work with optimized_attention_override

ef894cd

Made Mochi work with optimized_attention_override

61b5c5f

Made LTX work with optimized_attention_override

2cda45d

Made StableAudio work with optimized_attention_override

9461f30

Made optimized_attention_override work with ACE Step

27ebd31

Made Hunyuan3D work with optimized_attention_override

8b9b4bb

Make CosmosPredict2 work with optimized_attention_override

4a44ed4

Made CosmosVideo work with optimized_attention_override

8fe2dea

Made Omnigen 2 work with optimized_attention_override

09c84b3

Made StableCascade work with optimized_attention_override

034d6c1

Made AuraFlow work with optimized_attention_override

17090c5

Made Lumina work with optimized_attention_override

d644aba

Made Chroma work with optimized_attention_override

8be3edb

Made SVD work with optimized_attention_override

2d13bf1

Fix WanI2VCrossAttention so that it expects to receive transformer_op…

1ae6fe1

…tions

Fixed Wan2.1 Fun Camera transformer_options passthrough

af288b9

Fixed WAN 2.1 VACE transformer_options passthrough

d553073

Kosinkadink added 7 commits August 29, 2025 21:48

Add optimized to get_attention_function

cb959f9

Merge branch 'master' into attention-select

d9bb453

Disable attention logs for now

720d0a8

Remove attention logging code

eaa9433

Remove _register_core_attention_functions, as we wouldn't want someon…

c092b8a

…e to call that, just in case

Satisfy ruff

dd0a509

Remove AttentionOverrideTest node, that's something to cook up for later

66c4eb0

Kosinkadink added the Core Core team dependency label Aug 30, 2025

Kosinkadink requested a review from comfyanonymous as a code owner August 30, 2025 22:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enable Runtime Selection of Attention Functions #9639

Enable Runtime Selection of Attention Functions #9639

Kosinkadink commented Aug 30, 2025 •

edited

Loading

Uh oh!

Uh oh!

Enable Runtime Selection of Attention Functions #9639

Are you sure you want to change the base?

Enable Runtime Selection of Attention Functions #9639

Conversation

Kosinkadink commented Aug 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Kosinkadink commented Aug 30, 2025 •

edited

Loading