Allocated Attention ======================= .. currentmodule:: nki_samples.reference.allocated_attention This file hosts the high-performance reference implementation for the attention blocks that are used in `Stable Diffusion `_ models. This implementation uses the `direct allocation API ` to achieve better performance. .. autosummary:: :toctree: generated allocated_fused_self_attn_for_SD_small_head_size