@happierpig I have a small question about the implementation. I micro benched in the real case of 2d or 2d choice in rmsnorm, 3d seems to not advance better than the 2d. For head_dim <= 256 (qwen3 ...
Abstract: This paper introduces a novel type of sequences called C4-sequences. C4-sequences share similar optimal autocorrelation properties with Zadoff-Chu sequences. However, C4-sequences offer the ...
Abstract: Ultrasound (US)-guided needle insertion is widely employed in percutaneous interventions. However, providing feedback on the needle tip position via US imaging presents challenges due to ...