Materials
Paper | Link |
---|---|
FlashAttention 1, 2, 3 | PDF, PDF, PDF |
PagedAttention (vLLM) | |
SGLang | |
FlexAttention | |
FlashInfer | |
SpargeAttention | |
SageAttention 1,2 | PDF, PDF |
Paper | Link |
---|---|
Streaming LLM & DuoAttention | PDF, PDF |
MInference | |
H2O | |
TOVA/KIVI | PDF, PDF |
Speculative Decoding | PDF, PDF |
Multi-token prediction: Deepseek-v3 |
Paper | Link |
---|---|
Tuning-Free Multi-Event Long Video Generation | |
Long Context Tuning for Video Generation | |
One-Minute Video Generation with Test-Time Training | |
SKYREELS-V2: INFINITE-LENGTH FILM GENERATIVE MODEL |