Back to Glossary
Compressed Sparse Attention (CSA)
What is compressed sparse attention (CSA)?
A component of the hybrid attention architecture in DeepSeek-V4 that drastically compresses the KV cache.