news
Jun 2025 | [🤹 New blog post] I wrote a blog about Erwin where I describe the motivation behind it as well as where it lands in the current landscape of sub-quadratic architectures for irregular data. |
---|---|
Jun 2025 | [🚨 Workshop paper] We introduce Ball Sparse Attention: a novel sub-quadratic attention mechanism for irregular data that merges Erwin and Native Sparse Attention. Fantastic effort from my students. |
May 2025 |
Erwin was accepted to ICML 2025! See you all in Vancouver! Blog post soon 👀
|