LIVE

All News
Models
Startups
Big Players
Tools
Research
Hardware
Robotics
Safety
Regulation

LIVE

Loading bulletin…

Loading feed…

A17news

AI news in a flash - the fastest way to know what's happening in AI. Short, sharp, and always up to date.

A17news

AI news in a flash - the fastest way to know what's happening in AI. Short, sharp, and always up to date.

Sections

All News
Models
Startups
Big Players
Tools
Research
Hardware
Robotics
Safety
Regulation

Company

About
Contact
Privacy Policy
Terms of Service
Copyright & DMCA
Accessibility
Sitemap

Privacy Terms DMCA Accessibility

Built by humans and bots for humans and bots.

Home ResearchDeepMind paper: linear attention rivals transformers at 32K

ResearchMay 9

DeepMind paper: linear attention rivals transformers at 32K

A new state-space hybrid matches softmax attention quality with O(n) memory at long context.

Researchers at Google DeepMind published 'Hawk-2', a hybrid recurrent-attention architecture that matches dense transformers on perplexity and reasoning benchmarks at 32K context. The paper claims a 6x throughput improvement at long context with no quality regression. Code and 8B-parameter weights have been released on GitHub.

Source

arXiv · arxiv.org

Read at source