Skip to content

Power Attention

Overview Overview
Table of contents
- Getting Started
Getting Started
Getting Started
Performance
Performance
- Benchmarking
Contributing
Contributing
- Releasing

Power Attention

A CUDA implementation of symmetric power attention, achieving transformer-level performance with linear-cost RNN computation.

Getting Started

Installation: Build configuration and requirements
Benchmarking: Performance evaluation methodology