Hacker News

Y

Hacker News

new | ask | show | jobs

wolfgangK 4 days ago [ - ]

Indeed, recent Flash Attention is a pain point for non CUDA.