🤖 AI Dev Tools

3866 Tokens/Second: Asthenosphere Unleashes AMD NPU's Full Fury

Picture this: an AMD Ryzen NPU churning out AI responses at 3866 effective tokens per second, no CPU or GPU in sight. Asthenosphere just turned your laptop into a speculative decoding beast.

Asthenosphere performance logs showing 3866 tok/s on AMD Phoenix NPU tiles

⚡ Key Takeaways

  • Asthenosphere achieves 3866 effective tok/s on AMD NPU with zero CPU/GPU usage.
  • Full 12-tile transformer pipeline enables speculative decoding at 91.8% acceptance.
  • Edge AI shift incoming: NPUs like Phoenix XDNA redefine on-device inference.
Published by

DevTools Feed

Ship faster. Build smarter.

Worth sharing?

Get the best Developer Tools stories of the week in your inbox — no noise, no spam.

Originally reported by dev.to

Stay in the loop

The week's most important stories from DevTools Feed, delivered once a week.