Svelte Hacker News logo
  • top
  • new
  • show
  • ask
  • jobs
  • about

Efficient Streaming Language Models with Attention Sinks

arxiv.org

5 points by jxmorris12 17 hours ago