N
Hacker Next
new
show
ask
jobs
submit
login
Llama 3.1 405B now runs at 969 tokens/s on Cerebras Inference
cerebras.ai
426 points by
benchmarkist
3 days ago
|
155 comments
add comment