NHacker Next
  • new
  • show
  • ask
  • jobs
  • submit
login
CUDA-l2: Surpassing cuBLAS performance for matrix multiplication through RLgithub.com
132 points by dzign 4 days ago | 15 comments