Fp8 runs ~100 tflops faster when the kernel name has "cutlass" in it

Status
Not open for further replies.
Status
Not open for further replies.
Top