Skip to content
View enp1s0's full-sized avatar
🤯
Computing
🤯
Computing

Organizations

@FDPS @rioyokotalab @mori-lab @rapidsai @wmmae @hpc-wakate

Block or report enp1s0

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. wmmae/wmma_extension wmmae/wmma_extension Public

    An extension library of WMMA API (Tensor Core API)

    Cuda 87 14

  2. ozIMMU ozIMMU Public

    FP64 equivalent GEMM via Int8 Tensor Cores using the Ozaki scheme

    Cuda 49 2

  3. cutf cutf Public

    CUDA Template Functions

    C++ 19 1

  4. CULiP CULiP Public

    Library for profiling the execution time of CUDA official library functions

    Cuda 7

  5. cuMpSGEMM cuMpSGEMM Public

    Fast SGEMM emulation on Tensor Cores

    Cuda 7

  6. shgemm shgemm Public

    Fast multiplication of single-precision and half-precision matrices on Tensor Cores

    Cuda 7