ProjectPhysX / PTXprofiler Star 35 Code Issues Pull requests A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis. hpc profiler gpu opencl cuda nvidia gpu-acceleration gpu-computing sycl nvidia-cuda nvidia-gpu ptx gpu-programming roofline-model ptx-utils Updated Dec 31, 2023 C++
ekondis / gpuroofperf-toolkit Star 15 Code Issues Pull requests A GPU performance prediction toolkit for CUDA programs benchmark cuda performance-analysis gpu-computing roofline-model Updated Mar 25, 2019 Cuda
giopaglia / rooflini Star 4 Code Issues Pull requests A Python script for plotting roofline analyses. Intel Advisor style. matplotlib roofline-model intel-advisor intel-advisor-style roofline-plot Updated Aug 15, 2019 Python
emilioj / cs-roofline-toolkit Star 3 Code Issues Pull requests Fork of the CS Roofline Toolkit from Berkeley Lab performance-analysis roofline-model performance-characterization Updated Jun 11, 2018 Java
ebt-hpc / cca-ebt Star 3 Code Issues Pull requests Code Comprehension Assistance for Evidence-Based performance Tuning fortran optimization roofline-model code-comprehension kernel-classification loop-kernel operational-intensity Updated Oct 30, 2023 Python
cissieAB / ifarm-gpus Star 0 Code Issues Pull requests JLab ifarm GPU specifications. gpu neural-networks roofline-model Updated Feb 6, 2023 Python