#

triton

Here are 110 public repositories matching this topic...

ndtands / Speed_up_Model

Increase the inference speed of the model

pytorch classification triton resnet-50 tensorrt onnx dynamic-batch-size

Updated Jun 7, 2022
Python

Michael-Jing / mini-triton-backend-rs

trying to write a mini triton backend in rust

rust backend triton inference-engine

Updated Apr 26, 2023
C

detail-novelist / novelist-triton-server

Deploy KoGPT with Triton Inference Server

transformers triton huggingface triton-inference-server kogpt gptj large-language-models fastertransformer

Updated Nov 18, 2022
Shell

octoml / ariel

A library for interfacing with Triton.

python machine-learning triton triton-inference-server

Updated Jun 8, 2022
Python

irfanhabib / kubernetes-on-triton

Deploy kubernetes in a Triton VM

kubernetes terraform joyent triton

Updated Aug 8, 2017
Shell

HROlive / Poland-End-To-End-LLM-Bootcamp

This bootcamp is designed to give NLP researchers an end-to-end overview on the fundamentals of NVIDIA NeMo framework, complete solution for building large language models. It will also have hands-on exercises complimented by tutorials, code snippets, and presentations to help researchers kick-start with NeMo LLM Service and Guardrails.

nvidia triton gpt tensorrt nvidia-nemo prompt-tuning p-tuning llm llm-training llm-inference llama2 nemo-guardrails

Updated Mar 7, 2024
Jupyter Notebook

nopperl / torch-image-binarization

`torch.compile`-compatible image binarization algorithm implemented in PyTorch.

torch pytorch compile triton otsu image-binarization

Updated Apr 19, 2024
Python

romitjain / learning-gpu-programming

Learnings and experimentation with GPU programming

gpu cuda triton gpu-programming

Updated May 21, 2024
Cuda

mdavid626 / helium

WPF application for editing XML based configuration files

csharp continuous-delivery dotnet xml wpf continuous-deployment triton continous-integration

Updated Apr 25, 2017
C#

robertdfrench / manta-ci-bridge

Run CI jobs in Manta when trigger by Pull Requests

github ci pull-request manta triton

Updated Dec 10, 2022
Makefile

pytholic / Triton-Examples

Triton inference server examples.

Updated Nov 27, 2023
Python

ogvalt / triton-testcontainer

Package for running Nvidia Triton within python test with features like Dockerfile DSL and building images on fly.

python docker dockerfile pytest triton testcontainers

Updated May 21, 2024
Python

nocsi / soup-kitchen

security cloud triton soup-kitchens

Updated Mar 20, 2017

DeDeckerThomas / NLPiP

This repository contains everything regarding the bachelor thesis: NLPiP (NLP in Production).

nlp transformers triton nlp-machine-learning fastapi huggingface

Updated Feb 17, 2022
Jupyter Notebook

abhilash1910 / Framework-Optimization

Framework, Model & Kernel Optimizations for Distributed Deep Learning - Data Hack Summit

pytorch triton codegen inductor ddp deepspeed fsdp tensorparallel pipelineparallel

Updated Aug 1, 2023
Python

nextorigin / spine-manta

Manta adapter for Spine models running in NodeJS

nodejs orm manta joyent triton object-store spine object-storage spinemvc spinejs

Updated Sep 29, 2017
CoffeeScript

msclock / transformersplus

Add Some plus extra features to transformers

nlp transformers triton triton-server

Updated Aug 19, 2023
Python

cmosetick / triton-touchbase-terraform

WIP touchbase blueprint in terraform

Updated Mar 1, 2016
HCL

daemyung-archive / gpups

The benchmark for OpenAI Triton.

benchmark trident openai triton kakaobrain

Updated Oct 12, 2023
Python

nocsi / triton-portal

api-client joyent triton rbac

Updated Mar 26, 2017
HTML

Improve this page

Add a description, image, and links to the triton topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the triton topic, visit your repo's landing page and select "manage topics."