Skip to content
View mgoin's full-sized avatar
🤠
🤠

Sponsoring

@vllm-project

Organizations

@neuralmagic @vllm-project

Block or report mgoin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. vllm-project/vllm vllm-project/vllm Public

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python 32.6k 5k

  2. vllm-project/llm-compressor vllm-project/llm-compressor Public

    Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

    Python 814 67

  3. neuralmagic/deepsparse neuralmagic/deepsparse Public

    Sparsity-aware deep learning inference runtime for CPUs

    Python 3.1k 176

  4. neuralmagic/sparseml neuralmagic/sparseml Public

    Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

    Python 2.1k 149

  5. advos advos Public

    RISC-V OS in Rust with hardware support for SiFive's HiFive1 board

    Rust

  6. torch_bitmask torch_bitmask Public

    Implementations for fast bitmask compression for weight sparsity in PyTorch

    Python 3