NicoNico6

🏠

Working from home

Nianhui Guo NicoNico6

🏠

Working from home

Ph.D student in Hasso Plattner Institute, Potsdam University, Germany. Mainly focusing on efficient deep neural network research.

16 followers · 25 following

Hasso Plattner Institute (HPI)
Potsdam, German

Achievements

Stars

pprp / Pruner-Zero

Pruner-Zero: Evolving Symbolic Pruning Metric from scratch for LLMs

Python 76 6 Updated Nov 25, 2024

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 33,514 3,637 Updated Jan 7, 2025

sroecker / LLM_AppDev-HandsOn

Repository and hands-on workshop on how to develop applications with local LLMs

Jupyter Notebook 391 65 Updated Jul 3, 2024

GreenBitAI / green-bit-llm

A toolkit for fine-tuning, inferencing, and evaluating GreenBitAI's LLMs.

Python 78 8 Updated Jan 9, 2025

GreenBitAI / bitorch-engine

A toolkit enhances PyTorch with specialized functions for low-bit quantized neural networks.

Python 28 5 Updated Jun 25, 2024

GreenBitAI / gbx-lm

Run GreenBitAI's Quantized LLMs on Apple Devices with MLX

Python 15 3 Updated Jan 8, 2025

hahnyuan / LLM-Viewer

Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline model in a user-friendly interface.

Python 367 44 Updated Sep 11, 2024

jonfairbanks / local-rag

Ingest files for retrieval augmented generation (RAG) with open-source Large Language Models (LLMs), all without 3rd parties or sensitive data leaving your network.

Python 565 66 Updated Aug 12, 2024

zhpmatrix / PaperReading

每天阅读过的论文的简要笔记

206 9 Updated Jan 5, 2025

turboderp-org / exllamav2

A fast inference library for running LLMs locally on modern consumer-class GPUs

Python 3,828 290 Updated Jan 9, 2025

GreenBitAI / low_bit_llama

Advanced Ultra-Low Bitrate Compression Techniques for the LLaMA Family of LLMs

Python 110 5 Updated Jan 11, 2024

mbzuai-oryx / MobiLlama

MobiLlama : Small Language Model tailored for edge devices

Python 616 48 Updated Mar 3, 2024

Meituan-AutoML / MobileVLM

Strong and Open Vision Language Assistant for Mobile Devices

Python 1,098 71 Updated Apr 15, 2024

jy-yuan / KIVI

[ICML 2024] KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache

Python 261 25 Updated Oct 10, 2024

alipay / PainlessInferenceAcceleration

Python 295 20 Updated Jul 20, 2024

mobiusml / hqq

Official implementation of Half-Quadratic Quantization (HQQ)

Python 729 73 Updated Jan 7, 2025

pytorch-labs / gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,747 522 Updated Dec 14, 2024

Twilight92z / Quantize-Watermark

Python 20 2 Updated Nov 6, 2023

HanGuo97 / lq-lora

Python 124 14 Updated Jan 22, 2024

NicoNico6 / lq-lora

Forked from HanGuo97/lq-lora

Python 1 Updated Nov 21, 2023

IST-DASLab / qmoe

Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".

Python 263 22 Updated Nov 3, 2023

hahnyuan / PB-LLM

PB-LLM: Partially Binarized Large Language Models

Python 149 10 Updated Nov 20, 2023

imagination-research / sot

[ICLR 2024] Skeleton-of-Thought: Prompting LLMs for Efficient Parallel Generation

Python 149 17 Updated Mar 1, 2024

mit-han-lab / streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 6,738 373 Updated Jul 11, 2024

OpenBMB / BMPrinciples

A collection of phenomenons observed during the scaling of big foundation models, which may be developed into consensus, principles, or laws in the future

275 19 Updated Aug 13, 2023

AlpinDale / QuIP-for-Llama

Forked from Cornell-RelaxML/QuIP

Code for paper: "QuIP: 2-Bit Quantization of Large Language Models With Guarantees" adapted for Llama models

Python 36 3 Updated Aug 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Nianhui Guo NicoNico6

Achievements

Achievements

Block or report NicoNico6

Stars

pprp / Pruner-Zero

2noise / ChatTTS

sroecker / LLM_AppDev-HandsOn

GreenBitAI / green-bit-llm

GreenBitAI / bitorch-engine

GreenBitAI / gbx-lm

hahnyuan / LLM-Viewer

jonfairbanks / local-rag

zhpmatrix / PaperReading

turboderp-org / exllamav2

GreenBitAI / low_bit_llama

mbzuai-oryx / MobiLlama

Meituan-AutoML / MobileVLM

jy-yuan / KIVI

alipay / PainlessInferenceAcceleration

mobiusml / hqq

pytorch-labs / gpt-fast

Twilight92z / Quantize-Watermark

HanGuo97 / lq-lora

NicoNico6 / lq-lora

IST-DASLab / qmoe

hahnyuan / PB-LLM

imagination-research / sot

mit-han-lab / streaming-llm

OpenBMB / BMPrinciples

AlpinDale / QuIP-for-Llama

meta-llama / llama

hiyouga / LLaMA-Factory

HuangOwen / Awesome-LLM-Compression

SqueezeAILab / SqueezeLLM