deepspeed

Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train your own 8B/14B LLaVA-training-like MLLM on RTX3090/4090 24GB.

fine-tuning pipeline-parallelism pretraining model-parallel deepspeed mllm multimodal-large-language-models qwen video-large-language-models video-language-model

Updated Sep 24, 2024
Jupyter Notebook

LambdaLabsML / distributed-training-guide

Star

Best practices & guides on how to write distributed pytorch training code

gpu cluster mpi cuda slurm pytorch sharding kuberentes distributed-training nccl gpu-cluster deepspeed fsdp lambdalabs

Updated Oct 31, 2024
Python

stanleylsx / llms_tool

Star

一个基于HuggingFace开发的大语言模型训练、测试工具。支持各模型的webui、终端预测，低参数量及全参数模型训练(预训练、SFT、RM、PPO、DPO)和融合、量化。

bloom pytorch falcon llama moss mistral aquila baichuan deepspeed chatglm chatglm2 internlm llama2 qwen xverse baichuan2 aquila2 chatglm3

Updated Dec 8, 2023
Python

OpenCSGs / llm-inference

Star

llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deployment, such as UI, RESTful API, auto-scaling, computing resource management, monitoring, and more.

transformer ray deepspeed llama-cpp vllm llm-inference

Updated May 17, 2024
Python

git-cloner / llama2-lora-fine-tuning

Star

llama2 finetuning with deepspeed and lora

lora finetuning deepspeed llama2

Updated Jul 28, 2023
Python

xyjigsaw / LLM-Pretrain-SFT

Star

Scripts of LLM pre-training and fine-tuning (w/wo LoRA, DeepSpeed)

llama lora mistral deepspeed large-language-models baichuan2

Updated Jan 30, 2024
Python

pszemraj / ai-msgbot

Star

Training & Implementation of chatbots leveraging GPT-like architecture with the aitextgen package to enable dynamic conversations.

Updated Sep 6, 2022
Jupyter Notebook

jackaduma / ChatGLM-LoRA-RLHF-PyTorch

Star

A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically ChatGPT but with ChatGLM

pytorch llama gpt lora finetune ppo peft deepspeed llm chatgpt rlhf reward-models chatglm chatglm-6b

Updated Apr 28, 2023
Python

CoinCheung / gdGPT

Star

Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.

nlp bloom pipeline pytorch deepspeed llm full-finetune model-parallization flash-attention llama2 baichuan2-7b chatglm3-6b mixtral-8x7b

Updated Feb 5, 2024
Python

billvsme / train_law_llm

Star

✏️0成本LLM微调上手项目，⚡️一步一步使用colab训练法律LLM，基于microsoft/phi-1_5、chatglm3，包含lora微调，全参微调

python law ai lora deepspeed llm llama2

Updated Dec 27, 2023
Jupyter Notebook

Improve this page

Add a description, image, and links to the deepspeed topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the deepspeed topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

deepspeed

Here are 71 public repositories matching this topic...

InternLM / lmdeploy

OpenRLHF / OpenRLHF

zjunlp / KnowLM

PKU-Alignment / safe-rlhf

Xirider / finetune-gpt2xl

OpenMOSS / CoLLiE

alibaba / Megatron-LLaMA

sunzeyeah / RLHF

intelligent-machine-learning / glake

shm007g / LLaMA-Cult-and-More

Coobiw / MPP-LLaVA

LambdaLabsML / distributed-training-guide

stanleylsx / llms_tool

OpenCSGs / llm-inference

git-cloner / llama2-lora-fine-tuning

xyjigsaw / LLM-Pretrain-SFT

pszemraj / ai-msgbot

jackaduma / ChatGLM-LoRA-RLHF-PyTorch

CoinCheung / gdGPT

billvsme / train_law_llm

Improve this page

Add this topic to your repo