Name		Name	Last commit message	Last commit date
parent directory ..
.detignore		.detignore
.gitignore		.gitignore
README.md		README.md
chat_format.py		chat_format.py
dataset_utils.py		dataset_utils.py
distributed.yaml		distributed.yaml
finetune.py		finetune.py
requirements.txt		requirements.txt
startup-hook.sh		startup-hook.sh
test_model.py		test_model.py

README.md

LLM Finetuning using HuggingFace + Determined

In this demo, we finetune the TinyLlama-1.1B-Chat on a text-to-SQL dataset. We ran this on two 80 GB A100 GPUs.

To get started, first install Determined on your local machine:

pip install determined

Then finetune:

det e create distributed.yaml .

Change configuration options in distributed.yaml. Some important options are:

slots_per_trial: the number of GPUs to use.
dataset_subset: the difficulty subset to train on.
per_device_train_batch_size: the batch size per GPU.

Test your model's generation capabilities:

python test_model.py --exp_id <exp_id> --dataset_subset <dataset_subset>

Where

<exp_id> is the id of your finetuning experiment in the Determined UI.
<dataset_subset> is one of "easy", "medium", or "hard".

To test the pretrained model (not finetuned), leave out --exp_id. For example:

python test_model.py --dataset_subset easy

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llm-finetuning

llm-finetuning

.detignore

.detignore

.gitignore

.gitignore

README.md

README.md

chat_format.py

chat_format.py

dataset_utils.py

dataset_utils.py

distributed.yaml

distributed.yaml

finetune.py

finetune.py

requirements.txt

requirements.txt

startup-hook.sh

startup-hook.sh

test_model.py

test_model.py

README.md

LLM Finetuning using HuggingFace + Determined

Contributors

Files

llm-finetuning

Directory actions

More options

Directory actions

More options

Latest commit

History

llm-finetuning

Folders and files

parent directory

LLM Finetuning using HuggingFace + Determined

Contributors