Add functions for input-masked loss calculation and batching #825

chimezie · 2024-06-07T16:37:02Z

Adds support for completion-only finetuning via functions for iterating over batching that also calculates input masks and a loss function using the masks and updates in (D/L)oRA tuner to use it w/ --mask-inputs (addressing/adding: #484, see: #1086)

-- Updated to keep up with mlx(_lm) changes, etc.

llms/mlx_lm/tuner/trainer.py

… an updated attempt to better sync with iterate_batches logic

…_only

…iterate_batches) by default.

Renamed the batch iteration function (iterate_delineated_batches -> iterate_completion_batches).

…_only

Simplify by removing unnecessary imports and the unused chat templating method. Adjust tokens and batch handling to properly manage sequence lengths and masking using latest, related mlx_lm bits (ml-explore/mlx-examples#825).

chimezie · 2024-11-13T12:09:26Z

Superseded by #1103

chimezie added 5 commits June 7, 2024 12:35

Add input_masked loss calculation and batching w/ padding

59e937c

Merge branch 'ml-explore:main' into completion_only

8c1d33d

Merge branch 'ml-explore:main' into completion_only

0a3ec90

Merge branch 'ml-explore:main' into completion_only

1929f53

Merge branch 'ml-explore:main' into completion_only

95fb224

awni reviewed Nov 4, 2024

View reviewed changes

llms/mlx_lm/tuner/trainer.py Outdated Show resolved Hide resolved

awni reviewed Nov 4, 2024

View reviewed changes

llms/mlx_lm/tuner/trainer.py Outdated Show resolved Hide resolved

chimezie added 3 commits November 4, 2024 22:00

Merge branch 'ml-explore:main' into completion_only

a1fbc52

Replace iterate_input_masked_batches with iterate_delineated_batches,…

b7b3332

… an updated attempt to better sync with iterate_batches logic

Merge branch 'ml-explore:main' into completion_only

603dab5

chimezie changed the title ~~Add functions for input-masked loss calculation and padded batching~~ Add functions for input-masked loss calculation and batching Nov 5, 2024

chimezie added 11 commits November 5, 2024 15:25

Minor documentation update

5579b48

Merge remote-tracking branch 'origin/completion_only' into completion…

e0d66f5

…_only

Updates CL lora tuner with input masking that uses default_loss (and …

4b88c33

…iterate_batches) by default.

Fix variable reference

3c76a25

Update sublist search and calculation of input id length

960ed79

Fix

bfa6c29

Merge branch 'ml-explore:main' into completion_only

7f89ace

Merge branch 'ml-explore:main' into completion_only

3080102

Add input masking for fine-tuning in documentation

01e330d

Renamed the batch iteration function (iterate_delineated_batches -> iterate_completion_batches).

Merge remote-tracking branch 'origin/completion_only' into completion…

791727f

…_only

Update documentation

4ddbb98

chimezie mentioned this pull request Nov 10, 2024

Completion only fine-tuning of instruction models with collections of HF datasets #1103

Open

chimezie closed this Nov 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add functions for input-masked loss calculation and batching #825

Add functions for input-masked loss calculation and batching #825

chimezie commented Jun 7, 2024 •

edited

Loading

chimezie commented Nov 13, 2024

Add functions for input-masked loss calculation and batching #825

Add functions for input-masked loss calculation and batching #825

Conversation

chimezie commented Jun 7, 2024 • edited Loading

chimezie commented Nov 13, 2024

chimezie commented Jun 7, 2024 •

edited

Loading