Optionally delegate classifiers to XGBoost for finetuning and inference #114

JackHopkins · 2023-12-04T01:48:04Z

Is your feature request related to a problem? Please describe.
LLMs are extremely inefficient at classification. XGBoost is better if the data is available. We could use the aligned data from the LLM to train an XGBoost model, which would be much faster to run.

Describe the solution you'd like
When the output types denote a classification task (i.e where the goal is to sample one type in a union of literal types, or an enum), we optionally distil the teacher model into a decision forest using the XGBoost library.

Additional context
We could represent student models as optional packages, sort of like drivers, that the user could install through PIP.

E.g pip3 install tanuki.py[xgboost]

The text was updated successfully, but these errors were encountered:

JackHopkins added the enhancement New feature or request label Dec 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optionally delegate classifiers to XGBoost for finetuning and inference #114

Optionally delegate classifiers to XGBoost for finetuning and inference #114

JackHopkins commented Dec 4, 2023

Optionally delegate classifiers to XGBoost for finetuning and inference #114

Optionally delegate classifiers to XGBoost for finetuning and inference #114

Comments

JackHopkins commented Dec 4, 2023