Definition
An open-source toolkit for fine-tuning Llama-family and other open language models.
A widely used open-source LLM fine-tuning library built on top of DeepSpeed and Hugging Face, supporting SFT, RLHF, and parameter-efficient methods.
An open-source toolkit for fine-tuning Llama-family and other open language models.
A widely used open-source LLM fine-tuning library built on top of DeepSpeed and Hugging Face, supporting SFT, RLHF, and parameter-efficient methods.