arxiv:2403.13372

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Published on Mar 20

· Submitted by

akhaliq on Mar 21

#2 Paper of the day

Authors:

Yaowei Zheng ,

,

Junhao Zhang ,

Yanhan Ye ,

Abstract

Efficient fine-tuning is vital for adapting large language models (LLMs) to downstream tasks. However, it requires non-trivial efforts to implement these methods on different models. We present LlamaFactory, a unified framework that integrates a suite of cutting-edge efficient training methods. It allows users to flexibly customize the fine-tuning of 100+ LLMs without the need for coding through the built-in web UI LlamaBoard. We empirically validate the efficiency and effectiveness of our framework on language modeling and text generation tasks. It has been released at https://github.com/hiyouga/LLaMA-Factory and already received over 13,000 stars and 1,600 forks.

View arXiv page View PDF Add to collection

Community

AdinaY

Mar 21

Impressive work🔥 The demo is user friendly and supports Chinese/English/Russian : https://huggingface.co/spaces/hiyouga/LLaMA-Board

Mar 22

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

The following papers were recommended by the Semantic Scholar API

Please give a thumbs up to this comment if you found it helpful!

If you want recommendations for any Paper on Hugging Face checkout this Space

You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Models citing this paper 10

Browse 10 models citing this paper

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2403.13372 in a dataset README.md to link it from this page.

Spaces citing this paper 10

Collections including this paper 28