Skip to main content

Supported Models

Well-supported LLMs

Fine-tuning is currently available for the following models:

Model NameParametersArchitectureLicenseContext WindowSupported Fine-Tuning Context Window
mistral-7b7 billionMistralApache 2.03276832768
mistral-7b-instruct7 billionMistralApache 2.03276832768
mistral-7b-instruct-v0-27 billionMistralApache 2.03276832768
mixtral-8x7b46.7 billionMixtralApache 2.0327687168
mixtral-8x7b-v0-146.7 billionMixtralApache 2.0327687168
mixtral-8x7b-instruct-v0-146.7 billionMixtralApache 2.0327687168
llama-3-8b8 billionLlama-3Meta (request for commercial use)81928192
llama-3-8b-instruct8 billionLlama-3Meta (request for commercial use)81928192
llama-3-70b70 billionLlama-3Meta (request for commercial use)81928192
llama-3-70b-instruct70 billionLlama-3Meta (request for commercial use)81928192
llama-2-7b7 billionLlama-2Meta (request for commercial use)40964096
llama-2-7b-chat7 billionLlama-2Meta (request for commercial use)40964096
llama-2-13b13 billionLlama-2Meta (request for commercial use)40964096
llama-2-13b-chat13 billionLlama-2Meta (request for commercial use)40964096
llama-2-70b70 billionLlama-2Meta (request for commercial use)40964096
llama-2-70b-chat70 billionLlama-2Meta (request for commercial use)40964096
codellama-13b-instruct13 billionLlama-2Meta (request for commercial use)1638416384
codellama-70b-instruct70 billionLlama-2Meta (request for commercial use)40964096
codellama-7b7 billionLlama-2Meta (request for commercial use)40964096
codellama-7b-instruct7 billionLlama-2Meta (request for commercial use)40964096
gemma-2b2.5 billionGemmaGoogle81928192
gemma-2b-instruct2.5 billionGemmaGoogle81928192
gemma-7b8.5 billionGemmaGoogle81928192
gemma-7b-instruct8.5 billionGemmaGoogle81928192
zephyr-7b-beta7 billionMistralMIT3276832768
phi-22.78 billionPhi-2Microsoft20482048
phi-3-mini-4k-instruct3.92 billionPhi-3Microsoft40964096

Many of the latest OSS models are released in two variants:

  • Base model (llama-2-7b, etc): These are models that are primarily trained on the objective of text completion.
  • Instruction-Tuned (llama-2-7b-chat, mistral-7b-instruct, etc): These are models that have been further trained on (instruction, output) pairs in order to better respond to human instruction-styled inputs. The instructions effectively constrains the model’s output to align with the response characteristics or domain knowledge.

Best-Effort LLMs (via HuggingFace)

Best-effort fine-tuning is also offered for any Huggingface LLM meeting the following criteria:

  • Has the "Text Generation" and "Transformer" tags
  • Does not have a "custom_code" tag
  • Are not post-quantized (ex. model containing a quantization method such as "AWQ" in the name)
  • Has text inputs and outputs

"Best-effort" means we will try to support these models but it is not guaranteed.

Fine-tuning a custom LLM

  1. Get the Huggingface ID for your model by clicking the the copy icon on the custom base model's page, ex. "BioMistral/BioMistral-7B".

Huggingface screenshot

  1. Pass the Huggingface ID as the base_model.
# Create an adapter repository
repo = pb.repos.create(name="bio-summarizer", description="Bio News Summarizer", exists_ok=True)

# Start a fine-tuning job, blocks until training is finished
adapter = pb.adapters.create(
config=FinetuningConfig(
base_model="BioMistral/BioMistral-7B"
),
dataset="bio-dataset",
repo=repo,
description="initial model with defaults"
)
dedicated deployment needed for inference

Note that if you fine-tune a custom model not on our serverless deployments list, you'll need to deploy the custom base model as a dedicated deployment in order to run inference on your newly trained adapter.