configuration-guide.md

Configuration Guide

Comprehensive guide to all ModelForge configuration options.

Overview

ModelForge provides extensive configuration options for fine-tuning. This guide covers all available settings.

Basic Configuration

Required Fields

{
  "task": "text-generation",
  "model_name": "meta-llama/Llama-3.2-3B",
  "dataset": "/path/to/dataset.jsonl",
  "num_train_epochs": 3
}

task: Training task (text-generation, summarization, extractive-question-answering)
model_name: HuggingFace model ID or local path
dataset: Path to JSONL dataset file
num_train_epochs: Number of training epochs

Provider Selection

{
  "provider": "huggingface"  // or "unsloth"
}

huggingface: Standard provider (default)
unsloth: 2x faster training (Linux/WSL/Docker only)

See Provider Documentation for details.

Strategy Selection

{
  "strategy": "sft"  // or "qlora", "rlhf", "dpo"
}

sft: Supervised fine-tuning (default)
qlora: Quantized LoRA for memory efficiency
rlhf: Reinforcement Learning from Human Feedback
dpo: Direct Preference Optimization

See Strategy Documentation for details.

Schema Validation Rules

ModelForge validates configuration combinations at startup:

DPO/RLHF strategies require "task": "text-generation" — using them with summarization or QA tasks will raise a validation error
Unsloth provider requires "task": "text-generation" — encoder-decoder models are not supported
Unsloth provider requires a fixed max_seq_length (cannot be -1)

Training Parameters

Epoch and Batch Settings

{
  "num_train_epochs": 3,
  "per_device_train_batch_size": 4,
  "per_device_eval_batch_size": 4,
  "gradient_accumulation_steps": 4
}

Learning Rate

{
  "learning_rate": 2e-4,
  "lr_scheduler_type": "cosine",
  "warmup_steps": 100
}

Optimization

{
  "optim": "adamw_torch",  // or "adamw_8bit", "sgd"
  "weight_decay": 0.01,
  "max_grad_norm": 1.0
}

LoRA Configuration

{
  "lora_r": 16,
  "lora_alpha": 32,
  "lora_dropout": 0.1,
  "target_modules": "all-linear"
}

lora_r: LoRA rank (8, 16, 32, 64)
lora_alpha: LoRA alpha (usually 2x rank)
lora_dropout: Dropout rate
target_modules: Modules to apply LoRA

Quantization

Note: Quantization requires the [quantization] extra: pip install modelforge-finetuning[quantization]

{
  "use_4bit": true,
  "use_8bit": false,
  "bnb_4bit_compute_dtype": "float16",
  "bnb_4bit_quant_type": "nf4"
}

Mixed Precision

{
  "bf16": true,   // For Ampere+ GPUs (RTX 30xx/40xx)
  "fp16": false   // For older GPUs
}

Sequence Length

{
  "max_seq_length": 2048  // or 512, 1024, 4096, 8192
}

Note: When using Unsloth, max_seq_length cannot be -1 (auto-inference).

Evaluation

{
  "eval_split": 0.2,
  "eval_steps": 100,
  "evaluation_strategy": "steps",
  "save_strategy": "steps",
  "save_steps": 500
}

Logging

{
  "logging_steps": 10,
  "logging_strategy": "steps",
  "report_to": "tensorboard"
}

Hardware Profiles

Instead of manual configuration, use hardware profiles:

{
  "compute_specs": "low_end"  // or "mid_range", "high_end"
}

See Hardware Profiles for details.

Complete Example

{
  "task": "text-generation",
  "model_name": "meta-llama/Llama-3.2-3B",
  "provider": "unsloth",
  "strategy": "qlora",
  "dataset": "/path/to/dataset.jsonl",
  "max_seq_length": 2048,
  "num_train_epochs": 3,
  "per_device_train_batch_size": 4,
  "gradient_accumulation_steps": 4,
  "learning_rate": 2e-4,
  "lora_r": 64,
  "lora_alpha": 16,
  "use_4bit": true,
  "bf16": true,
  "eval_split": 0.2,
  "eval_steps": 100,
  "save_steps": 500,
  "logging_steps": 10
}

Next Steps

Dataset Formats - Prepare your data
Training Tasks - Understand task types
Hardware Profiles - Optimize for your GPU
API Reference - Full schema

For detailed API schema, see Training Configuration Schema.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Configuration Guide

Overview

Basic Configuration

Required Fields

Provider Selection

Strategy Selection

Schema Validation Rules

Training Parameters

Epoch and Batch Settings

Learning Rate

Optimization

LoRA Configuration

Quantization

Mixed Precision

Sequence Length

Evaluation

Logging

Hardware Profiles

Complete Example

Next Steps

FilesExpand file tree

configuration-guide.md

Latest commit

History

configuration-guide.md

File metadata and controls

Configuration Guide

Overview

Basic Configuration

Required Fields

Provider Selection

Strategy Selection

Schema Validation Rules

Training Parameters

Epoch and Batch Settings

Learning Rate

Optimization

LoRA Configuration

Quantization

Mixed Precision

Sequence Length

Evaluation

Logging

Hardware Profiles

Complete Example

Next Steps