LlamaForge

Fine-tuning pipeline for LLMs with LoRA, supporting CPU and GPU execution. Integrates with Ollama for model management and GGUF conversion.

⚠️ Project Status

Maturity: Alpha (v0.1.0) - Experimental research tool

What works well:

✅ LoRA fine-tuning on 1-7B parameter models
✅ GGUF conversion for Ollama deployment
✅ Distributed training with SOLLOL integration
✅ CPU and GPU execution with automatic detection
✅ Interactive wizard and CLI modes

Known limitations:

⚠️ Memory requirements: 16GB+ RAM for 7B models, OOM possible during merge step
⚠️ Small datasets break models: Minimum 500-1000 samples required
⚠️ No automated tests: Manual testing only, no CI/CD
⚠️ Limited architecture support: Tested on Llama, Mistral, CodeLlama, Qwen
⚠️ Not battle-tested: Limited production usage

Recommended for:

🎓 Learning distributed training and LoRA fine-tuning
🔬 Research and experimentation
🏠 Personal projects with adequate hardware (16GB+ RAM)
🛠️ Contributors who want to help mature the project

NOT recommended for:

❌ Production training pipelines
❌ Systems with <16GB RAM
❌ Mission-critical workloads
❌ Users unfamiliar with ML/PyTorch

Read before using: TECHNICAL_REALITY.md and TRAINING_ISSUES_ANALYSIS.md for honest documentation of limitations.

Overview

LlamaForge provides a streamlined workflow for fine-tuning large language models using Parameter-Efficient Fine-Tuning (PEFT) with LoRA. The system handles dataset preprocessing, training, and conversion to GGUF format for use with Ollama.

Key Features

Automatic Hardware Detection: GPU acceleration when available, CPU fallback otherwise
Memory-Efficient Training: LoRA fine-tuning with gradient checkpointing
Flexible Dataset Loading: Supports JSON, JSONL, CSV, and plain text formats
Ollama Integration: Detects locally available models and exports to GGUF
CPU Optimization: Multi-threaded CPU training with aggressive memory optimizations
Interactive and CLI Modes: Choose between guided wizard or direct command-line usage

Installation

Prerequisites

Python 3.8+
16GB+ RAM (for 7B parameter models)
50GB+ disk space (for models and checkpoints)
Ollama installed (optional, for model detection and deployment)

Setup

git clone https://github.com/B-A-M-N/LlamaForge.git
cd LlamaForge
pip install -r requirements.txt

Quick Start

Interactive Mode

The interactive wizard guides you through model selection, dataset configuration, and training parameters:

python llamaforge_interactive.py

The wizard will:

Scan for locally available Ollama models
Help you select or specify a base model
Configure dataset and training parameters
Execute training and optional GGUF conversion

Command Line Mode

For direct execution with known parameters:

python llamaforge.py \
    --model mistralai/Mistral-7B-v0.1 \
    --data train.jsonl \
    --epochs 3 \
    --output finetuned-model.gguf

Usage

Basic Training

python llamaforge.py \
    --model mistralai/Mistral-7B-v0.1 \
    --data train.jsonl \
    --epochs 3

Advanced Configuration

python llamaforge.py \
    --model meta-llama/Llama-2-7b-hf \
    --data dataset.jsonl \
    --epochs 5 \
    --batch-size 2 \
    --gradient-accumulation 4 \
    --learning-rate 1e-4 \
    --lora-r 16 \
    --lora-alpha 32 \
    --max-length 1024 \
    --quantization q4_k_m \
    --output finetuned-model.gguf

Output Formats

GGUF (default): For Ollama deployment

python llamaforge.py --model MODEL --data DATA --output model.gguf

HuggingFace: For further processing or deployment

python llamaforge.py --model MODEL --data DATA --no-gguf

Dataset Formats

LlamaForge automatically detects and processes multiple dataset formats.

JSONL (Recommended)

{"prompt": "What is AI?", "completion": "Artificial Intelligence is..."}
{"instruction": "Translate to French", "input": "Hello", "output": "Bonjour"}
{"question": "What is 2+2?", "answer": "4"}

Supported field combinations:

prompt + completion
instruction + input + output
instruction + output
question + answer
text (for continued pre-training)

CSV

prompt,completion
"What is AI?","Artificial Intelligence is..."
"Explain Python","Python is a programming language..."

Plain Text

Each line is treated as a separate training example.
Useful for continued pre-training on domain-specific text.

Parameters

Required Arguments

Argument	Description
`--model`	Base model (HuggingFace identifier or local path)
`--data`	Path to training dataset file

Training Configuration

Argument	Default	Description
`--epochs`	3	Number of training epochs
`--batch-size`	1	Training batch size
`--gradient-accumulation`	4	Gradient accumulation steps
`--learning-rate`	2e-4	Learning rate
`--max-length`	512	Maximum sequence length

LoRA Configuration

Argument	Default	Description
`--lora-r`	8	LoRA rank (adapter dimension)
`--lora-alpha`	16	LoRA scaling factor
`--lora-dropout`	0.05	Dropout probability

Output Configuration

Argument	Default	Description
`--output`	auto-generated	Output file path
`--quantization`	q4_k_m	GGUF quantization method
`--no-gguf`	False	Skip GGUF conversion

Training Pipeline

The system executes the following steps:

Model Loading: Loads base model from HuggingFace or local cache
Dataset Processing: Automatically detects format and structures data
LoRA Initialization: Configures parameter-efficient adapters
Training: Executes fine-tuning with gradient checkpointing
Adapter Merging: Combines LoRA weights with base model
GGUF Conversion: Quantizes and converts to GGUF format (if enabled)

Distributed Training

LlamaForge supports distributed training across multiple nodes using PyTorch DDP and SOLLOL for node discovery.

Quick Start

# Automatic node discovery and launch
python launch_distributed_training_direct.py \
    --model TinyLlama/TinyLlama-1.1B-Chat-v1.0 \
    --dataset examples/datasets/alpaca_1k.jsonl \
    --epochs 1

For detailed distributed training setup, see DISTRIBUTED_TRAINING_SOLLOL.md.

Use with Ollama

After training completes with GGUF output:

# Create Modelfile
echo "FROM ./finetuned-model.gguf" > Modelfile

# Import to Ollama
ollama create my-finetuned-model -f Modelfile

# Run inference
ollama run my-finetuned-model "Your prompt here"

Examples

Code Generation Fine-Tuning

python llamaforge.py \
    --model codellama/CodeLlama-7b-hf \
    --data examples/datasets/code_alpaca_full.jsonl \
    --max-length 2048 \
    --epochs 3 \
    --lora-r 16

Instruction Following

python llamaforge.py \
    --model mistralai/Mistral-7B-v0.1 \
    --data examples/datasets/alpaca_gpt4.jsonl \
    --epochs 3 \
    --learning-rate 1e-4

Chain-of-Thought Reasoning

python llamaforge.py \
    --model meta-llama/Llama-2-7b-hf \
    --data examples/datasets/gsm8k_cot.jsonl \
    --epochs 5 \
    --max-length 1024

Performance Characteristics

CPU Training

7B Model: ~2-4 hours per epoch (dataset and hardware dependent)
Memory: ~16-20GB RAM for 7B models with LoRA
Optimization: Automatic CPU core utilization and memory management

GPU Training

7B Model: ~15-30 minutes per epoch (on modern GPU)
Memory: ~12-16GB VRAM for 7B models
Multiple GPUs: Automatic data parallelism when available

Memory Management

Model Size	Minimum RAM	Recommended RAM
1-3B	8GB	12GB
7B	16GB	24GB
13B	32GB	48GB

If encountering OOM errors:

Reduce --batch-size to 1
Decrease --max-length
Lower --lora-r (e.g., 4 or 8)
Increase --gradient-accumulation

Project Structure

LlamaForge/
├── src/
│   ├── lora_trainer.py          # Core training logic
│   ├── dataset_loader.py        # Dataset preprocessing
│   ├── gguf_converter.py        # GGUF conversion
│   ├── ollama_utils.py          # Ollama integration
│   └── sollol_integration.py    # Distributed training
├── examples/
│   └── datasets/                # Example datasets
├── llamaforge.py                # Main CLI
├── llamaforge_interactive.py    # Interactive wizard
├── launch_distributed_training_direct.py  # Distributed launcher
└── requirements.txt

Documentation

Distributed Training Guide - Multi-node training setup
Dataset Guide - Dataset preparation and formats
Evaluation Guide - Model evaluation and testing
SystemD Service Setup - Persistent worker configuration

Limitations

Model Support: Primarily tested with Llama, Mistral, CodeLlama, and Qwen architectures
Dataset Size: In-memory loading may be problematic for very large datasets (>1GB)
Quantization: GGUF conversion requires llama.cpp compatibility
Distributed Training: Requires manual setup on worker nodes

Troubleshooting

Import Errors

Ensure all dependencies are installed:

pip install -r requirements.txt --upgrade

Out of Memory

Reduce memory usage:

python llamaforge.py \
    --model MODEL \
    --data DATA \
    --batch-size 1 \
    --max-length 256 \
    --lora-r 4

Slow Training

For CPU training:

Use smaller batch sizes with higher gradient accumulation
Reduce max sequence length
Consider using a smaller base model

GGUF Conversion Fails

Ensure llama.cpp is installed:

git clone https://github.com/ggerganov/llama.cpp
cd llama.cpp && make

Contributing

Contributions are welcome. Please:

Fork the repository
Create a feature branch
Add tests for new functionality
Ensure existing tests pass
Submit a pull request

License

MIT License - See LICENSE file for details

Acknowledgments

PEFT/LoRA: HuggingFace PEFT
GGUF Conversion: llama.cpp
Distributed Training: SOLLOL
Model Runtime: Ollama

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.github/workflows		.github/workflows
configs		configs
docs		docs
examples		examples
manifests		manifests
scripts		scripts
src		src
testtraindata		testtraindata
tools		tools
work/training/lora_adapter		work/training/lora_adapter
.gitignore		.gitignore
10M_EXPANSION_COMPLETE.md		10M_EXPANSION_COMPLETE.md
A4000_TRAINING_GUIDE.md		A4000_TRAINING_GUIDE.md
ACTUAL_DARK_PROTECTOR_DATASETS.md		ACTUAL_DARK_PROTECTOR_DATASETS.md
AUTOMATIC_OLLAMA_CREATION.md		AUTOMATIC_OLLAMA_CREATION.md
CODE_QUALITY_DATASETS_GUIDE.md		CODE_QUALITY_DATASETS_GUIDE.md
COMPLETE_DATASET_SUMMARY.md		COMPLETE_DATASET_SUMMARY.md
COMPLETE_EXPANSION_SUMMARY.md		COMPLETE_EXPANSION_SUMMARY.md
CPU_OPTIMIZATION_GUIDE.md		CPU_OPTIMIZATION_GUIDE.md
DARK_PSYCH_ESOTERIC_STRATEGY.md		DARK_PSYCH_ESOTERIC_STRATEGY.md
DATASETS_READY.txt		DATASETS_READY.txt
DATASET_EXPANSION_GUIDE.md		DATASET_EXPANSION_GUIDE.md
DATASET_EXPANSION_SUMMARY.md		DATASET_EXPANSION_SUMMARY.md
DATASET_GUIDE.md		DATASET_GUIDE.md
DATASET_STRATEGY_GUIDE.md		DATASET_STRATEGY_GUIDE.md
DIRECTORY_BROWSING.md		DIRECTORY_BROWSING.md
DISTRIBUTED_TRAINING_GUIDE.md		DISTRIBUTED_TRAINING_GUIDE.md
DISTRIBUTED_TRAINING_SOLLOL.md		DISTRIBUTED_TRAINING_SOLLOL.md
EMERGENT_PERSONALITY_STRATEGY.md		EMERGENT_PERSONALITY_STRATEGY.md
EVALUATION_GUIDE.md		EVALUATION_GUIDE.md
EXPANSION_DATASETS_MANIFEST.md		EXPANSION_DATASETS_MANIFEST.md
EXPANSION_STATUS_SUMMARY.md		EXPANSION_STATUS_SUMMARY.md
FINAL_COMPREHENSIVE_STATISTICS.md		FINAL_COMPREHENSIVE_STATISTICS.md
FINAL_DATASET_SUMMARY.md		FINAL_DATASET_SUMMARY.md
FINAL_QUALITY_UPGRADE_REPORT.md		FINAL_QUALITY_UPGRADE_REPORT.md
FINAL_SUMMARY.md		FINAL_SUMMARY.md
KNOWN_ISSUES.md		KNOWN_ISSUES.md
LEVIATHAN_TRAINING_READY.md		LEVIATHAN_TRAINING_READY.md
LICENSE		LICENSE
Modelfile		Modelfile
OLLAMA_INTEGRATION.md		OLLAMA_INTEGRATION.md
PERSONA_EXPANSION_COMPLETE.md		PERSONA_EXPANSION_COMPLETE.md
PROJECT_STATUS_FINAL.md		PROJECT_STATUS_FINAL.md
QUICKSTART.md		QUICKSTART.md
QUICK_REFERENCE.md		QUICK_REFERENCE.md
QUICK_START_COT_TRAINING.md		QUICK_START_COT_TRAINING.md
RAM_ESTIMATION_FEATURE.md		RAM_ESTIMATION_FEATURE.md
README.md		README.md
README_COMPLETE_SYSTEM.md		README_COMPLETE_SYSTEM.md
REAL_DATASETS_COUNT.md		REAL_DATASETS_COUNT.md
REASONING_TRAINING_GUIDE.md		REASONING_TRAINING_GUIDE.md
SETUP_COMPLETE.md		SETUP_COMPLETE.md
SOLLOL_INTEGRATION.md		SOLLOL_INTEGRATION.md
SYNTHETIC_VS_REAL_DATASETS_AUDIT.md		SYNTHETIC_VS_REAL_DATASETS_AUDIT.md
SYSTEMD_SERVICE_SETUP.md		SYSTEMD_SERVICE_SETUP.md
SYSTEM_READY_STATUS.md		SYSTEM_READY_STATUS.md
TEACHER_STUDENT_GUIDE.md		TEACHER_STUDENT_GUIDE.md
TECHNICAL_REALITY.md		TECHNICAL_REALITY.md
TRAINING_EXECUTION_GUIDE.md		TRAINING_EXECUTION_GUIDE.md
TRAINING_GUIDE.md		TRAINING_GUIDE.md
TRAINING_ISSUES_ANALYSIS.md		TRAINING_ISSUES_ANALYSIS.md
TRAINING_OPTIONS_SUMMARY.md		TRAINING_OPTIONS_SUMMARY.md
TRAINING_READY_FINAL_SUMMARY.md		TRAINING_READY_FINAL_SUMMARY.md
ULTIMATE_3M_GUIDE.md		ULTIMATE_3M_GUIDE.md
USE_OLLAMA_NOW.md		USE_OLLAMA_NOW.md
add_persona_tags.py		add_persona_tags.py
analyze_dataset_quality.py		analyze_dataset_quality.py
analyze_personas.py		analyze_personas.py
analyze_refusal_capability.py		analyze_refusal_capability.py
build_behavioral_corpora.py		build_behavioral_corpora.py
create_10pct_sample.py		create_10pct_sample.py
create_3m_ultimate_dataset.sh		create_3m_ultimate_dataset.sh
create_3m_with_duplication.py		create_3m_with_duplication.py
create_cot_tot_dataset.py		create_cot_tot_dataset.py
create_test_sample.py		create_test_sample.py
create_ultimate_reasoning_dataset.sh		create_ultimate_reasoning_dataset.sh
dataset_manifest.json		dataset_manifest.json
dataset_manifest.py		dataset_manifest.py
delete_synthetic_datasets.py		delete_synthetic_datasets.py
download_ALL_real_alternatives.py		download_ALL_real_alternatives.py
download_actual_dark_datasets.py		download_actual_dark_datasets.py
download_code_datasets.py		download_code_datasets.py
download_code_quality_datasets.py		download_code_quality_datasets.py
download_codegeex_corpus.py		download_codegeex_corpus.py
download_cot_tot_datasets.py		download_cot_tot_datasets.py
download_dark_domains_only.py		download_dark_domains_only.py
download_dark_themed_real_datasets.py		download_dark_themed_real_datasets.py
download_datasets.py		download_datasets.py
download_debugging_datasets.py		download_debugging_datasets.py
download_expansion_datasets.py		download_expansion_datasets.py
download_expansion_datasets_v2.py		download_expansion_datasets_v2.py
download_expansion_phase2.py		download_expansion_phase2.py
download_expansion_phase2_fixed.py		download_expansion_phase2_fixed.py
download_expansion_phase3_mega.py		download_expansion_phase3_mega.py
download_expansion_phase4_psychological.py		download_expansion_phase4_psychological.py
download_expansion_phase5_advanced_reasoning.py		download_expansion_phase5_advanced_reasoning.py
download_expansion_phase5_fixed.py		download_expansion_phase5_fixed.py
download_gap_filling_datasets.py		download_gap_filling_datasets.py
download_gap_spanning_datasets.py		download_gap_spanning_datasets.py
download_legitimate_datasets_only.py		download_legitimate_datasets_only.py

Folders and files

Latest commit

History

Repository files navigation

LlamaForge

⚠️ Project Status

Overview

Key Features

Installation

Prerequisites

Setup

Quick Start

Interactive Mode

Command Line Mode

Usage

Basic Training

Advanced Configuration

Output Formats

Dataset Formats

JSONL (Recommended)

CSV

Plain Text

Parameters

Required Arguments

Training Configuration

LoRA Configuration

Output Configuration

Training Pipeline

Distributed Training

Quick Start

Use with Ollama

Examples

Code Generation Fine-Tuning

Instruction Following

Chain-of-Thought Reasoning

Performance Characteristics

CPU Training

GPU Training

Memory Management

Project Structure

Documentation

Limitations

Troubleshooting

Import Errors

Out of Memory

Slow Training

GGUF Conversion Fails

Contributing

License

Acknowledgments

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages