History

Adrian Flores Cortes 3def230d58 Initial commit: local-llm-agent infrastructure project Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>		2026-02-02 16:42:45 -06:00
..
README.md	Initial commit: local-llm-agent infrastructure project	2026-02-02 16:42:45 -06:00

README.md

LoRA Adapters

This directory contains LoRA (Low-Rank Adaptation) adapters for project-specific fine-tuning.

Directory Structure

lora-adapters/
├── README.md           # This file
├── erp-core/           # ERP Core domain adapter
│   ├── adapter_config.json
│   └── adapter_model.safetensors
├── trading/            # Trading platform adapter
│   ├── adapter_config.json
│   └── adapter_model.safetensors
└── {project-name}/     # Additional project adapters
    ├── adapter_config.json
    └── adapter_model.safetensors

Creating LoRA Adapters

Prerequisites

Base model: mistralai/Mistral-7B-Instruct-v0.2 (or compatible)
Training data in JSONL format
PEFT library for training

Training Example

from peft import LoraConfig, get_peft_model
from transformers import AutoModelForCausalLM, AutoTokenizer

# Load base model
model = AutoModelForCausalLM.from_pretrained("mistralai/Mistral-7B-Instruct-v0.2")
tokenizer = AutoTokenizer.from_pretrained("mistralai/Mistral-7B-Instruct-v0.2")

# Configure LoRA
lora_config = LoraConfig(
    r=64,                      # Rank
    lora_alpha=128,            # Alpha scaling
    target_modules=["q_proj", "k_proj", "v_proj", "o_proj"],
    lora_dropout=0.05,
    bias="none",
    task_type="CAUSAL_LM"
)

# Apply LoRA
model = get_peft_model(model, lora_config)

# Train...
# Save adapter
model.save_pretrained("lora-adapters/your-adapter")

Required Files

adapter_config.json: LoRA configuration

{
  "base_model_name_or_path": "mistralai/Mistral-7B-Instruct-v0.2",
  "peft_type": "LORA",
  "task_type": "CAUSAL_LM",
  "r": 64,
  "lora_alpha": 128,
  "lora_dropout": 0.05,
  "target_modules": ["q_proj", "k_proj", "v_proj", "o_proj"]
}

adapter_model.safetensors: LoRA weights

Using LoRA Adapters with vLLM

Configuration

Adapters are automatically mounted in the vLLM container:

# docker-compose.vllm.yml
volumes:
  - ./lora-adapters:/lora-adapters:ro

API Usage

# Chat with LoRA adapter
curl -X POST http://localhost:3160/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "mistralai/Mistral-7B-Instruct-v0.2",
    "messages": [
      {"role": "user", "content": "How do I create an invoice in the ERP system?"}
    ],
    "lora_adapter": "erp-core"
  }'

Listing Available Adapters

# List LoRA adapters
curl http://localhost:3160/v1/lora/adapters

Project-Specific Adapters

erp-core

Purpose: ERP domain knowledge (invoices, inventory, accounting)
Training data: ERP documentation, code, user interactions
Base model: Mistral-7B-Instruct

trading

Purpose: Trading platform domain (orders, positions, market data)
Training data: Trading documentation, API specs, user queries
Base model: Mistral-7B-Instruct

Best Practices

Keep adapters small: LoRA adapters should be < 100MB
Test locally first: Verify adapter loads correctly
Version control: Track adapter versions separately
Documentation: Document training data and hyperparameters

Troubleshooting

Adapter not loading

Check file permissions
Verify adapter_config.json matches base model
Check vLLM logs: docker logs local-llm-vllm

Memory issues

Reduce max_loras in docker-compose
Use smaller LoRA rank (r=32 instead of r=64)
Enable LoRA merging for inference