Mistral LLM Model Integration

Mistral is a high-performance open-source language model that can be used with OpenRegister through Ollama or Hugging Face integrations.

Overview

Mistral models are available in multiple sizes and can be run locally using:

Ollama: Simple setup, native API
Hugging Face TGI/vLLM: OpenAI-compatible API, optimized for production

Model Variants

Model	Size	Parameters	Use Case	Memory Required
Mistral 7B	7B	7 billion	General purpose, RAG	16GB
Mistral 7B Instruct	7B	7 billion	Chat, instructions	16GB
Mixtral 8x7B	47B	47 billion	High quality, complex tasks	48GB+

Using Mistral with Ollama

Quick Start

# Pull Mistral model
docker exec openregister-ollama ollama pull mistral:7b

# Or Mistral Instruct (recommended for chat)
docker exec openregister-ollama ollama pull mistral:latest

Configuration

Navigate to Settings → OpenRegister → LLM Configuration
Select Ollama as provider
Configure:
- Ollama URL: http://openregister-ollama:11434
- Chat Model: mistral:latest or mistral:7b

See Ollama Integration for detailed setup instructions.

Using Mistral with Hugging Face

Quick Start

# Start TGI with Mistral (using huggingface profile)
docker-compose -f docker-compose.dev.yml --profile huggingface up -d tgi-mistral

# Or start vLLM with Mistral (if configured)
docker-compose -f docker-compose.dev.yml --profile huggingface up -d vllm-mistral

Configuration

Navigate to Settings → OpenRegister → LLM Configuration
Select OpenAI as provider (TGI/vLLM are OpenAI-compatible)
Configure:
- Base URL: http://tgi-mistral:80 (TGI) or http://vllm-mistral:8000 (vLLM)
- Model: mistral-7b-instruct
- API Key: dummy (not used for local)

See Hugging Face Integration for detailed setup instructions.

Use Cases

1. General Purpose Chat

Mistral excels at:

Conversational AI
Question answering
Text generation
Code generation

2. RAG (Retrieval Augmented Generation)

Use Mistral with OpenRegister's RAG features:

Answer questions using your data
Context-aware responses
Citation support

3. Function Calling

Mistral supports function calling for:

Object search
Object creation
Object updates
Register queries

Performance Comparison

Setup	Speed	Quality	Ease of Use
Ollama	⚡⚡⚡ Fast	⭐⭐⭐⭐	⭐⭐⭐⭐⭐ Easy
TGI	⚡⚡ Fast	⭐⭐⭐⭐	⭐⭐⭐ Medium
vLLM	⚡⚡⚡ Very Fast	⭐⭐⭐⭐	⭐⭐⭐ Medium

Recommended Configuration

For Development

Use Ollama with Mistral:

Easiest setup
Good performance
Native API

For Production

Use TGI or vLLM with Mistral:

Better throughput
OpenAI-compatible API
Optimized inference

Troubleshooting

Model Not Found (Ollama)

# List available models
docker exec openregister-ollama ollama list

# Pull Mistral if missing
docker exec openregister-ollama ollama pull mistral:latest

# Verify model name includes tag
docker exec openregister-ollama ollama show mistral:latest

Slow Performance

Solutions:

Use GPU acceleration (10-100x faster)
Use Mistral 7B instead of Mixtral 8x7B
Ensure models are loaded in memory

Support

For issues specific to:

Mistral models: Check Mistral AI Documentation
Ollama setup: See Ollama Integration
Hugging Face setup: See Hugging Face Integration
OpenRegister integration: OpenRegister GitHub issues

Overview​

Model Variants​

Using Mistral with Ollama​

Quick Start​

Configuration​

Using Mistral with Hugging Face​

Quick Start​

Configuration​

Use Cases​

1. General Purpose Chat​

2. RAG (Retrieval Augmented Generation)​

3. Function Calling​

Performance Comparison​

Recommended Configuration​

For Development​

For Production​

Troubleshooting​

Model Not Found (Ollama)​

Slow Performance​

Further Reading​

Support​

Overview

Model Variants

Using Mistral with Ollama

Quick Start

Configuration

Using Mistral with Hugging Face

Quick Start

Configuration

Use Cases

1. General Purpose Chat

2. RAG (Retrieval Augmented Generation)

3. Function Calling

Performance Comparison

Recommended Configuration

For Development

For Production

Troubleshooting

Model Not Found (Ollama)

Slow Performance

Further Reading

Support