Vish/homelab-optimized

Fork 0

Files

Gitea Mirror Bot 0b88d4860e

Documentation / Build Docusaurus (push) Failing after 5m21s

Details

Documentation / Deploy to GitHub Pages (push) Has been skipped

Details

Sanitized mirror from private repository - 2026-03-27 10:50:30 UTC

2026-03-27 10:50:30 +00:00

10 KiB

Raw Blame History

Perplexica - AI-Powered Search Engine

Perplexica is a self-hosted AI-powered search engine that combines traditional search with Large Language Models to provide intelligent, conversational search results.

Overview

Setting	Value
Host	Homelab VM (192.168.0.210)
Port	4785
Image	`itzcrazykns1337/perplexica:latest`
Web UI	http://192.168.0.210:4785
Settings	http://192.168.0.210:4785/settings
Stack File	`hosts/vms/homelab-vm/perplexica.yaml`

Features

AI-Powered Search: Combines web search with LLM reasoning
Multiple LLM Support: OpenAI, Ollama, Anthropic, Gemini, Groq, LM Studio
Integrated SearXNG: Self-hosted search engine for privacy
Media Search: Automatic image and video search
Chat History: Persistent conversation storage
Custom System Instructions: Personalize AI behavior

Current Configuration

LLM Providers

Perplexica is currently configured with:

Transformers (Built-in)
- Embedding models for semantic search
- No external API needed
Ollama - Atlantis (Primary)
- Base URL: http://192.168.0.200:11434 (local network)
- Public URL: https://ollama.vish.gg (Cloudflare proxy - may timeout)
- Available models:
  - qwen2.5:3b - Fast, efficient
  - qwen2.5:1.5b - Very fast, lightweight
  - llama3.2:3b - Good balance
  - mistral:7b - Strong reasoning
  - codellama:7b - For code-related searches
  - And 15+ more models
Ollama - Seattle (Secondary/Backup)
- Base URL: http://100.82.197.124:11434 (Tailscale VPN)
- Hosted on Contabo VPS (CPU-only inference)
- Available models:
  - qwen2.5:1.5b - Fast, lightweight
- Purpose: Load distribution and redundancy
- See: hosts/vms/seattle/README-ollama.md

SearXNG Integration

Perplexica includes a built-in SearXNG instance:

Runs internally on port 8080
Aggregates results from multiple search engines
Provides privacy-focused web search
Automatically started with the container

Setup & Deployment

Docker Compose

services:
  perplexica:
    image: itzcrazykns1337/perplexica:latest
    container_name: perplexica
    ports:
      - "4785:3000"
    environment:
      - OLLAMA_BASE_URL=http://192.168.0.200:11434
    volumes:
      - perplexica-data:/home/perplexica/data
    restart: unless-stopped

volumes:
  perplexica-data:

Important: The OLLAMA_BASE_URL environment variable configures which Ollama instance Perplexica uses.

Current Configuration (February 2026):

Using Seattle Ollama: http://100.82.197.124:11434 (via Tailscale)
This distributes LLM inference load to the Contabo VPS
CPU-only inference (~8-12 tokens/second)
Zero additional cost (VPS already running)

Recent Fixes

January 2026 - Networking Simplification

The configuration was simplified to resolve networking issues:

Before (❌ Had Issues):

services:
  perplexica:
    extra_hosts:
      - "host.docker.internal:host-gateway"
    network_mode: bridge

After (✅ Working):

services:
  perplexica:
    # Uses default bridge network
    # No extra_hosts needed

What Changed:

Removed extra_hosts configuration (not needed for external Ollama access)
Removed explicit network_mode: bridge (uses default)
Simplified networking works better with container DNS

February 2026 - Cloudflare Timeout Fix

Fixed LLM query timeouts by using local Ollama URL:

Problem:

Using https://ollama.vish.gg caused Cloudflare 524 timeouts
LLM queries took longer than Cloudflare's timeout limit
Searches stuck in "answering" state indefinitely

Solution:

environment:
  - OLLAMA_BASE_URL=http://192.168.0.200:11434

Result:

Direct local network connection to Ollama
No Cloudflare proxy delays
Fast, reliable LLM responses
Searches complete successfully

Configuration

Adding LLM Providers

Navigate to http://192.168.0.210:4785/settings
Click "Model Providers"
Add a new provider:

Example: Ollama Seattle (Secondary Instance)

{
  "name": "Ollama Seattle",
  "type": "ollama",
  "baseURL": "http://100.82.197.124:11434",
  "apiKey": ""
}

Benefits:

Load distribution across multiple Ollama instances
Redundancy if primary Ollama is down
Access to models hosted on seattle VM

Example: Local LM Studio

{
  "name": "LM Studio",
  "type": "lmstudio",
  "baseURL": "http://100.98.93.15:1234",
  "apiKey": "lm-studio"
}

Example: OpenAI

{
  "name": "OpenAI",
  "type": "openai",
  "apiKey": "sk-...",
  "baseURL": "https://api.openai.com/v1"
}

Custom System Instructions

Add personalized behavior in Settings → Personalization:

Respond in a friendly and concise tone.
Format answers as bullet points when appropriate.
Focus on technical accuracy for programming questions.

Usage

Basic Search

Open http://192.168.0.210:4785
Enter your search query
Select search mode:
- Web Search - General internet search
- Academic - Research papers and publications
- YouTube - Video search
- Code - GitHub and programming resources

Advanced Features

Auto Media Search:

Automatically finds relevant images and videos
Enable in Settings → Preferences

Weather Widget:

Shows current weather on homepage
Toggle in Settings → Preferences

News Widget:

Recent news headlines on homepage
Toggle in Settings → Preferences

Data Persistence

Perplexica stores data in a Docker volume:

# Location: perplexica-data volume
/home/perplexica/data/
├── config.json       # App configuration & LLM providers
└── db.sqlite         # Chat history and conversations

Backup

# Backup perplexica data
docker run --rm -v perplexica-data:/data -v $(pwd):/backup alpine tar czf /backup/perplexica-backup.tar.gz /data

# Restore
docker run --rm -v perplexica-data:/data -v $(pwd):/backup alpine tar xzf /backup/perplexica-backup.tar.gz -C /

API Access

Configuration API

# Get current configuration
curl http://192.168.0.210:4785/api/config

# Returns LLM providers, preferences, etc.

Search API

# Perform a search (requires authentication)
curl -X POST http://192.168.0.210:4785/api/search \
  -H "Content-Type: application/json" \
  -d '{
    "query": "what is kubernetes",
    "mode": "web"
  }'

Monitoring

Container Status

# Check if running
docker ps | grep perplexica

# View logs
docker logs perplexica

# Follow logs
docker logs -f perplexica

Health Check

# Test HTTP response
curl -I http://192.168.0.210:4785

# Expected: HTTP/1.1 200 OK

Troubleshooting

SearXNG Errors

If you see ERROR:searx.engines in logs:

# Check internal SearXNG
docker exec perplexica curl http://localhost:8080

# These errors are normal and non-critical:
# - "loading engine ahmia failed" (Tor engine)
# - "loading engine torch failed" (Tor engine)
# - "X-Forwarded-For nor X-Real-IP header is set"

LLM Connection Issues

Problem: "Failed to connect to LLM provider"

Solution:

Verify the provider URL is accessible from the container
Check API key is correct
For Ollama, ensure models are pulled:
```
curl https://ollama.vish.gg/api/tags
```

Container Won't Start

# Check logs for errors
docker logs perplexica

# Common issues:
# - Port 4785 already in use
# - Volume mount permissions
# - Database corruption (delete and recreate volume)

Database Issues

If chat history is corrupted:

# Stop container
docker stop perplexica

# Backup and reset database
docker run --rm -v perplexica-data:/data alpine rm /data/db.sqlite

# Restart (will create new database)
docker start perplexica

Privacy Considerations

What Data Leaves Your Network?

When using external LLM APIs (OpenAI, Anthropic, etc.):

Your search queries
Chat history for context
Search results fed to the LLM

Keeping Everything Local

For maximum privacy, use local models:

Use Ollama (as currently configured)
- https://ollama.vish.gg is your local Ollama instance
- No data sent to external APIs
- All processing happens on your hardware
Use LM Studio (Tailscale network)
- http://100.98.93.15:1234/v1
- Local inference
- Private to your network

Performance Tips

Choose appropriate models:
- qwen2.5:1.5b - Fastest, basic queries
- qwen2.5:3b - Good balance
- mistral:7b - Better quality, slower
Disable auto media search if you don't need images/videos
Use SearXNG directly for simple searches (bypass AI)
Limit context window in system instructions to reduce token usage

Integration Ideas

Home Assistant

Create a custom command to search via Perplexica:

# In Home Assistant configuration.yaml
shell_command:
  perplexica_search: 'curl -X POST http://192.168.0.210:4785/api/search -d "{\"query\": \"{{ query }}\"}"'

Alfred/Raycast (macOS)

Create a workflow to search directly from your launcher.

Custom Dashboard

Embed the search interface in your homelab dashboard:

<iframe src="http://192.168.0.210:4785" width="100%" height="800px"></iframe>

Updates

Manual Update

# Pull latest image
docker pull itzcrazykns1337/perplexica:latest

# Recreate container (GitOps handles this)
docker compose -f hosts/vms/homelab-vm/perplexica.yaml up -d

Automatic Updates

Managed via GitOps + Watchtower:

GitOps polls repo every 5 minutes
Watchtower updates :latest images automatically
No manual intervention needed

Ollama (Atlantis/ollama/) - Local LLM inference
OpenHands (homelab_vm/openhands.yaml) - AI coding agent
Redlib (homelab_vm/redlib.yaml) - Reddit privacy frontend
SearXNG (Built into Perplexica) - Privacy-focused search

References

Status: ✅ Fully operational Last Updated: February 2026 Maintained By: GitOps (Portainer)

10 KiB Raw Blame History