Sanitized mirror from private repository - 2026-04-20 01:32:01 UTC

2026-04-20 01:32:01 +00:00
commit e7652c8dab
1445 changed files with 364095 additions and 0 deletions
--- a/docs/services/individual/perplexica.md
+++ b/docs/services/individual/perplexica.md
@@ -0,0 +1,441 @@
+# Perplexica - AI-Powered Search Engine
+
+Perplexica is a self-hosted AI-powered search engine that combines traditional search with Large Language Models to provide intelligent, conversational search results.
+
+## Overview
+
+| Setting | Value |
+|---------|-------|
+| **Host** | Homelab VM (192.168.0.210) |
+| **Port** | 4785 |
+| **Image** | `itzcrazykns1337/perplexica:latest` |
+| **Web UI** | http://192.168.0.210:4785 |
+| **Settings** | http://192.168.0.210:4785/settings |
+| **Stack File** | `hosts/vms/homelab-vm/perplexica.yaml` |
+
+## Features
+
+- **AI-Powered Search**: Combines web search with LLM reasoning
+- **Multiple LLM Support**: OpenAI, Ollama, Anthropic, Gemini, Groq, LM Studio
+- **Integrated SearXNG**: Self-hosted search engine for privacy
+- **Media Search**: Automatic image and video search
+- **Chat History**: Persistent conversation storage
+- **Custom System Instructions**: Personalize AI behavior
+
+## Current Configuration
+
+### LLM Providers
+
+Perplexica is currently configured with:
+
+1. **Transformers** (Built-in)
+   - Embedding models for semantic search
+   - No external API needed
+
+2. **Ollama - Atlantis** (Primary)
+   - Base URL: `http://192.168.0.200:11434` (local network)
+   - Public URL: `https://ollama.vish.gg` (Cloudflare proxy - may timeout)
+   - Available models:
+     - `qwen2.5:3b` - Fast, efficient
+     - `qwen2.5:1.5b` - Very fast, lightweight
+     - `llama3.2:3b` - Good balance
+     - `mistral:7b` - Strong reasoning
+     - `codellama:7b` - For code-related searches
+     - And 15+ more models
+
+3. **Ollama - Seattle** (Secondary/Backup)
+   - Base URL: `http://100.82.197.124:11434` (Tailscale VPN)
+   - Hosted on Contabo VPS (CPU-only inference)
+   - Available models:
+     - `qwen2.5:1.5b` - Fast, lightweight
+   - Purpose: Load distribution and redundancy
+   - See: `hosts/vms/seattle/README-ollama.md`
+
+### SearXNG Integration
+
+Perplexica includes a built-in SearXNG instance:
+- Runs internally on port 8080
+- Aggregates results from multiple search engines
+- Provides privacy-focused web search
+- Automatically started with the container
+
+## Setup & Deployment
+
+### Docker Compose
+
+```yaml
+services:
+  perplexica:
+    image: itzcrazykns1337/perplexica:latest
+    container_name: perplexica
+    ports:
+      - "4785:3000"
+    environment:
+      - OLLAMA_BASE_URL=http://192.168.0.200:11434
+    volumes:
+      - perplexica-data:/home/perplexica/data
+    restart: unless-stopped
+
+volumes:
+  perplexica-data:
+```
+
+**Important:** The `OLLAMA_BASE_URL` environment variable configures which Ollama instance Perplexica uses.
+
+**Current Configuration (February 2026):**
+- Using Seattle Ollama: `http://100.82.197.124:11434` (via Tailscale)
+- This distributes LLM inference load to the Contabo VPS
+- CPU-only inference (~8-12 tokens/second)
+- Zero additional cost (VPS already running)
+
+### Recent Fixes
+
+#### January 2026 - Networking Simplification
+
+The configuration was simplified to resolve networking issues:
+
+**Before (❌ Had Issues):**
+```yaml
+services:
+  perplexica:
+    extra_hosts:
+      - "host.docker.internal:host-gateway"
+    network_mode: bridge
+```
+
+**After (✅ Working):**
+```yaml
+services:
+  perplexica:
+    # Uses default bridge network
+    # No extra_hosts needed
+```
+
+**What Changed:**
+- Removed `extra_hosts` configuration (not needed for external Ollama access)
+- Removed explicit `network_mode: bridge` (uses default)
+- Simplified networking works better with container DNS
+
+#### February 2026 - Cloudflare Timeout Fix
+
+Fixed LLM query timeouts by using local Ollama URL:
+
+**Problem:**
+- Using `https://ollama.vish.gg` caused Cloudflare 524 timeouts
+- LLM queries took longer than Cloudflare's timeout limit
+- Searches stuck in "answering" state indefinitely
+
+**Solution:**
+```yaml
+environment:
+  - OLLAMA_BASE_URL=http://192.168.0.200:11434
+```
+
+**Result:**
+- Direct local network connection to Ollama
+- No Cloudflare proxy delays
+- Fast, reliable LLM responses
+- Searches complete successfully
+
+## Configuration
+
+### Adding LLM Providers
+
+1. Navigate to http://192.168.0.210:4785/settings
+2. Click "Model Providers"
+3. Add a new provider:
+
+#### Example: Ollama Seattle (Secondary Instance)
+
+```json
+{
+  "name": "Ollama Seattle",
+  "type": "ollama",
+  "baseURL": "http://100.82.197.124:11434",
+  "apiKey": ""
+}
+```
+
+Benefits:
+- Load distribution across multiple Ollama instances
+- Redundancy if primary Ollama is down
+- Access to models hosted on seattle VM
+
+#### Example: Local LM Studio
+
+```json
+{
+  "name": "LM Studio",
+  "type": "lmstudio",
+  "baseURL": "http://100.98.93.15:1234",
+  "apiKey": "lm-studio"
+}
+```
+
+#### Example: OpenAI
+
+```json
+{
+  "name": "OpenAI",
+  "type": "openai",
+  "apiKey": "sk-...",
+  "baseURL": "https://api.openai.com/v1"
+}
+```
+
+### Custom System Instructions
+
+Add personalized behavior in Settings → Personalization:
+
+```
+Respond in a friendly and concise tone.
+Format answers as bullet points when appropriate.
+Focus on technical accuracy for programming questions.
+```
+
+## Usage
+
+### Basic Search
+
+1. Open http://192.168.0.210:4785
+2. Enter your search query
+3. Select search mode:
+   - **Web Search** - General internet search
+   - **Academic** - Research papers and publications
+   - **YouTube** - Video search
+   - **Code** - GitHub and programming resources
+
+### Advanced Features
+
+**Auto Media Search:**
+- Automatically finds relevant images and videos
+- Enable in Settings → Preferences
+
+**Weather Widget:**
+- Shows current weather on homepage
+- Toggle in Settings → Preferences
+
+**News Widget:**
+- Recent news headlines on homepage
+- Toggle in Settings → Preferences
+
+## Data Persistence
+
+Perplexica stores data in a Docker volume:
+
+```bash
+# Location: perplexica-data volume
+/home/perplexica/data/
+├── config.json       # App configuration & LLM providers
+└── db.sqlite         # Chat history and conversations
+```
+
+### Backup
+
+```bash
+# Backup perplexica data
+docker run --rm -v perplexica-data:/data -v $(pwd):/backup alpine tar czf /backup/perplexica-backup.tar.gz /data
+
+# Restore
+docker run --rm -v perplexica-data:/data -v $(pwd):/backup alpine tar xzf /backup/perplexica-backup.tar.gz -C /
+```
+
+## API Access
+
+### Configuration API
+
+```bash
+# Get current configuration
+curl http://192.168.0.210:4785/api/config
+
+# Returns LLM providers, preferences, etc.
+```
+
+### Search API
+
+```bash
+# Perform a search (requires authentication)
+curl -X POST http://192.168.0.210:4785/api/search \
+  -H "Content-Type: application/json" \
+  -d '{
+    "query": "what is kubernetes",
+    "mode": "web"
+  }'
+```
+
+## Monitoring
+
+### Container Status
+
+```bash
+# Check if running
+docker ps | grep perplexica
+
+# View logs
+docker logs perplexica
+
+# Follow logs
+docker logs -f perplexica
+```
+
+### Health Check
+
+```bash
+# Test HTTP response
+curl -I http://192.168.0.210:4785
+
+# Expected: HTTP/1.1 200 OK
+```
+
+## Troubleshooting
+
+### SearXNG Errors
+
+If you see `ERROR:searx.engines` in logs:
+
+```bash
+# Check internal SearXNG
+docker exec perplexica curl http://localhost:8080
+
+# These errors are normal and non-critical:
+# - "loading engine ahmia failed" (Tor engine)
+# - "loading engine torch failed" (Tor engine)
+# - "X-Forwarded-For nor X-Real-IP header is set"
+```
+
+### LLM Connection Issues
+
+**Problem:** "Failed to connect to LLM provider"
+
+**Solution:**
+1. Verify the provider URL is accessible from the container
+2. Check API key is correct
+3. For Ollama, ensure models are pulled:
+   ```bash
+   curl https://ollama.vish.gg/api/tags
+   ```
+
+### Container Won't Start
+
+```bash
+# Check logs for errors
+docker logs perplexica
+
+# Common issues:
+# - Port 4785 already in use
+# - Volume mount permissions
+# - Database corruption (delete and recreate volume)
+```
+
+### Database Issues
+
+If chat history is corrupted:
+
+```bash
+# Stop container
+docker stop perplexica
+
+# Backup and reset database
+docker run --rm -v perplexica-data:/data alpine rm /data/db.sqlite
+
+# Restart (will create new database)
+docker start perplexica
+```
+
+## Privacy Considerations
+
+### What Data Leaves Your Network?
+
+When using external LLM APIs (OpenAI, Anthropic, etc.):
+- Your search queries
+- Chat history for context
+- Search results fed to the LLM
+
+### Keeping Everything Local
+
+For maximum privacy, use local models:
+
+1. **Use Ollama** (as currently configured)
+   - `https://ollama.vish.gg` is your local Ollama instance
+   - No data sent to external APIs
+   - All processing happens on your hardware
+
+2. **Use LM Studio** (Tailscale network)
+   - `http://100.98.93.15:1234/v1`
+   - Local inference
+   - Private to your network
+
+## Performance Tips
+
+1. **Choose appropriate models:**
+   - `qwen2.5:1.5b` - Fastest, basic queries
+   - `qwen2.5:3b` - Good balance
+   - `mistral:7b` - Better quality, slower
+
+2. **Disable auto media search** if you don't need images/videos
+
+3. **Use SearXNG directly** for simple searches (bypass AI)
+
+4. **Limit context window** in system instructions to reduce token usage
+
+## Integration Ideas
+
+### Home Assistant
+
+Create a custom command to search via Perplexica:
+
+```yaml
+# In Home Assistant configuration.yaml
+shell_command:
+  perplexica_search: 'curl -X POST http://192.168.0.210:4785/api/search -d "{\"query\": \"{{ query }}\"}"'
+```
+
+### Alfred/Raycast (macOS)
+
+Create a workflow to search directly from your launcher.
+
+### Custom Dashboard
+
+Embed the search interface in your homelab dashboard:
+
+```html
+<iframe src="http://192.168.0.210:4785" width="100%" height="800px"></iframe>
+```
+
+## Updates
+
+### Manual Update
+
+```bash
+# Pull latest image
+docker pull itzcrazykns1337/perplexica:latest
+
+# Recreate container (GitOps handles this)
+docker compose -f hosts/vms/homelab-vm/perplexica.yaml up -d
+```
+
+### Automatic Updates
+
+Managed via GitOps + Watchtower:
+- GitOps polls repo every 5 minutes
+- Watchtower updates `:latest` images automatically
+- No manual intervention needed
+
+## Related Services
+
+- **Ollama** (`Atlantis/ollama/`) - Local LLM inference
+- **OpenHands** (`homelab_vm/openhands.yaml`) - AI coding agent
+- **Redlib** (`homelab_vm/redlib.yaml`) - Reddit privacy frontend
+- **SearXNG** (Built into Perplexica) - Privacy-focused search
+
+## References
+
+- [Perplexica GitHub](https://github.com/ItzCrazyKns/Perplexica)
+- [SearXNG Documentation](https://docs.searxng.org/)
+- [Ollama Models](https://ollama.com/library)
+
+---
+
+**Status:** ✅ Fully operational
+**Last Updated:** February 2026
+**Maintained By:** GitOps (Portainer)