Sanitized mirror from private repository - 2026-04-01 04:44:34 UTC
This commit is contained in:
121
docs/hardware/compute-hosts.md
Normal file
121
docs/hardware/compute-hosts.md
Normal file
@@ -0,0 +1,121 @@
|
||||
# 💻 Compute Hosts
|
||||
|
||||
*Physical and virtual compute infrastructure for the homelab*
|
||||
|
||||
## Overview
|
||||
Documentation of all compute hosts providing processing power for containers, VMs, and services.
|
||||
|
||||
## Physical Hosts
|
||||
|
||||
### Intel NUC Systems
|
||||
- **Model**: Intel NUC 11 Pro
|
||||
- **CPU**: Intel Core i7-1165G7
|
||||
- **RAM**: 32GB DDR4
|
||||
- **Storage**: 1TB NVMe SSD
|
||||
- **Role**: Container orchestration, development
|
||||
|
||||
### Raspberry Pi Cluster
|
||||
- **Model**: Raspberry Pi 5 (8GB)
|
||||
- **Quantity**: Multiple units
|
||||
- **Role**: Edge computing, IoT services, monitoring
|
||||
- **OS**: Raspberry Pi OS / Ubuntu Server
|
||||
|
||||
### Mini PCs
|
||||
- **Various models** for specific workloads
|
||||
- **Low power consumption** for 24/7 operation
|
||||
- **Specialized roles** (networking, monitoring, etc.)
|
||||
|
||||
## Virtual Machines
|
||||
|
||||
### Proxmox VE Cluster
|
||||
- **Hypervisor**: Proxmox Virtual Environment
|
||||
- **High Availability**: Cluster configuration
|
||||
- **Live Migration**: VM mobility between hosts
|
||||
- **Backup Integration**: Automated VM backups
|
||||
|
||||
### VM Categories
|
||||
|
||||
#### Production VMs
|
||||
- **Web services**: Nginx, application servers
|
||||
- **Databases**: PostgreSQL, Redis, MongoDB
|
||||
- **Monitoring**: Prometheus, Grafana stack
|
||||
- **Communication**: Matrix, Mattermost, email
|
||||
|
||||
#### Development VMs
|
||||
- **CI/CD**: Gitea runners, build environments
|
||||
- **Testing**: Isolated testing environments
|
||||
- **Staging**: Pre-production deployments
|
||||
|
||||
#### Specialized VMs
|
||||
- **Gaming servers**: Minecraft, game hosting
|
||||
- **Media processing**: Transcoding, conversion
|
||||
- **AI/ML**: Machine learning workloads
|
||||
|
||||
## Resource Allocation
|
||||
|
||||
### CPU Distribution
|
||||
- **Production services**: 60% allocation
|
||||
- **Development/testing**: 25% allocation
|
||||
- **Monitoring/management**: 15% allocation
|
||||
|
||||
### Memory Management
|
||||
- **Container workloads**: Dynamic allocation
|
||||
- **VM reservations**: Guaranteed minimums
|
||||
- **Overcommit ratios**: Optimized for workload
|
||||
|
||||
### Storage Tiers
|
||||
- **NVMe SSD**: High-performance workloads
|
||||
- **SATA SSD**: General purpose storage
|
||||
- **HDD**: Bulk storage, backups
|
||||
|
||||
## Network Configuration
|
||||
|
||||
### Management Networks
|
||||
- **IPMI/iLO**: Out-of-band management
|
||||
- **Admin VLAN**: Management interfaces
|
||||
- **Monitoring**: SNMP, logging
|
||||
|
||||
### Service Networks
|
||||
- **Production VLAN**: Live services
|
||||
- **Development VLAN**: Testing environments
|
||||
- **DMZ**: Public-facing services
|
||||
|
||||
## High Availability
|
||||
|
||||
### Clustering
|
||||
- **Container orchestration**: Docker Swarm/K8s
|
||||
- **Database clustering**: PostgreSQL HA
|
||||
- **Load balancing**: HAProxy, Nginx
|
||||
|
||||
### Failover Procedures
|
||||
- **Automatic failover**: Critical services
|
||||
- **Manual procedures**: Complex applications
|
||||
- **Recovery testing**: Regular DR drills
|
||||
|
||||
## Monitoring & Alerting
|
||||
|
||||
### System Metrics
|
||||
- **CPU, memory, disk usage**
|
||||
- **Network performance**
|
||||
- **Temperature monitoring**
|
||||
- **Power consumption**
|
||||
|
||||
### Health Checks
|
||||
- **Service availability**
|
||||
- **Performance thresholds**
|
||||
- **Capacity planning**
|
||||
|
||||
## Maintenance
|
||||
|
||||
### Update Schedules
|
||||
- **Security patches**: Weekly
|
||||
- **System updates**: Monthly
|
||||
- **Firmware updates**: Quarterly
|
||||
|
||||
### Hardware Maintenance
|
||||
- **Cleaning schedules**
|
||||
- **Component replacement**
|
||||
- **Capacity upgrades**
|
||||
|
||||
---
|
||||
**Status**: ✅ All compute hosts operational with monitoring coverage
|
||||
Reference in New Issue
Block a user