Operación Privada AI-LAB
Runbook operativo interno del laboratorio AI-LAB.
Documento interno operativo del laboratorio AI-LAB.
Infraestructura Principal
Section titled “Infraestructura Principal”| Servicio | Host | Puerto | Estado |
|---|---|---|---|
| AI-LAB Main Node | 192.168.1.30 | - | ONLINE |
| Main LM Studio | 192.168.1.200 | 1234 | ONLINE |
| GPU Node RX9070XT | 192.168.1.50 | 1234 | ONLINE |
| GPU Node RX7900XT | 192.168.1.60 | 1234 | OFFLINE |
| Router API | 192.168.1.30 | 8008 | ONLINE |
| OpenCode | 192.168.1.30 | 4096 | ONLINE |
| Traefik | 192.168.1.30 | 80/443 | ONLINE |
| Qdrant | 192.168.1.30 | 6333 | ONLINE |
| Open WebUI | 192.168.1.30 | 3000 | ONLINE |
| Portainer | 192.168.1.30 | 9443 | ONLINE |
Router Cognitivo
Section titled “Router Cognitivo”Estado
Section titled “Estado”Servicio:
sudo systemctl status ialab-router-apiRestart:
sudo systemctl restart ialab-router-apiLogs:
journalctl -u ialab-router-api -fHealth:
curl http://127.0.0.1:8008/healthModelos:
curl http://127.0.0.1:8008/v1/modelsTest chat:
curl -s http://127.0.0.1:8008/v1/chat/completions \ -H "Content-Type: application/json" \ -d '{ "model": "ailab-router/auto", "messages": [ { "role": "user", "content": "di hola" } ] }'OpenCode
Section titled “OpenCode”Lanzar OpenCode
Section titled “Lanzar OpenCode”/opt/ai-lab/runtime/opencode_ui.shMatar proceso
Section titled “Matar proceso”pkill -f opencodeVer proceso
Section titled “Ver proceso”ps aux | grep opencodeAcceso web
Section titled “Acceso web”http://192.168.1.30:4096LM Studio
Section titled “LM Studio”Endpoint REST
Section titled “Endpoint REST”http://IP:1234Ver modelos
Section titled “Ver modelos”curl http://192.168.1.200:1234/v1/modelsTest chat directo
Section titled “Test chat directo”curl -s http://192.168.1.200:1234/v1/chat/completions \ -H "Content-Type: application/json" \ -d '{ "model": "google/gemma-4-e4b", "messages": [ { "role": "user", "content": "hola" } ] }'Snapshot Runtime
Section titled “Snapshot Runtime”Generar snapshot
Section titled “Generar snapshot”sudo systemctl restart ialab-live-stateVer snapshot actual
Section titled “Ver snapshot actual”cat /opt/ai-lab-data/snapshots/current/system_snapshot.json | jqVer estado LLM
Section titled “Ver estado LLM”cat /opt/ai-lab-data/snapshots/current/system_snapshot.json | jq '.llm'Docker
Section titled “Docker”Contenedores
Section titled “Contenedores”docker psdocker network lsReiniciar Traefik
Section titled “Reiniciar Traefik”docker restart traefikTraefik
Section titled “Traefik”Dashboard
Section titled “Dashboard”http://192.168.1.30:8080Configuración
Section titled “Configuración”/opt/ai-lab/stacks/traefik/docker-compose.ymlReiniciar stack
Section titled “Reiniciar stack”cd /opt/ai-lab/stacks/traefikdocker compose up -dCloudflare Tunnel
Section titled “Cloudflare Tunnel”Estado
Section titled “Estado”systemctl status cloudflaredjournalctl -u cloudflared -fAstro / ialab-docs
Section titled “Astro / ialab-docs”Desarrollo
Section titled “Desarrollo”npm run dev -- --hostnpm run buildReiniciar docs
Section titled “Reiniciar docs”sudo systemctl restart ialab-docsjournalctl -u ialab-docs -fSnapshot rápido
Section titled “Snapshot rápido”git add .git commit -m "snapshot"Estado
Section titled “Estado”git statusArquitectura Operativa
Section titled “Arquitectura Operativa”flowchart LR User --> Cloudflare Cloudflare --> Tunnel Tunnel --> Traefik Traefik --> Astro Traefik --> OpenCode Traefik --> OpenWebUI Traefik --> RouterAPI RouterAPI --> LMStudioLocal RouterAPI --> RX9070XT RouterAPI --> RX7900XT RouterAPI --> Qdrant OpenCode --> RouterAPI OpenWebUI --> RouterAPI
Estado Actual del LAB
Section titled “Estado Actual del LAB”- Main LM Studio operativo
- Router OpenAI-compatible funcional
- OpenCode operativo
- Traefik + Cloudflare activos
- Observabilidad viva
- Arquitectura distribuida parcial
- RX9070XT pendiente de estabilización
- RX7900XT apagado
- Failover inteligente real
- Auto healthcheck por request
- Métricas Prometheus
- Grafana NOC
- Routing cognitivo avanzado
- Persistencia de memoria
- Multi-agent runtime
- Balanceo automático GPU