Some checks failed
Release / semantic-release (push) Successful in 19s
tests / Unit tests (Linux, Python 3.11) (push) Successful in 9m54s
Release / build-linux (push) Failing after 7m14s
Release / build-windows (push) Has been cancelled
Release / build-macos (arm64, macos-latest) (push) Has been cancelled
Release / build-macos (x64, macos-15-intel) (push) Has been cancelled
Release / release-main (push) Has been cancelled
Release / release-develop (push) Has been cancelled
GPU acceleration is now on by default and verified end-to-end on the Blackwell RTX 5050 (sm_120): - Ollama offloads 100% to GPU (log: library=CUDA compute=12.0, BLACKWELL_NATIVE_FP4=1). compose passes GPU via CDI (devices: nvidia.com/gpu=all) to both ollama and javis. - Whisper STT on GPU: faster-whisper>=1.1.0 + nvidia-cublas/cudnn cu12, LD_LIBRARY_PATH baked into the image. Verified float16 transcribe on sm_120; bridge auto-falls back to CPU when no GPU is present. - Model: default chat model -> qwen3:8b (best 8GB-VRAM tool-calling, ~5GB Q4). Embed stays nomic-embed-text. - README documents the host one-time setup (nvidia-container-toolkit + `nvidia-ctk cdi generate`) and GPU on/off. Verified: image builds; GPU visible in both containers via compose; ollama ps = 100% GPU; faster-whisper cuda OK + CPU fallback OK; bridge /health 200.
43 lines
1.7 KiB
Bash
Executable File
43 lines
1.7 KiB
Bash
Executable File
#!/usr/bin/env bash
|
|
# Container entrypoint: render config from env, set the VNC password, ensure the
|
|
# Piper voice exists, then hand off to supervisord (which runs the desktop,
|
|
# bridge, and bot).
|
|
set -euo pipefail
|
|
|
|
# --- Defaults (override via .env / compose) ---
|
|
: "${VNC_PASSWORD:=javis123}"
|
|
: "${VNC_RESOLUTION:=1920x1080}"
|
|
: "${OLLAMA_BASE_URL:=http://ollama:11434}"
|
|
: "${OLLAMA_CHAT_MODEL:=qwen3:8b}"
|
|
: "${OLLAMA_EMBED_MODEL:=nomic-embed-text}"
|
|
: "${WHISPER_MODEL:=small}"
|
|
: "${WHISPER_DEVICE:=cuda}"
|
|
: "${WHISPER_COMPUTE_TYPE:=float16}"
|
|
: "${JARVIS_DB_PATH:=/data/jarvis.db}"
|
|
: "${BRIDGE_HOST:=0.0.0.0}"
|
|
: "${BRIDGE_PORT:=8765}"
|
|
: "${PIPER_VOICE:=en_GB-alan-medium}"
|
|
: "${PIPER_VOICE_DIR:=/opt/piper-voices}"
|
|
: "${TTS_PIPER_MODEL_PATH:=${PIPER_VOICE_DIR}/${PIPER_VOICE}.onnx}"
|
|
|
|
export VNC_RESOLUTION OLLAMA_BASE_URL OLLAMA_CHAT_MODEL OLLAMA_EMBED_MODEL \
|
|
WHISPER_MODEL WHISPER_DEVICE WHISPER_COMPUTE_TYPE JARVIS_DB_PATH \
|
|
PIPER_VOICE PIPER_VOICE_DIR TTS_PIPER_MODEL_PATH BRIDGE_HOST BRIDGE_PORT
|
|
|
|
mkdir -p /data /app/config "$(dirname "$JARVIS_DB_PATH")"
|
|
|
|
# --- VNC password file ---
|
|
mkdir -p /root/.vnc
|
|
echo "$VNC_PASSWORD" | tigervncpasswd -f > /root/.vnc/passwd
|
|
chmod 600 /root/.vnc/passwd
|
|
|
|
# --- Render jarvis brain config from template ---
|
|
envsubst < /app/docker/jarvis-config.template.json > /app/config/jarvis.json
|
|
export JARVIS_CONFIG_PATH=/app/config/jarvis.json
|
|
|
|
# --- Ensure the Piper voice exists (best effort) ---
|
|
bash /app/docker/download-piper.sh || echo "[entrypoint] piper download failed; TTS may be unavailable"
|
|
|
|
echo "[entrypoint] display=$DISPLAY ollama=$OLLAMA_BASE_URL whisper=$WHISPER_MODEL/$WHISPER_DEVICE"
|
|
exec supervisord -c /app/docker/supervisord.conf
|