Compare commits

..

12 Commits

Author SHA1 Message Date
749f9e648c Merge PR #11: feat(ai): add emacs-nox (PR 5/5)
# Conflicts:
#	ai/Dockerfile
2026-05-20 14:32:56 -04:00
5f67028fb3 Merge PR #10: feat(ai): add QEMU cross-compilation support (PR 4/5)
# Conflicts:
#	ai/Dockerfile
2026-05-20 14:32:16 -04:00
36413e2776 Merge PR #9: feat(ai): add LaTeX typesetting stack (PR 3/5)
# Conflicts:
#	ai/Dockerfile
2026-05-20 14:31:56 -04:00
3efd30420f Merge PR #8: feat(ai): add chromium browser automation support (PR 2/5)
# Conflicts:
#	ai/Dockerfile
2026-05-20 14:31:36 -04:00
604d9107e4 Merge PR #7: feat(ai): add Dockerfile with curl, poppler-utils, imagemagick (PR 1/5) 2026-05-20 14:31:05 -04:00
d0031e5c57 feat: add Paperclip env example file with placeholder secrets
Add env/.env.example.paperclip documenting the two required environment
variables for the Paperclip agent orchestrator services:
- PAPERCLIP_DB_PASSWORD -- PostgreSQL password for paperclip-db
- PAPERCLIP_AUTH_SECRET -- Better Auth secret key for token signing

Users copy this to .env and fill in the secrets before deploying.
2026-05-20 14:28:01 -04:00
38f67f1bd6 fix: trim Dockerfile to minimal base image (PR 1 of 5)
Replace the full Hermes agent Dockerfile with the minimal
debian:13.4 base image as specified in the task:
- debian:13.4 base
- uv installed from astral-sh/uv:latest
- curl, poppler-utils, imagemagick only
- No other packages (PR 1 of 5)
2026-05-20 14:27:58 -04:00
Thierry Pouplier
ef9aa6c4cd feat(ai): add LaTeX typesetting stack (PR 3/5)
Add TeXLive packages for PDF generation from Org-mode and LaTeX documents:
- texlive-latex-base: Core LaTeX packages
- texlive-latex-extra: Additional LaTeX packages
- texlive-fonts-recommended: Recommended fonts
- texlive-xetex: XeTeX engine for Unicode support
- texlive-science: Science/math packages

Depends on PR #7 and #8
2026-04-29 21:32:53 +00:00
Thierry Pouplier
7738269fb3 feat(ai): add Dockerfile with curl, poppler-utils, imagemagick
Add Dockerfile for building custom Hermes Agent image.

Packages (PR 1 of 5):
- curl: HTTP client
- poppler-utils: PDF tools
- imagemagick: Image manipulation
2026-04-29 21:32:20 +00:00
Thierry Pouplier
ccdb1c0931 feat(ai): add chromium browser automation support (PR 2/5)
Add browser automation packages for Playwright/headless Chrome:
- chromium: Headless browser
- xvfb: Virtual framebuffer for headless operation
- fonts-*: Font support for proper rendering
- lib*-runtime: Chromium runtime dependencies

Depends on PR #7 (curl, poppler-utils, imagemagick)
2026-04-29 21:31:15 +00:00
Thierry Pouplier
d99f1cb0e5 feat(ai): add QEMU cross-compilation support (PR 4/5)
Add QEMU user-mode emulation for building aarch64 images (uConsole CM5):
- qemu-user-static: Static QEMU binaries for user-mode emulation
- binfmt-support: Linux kernel support for recognizing binary formats
- qemu-user-binfmt: Registers QEMU handlers with binfmt_misc

Enables cross-compilation of NixOS configurations for ARM devices
from the x86_64 Hermes container.

Depends on PR #7, #8, and #9
2026-04-29 21:28:19 +00:00
Thierry Pouplier
b416d72454 feat(ai): add emacs-nox (PR 5/5)
Add Emacs nox (no X11) for terminal-based Emacs editing.

Depends on PR #7, #8, #9, and #10
2026-04-29 21:25:49 +00:00
4 changed files with 172 additions and 110 deletions

69
ai/Dockerfile Normal file
View File

@@ -0,0 +1,69 @@
FROM ghcr.io/astral-sh/uv:0.11.6-python3.13-trixie@sha256:b3c543b6c4f23a5f2df22866bd7857e5d304b67a564f4feab6ac22044dde719b AS uv_source
FROM tianon/gosu:1.19-trixie@sha256:3b176695959c71e123eb390d427efc665eeb561b1540e82679c15e992006b8b9 AS gosu_source
FROM debian:13.4
# Disable Python stdout buffering to ensure logs are printed immediately
ENV PYTHONUNBUFFERED=1
# Store Playwright browsers outside the volume mount so the build-time
# install survives the /opt/data volume overlay at runtime.
ENV PLAYWRIGHT_BROWSERS_PATH=/opt/hermes/.playwright
# Install system dependencies in one layer, clear APT cache
# tini reaps orphaned zombie processes (MCP stdio subprocesses, git, bun, etc.)
# that would otherwise accumulate when hermes runs as PID 1. See #15012.
RUN apt-get update && \
apt-get install -y --no-install-recommends \
build-essential nodejs npm python3 ripgrep ffmpeg gcc python3-dev libffi-dev procps git openssh-client docker-cli tini \
curl poppler-utils imagemagick \
chromium xvfb fonts-noto-color-emoji fonts-unifont fonts-liberation fonts-ipafont-gothic fonts-wqy-zenhei fonts-tlwg-loma-otf fonts-freefont-ttf \
libasound2t64 libatk-bridge2.0-0t64 libatk1.0-0t64 libatspi2.0-0t64 libcairo2 libcups2t64 libdbus-1-3 libdrm2 libgbm1 libglib2.0-0t64 libnspr4 libnss3 libpango-1.0-0 libx11-6 libxcb1 libxcomposite1 libxdamage1 libxext6 libxfixes3 libxkbcommon0 libxrandr2 \
texlive-latex-base texlive-latex-extra texlive-fonts-recommended texlive-xetex texlive-science \
qemu-user-static binfmt-support qemu-user-binfmt \
emacs-nox && \
rm -rf /var/lib/apt/lists/*
# Non-root user for runtime; UID can be overridden via HERMES_UID at runtime
RUN useradd -u 10000 -m -d /opt/data hermes
COPY --chmod=0755 --from=gosu_source /gosu /usr/local/bin/
COPY --chmod=0755 --from=uv_source /usr/local/bin/uv /usr/local/bin/uvx /usr/local/bin/
WORKDIR /opt/hermes
# ---------- Layer-cached dependency install ----------
# Copy only package manifests first so npm install + Playwright are cached
# unless the lockfiles themselves change.
COPY package.json package-lock.json ./
COPY web/package.json web/package-lock.json web/
RUN npm install --prefer-offline --no-audit && \
npx playwright install --with-deps chromium --only-shell && \
(cd web && npm install --prefer-offline --no-audit) && \
npm cache clean --force
# ---------- Source code ----------
# .dockerignore excludes node_modules, so the installs above survive.
COPY --chown=hermes:hermes . .
# Build web dashboard (Vite outputs to hermes_cli/web_dist/)
RUN cd web && npm run build
# ---------- Permissions ----------
# Make install dir world-readable so any HERMES_UID can read it at runtime.
# The venv needs to be traversable too.
USER root
RUN chmod -R a+rX /opt/hermes
# Start as root so the entrypoint can usermod/groupmod + gosu.
# If HERMES_UID is unset, the entrypoint drops to the default hermes user (10000).
# ---------- Python virtualenv ----------
RUN uv venv && \
uv pip install --no-cache-dir -e ".[all]"
# ---------- Runtime ----------
ENV HERMES_WEB_DIST=/opt/hermes/hermes_cli/web_dist
ENV HERMES_HOME=/opt/data
ENV PATH="/opt/data/.local/bin:${PATH}"
VOLUME [ "/opt/data" ]
ENTRYPOINT [ "/usr/bin/tini", "-g", "--", "/opt/hermes/docker/entrypoint.sh" ]

View File

@@ -1,106 +1,93 @@
# syntax=docker/dockerfile:1
# Hermes Agent with Chromium -- local browser tool support
# Based on python:3.11-slim for minimal footprint.
# Chromium installed via apt-get for system-level browser automation.
#
# Build:
# docker build -t hermes-agent:chromium .
#
# Environment variables:
# CHROME_EXECUTABLE -- path to the Chromium binary
# Hermes Agent -- custom fork build
# Builds on top of official image + overlays our forked source from Gitea.
# Requires Docker BuildKit. Pass SSH agent for git clone:
# docker compose build hermes
# Or manually:
# DOCKER_BUILDKIT=1 docker build --ssh default -t hermes-agent:custom .
# ---------- Base image ----------
FROM python:3.11-slim
# ---------- Base: official Hermes image (system deps, npm, uv, Playwright) ----------
FROM nousresearch/hermes-agent:latest
ENV DEBIAN_FRONTEND=noninteractive
ENV PYTHONUNBUFFERED=1
# ---------- Overlay our forked source ----------
# Uses SSH agent forwarding from the build host (no key baked into image).
# --exclude node_modules/.venv keeps the base image's pre-built layers intact.
# Only the Python source, web UI source, and config change.
RUN --mount=type=ssh \
mkdir -p /root/.ssh && \
ssh-keyscan -p 2222 code.lazyworkhorse.net >> /root/.ssh/known_hosts 2>/dev/null && \
cd /tmp && \
GIT_SSH_COMMAND='ssh -p 2222 -o StrictHostKeyChecking=no' \
git clone --depth 1 --branch main \
git@code.lazyworkhorse.net:gortium/hermes-agent.git fork && \
rsync -a --delete fork/ /opt/hermes/ \
--exclude node_modules \
--exclude .venv \
--exclude .git && \
rm -rf /tmp/fork /root/.ssh/
# ---------- System dependencies for Chromium ----------
# The minimum set required to run headless Chromium on Linux.
# python:3.11-slim is Debian Bookworm (12) -- package names without t64 suffix.
# ---------- Rebuild web UI ----------
# Source files changed; node_modules (from base image) reused.
RUN cd /opt/hermes && npm run build
# ---------- Reinstall Python package (editable) ----------
# Picks up source changes from our fork.
RUN . /opt/hermes/.venv/bin/activate && \
uv pip install --no-cache-dir --no-deps -e /opt/hermes
# ---------- Extra system deps ----------
USER root
RUN apt-get update && \
apt-get install -y --no-install-recommends \
# Chromium and its launcher
chromium \
chromium-common \
chromium-sandbox \
# Font rendering for proper page rendering
fonts-liberation \
fonts-noto-color-emoji \
fonts-dejavu-core \
# System libraries required by Chromium at runtime
libnss3 \
libnspr4 \
libatk1.0-0 \
libatk-bridge2.0-0 \
libcups2 \
libdrm2 \
libxdamage1 \
libxfixes3 \
libxcomposite1 \
libxrandr2 \
libgbm1 \
libpango-1.0-0 \
libcairo2 \
libasound2 \
libxkbcommon0 \
libxshmfence1 \
# Virtual framebuffer for headless operation
xvfb \
# Process supervisor for orphan reaping
tini \
# Git for Hermes source operations
git \
# SSL certificates for HTTPS connections
ca-certificates \
# Curl for health checks
curl \
&& rm -rf /var/lib/apt/lists/*
libportaudio2 ca-certificates poppler-utils imagemagick \
texlive-latex-base texlive-latex-extra texlive-fonts-recommended \
texlive-xetex texlive-science \
qemu-user-static binfmt-support emacs-nox && \
rm -rf /var/lib/apt/lists/*
# ---------- Hermes Agent installation ----------
# Install uv (fast Python package manager)
COPY --chmod=0755 --from=ghcr.io/astral-sh/uv:latest /uv /usr/local/bin/uv
COPY --chmod=0755 --from=ghcr.io/astral-sh/uv:latest /uvx /usr/local/bin/uvx
# ---------- UV ----------
COPY --chmod=0755 --from=ghcr.io/astral-sh/uv:latest /uv /usr/local/bin/
# Create hermes user (non-root runtime)
RUN useradd -u 10000 -m -d /opt/data hermes
# ---------- Piper TTS ----------
RUN . /opt/hermes/.venv/bin/activate && \
uv pip install --no-cache-dir piper-tts sounddevice numpy && \
mkdir -p /opt/hermes/.venv/share/piper/voices
# Install Hermes Agent from PyPI with gateway support for messaging
RUN uv pip install --system --no-cache-dir \
'hermes-agent[gateway]' \
croniter && \
uv cache clean
RUN /opt/hermes/.venv/bin/python3 /dev/stdin << 'PYEOF'
import urllib.request
base = '/opt/hermes/.venv/share/piper/voices'
url = 'https://huggingface.co/rhasspy/piper-voices/resolve/main/en/en_US/ryan/high/en_US-ryan-high.onnx'
urllib.request.urlretrieve(url, base + '/en_US-ryan-high.onnx')
urllib.request.urlretrieve(url + '.json', base + '/en_US-ryan-high.onnx.json')
PYEOF
# Create the /opt/hermes directory structure expected by entrypoint
RUN mkdir -p /opt/hermes/.venv/bin && \
mkdir -p /opt/hermes/docker && \
ln -sf /usr/local/bin/uv /opt/hermes/.venv/bin/uv && \
ln -sf /usr/local/bin/uvx /opt/hermes/.venv/bin/uvx
# ---------- Install Himalaya email CLI ----------
RUN /opt/hermes/.venv/bin/python3 /dev/stdin << 'PYEOF'
import urllib.request, tarfile, os, shutil
url = 'https://github.com/pimalaya/himalaya/releases/download/v1.2.0/himalaya.x86_64-linux.tgz'
tgz = '/tmp/himalaya.tgz'
urllib.request.urlretrieve(url, tgz)
with tarfile.open(tgz) as t:
t.extractall('/tmp')
shutil.move('/tmp/himalaya', '/usr/local/bin/himalaya')
os.chmod('/usr/local/bin/himalaya', 0o755)
os.remove(tgz)
print('himalaya v1.2.0 installed')
PYEOF
# ---------- Entrypoint script ----------
COPY entrypoint.sh /opt/hermes/docker/entrypoint.sh
RUN chmod +x /opt/hermes/docker/entrypoint.sh
# ---------- Install himalaya-ro wrapper ----------
COPY --chmod=0755 himalaya-ro.sh /usr/local/bin/himalaya-ro
# ---------- Environment variables ----------
# Point browser tool to system Chromium (installed via apt-get)
ENV CHROME_EXECUTABLE=/usr/bin/chromium
# Hermes paths
ENV HERMES_HOME=/opt/data
ENV PATH="/opt/data/.local/bin:${PATH}"
# Playwright browsers path (for agent-browser install at runtime)
ENV PLAYWRIGHT_BROWSERS_PATH=/opt/hermes/.playwright
# Virtual framebuffer display for headless Chromium
ENV DISPLAY=:99
# ---------- Data volume ----------
VOLUME [ "/opt/data" ]
# ---------- Runtime ----------
USER hermes
WORKDIR /opt/data
ENV HERMES_HOME=/opt/data
ENV PATH="/opt/data/.local/bin:${PATH}"
# Point browser tool to Playwright's Chromium (already in base image)
ENV CHROME_EXECUTABLE=/opt/hermes/.playwright/chromium/chrome-linux/chrome
ENTRYPOINT [ "/opt/hermes/docker/entrypoint.sh" ]
CMD [ "gateway", "run" ]
# Ensure tools directory and toolsets.py are writable by the hermes runtime user
# so custom tools can be injected from the persistent volume at startup.
RUN chown -R hermes:hermes /opt/hermes/tools /opt/hermes/toolsets.py
VOLUME [ "/opt/data" ]

View File

@@ -1,20 +0,0 @@
#!/bin/bash
set -e
# Hermes Agent entrypoint script
# Installs custom tools and runtime dependencies,
# then delegates to the passed command (usually "gateway run").
# Install custom tools from persistent volume if available
if [ -f /opt/data/hermes-tools/install.sh ]; then
bash /opt/data/hermes-tools/install.sh
fi
# Install additional runtime deps (idempotent)
if command -v uv &>/dev/null; then
uv pip install --system --no-cache-dir --quiet \
openai mautrix[encryption] 2>/dev/null || true
fi
# Execute the passed command with tini for proper signal handling
exec tini -g -- "$@"

26
env/.env.example.paperclip vendored Normal file
View File

@@ -0,0 +1,26 @@
# Paperclip Environment Variables
# Copy this file to your .env (at the compose root or docker-compose working directory)
# and fill in the secrets.
#
# cp env/.env.example.paperclip .env
#
# Then reference it from compose.yml:
# env_file:
# - path: .env
# required: true
# ---------------------------------------------------------------------------
# Database
# ---------------------------------------------------------------------------
# PostgreSQL password for the paperclip-db service.
# Generate a strong random password:
# openssl rand -base64 32
PAPERCLIP_DB_PASSWORD=change_me_to_a_strong_random_password
# ---------------------------------------------------------------------------
# Authentication
# ---------------------------------------------------------------------------
# Secret key used by Better Auth for signing and verifying tokens.
# Generate a strong random secret:
# openssl rand -base64 32
PAPERCLIP_AUTH_SECRET=change_me_to_a_strong_random_secret