refactor: split ai/ into hermes/ and ollama/ directories with gfx906 build #19
Reference in New Issue
Block a user
No description provided.
Delete Branch "feat/ollama-gfx906"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Summary
Organize the ai/ stack into subdirectories and add the custom ollama build for MI50 (gfx906).
Changes
Directory structure
ai/hermes/Dockerfile
nousresearch/hermes-agent:latestas base, adds extras on topai/ollama/Dockerfile (NEW)
gfx906:xnack-for AMD MI50 GPUs--preset 'ROCm 6'for proper GPU kernel code generation (notCC=hipcc)-DCMAKE_HIP_COMPILER=""for CPU step (prevents HIP poisoning GCC)-DCMAKE_PREFIX_PATH="/opt/rocm"for find_package(hip)-DCMAKE_INSTALL_PREFIX=/build/distat configure timeCOPY /build/dist/lib/ollama/(correct nesting, not the double-bug)ai/compose.yml
build: ./hermes(was./)build: ./ollama, image tagrocm-gfx906privileged: truefrom ollama (devices + group_add is sufficient)OLLAMA_FLASH_ATTENTION=1(was 0 - critical for MI50 throughput)Cleanup
Verification
Build test
ollama GPU check
Related
assets/ollama/Dockerfilethat was in the infra repo781d98046btod34a4d3647refactor: split ai/ into hermes/ and ollama/ directoriesto refactor: split ai/ into hermes/ and ollama/ directories with gfx906 build