OSI-openLocal inference and "what runs on my machine"

claude-code-local

nicedreamzapp

MLX-native proxy that runs the Claude Code CLI against local models on Apple Silicon, fully offline.

2.9k stars(as of 2026-06-26)View on GitHub

Overview

What is claude-code-local?

A roughly 1,000-line MLX-native server that exposes an Anthropic-API-compatible endpoint so the Claude Code CLI runs against local models (Gemma, Llama, Qwen, DeepSeek) entirely on Apple Silicon and offline. It adds launchers for code, browser-agent, voice and iMessage modes plus tool-call recovery, and is self-contained: git clone plus a setup script, no purchase required.

Analysis

Pros & Cons

Pros

MIT-licensed and genuinely self-hostable offline, a good fit for NDA or air-gapped work
MLX-native rather than a generic wrapper
Concrete multi-mode launchers (code, browser, voice) lower the setup barrier

Cons

Single maintainer who also sells a commercial service built on it, a sustainability and conflict risk (the software itself is free and MIT)
Hard Apple Silicon plus high-RAM requirement excludes most users
v0.1.0 with only self-reported, unverified performance and reliability numbers

License

MIT (OSI-open)

When it is interesting

You have a high-RAM Apple Silicon Mac and need Claude Code workflows with zero data leaving the device.

When it is too early

If you need cross-platform support, multi-maintainer stability, or independently verified throughput.

This repo featured in the 2026-07 edition of the Open-Source AI Radar.

Similar repositories

oMLX

jundot

16.6k

macOS-native LLM inference server for Apple Silicon with continuous batching and SSD-tiered caching.

OSI-openLocal inference and "what runs on my machine"

apfel

Arthur-Ficial

5.8k

Expose the on-device Apple Intelligence model on macOS 26 as a zero-setup OpenAI-compatible local API.

OSI-openLocal inference and "what runs on my machine"

shimmy

Michael-A-Kuykendall

5.3k

Pure-Rust local inference engine with an OpenAI-compatible API, shipped as one binary.

OSI-openLocal inference and "what runs on my machine"