As AI fashions more and more turn out to be commoditized, startups are racing to construct the software program layer that sits on high of them. One attention-grabbing entrant into this area is Osaurus, an open supply, Apple-only LLM server that lets customers transfer between totally different native AI fashions, both domestically or within the cloud, whereas retaining their information and instruments all on their very own {hardware}.
Osaurus advanced out of the concept for a desktop AI companion, Dinoki, which Osaurus co-founder Terence Pae described as a kind of “AI-powered Clippy.” Dinoki’s clients had requested him why they need to purchase the app in the event that they nonetheless needed to pay for tokens — the utilization items AI corporations cost for processing prompts and producing responses.
That bought Pae considering extra deeply about working AI domestically.
“That’s how Osaurus began,” Pae, beforehand a software program engineer at Tesla and Netflix, advised TechCrunch over a name. The thought, he defined, was to attempt to run an AI assistant domestically. “You are able to do just about every thing in your Mac domestically, like searching your information, accessing your browser, accessing your system configurations. I figured this could be a good way to place Osaurus as a private AI for people.”
Pae started constructing the instrument in public as an open-source project, including options and fixing bugs alongside the best way.

As we speak, Osaurus can flexibly join with domestically hosted AI fashions or cloud suppliers like OpenAI and Anthropic. Customers can freely select which AI fashions they’re utilizing, and preserve different points of the AI expertise on their very own {hardware}, just like the fashions’ personal reminiscence, or their information and instruments.
On condition that totally different AI fashions have totally different strengths, the benefit of this method is that customers can swap to the AI mannequin that most closely fits their wants.
Such a construction makes Osaurus what’s known as a “harness” — a management layer that connects totally different AI fashions, instruments, and workflows by means of a single interface, just like instruments like OpenClaw or Hermes. Nevertheless, the distinction is that such instruments are sometimes aimed toward builders who know their approach round a terminal. And typically, like within the case of OpenClaw, they could pose safety points and holes to fret about.
Osaurus, in the meantime, presents an easy-to-use interface that customers can use, and addresses safety issues by working issues in a hardware-isolated, digital sandbox. This limits the AI to a sure scope, retaining your pc and information protected.

After all, the observe of working AI fashions in your machine remains to be in its early days, provided that it’s closely resource-intensive and hardware-dependent. To run native fashions, your system will want at the very least 64 GB of RAM. For working bigger fashions, like DeepSeek v4, Pae recommends techniques with about 128 GB of RAM.
However Pae believes native AI’s wants will come down in time.
“I can see the potential of it, as a result of the intelligence per wattage — which is just like the metric for native AI — has been going up considerably. It’s by itself curve of innovation. Final 12 months, native AI might barely end sentences, however right now it may well really run instruments, write code, entry your browser, and order stuff from Amazon […] it’s simply getting higher and higher,” he stated.

Osaurus right now can run MiniMax M2.5, Gemma 4, Qwen3.6, GPT-OSS, Llama, DeepSeek V4, and different fashions. It additionally helps Apple’s on-device basis fashions, Liquid AI’s LFM household of on-device fashions, and within the cloud, it may well connect with OpenAI, Anthropic, Gemini, xAI/Grok, Venice AI, OpenRouter, Ollama, and LM Studio.
As a full MCP (Mannequin Context Protocol) server, you may give any MCP-compatible consumer entry to your instruments as properly. Plus, it ships with over 20 native plugins for Mail, Calendar, Imaginative and prescient, macOS Use, XLSX, PPTX, Browser, Music, Git, Filesystem, Search, Fetch, and extra.
Extra just lately, Osaurus was up to date to incorporate voice capabilities as properly.
For the reason that venture went reside practically a 12 months in the past, it has been downloaded north of 112,000 occasions, in accordance with its website.
Presently, Osaurus’ founders (who embody co-founder Sam Yoo) are collaborating within the New York-based startup accelerator Alliance. They’re additionally fascinated about subsequent steps, which might see Osaurus being supplied to companies, like these within the authorized area or in healthcare, the place working native LLMs might deal with privateness issues.
As the ability of native AI fashions grows, the workforce believes it might decrease the demand for AI information facilities.
“We’re seeing this explosive development within the AI area the place [cloud AI providers] should scale up utilizing information facilities and infrastructure, however we really feel like individuals haven’t actually seen the worth of the native AI but,” Pae stated. “As a substitute of counting on the cloud, they will really deploy a Mac Studio on-prem, and it ought to use considerably much less energy. You continue to have the capabilities of the cloud, however you’ll not be depending on a knowledge middle to have the ability to run that AI,” he added.
Once you buy by means of hyperlinks in our articles, we may earn a small commission. This doesn’t have an effect on our editorial independence.

