Generative AI

How do you run AI agents privately, without sending your data to the cloud?

Konstantin Semenenko

June 16, 2026

minutes read

You run AI agents privately by keeping the model and the data inside infrastructure you control: local models on your own machines for individuals, and on-prem or private-cloud deployment with bring-your-own-model for teams. Private AI is no longer a downgrade.

AI agents are useful enough that regulated teams want them, and sensitive enough that those same teams cannot pipe their data through someone else's API. For finance, healthcare, and anyone under real compliance, "just use the cloud model" is not an option. Here is how to run capable AI agents privately, and what you actually give up by doing it.

This is something we build products for, so the options below are concrete, not hypothetical.

‍

Why can't regulated teams use cloud AI agents freely?

Because the data is the problem, not the model. Under HIPAA, GDPR, or financial-compliance rules, sending patient records, personal data, or financial detail to a third-party API can be a violation regardless of how good the model is. Even when a provider promises not to train on your data, you are still moving regulated data outside your control, and that movement is what the rules govern.

So the question for these teams is not "which model is smartest" but "which setup keeps the data inside the boundary." A brilliant model you are not allowed to send data to is useless.

‍

What does "private AI" actually mean?

It means the model runs where your data already lives, under your control, not on someone else's servers. In practice that spans a range: fully local on a single machine, on-prem inside your own datacenter, or a private cloud you control, with bring-your-own-model so you choose what runs. The common thread is that your data does not leave your boundary to get an answer. Private AI is not a weaker model. It is the same work done inside your own boundary.

‍

Can you run capable agents locally?

Yes, and that is the part that changed. You can now run multiple AI agents on local hardware, including open models, with no cloud call at all. dotPilot, our open-source local agent orchestrator, runs several agents locally, connects to tools like Codex CLI, Claude Code, Copilot, and Gemini, or to local models through LLamaSharp and ONNX, and stays fully private with no cloud required. It is built on C#/.NET and the Microsoft Agent Framework.

For an individual engineer or a small regulated team, local agents cover a lot of real work without a single byte leaving the machine.

‍

What about enterprise scale?

That is a deployment problem, and it has a deployment answer. When a whole organization needs private AI, you move from local to a controlled platform: role-based agents, bring-your-own-model, and on-prem or private-cloud deployment, so the capability scales without the data leaving. AI Base, our private AI platform, is built for exactly this: secure, compliant AI inside finance, healthcare, and enterprise infrastructure.

The point is that "private" and "enterprise-grade" are not opposites. You can have both, with the right architecture.

‍

What do you give up, and what don't you?

You give up some convenience: you run the infrastructure, you manage the models, and the very largest frontier models may still live behind an API you cannot use for regulated data. What you do not give up is capability for most tasks, because local and open models have closed much of the gap, and you keep full control of your data and your compliance posture.

For regulated work, that trade is usually easy: a slightly smaller model you are allowed to use beats the best model you are not.

‍

How do you build a private agent setup?

A practical path:

Decide the boundary: what data must never leave, and where it lives.
Start local where you can: run agents on your own hardware with open or local models for sensitive tasks.
Scale to a private platform when the org needs it: on-prem or private cloud, role-based access, bring-your-own-model.
Keep the same discipline as any AI code: rules, verification, and a human gate, so private does not mean unreviewed.

dotPilot is open source if you want to start local (dotpilot.managed-code.com), and AI Base is the enterprise path (aibase.fr). If you want help designing the boundary before you build, that is where AI Discovery starts.

‍

News & Insights

View all

You know what you want to build. Let's go ship it.

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

How do you run AI agents privately, without sending your data to the cloud?

Why can't regulated teams use cloud AI agents freely?

What does "private AI" actually mean?

Can you run capable agents locally?

What about enterprise scale?

What do you give up, and what don't you?

How do you build a private agent setup?

News & Insights

How much does AI save in customer support?

The AI productivity paradox: why time saved isn't money saved

AI ROI by industry: where the returns are highest

You know what you want to build. Let's go ship it.

managed code