Privacy-First Dev Copilot: Deep Dive¶

Date: March 10, 2026 Previous Verdict: KILL (Tabby, Continue, Cody exist. Narrow wedge.) Revised Verdict: CONDITIONAL GO (39/50) with seed funding Trigger: Founder willing to raise seed + build larger team. Air-gapped niche is structurally protected.

The Platform Risk: What It Actually Means¶

The General Market Is Absorbed (This Is Dead)¶

Platform	Valuation/ARR	Users/Share	Key 2025-2026 Moves
GitHub Copilot	Part of Microsoft	20M users, 42% share, 90% Fortune 100	Agent Mode, Mission Control, multi-model (Claude/Gemini/GPT), built-in security scanning, Copilot Metrics GA
Cursor	$29.3B val, $2B+ ARR	~18-25% share, Fortune 500	NVIDIA + Google as strategic investors. Fastest B2B SaaS to $1B ARR ever (24 months)
Windsurf/Codeium	Carved up	Dead as independent	Google $2.4B acqui-hire (July 2025) + Cognition $250M for remaining assets

Features that were startups 18 months ago are now platform checkboxes: agent mode, multi-file edits, code review, test generation, security scanning, multi-model support. You cannot build a general-purpose copilot startup.

The Hard Architectural Boundary (This Is Alive)¶

Neither Cursor nor GitHub Copilot can serve air-gapped environments. Neither has a roadmap to do so.

GitHub Copilot: Requires active network to Microsoft-hosted APIs. No offline mode. Copilot is NOT available on GitHub Enterprise Server (on-premises). The "self-hosted runner" support (Oct 2025) is for CI compute, not inference.
Cursor: Cloud-native on AWS. "Privacy Mode" = zero retention on Cursor servers, but processing still happens there. No on-prem architecture.
Why this won't change: Microsoft won't cannibalize Azure/GitHub.com revenue. Cursor won't rearchitect for a niche market. This is a business model conflict, not a feature gap.

What Cloud Tools CAN Offer (Partial Measures)¶

Amazon Q Developer: Agent within your VPC (hybrid), but inference still routes through AWS endpoints. Not true air-gap.
Google Gemini Code Assist: VPC Service Controls + GitHub Enterprise Server integration, but AI processing on Google's infrastructure.
These serve "compliance theater" needs but not genuine ITAR/CMMC requirements.

Who Cannot Use Cloud AI Coding Tools (The Protected Market)¶

Defense Contractors (ITAR/CMMC)¶

Criminal liability for sending defense-related source code to cloud AI services without explicit authorization. CMMC finalized autumn 2025, now applies to entire US defense industrial base. - Primes: Lockheed Martin, Raytheon, Northrop Grumman, L3Harris, Booz Allen, SAIC - Classified program offices - Intelligence community contractors - 500K-700K software engineers who legally cannot use Cursor/Copilot

Financial Services¶

Data residency requirements (GDPR, national regulations)
OCC/FRB model risk guidance (SR 11-7) requiring auditability
Source code as trade secret under contractual obligations
Goldman Sachs, JPMorgan, Deutsche Bank actively evaluating on-prem options

Healthcare¶

HIPAA: dev environments indexing clinical systems can expose PHI in prompts
FDA-regulated medical device software faces additional scrutiny
Average healthcare breach: $7.42M

Sovereign AI (Government)¶

EU: eIDAS 2.0, GDPR, EU AI Act driving non-US-cloud requirements
France's SNCF already uses Mistral Code
FedRAMP required for US federal civilian agencies

Competitive Landscape: Self-Hosted Players (March 2026)¶

Tabnine Enterprise -- The Incumbent¶

Status: Enterprise-only (killed free tier April 2025). All-in on self-hosted.
Key win: Named Gartner Magic Quadrant "Visionary" (Sept 2025) -- significant sales validation
Dell/NVIDIA partnership (GTC 2025): Turnkey air-gapped deployment on Dell PowerEdge R760xa (L40S) and XE9680 (H100). Hardware-software bundle.
Enterprise Context Engine (GA Feb 2026): Connects AI to repos, Jira, docs without exposing data to public models
Deployment: SaaS, private VPC, on-prem, fully air-gapped (zero outbound connections)
Weakness: Wraps third-party models -- no proprietary model. As Mistral Code gains ground, model quality becomes a liability.

Mistral Code -- The Model Quality Leader¶

Launched: June 2025, air-gapped GA Q3 2025
Stack: Codestral (completion) + Codestral Embed (search) + Devstral (agentic) + Mistral Medium (chat)
Fine-tuning on private code: Customers can train on internal repos. Dramatically improves accuracy.
Customers: Capgemini, Abanca (bank), SNCF (French railway)
80+ programming languages
Weakness: Strong EU, weak US defense presence. No FedRAMP. No turnkey hardware bundle.

Tabby (TabbyML) -- Best Open-Source¶

GitHub: 33K stars, v0.32.0 (Jan 2026), consistent releases
Funding: Only $3.2M seed (2023). No subsequent rounds identified.
Architecture: Fully self-hostable Rust binary. Code completion, chat, inline editing, Answer Engine.
Enterprise features: LDAP auth, GitLab MR indexing, REST APIs, custom branding
Model support: Any GGUF/GGML model via Ollama/llama.cpp. IBM Granite, CodeLlama, StarCoder, DeepSeek Coder.
License: Apache-2.0 (commercially usable)
Weakness: Tiny team, no sales infrastructure, no hardware partnerships.

Others¶

Player	Status	Notes
IBM Granite/Bob	Active. Bob 1.0 launched March 2026	Multi-model (Granite + Claude + Llama + Mistral). Enterprise distribution. Not a focused copilot.
Augment Code	$479M raised, ISO 42001 certified	VPC/on-prem options. 200K token context. Credit-based pricing $20-$200/mo.
Sourcegraph Cody	Enterprise-only. Killed consumer July 2025	Launched "Amp" replacement. Leverages code search/intelligence. $2.6B valuation.
Continue.dev	Pivoted away from copilot	Now "source-controlled AI checks in CI." $5.6M funding. IDE extension is legacy.
Embedder	Small, defense-focused	Claims Tesla, NVIDIA, General Dynamics engineers. Air-gapped tiers. Under-the-radar.
Coder.com	Self-hosted CDE platform	Not a copilot but infrastructure for AI agents. MCP integration. Defense/finance customers.
FauxPilot	Effectively dead	Community project. Uses old CodeGen models. Not enterprise-grade.

What Killed CodeGate (Archived June 2025)¶

CodeGate (by Stacklok, founded by Kubernetes co-creator Craig McLuckie) was a local proxy between IDE and AI assistant. It redacted secrets, protected PII, scanned for vulnerable libraries.

Why it died: 1. Middleware approach was wrong. Cursor/Copilot added native security scanning, eroding the value of a third-party shim. 2. Adoption friction. Requiring every developer to install and maintain a local daemon was a UX burden. 3. Wrong abstraction layer. The real problem isn't "intercept the prompt" -- it's "don't send the prompt to the cloud at all." On-prem inference eliminates CodeGate's entire threat model. 4. Stacklok pivoted to enterprise MCP platform -- governance for AI agent tool access. The leadership concluded agent-tool-access governance is a larger, more durable problem than prompt security for code completion.

Lesson: Don't build middleware on top of cloud tools. Build the self-hosted alternative.

TAM Estimation¶

By Segment¶

Segment	Developers	Price Range	Annual TAM
US Defense (ITAR/CMMC)	500K-700K	$25-50/seat/mo	$1.5B-$4.2B
Financial Services (global)	1M+ (10-15% near-term)	$20-40/seat/mo	$2.4B-$7.2B
Healthcare	200K+	$20-40/seat/mo	$0.5B-$1.0B
Sovereign/Government	300K+	$25-50/seat/mo	$0.9B-$1.8B

Realistic addressable (next 3 years): $2-5B/year, mostly uncaptured. Current penetration: Near zero. These buyers are watching non-regulated colleagues use Cursor with visible productivity gains but have no compliant equivalent.

Pricing Benchmarks¶

Sourcegraph Cody Enterprise: $59/user/month ($708/year)
Tabnine air-gapped enterprise: $50-80/user/month (estimated)
100-engineer defense contractor: $60K-$96K/year
500-engineer organization: $300K-$480K/year

The Unfilled Gap¶

No single product combines all four: 1. Cursor-quality developer experience 2. True air-gap (zero outbound connections, not VPC theater) 3. ITAR/CMMC compliance documentation + FedRAMP posture 4. Turnkey hardware-software bundle

Player	DX Quality	True Air-Gap	ITAR/CMMC Compliance	HW Bundle
Tabnine	Medium	Yes	Partial	Yes (Dell/NVIDIA)
Mistral Code	High	Yes	No (EU-focused)	No
Tabby	Medium	Yes	No	No
IBM	Low-medium	Yes	Partial (existing gov relationships)	Yes (via IBM infra)
Gap	High	Yes	Yes (full CMMC + FedRAMP)	Yes

What to Build¶

Not "another coding assistant." The compliance packaging + DX layer on open-source inference:

CMMC/ITAR compliance documentation package (SSP templates, audit log formats, approved model list management, ISSO/ISSM approval workflows) -- the gap CodeGate left when it died
Admin dashboard for policy management, model governance, usage analytics
Turnkey deployment on standard GPU hardware (Dell/Supermicro + NVIDIA)
IDE integration leveraging Tabby (Apache-2.0) or building custom on Ollama
Fine-tuning pipeline for customer-specific codebases (competitive with Mistral Code)

Revenue: $1K-$5K/month per enterprise deployment. Defense contracts at $50K-$500K/year.

Founder Fit Assessment¶

Dimension	Rating	Evidence
Technical capability	5/5	llama.cpp/CUDA, MCP integration, local inference expertise
Domain knowledge	3/5	Dev tools (Uber), but no defense procurement experience
Market access	2/5	No existing defense contractor relationships
Speed to MVP	4/5	Compliance packaging on Tabby/Ollama is weeks, not months
Competitive positioning	3/5	Tabnine + Mistral Code are well-funded, credible incumbents

Why This Scores 39/50 (Lower Than Edge MLOps at 42)¶

Tabnine is formidable. Gartner-recognized, Dell/NVIDIA bundles, years of enterprise sales.
Mistral Code has better models. If they pursue US defense, they compress the niche.
FedRAMP is $200K-$500K and 12-18 months. Can consume seed runway before first sale.
Defense sales cycles are 12-18 months. Seed needs to cover that.
Your domain expertise is compliance (Cash App), not defense procurement. Different buyer persona.
No proprietary model advantage. You'd be packaging open-source models -- differentiation is in compliance/DX, not model quality.

Signals to Monitor¶

GitHub Copilot on-prem announcement: Low probability but catastrophic. Would compress the entire niche.
Mistral Code US expansion / FedRAMP pursuit: Major threat to Tabnine and any new entrant.
Tabby Series A: Would signal ability to build sales infrastructure.
ITAR enforcement action involving cloud AI: A breach disclosure catalyzes demand faster than any sales motion.
DeepSeek Coder V3 / Qwen3-Code quality: As open-weight models match frontier quality, self-hosting economics improve for everyone.

Sources¶

TabbyML GitHub, website, TechCrunch ($3.2M raise)
Continue.dev: Tracxn, GitHub, pricing page
Sourcegraph: blog (Cody plan changes), Wikipedia, TrueUp (layoffs)
GitHub Copilot: GitHub docs, changelog (self-hosted runners Oct 2025)
Cursor: CNBC ($29.3B valuation), TechCrunch ($2B ARR), SaaStr ($1B ARR milestone)
Windsurf: DeepLearning.AI The Batch, TechRepublic (Google $2.4B)
Mistral Code: mistral.ai blog, VentureBeat
Tabnine: blog (GTC 2025, air-gapped, Enterprise Context Engine), GlobeNewswire
CodeGate: GitHub (archived), Stacklok website (MCP pivot)
IBM: developer.ibm.com, IT Jungle (Bob 1.0)
Augment Code: TechCrunch ($252M launch)
Market sizing: GlobeNewswire/SNS Insider, Mordor Intelligence
Air-gapped analysis: IntuitionLabs, TrueFoundry (ITAR guide)
Coder.com: blog (enterprise AI platform)