Skip to content

Privacy-First Dev Copilot: Deep Dive

Date: March 10, 2026 Previous Verdict: KILL (Tabby, Continue, Cody exist. Narrow wedge.) Revised Verdict: CONDITIONAL GO (39/50) with seed funding Trigger: Founder willing to raise seed + build larger team. Air-gapped niche is structurally protected.


The Platform Risk: What It Actually Means

The General Market Is Absorbed (This Is Dead)

Platform Valuation/ARR Users/Share Key 2025-2026 Moves
GitHub Copilot Part of Microsoft 20M users, 42% share, 90% Fortune 100 Agent Mode, Mission Control, multi-model (Claude/Gemini/GPT), built-in security scanning, Copilot Metrics GA
Cursor $29.3B val, $2B+ ARR ~18-25% share, Fortune 500 NVIDIA + Google as strategic investors. Fastest B2B SaaS to $1B ARR ever (24 months)
Windsurf/Codeium Carved up Dead as independent Google $2.4B acqui-hire (July 2025) + Cognition $250M for remaining assets

Features that were startups 18 months ago are now platform checkboxes: agent mode, multi-file edits, code review, test generation, security scanning, multi-model support. You cannot build a general-purpose copilot startup.

The Hard Architectural Boundary (This Is Alive)

Neither Cursor nor GitHub Copilot can serve air-gapped environments. Neither has a roadmap to do so.

  • GitHub Copilot: Requires active network to Microsoft-hosted APIs. No offline mode. Copilot is NOT available on GitHub Enterprise Server (on-premises). The "self-hosted runner" support (Oct 2025) is for CI compute, not inference.
  • Cursor: Cloud-native on AWS. "Privacy Mode" = zero retention on Cursor servers, but processing still happens there. No on-prem architecture.
  • Why this won't change: Microsoft won't cannibalize Azure/GitHub.com revenue. Cursor won't rearchitect for a niche market. This is a business model conflict, not a feature gap.

What Cloud Tools CAN Offer (Partial Measures)

  • Amazon Q Developer: Agent within your VPC (hybrid), but inference still routes through AWS endpoints. Not true air-gap.
  • Google Gemini Code Assist: VPC Service Controls + GitHub Enterprise Server integration, but AI processing on Google's infrastructure.
  • These serve "compliance theater" needs but not genuine ITAR/CMMC requirements.

Who Cannot Use Cloud AI Coding Tools (The Protected Market)

Defense Contractors (ITAR/CMMC)

Criminal liability for sending defense-related source code to cloud AI services without explicit authorization. CMMC finalized autumn 2025, now applies to entire US defense industrial base. - Primes: Lockheed Martin, Raytheon, Northrop Grumman, L3Harris, Booz Allen, SAIC - Classified program offices - Intelligence community contractors - 500K-700K software engineers who legally cannot use Cursor/Copilot

Financial Services

  • Data residency requirements (GDPR, national regulations)
  • OCC/FRB model risk guidance (SR 11-7) requiring auditability
  • Source code as trade secret under contractual obligations
  • Goldman Sachs, JPMorgan, Deutsche Bank actively evaluating on-prem options

Healthcare

  • HIPAA: dev environments indexing clinical systems can expose PHI in prompts
  • FDA-regulated medical device software faces additional scrutiny
  • Average healthcare breach: $7.42M

Sovereign AI (Government)

  • EU: eIDAS 2.0, GDPR, EU AI Act driving non-US-cloud requirements
  • France's SNCF already uses Mistral Code
  • FedRAMP required for US federal civilian agencies

Competitive Landscape: Self-Hosted Players (March 2026)

Tabnine Enterprise -- The Incumbent

  • Status: Enterprise-only (killed free tier April 2025). All-in on self-hosted.
  • Key win: Named Gartner Magic Quadrant "Visionary" (Sept 2025) -- significant sales validation
  • Dell/NVIDIA partnership (GTC 2025): Turnkey air-gapped deployment on Dell PowerEdge R760xa (L40S) and XE9680 (H100). Hardware-software bundle.
  • Enterprise Context Engine (GA Feb 2026): Connects AI to repos, Jira, docs without exposing data to public models
  • Deployment: SaaS, private VPC, on-prem, fully air-gapped (zero outbound connections)
  • Weakness: Wraps third-party models -- no proprietary model. As Mistral Code gains ground, model quality becomes a liability.

Mistral Code -- The Model Quality Leader

  • Launched: June 2025, air-gapped GA Q3 2025
  • Stack: Codestral (completion) + Codestral Embed (search) + Devstral (agentic) + Mistral Medium (chat)
  • Fine-tuning on private code: Customers can train on internal repos. Dramatically improves accuracy.
  • Customers: Capgemini, Abanca (bank), SNCF (French railway)
  • 80+ programming languages
  • Weakness: Strong EU, weak US defense presence. No FedRAMP. No turnkey hardware bundle.

Tabby (TabbyML) -- Best Open-Source

  • GitHub: 33K stars, v0.32.0 (Jan 2026), consistent releases
  • Funding: Only $3.2M seed (2023). No subsequent rounds identified.
  • Architecture: Fully self-hostable Rust binary. Code completion, chat, inline editing, Answer Engine.
  • Enterprise features: LDAP auth, GitLab MR indexing, REST APIs, custom branding
  • Model support: Any GGUF/GGML model via Ollama/llama.cpp. IBM Granite, CodeLlama, StarCoder, DeepSeek Coder.
  • License: Apache-2.0 (commercially usable)
  • Weakness: Tiny team, no sales infrastructure, no hardware partnerships.

Others

Player Status Notes
IBM Granite/Bob Active. Bob 1.0 launched March 2026 Multi-model (Granite + Claude + Llama + Mistral). Enterprise distribution. Not a focused copilot.
Augment Code $479M raised, ISO 42001 certified VPC/on-prem options. 200K token context. Credit-based pricing $20-$200/mo.
Sourcegraph Cody Enterprise-only. Killed consumer July 2025 Launched "Amp" replacement. Leverages code search/intelligence. $2.6B valuation.
Continue.dev Pivoted away from copilot Now "source-controlled AI checks in CI." $5.6M funding. IDE extension is legacy.
Embedder Small, defense-focused Claims Tesla, NVIDIA, General Dynamics engineers. Air-gapped tiers. Under-the-radar.
Coder.com Self-hosted CDE platform Not a copilot but infrastructure for AI agents. MCP integration. Defense/finance customers.
FauxPilot Effectively dead Community project. Uses old CodeGen models. Not enterprise-grade.

What Killed CodeGate (Archived June 2025)

CodeGate (by Stacklok, founded by Kubernetes co-creator Craig McLuckie) was a local proxy between IDE and AI assistant. It redacted secrets, protected PII, scanned for vulnerable libraries.

Why it died: 1. Middleware approach was wrong. Cursor/Copilot added native security scanning, eroding the value of a third-party shim. 2. Adoption friction. Requiring every developer to install and maintain a local daemon was a UX burden. 3. Wrong abstraction layer. The real problem isn't "intercept the prompt" -- it's "don't send the prompt to the cloud at all." On-prem inference eliminates CodeGate's entire threat model. 4. Stacklok pivoted to enterprise MCP platform -- governance for AI agent tool access. The leadership concluded agent-tool-access governance is a larger, more durable problem than prompt security for code completion.

Lesson: Don't build middleware on top of cloud tools. Build the self-hosted alternative.


TAM Estimation

By Segment

Segment Developers Price Range Annual TAM
US Defense (ITAR/CMMC) 500K-700K $25-50/seat/mo $1.5B-$4.2B
Financial Services (global) 1M+ (10-15% near-term) $20-40/seat/mo $2.4B-$7.2B
Healthcare 200K+ $20-40/seat/mo $0.5B-$1.0B
Sovereign/Government 300K+ $25-50/seat/mo $0.9B-$1.8B

Realistic addressable (next 3 years): $2-5B/year, mostly uncaptured. Current penetration: Near zero. These buyers are watching non-regulated colleagues use Cursor with visible productivity gains but have no compliant equivalent.

Pricing Benchmarks

  • Sourcegraph Cody Enterprise: $59/user/month ($708/year)
  • Tabnine air-gapped enterprise: $50-80/user/month (estimated)
  • 100-engineer defense contractor: $60K-$96K/year
  • 500-engineer organization: $300K-$480K/year

The Unfilled Gap

No single product combines all four: 1. Cursor-quality developer experience 2. True air-gap (zero outbound connections, not VPC theater) 3. ITAR/CMMC compliance documentation + FedRAMP posture 4. Turnkey hardware-software bundle

Player DX Quality True Air-Gap ITAR/CMMC Compliance HW Bundle
Tabnine Medium Yes Partial Yes (Dell/NVIDIA)
Mistral Code High Yes No (EU-focused) No
Tabby Medium Yes No No
IBM Low-medium Yes Partial (existing gov relationships) Yes (via IBM infra)
Gap High Yes Yes (full CMMC + FedRAMP) Yes

What to Build

Not "another coding assistant." The compliance packaging + DX layer on open-source inference:

  1. CMMC/ITAR compliance documentation package (SSP templates, audit log formats, approved model list management, ISSO/ISSM approval workflows) -- the gap CodeGate left when it died
  2. Admin dashboard for policy management, model governance, usage analytics
  3. Turnkey deployment on standard GPU hardware (Dell/Supermicro + NVIDIA)
  4. IDE integration leveraging Tabby (Apache-2.0) or building custom on Ollama
  5. Fine-tuning pipeline for customer-specific codebases (competitive with Mistral Code)

Revenue: $1K-$5K/month per enterprise deployment. Defense contracts at $50K-$500K/year.


Founder Fit Assessment

Dimension Rating Evidence
Technical capability 5/5 llama.cpp/CUDA, MCP integration, local inference expertise
Domain knowledge 3/5 Dev tools (Uber), but no defense procurement experience
Market access 2/5 No existing defense contractor relationships
Speed to MVP 4/5 Compliance packaging on Tabby/Ollama is weeks, not months
Competitive positioning 3/5 Tabnine + Mistral Code are well-funded, credible incumbents

Why This Scores 39/50 (Lower Than Edge MLOps at 42)

  1. Tabnine is formidable. Gartner-recognized, Dell/NVIDIA bundles, years of enterprise sales.
  2. Mistral Code has better models. If they pursue US defense, they compress the niche.
  3. FedRAMP is $200K-$500K and 12-18 months. Can consume seed runway before first sale.
  4. Defense sales cycles are 12-18 months. Seed needs to cover that.
  5. Your domain expertise is compliance (Cash App), not defense procurement. Different buyer persona.
  6. No proprietary model advantage. You'd be packaging open-source models -- differentiation is in compliance/DX, not model quality.

Signals to Monitor

  • GitHub Copilot on-prem announcement: Low probability but catastrophic. Would compress the entire niche.
  • Mistral Code US expansion / FedRAMP pursuit: Major threat to Tabnine and any new entrant.
  • Tabby Series A: Would signal ability to build sales infrastructure.
  • ITAR enforcement action involving cloud AI: A breach disclosure catalyzes demand faster than any sales motion.
  • DeepSeek Coder V3 / Qwen3-Code quality: As open-weight models match frontier quality, self-hosting economics improve for everyone.

Sources

  • TabbyML GitHub, website, TechCrunch ($3.2M raise)
  • Continue.dev: Tracxn, GitHub, pricing page
  • Sourcegraph: blog (Cody plan changes), Wikipedia, TrueUp (layoffs)
  • GitHub Copilot: GitHub docs, changelog (self-hosted runners Oct 2025)
  • Cursor: CNBC ($29.3B valuation), TechCrunch ($2B ARR), SaaStr ($1B ARR milestone)
  • Windsurf: DeepLearning.AI The Batch, TechRepublic (Google $2.4B)
  • Mistral Code: mistral.ai blog, VentureBeat
  • Tabnine: blog (GTC 2025, air-gapped, Enterprise Context Engine), GlobeNewswire
  • CodeGate: GitHub (archived), Stacklok website (MCP pivot)
  • IBM: developer.ibm.com, IT Jungle (Bob 1.0)
  • Augment Code: TechCrunch ($252M launch)
  • Market sizing: GlobeNewswire/SNS Insider, Mordor Intelligence
  • Air-gapped analysis: IntuitionLabs, TrueFoundry (ITAR guide)
  • Coder.com: blog (enterprise AI platform)