Shipping AI-Assisted IDEs Without Bleeding Secrets

Alprina Security Team

Cover Image for Shipping AI-Assisted IDEs Without Bleeding Secrets

Alprina Security Team

August 25, 2024

Hook - When a Helpful Autocomplete Became an Incident

During a routine sprint, one of our senior engineers opened VS Code to tighten up a payments route. Copilot Enterprise dutifully hoovered up every open buffer-including config/staging.env, an AWS keypair sitting in an editor tab, and a migration script referencing dormant PII tables. A minute later those files showed up in the telemetry payload sent to the vendor's cloud. Worse, the completion it returned "simplified" the handler by deleting a call to verifySessionSignature() because the model misread a comment. Neither problem would have surfaced if code review hadn't flagged the missing auth guard. Instead, we spent the rest of the day rotating keys, scrubbing logs, and writing the policy we should have shipped before giving an LLM full sight of the repo.

If you are rolling out AI pair-programming inside regulated or security-critical workspaces, assume three classes of damage: context exfiltration (anything the assistant reads can leave the laptop), silent integrity issues (the assistant edits security boundaries confidently but incorrectly), and zero reviewer context (diffs contain changes without explaining why the bot made them). This article lays out the layered guardrails we now require before any IDE extension gets access to our source tree.

Problem Deep Dive - Where IDE Assistants Leak

LLM extensions touch the IDE in four places:

Context harvesting. Extensions like Copilot, Cody, and Codeium read open buffers, git diff, clipboard contents, command palettes, and sometimes entire project trees to build prompts. Anything the developer opens-even out of curiosity-goes upstream unless you filter it.
Code application. Suggestions typically insert or delete AST nodes without understanding trust boundaries. We've seen completions remove authorize() middleware, soften regex validation, or swap constant-time comparisons for nave equality.
Telemetry. Many enterprise deployments keep human-in-the-loop traces for training. If you let the assistant read staged data or production logs, you just created an unmanaged data export.
Reviewer visibility. Diffs show the code change but not what context the model saw, which files it read, or which policy rules blocked earlier completions. Without breadcrumbs, reviewers cannot decide whether to trust the edit.

The takeaway: treat an IDE assistant like any untrusted service with read/write access. You need allowlists, policy engines, audit trails, and fallback paths before enabling it outside hobby repos.

Technical Solutions - Guardrails in Layers

1. Fence the Filesystem First

Give the assistant a curated view of the workspace. In VS Code you can intercept document events and refuse to ingest sensitive paths:

const SAFE = ["src/**", "lib/**", "docs/**", "*.md"];
const DENY = ["**/*.env", "secrets/**", "config/**", "id_rsa", "tmp/**"];

function shouldShare(uri: vscode.Uri) {
  const rel = vscode.workspace.asRelativePath(uri, false);
  return minimatch.some(rel, SAFE) && !minimatch.some(rel, DENY);
}

workspace.onDidOpenTextDocument((doc) => {
  if (!shouldShare(doc.uri)) {
    telemetry.bump("blocked_file", doc.uri.fsPath);
    return;
  }
  contextCache.ingest(doc);
});

Apply similar logic in JetBrains via VirtualFileListener and in Zed by wrapping the language server's document store. Ship the allow/deny lists from a central repo so security can update them without shipping a new extension build.

Terminal capture needs the same treatment. Run the assistant behind a local proxy (mitmproxy, Envoy, or a tiny Rust service) that replaces known secret patterns before traffic leaves the machine:

PATTERNS = [
    re.compile(rb"aws_access_key_id=AKIA[0-9A-Z]{16}"),
    re.compile(rb"-----BEGIN PRIVATE KEY-----[\s\S]+?-----END PRIVATE KEY-----"),
]

def scrub(payload: bytes) -> bytes:
    for pattern in PATTERNS:
        payload = pattern.sub(b"[REDACTED]", payload)
    return payload

2. Enforce Code-Integrity Policies on Every Completion

Before applying a completion, parse the edit and run targeted policies. For example, prevent Express routes from losing authorize() middleware:

import traverse from "@babel/traverse";

function assertAuth(ast: t.File) {
  traverse(ast, {
    CallExpression(path) {
      const callee = path.node.callee;
      if (t.isMemberExpression(callee) && callee.property.name === "post") {
        const args = path.node.arguments;
        const hasAuth = args.some((arg) => t.isIdentifier(arg) && arg.name === "authorize");
        if (!hasAuth) {
          throw new Error("AI edit removed authorize() middleware");
        }
      }
    },
  });
}

Reject the edit in-editor and show the policy violation inline. Developers still get velocity, but the assistant can't silently gut defenses.

3. Record Context Metadata for Reviewers

When the assistant applies a diff, capture metadata alongside the edit:

{
  "assistant": "copilot-enterprise",
  "model": "gpt-4o",
  "prompt": {
    "files": ["src/routes/payments.ts"],
    "truncated_bytes": 2048,
    "blocked": ["config/staging.env"]
  },
  "policy_version": "2024-08-15"
}

Store this in .vscode/ai-edits.log or .idea/ai-history.json and commit it with the change. Reviewers see what the bot saw, which policies fired, and can reconstruct the edit if needed.

4. Offer Local-Only Modes for Restricted Repos

Bundle a local fallback: embeddings via sqlite-vec, inference via llama.cpp or oseba, and a retrieval index built on sanitized source. Gate this mode with repo metadata (e.g., .ai-policy.yml) so regulated projects never send context to the cloud. Engineers still get autocomplete; secrets stay on disk.

5. Issue Scoped Tokens via a Proxy

Never hand long-lived enterprise tokens to the extension. Instead, require every request to go through a proxy that introspects and scopes usage:

POST /v1/introspect
Authorization: Bearer dev_xxx

{ "repo": "payments-service", "classification": "internal", "expires_at": 172488 }

The proxy drops any request that references disallowed paths or exceeds context budgets. Rotate developer tokens automatically; log every allowed/blocked request for audits.

Testing & Verification

Secret scanners on cache directories. Nightly job runs gitleaks --no-git against .ai-cache/** to ensure the assistant cache never contains secrets.
Policy unit tests. Maintain fixtures of known-bad edits (missing auth, weaker regex) and ensure the policy engine rejects them.
Proxy integration tests. Simulate extension traffic with blocked files and verify the proxy returns HTTP 403.
Prompt-injection drills. Drop PROMPT.md files instructing the model to dump secrets. Guards should veto the request and emit telemetry.
Dashboarding. Track allowed vs. blocked reads, average context size, and local-mode usage. Alerts fire when a repo suddenly produces large blocked-read spikes.

Common Questions

Do developers have to label every file? No. Start with conservative defaults (src/** allowed) plus auto-generated deny lists from git secrets. Give teams an override file for legitimate cases.

Won't constant blocking annoy engineers? Provide inline explanations and a "request access" workflow. The goal is friction when secrets are at risk, not permanent denial.

Is running a local model worth the cost? For restricted repositories, yes. Even a smaller open model beats the risk of streaming PII to a vendor.

How do we ensure policies stay current? Version them. Each edit log stores policy_version. If you update rules, bots regenerate completions so reviews stay aligned.

Conclusion

AI-assisted IDEs are worth the productivity gains, but only if you treat them like any untrusted service with repo-wide privileges. Put fences around the filesystem, inspect every edit with targeted policies, capture provenance for reviewers, offer local-only modes for sensitive work, and route all traffic through scoped proxies. With those pieces in place, your developers can keep vibe-coding with copilots while your secrets and auth guards stay exactly where they belong.

Alprina Blog