Secure Vibe Coding in 2026: The Files, Prompts and Rules of Use and Research

May 3, 2026 • ☕︎ 16 min read

“For every 10 working features your AI ships, roughly 9 are exploitable.” — CMU Research, 2025

Overview

AI is getting better and making apps is easier than ever but Security isn’t following the same flow.

Secure Vibe Coding is the future but still when you do normal change in code? Bad flow? Feature update? and it breaks somewhere or introduce novel vulnerability or completely new attack surface. It happens to me on my side gigs and projects and certainly happened to you lot more time in the production lifecycle.

You prompted to Cursor/Claude Code. It made an architecture which looks flawless. You passed that check and deployed it. After few commits or PR merge, one value gets changed and core logic or database is exposed and someone can bypass the subscription paywall. (this is very bad happened to me recently :/) Or even maybe your agent started deleting things on its own, Replit agent that nuked the production database even after being explicitly told to freeze changes or OpenClaw clearing inbox folder !

The thing is, AI knows what SQLi and XSS is, it knows RLS (Row Level Security), it considers everything but it just doesn’t apply it consistently. It has way more info than one person could have about everything but nobody mentions which policy has to work where, which is why the half vibe coded app still leaking user’s data by tweaking User ID with RLS turned on.

This post is about fixes and research I have been doing. This is for “How to run agentic engineering in 2026 without shipping the next breach”. I divided this in 3 stages/tiers. Choose the tier which matches your current needs.

Tier 1 : Personal projects, MVPs, weekend builds
Tier 2 : Launched for real users, real database, but still at small scale
Tier 3 : Mid size production scale, agents with tools, dozens of team shipping daily on VCS

Every file I talk about here, put them in root of your project, your agent will pick it up. (at least work safely for tier 1 category, rest you need to modify according to your needs but concept remains same)

How Actually These Files Work

Cursor reads .cursorrules and AGENTS.md automatically
Claude Code reads CLAUDE.md automatically + anything in .claude/
Windsurf reads .windsurfrules
GitHub Copilot reads .github/copilot-instructions.md
Codex CLI reads agent definitions from ~/.codex/agents/ (global) or .codex/agents/ (project)

· · ·

Tier 1: Personal Projects, MVPs

This is for people who builds something randomly or creating MVPs for their product, or college projects. Many things will be basic here but I have seen many people who are not much into security or not updated they still tend to skip things or they just don’t know as many things happening everyday. (Especially this will be helpful for you, AJ .)

These 9 things and you are already making your MVP lot more secure.

1. Drop `CLAUDE.md` in Your Project Root

This is really the most important thing in the post. Seriously. Keep it under 200 lines (for better context window and also longer files gets ignored! Many people have reported this in the Claude community).

What you don’t know is, wrap critical sections in <important> tags so it survives context compaction.

You will find this file here . All files are available in this GitHub repo .

The dependency block is most important! That will save you from many supply chain attacks. You get saved from it in the prompt layer.

2. Use Plan Mode Before Every Feature. Always

Most people use it at starting of project, even me, and then stop using it when implementing new feature or making huge change.

It’s simple math. If a feature has 20 decisions to make and agent is 80% accurate per decision then end-to-end accuracy is 1.15%. It is really simple thing that will save lot of tokens and you can review every code changes it will be going to do. My workflow now:

1. Enter plan mode
2. Type: "I want to add [feature]. Read CLAUDE.md and the relevant files.
   Ask me 3-5 clarifying questions about edge cases and security before
   proposing a plan."
3. Answer the questions
4. Get the plan, edit it inline if needed
5. Exit plan mode, let it implement

3. Create `.gitignore` Before the First Commit

If LLM forgot to create it before committing then it will be in git history. Also don’t forget to include .claude/sessions/ or .codex/sessions/ as they contain other chats metadata and check your LLM .claude/settings file as well. (I got a bug through it on open source since LLM was auto accepting edits even on default mode)

Sample .gitignore file here .

4. Use Managed Auth From Starting

Pick anyone: Auth0, Clerk, Supabase Auth, Lucia, Firebase Auth. But tell agent “Use Auth0 for authentication. Protect all routes with server middleware. No client-side auth check and no custom JWT.”

You may not believe but there are many breaches because of this. Enrichlead was totally shutdown in 2 days because of it. Few examples:

Enrichlead — March 2025 (Source: Reddit )
Base44 — July 2025 (Source: WIZ )
Tea app breach — July 2025 (Source: Hacker News )
Cursor CurXecute / MCPoison — August 2025 (Source: Tenable )
Loveable exposure — April 2026 (Source: Loveable.dev )

5. Create Sample `.env.example` as Template Before Committing

If the AI pastes a key directly in code, catch it immediately and tell it to move it. The CLAUDE.md rules make this rare, but it still slips through.

Sample file here .

6. Private by Default Deploys

Vercel, Netlify, Supabase, Railway: They all have a “preview deployments are private” option. (I got to know about this recently)

Also: no console.log(user) or console.log(session) in production. If your style is to debug via logging, then don’t forget to remove it.

7. A Self-Review System

Use new LLM (Preferably GPT 5.5 — xhigh) or at least new session for this:

Now act as a Senior Security Engineer reviewing the code written by commit history.
Be adversarial. Don't reassure me. Check for:
1. Injection (SQL, XSS, command, prompt, etc)
2. Auth/authorization bypasses (especially client-side checks)
3. Secrets exposure (logs, errors, env-like vars hitting the client)
4. Missing input validation
5. Insecure defaults (open CORS, no rate limit, wildcard permissions)
6. Hallucinated dependencies (verify each new import exists on npm/PyPI)

For each finding: file, line, severity, fix. No reassurance.

8. CAPTCHA

Cloudflare Turnstile . It’s free and invisible with just one tag. Put on every public form. It will save your future Cloud bills.

It’s really powerful against headless browsers and especially scrapers. (That’s why I hate them too!)

9. Verify the Package Exists Before You Install It

This is why supply chain attacks are affecting a lot and slopsquatting is eating a lot of people in 2026. LLMs still sometimes invent package names which are not popular enough. It auto installs attacker controlled npm/PyPI.

January 2026 react-codeshift : Aikido security research A hallucinated npm package called react-codeshift spread to 237 repos through AI-generated agent skill files. Nobody made it on purpose. The AI invented the name, somebody could have registered it, and every agent that read those skills would have installed it.

· · ·

Tier 2: Real Users, Real Database, but at Small Scale

You have made some progress and launched it for the real world and now you have Postgres, Stripe and real traffic on your Analytics. (50–100 paying users maybe)

Now you need proper architecture and threat modelling in your code. I came up with these additional 10 methods on top of 9 previous methods for this tier.

Things get real from here!

1. Write Planning Files BEFORE LLM Writes Code

Just planning mode isn’t sufficient now. We need solid files to provide context and save tokens as well. More you let your model guess, more hallucination and security risk it will have. I have come up with 4 files architecture for this with additional bonus file:

PRD.md — what you are building and why
ARCHITECTURE.md — DFD flow, stacks, components, and how they connect
DATA_MODEL.md — your all tables, relations, RLS policies
THREAT_MODEL.md — what you are protecting and from whom

And add them in your CLAUDE.md / AGENTS.md so it will read them every time before new session. (you have to play smart here, try to make these files as much by yourself so it will have only required things)

You will find all files on this GitHub repo, here .

Threat model file is most important to protect from breaches because Indirect prompt injection hit 84% success rates against Cursor and Copilot in Liu et al.’s November 2025 study and you are (mostly) protected from it, thanks to that file. The MCP STDIO issue is a whole storm covered in Tier 3.

BONUS — Indexes of Functions (IOF)

I noticed one more thing which is useful: Indexes of Functions.

Idea is simple. You have a bit mature codebase now and thousands of LOC and your agent burns more tokens when reading a file. Simple fix: create a small .md for every big source file next to it, listing every function with its parameters and one-line purpose. So db.py gets db.md and api.py gets api.md.

You will consume more tokens to generate them, but you can use local LLMs or a cheap model for this (I don’t consider Gemini 3.1 Pro as SOTA model, it’s just stupidly dumb and slow). That’s it. And add in your CLAUDE.md file that “read bot.md to find what you are looking from bot.py when adding feature related to it.”

sample bonus file — database.py as db.md

2. Have RLS From Day One and Then Break It

Every Supabase tutorial says to turn on RLS, which becomes new normal for small architecture. But do you know the policy has to actually work? AI knows it, but we need to tell them!

You are now User B with the anon key and a valid session.
User A has rows you should NOT be able to read.
Try to read them. List every query, header, and parameter you'd use.
Then fix the RLS to make those queries return nothing.
Then try again as User B and confirm.

Reya Vir did a great job to highlight this problem with her own methods. She says “Coding agents are optimized to eliminate errors, not understand them.” Which means, if a “Permission Denied” error fires during a DB query, the agent will happily resolve it by setting the policy to USING (true). And this above prompt will catch that exactly.

3. Rate Limit on the Backend!

Please never do rate limit on the frontend. It’s a security theater. AI is smart enough to implement it on the backend but you need to tell! Otherwise depending upon your other memory with LLM and your way of working, for easiness it can use frontend rate limiting.

I am not going into much technicality here why backend rate limiting is superior, you maybe need to use express-rate-limit only for that. Read this Reddit post comments and OP’s question for more.

4. Sensitive API Calls Go Through Your Backend. Always

Stripe, SendGrid, OpenAI, Anthropic, Twilio — backend only. The agent MAY suggest a “convenient” frontend fetch with the key in a meta tag. It’s never convenient. It’s a $900 bill.

5. Move Secrets From `.env` to Real Vault

Now having .env is not ideal as you have real users and data. AWS Secrets Manager , GCP Secret Manager , Doppler , Infisical all have vault to handle secrets. Any one would cost less than a $20 LLM plan.

6. Security Agent

Now we need a dedicated security agent to review the codebase regularly. Create .claude/agents/security-reviewer.md.

You can find this file here .

Then in your session: “Use the security-reviewer subagent to review my last commit.” It cannot write. It only reports and can say NO. This is the basic helper-model pattern at the file system level.

7. Block Sketchy Installs at the Hook Layer

Now we need to be careful with how our LLM’s settings.json file is written. Sample file is here .

Basic idea is, it will block any package install until you confirm it. LLM cannot decide to skip them. The deny entries for --no-verify and -n matters really — it was found as a real bypass in Claude Code GH issue March 2026 .

8. Pre-Commit Hooks for Secrets and Obvious Vulns

These packages should be there when you start building. (optional but recommended)

brew install gitleaks
pip install pre-commit
pre-commit install

And .pre-commit-config.yaml which you will find here on the GitHub repo.

9. Security Scan in CI on Every PR

A security workflow must be there on your GitHub. I am just giving a simple file with Snyk, SBOM, and Semgrep but you should customize according to your requirements.

Sample file can be found here .

10. Reset Context

This should be a no brainer and after Opus 4.7 managing context window is a real struggle. It eats a lot more tokens to compact and then accuracy would drop as context window grows. And since you have the above files already, you don’t need to provide context from beginning in every new session.

· · ·

Tier 3: Mid-Size Production, Agents With Tools, and Dozens of Teammates on VCS

You need to be very careful here if you are using AI for your code now because things can go wrong very fast. You shouldn’t do vibe coding a feature randomly. With my research I came up with 10 ways to make your Vibe Coding, SECURE.

1. JIT Permissions for Every Agent

Just like normal user accounts, we will treat agents/subagents the same. No long-lived service accounts with role:admin (you don’t have any real unused service account right? 👀)

We will strictly follow RBAC at least. Agents must be scoped to exactly the resources and operations they need with ephemeral tokens via AWS STS, GCP short-lived tokens, GitHub fine-grained PATs.

2. Run the Security Agent in the CI, Not Just Locally

The Tier 2 security reviewer agent was working on your machine, but in Tier 3 it runs in the CI pipeline on every PR. Pair it with the code reviewer agent (for correctness) and the docs reviewer agent (for consistency). These three agents running in parallel and reading all of your PR with their own context should make your PR code safe before merging it.

3. Human in the Loop

Before you do any actions related to: deletes, payments, permission changes, customer-facing sends, or production deploys, the LLM should draft a summary, and then you validate it. The Replit incident happened because there was no human in the loop.

4. Policy-as-Code (PaC)

Use OPA or Cedar like VCS, which are reviewable and it tells “This agent can call this tool only when these conditions hold.”

5. Scan IaC in CI on Every Terraform/K8s Change

You most likely would be doing this, I hope! But still you need to be more careful because supply chain attacks didn’t leave Trivy — March 2026 as well.

Block the merge on privileged: true, 0.0.0.0/0, Action: "*", missing aws_s3_bucket_public_access_block, and the rest. The agent has seen these in training data thousands of times.

6. MCP vs CLI — Better Approach

As we know MCP is going down on the hype and it’s not that useful because it basically gets context of every tool on the first run and you burn on the context window. That’s where Corsair study and their own method comes into picture.

The fundamental solution they are providing is calling only required tools by their own way. I don’t have any affiliations with them but the idea is really nice to implement and that really does consume a lot less tokens and context window.

7. MCP Servers — Read This Carefully

In April 2026, OX Security disclosed a design-level command injection across Anthropic’s official MCP SDKs in Python, TypeScript, Java, and Rust. Roughly 200,000 vulnerable instances were affected. Cursor, Windsurf, Claude Code, Gemini-CLI, GitHub Copilot — all were affected through their MCP JSON config. Windsurf CVE-2026-30615 is a 0-click: open a public repo with a malicious MCP config and get RCE.

If your stack uses an MCP server, do all of this (HIGHLY RECOMMENDED):

Disable STDIO MCP servers if you don’t need them.
If you must use STDIO, allowlist commands by absolute path. Don’t accept relative paths or env wrappers.
Never install MCP servers from public marketplaces without review.
Sandbox every MCP server in a container. Read-only filesystem except the working dir. No outbound network except an allowlist.
Patch list: LiteLLM ( CVE-2026-30623 ), Flowise ( CVE-2026-40933 ), Cursor ( CVE-2026-30615 ), Windsurf ( CVE-2026-30615 ), Agent Zero ( CVE-2026-30624 ). Update or isolate.
Audit every MCP tool invocation.

8. Serve Security Context Through a Hardened MCP Server

Simple conceptual sketch:

// security-context-server.ts
// Run in a sandboxed container. HTTP transport only. Auth required.
// No STDIO. No marketplace. Internal use only.

server.tool("get_threat_model", async () => {
  return readFile("./THREAT_MODEL.md");
});

server.tool("check_dependency", async ({ name, version }) => {
  return await verifyPackage(name, version);
  // Calls Socket.dev / Snyk / npm registry
});

server.tool("validate_iac", async ({ file }) => {
  return await scanIaC(file);  // tfsec / Checkov in a sandbox
});

server.tool("get_rls_pattern", async ({ table }) => {
  return getRLSTemplate(table);
});

The reasoning is that agent pulls security context on demand and rules live outside the context window. HTTP+auth and not STDIO, sandboxed and logged.

9. Cost Cap Per Agent Session

Set max token per session, max tool calls per session, max thinking cap. Claude Code session budget settings and code intelligence plugins — use these things and with your own workflow decide how much tokens and $ to spend.

10. Audit Log Every Agent Action

Make a dedicated audit DB for all your agent calls. It will help tremendously when something goes wrong. It’s simple and very effective.

· · ·

Final Thoughts

This is my research on what I was studying in the last month — finding the core root causes of recent supply chain attacks, tracking the majority of them, and finding how we can get protected against them while using AI.

The “vibe coding is dead” takes are mostly right. What’s left is agentic engineering, which is actually better because you write the specs, the agent executes, and you review and ship it. You’ll ship a much more mature tool this way, developed fast and more feature-rich.

But that won’t work if you skip the basic parts. You need to adopt: planning mode, threat modeling, security agent review, context reset, verifying the dependency, blocking the install hook bypasses, auditing and logging the MCP calls.

Try to do this on your every project, starting with Tier 1. That will develop your muscle memory, and you will be more secure as a developer.

You can find all files of this post in this GitHub repo: secure-vibe-starter Leave it a star, create an issue or discuss directly with me if you have any more suggestions.

Also published on Medium .

Secure Vibe Coding in 2026: The Files, Prompts and Rules of Use and Research

Overview

How Actually These Files Work

Tier 1: Personal Projects, MVPs

1. Drop CLAUDE.md in Your Project Root

2. Use Plan Mode Before Every Feature. Always

3. Create .gitignore Before the First Commit

4. Use Managed Auth From Starting

5. Create Sample .env.example as Template Before Committing

6. Private by Default Deploys

7. A Self-Review System

8. CAPTCHA

9. Verify the Package Exists Before You Install It

Tier 2: Real Users, Real Database, but at Small Scale

1. Write Planning Files BEFORE LLM Writes Code

2. Have RLS From Day One and Then Break It

3. Rate Limit on the Backend!

4. Sensitive API Calls Go Through Your Backend. Always

5. Move Secrets From .env to Real Vault

6. Security Agent

7. Block Sketchy Installs at the Hook Layer

8. Pre-Commit Hooks for Secrets and Obvious Vulns

9. Security Scan in CI on Every PR

10. Reset Context

Tier 3: Mid-Size Production, Agents With Tools, and Dozens of Teammates on VCS

1. JIT Permissions for Every Agent

2. Run the Security Agent in the CI, Not Just Locally

3. Human in the Loop

4. Policy-as-Code (PaC)

5. Scan IaC in CI on Every Terraform/K8s Change

6. MCP vs CLI — Better Approach

7. MCP Servers — Read This Carefully

8. Serve Security Context Through a Hardened MCP Server

9. Cost Cap Per Agent Session

10. Audit Log Every Agent Action

Final Thoughts

1. Drop `CLAUDE.md` in Your Project Root

3. Create `.gitignore` Before the First Commit

5. Create Sample `.env.example` as Template Before Committing

5. Move Secrets From `.env` to Real Vault