feat: add content sanitizer to strip hidden text before AI processing by gentlemandev · Pull Request #2137 · elie222/inbox-zero

gentlemandev · 2026-04-04T00:41:22Z

Summary

New utility utils/ai/content-sanitizer.ts that strips invisible/hidden content from emails before passing to AI
Prevents attackers from embedding invisible instructions in emails that the AI would process but users can't see

What it strips

Zero-width Unicode: \u200B, \u200C, \u200D, \u2060, \uFEFF
RTL/LTR overrides: \u202A-\u202E, \u2066-\u2069
Hidden HTML elements: display:none, visibility:hidden, font-size:0, opacity:0
White-on-white text: color:#fff/#ffffff/white/rgb(255,255,255)
Offscreen positioning: position:absolute + left:-9999px
Zero-dimension elements: width:0/height:0 + overflow:hidden
HTML comments:

API

stripHiddenText(text: string): string      // for plain text
stripHiddenHtml(html: string): string      // for HTML emails
sanitizeForAI({ textPlain?, textHtml? })   // convenience wrapper

Not yet wired in

This PR adds the utility and tests. A follow-up PR will call sanitizeForAI() before aiChooseRule(), aiGenerateArgs(), and aiDraftReply().

Test plan

32 tests covering all hidden content types
Edge cases: nested hidden elements, mixed attacks, legitimate HTML preservation
Undefined/empty input handling

🤖 Generated with Claude Code

…I processing Strips zero-width Unicode characters, RTL/LTR overrides, hidden HTML elements (display:none, visibility:hidden, zero font-size, opacity:0, white-on-white text, offscreen positioning), and HTML comments that attackers use to inject invisible instructions into email content. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

vercel · 2026-04-04T00:41:29Z

The latest updates on your projects. Learn more about Vercel for GitHub.

1 Skipped Deployment

Project	Deployment	Actions	Updated (UTC)
inbox-zero	Ignored		Apr 4, 2026 0:41am

apps/web/utils/ai/content-sanitizer.ts

+
+/** Strip hidden/invisible content from HTML before AI processing */
+export function stripHiddenHtml(html: string): string {
+  let result = html.replace(HTML_COMMENT, "");


cubic-dev-ai

2 issues found across 2 files

Prompt for AI agents (unresolved issues)


Check if these issues are valid — if so, understand the root cause of each and fix them. If appropriate, use sub-agents to investigate and fix each issue separately.


<file name="apps/web/utils/ai/content-sanitizer.ts">

<violation number="1" location="apps/web/utils/ai/content-sanitizer.ts:14">
P1: The `color` regex also matches `background-color`, causing false positives that remove visible content.</violation>

<violation number="2" location="apps/web/utils/ai/content-sanitizer.ts:46">
P2: `sanitizeForAI` drops empty-string inputs by using truthy checks; use an explicit `undefined` check so empty content is preserved.</violation>
</file>

_{Reply with feedback, questions, or to request a fix. Tag @cubic-dev-ai to re-run a review.}

cubic-dev-ai · 2026-04-04T00:45:19Z

apps/web/utils/ai/content-sanitizer.ts

+  /visibility\s*:\s*hidden/i,
+  /font-size\s*:\s*0(?:px|em|rem|%|pt)?\s*[;"']/i,
+  /opacity\s*:\s*0\s*[;"']/i,
+  /color\s*:\s*(?:#fff(?:fff)?|white|rgb\(\s*255\s*,\s*255\s*,\s*255\s*\))\s*[;"']/i,


P1: The color regex also matches background-color, causing false positives that remove visible content.

Prompt for AI agents

Check if this issue is valid — if so, understand the root cause and fix it. At apps/web/utils/ai/content-sanitizer.ts, line 14: <comment>The `color` regex also matches `background-color`, causing false positives that remove visible content.</comment> <file context> @@ -0,0 +1,87 @@ + /visibility\s*:\s*hidden/i, + /font-size\s*:\s*0(?:px|em|rem|%|pt)?\s*[;"']/i, + /opacity\s*:\s*0\s*[;"']/i, + /color\s*:\s*(?:#fff(?:fff)?|white|rgb\(\s*255\s*,\s*255\s*,\s*255\s*\))\s*[;"']/i, +]; + </file context>

cubic-dev-ai · 2026-04-04T00:45:19Z

apps/web/utils/ai/content-sanitizer.ts

+  textHtml?: string;
+}): { textPlain?: string; textHtml?: string } {
+  return {
+    textPlain: input.textPlain ? stripHiddenText(input.textPlain) : undefined,


P2: sanitizeForAI drops empty-string inputs by using truthy checks; use an explicit undefined check so empty content is preserved.

Prompt for AI agents

Check if this issue is valid — if so, understand the root cause and fix it. At apps/web/utils/ai/content-sanitizer.ts, line 46: <comment>`sanitizeForAI` drops empty-string inputs by using truthy checks; use an explicit `undefined` check so empty content is preserved.</comment> <file context> @@ -0,0 +1,87 @@ + textHtml?: string; +}): { textPlain?: string; textHtml?: string } { + return { + textPlain: input.textPlain ? stripHiddenText(input.textPlain) : undefined, + textHtml: input.textHtml ? stripHiddenHtml(input.textHtml) : undefined, + }; </file context>

github-advanced-security bot found potential problems Apr 4, 2026

View reviewed changes

apps/web/utils/ai/content-sanitizer.ts

/** Strip hidden/invisible content from HTML before AI processing */

export function stripHiddenHtml(html: string): string {

let result = html.replace(HTML_COMMENT, "");

cubic-dev-ai bot reviewed Apr 4, 2026

View reviewed changes

gentlemandev mentioned this pull request Apr 4, 2026

test: add security pipeline integration tests #2138

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add content sanitizer to strip hidden text before AI processing#2137

feat: add content sanitizer to strip hidden text before AI processing#2137
gentlemandev wants to merge 1 commit intomainfrom
feat/content-sanitizer

gentlemandev commented Apr 4, 2026

Uh oh!

vercel bot commented Apr 4, 2026

Uh oh!

cubic-dev-ai bot left a comment

Uh oh!

cubic-dev-ai bot Apr 4, 2026 •

edited

Loading

Uh oh!

cubic-dev-ai bot Apr 4, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

gentlemandev commented Apr 4, 2026

Summary

What it strips

API

Not yet wired in

Test plan

Uh oh!

vercel bot commented Apr 4, 2026

Uh oh!

cubic-dev-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

cubic-dev-ai bot Apr 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cubic-dev-ai bot Apr 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

cubic-dev-ai bot Apr 4, 2026 •

edited

Loading

cubic-dev-ai bot Apr 4, 2026 •

edited

Loading