Telescopetest-io: add AI content filtering by Judyzc · Pull Request #144 · cloudflare/telescope

Judyzc · 2026-02-20T19:09:19Z

Related to #143. This PR sets up AI content filtering for telescopetest.io, as described in the issue.

Added a new content_rating column in D1 the tests metadata table. Auto-generated a migration file (0002) for this with Prisma by following the README
Added workers AI binding for a text model (https://developers.cloudflare.com/workers-ai/models/llama-guard-3-8b/) and an image model (https://developers.cloudflare.com/workers-ai/models/llama-3.2-11b-vision-instruct/). These are used in lib/ai/ai-content-rater.ts, which adds the function rateUrlContent().
This function rateUrlContent() is called on in upload.ts POST endpoint with waitUntil() and always returns either SAFE or UNSAFE. If this AI content check gets interrupted by user refresh, it can be called again by the telescopetest.io/results/[testId] page, which now polls (with the GET endpoint tests/[testId]/rating) and blocks displaying results until a rating is given.
If a user wants to see unsafe content, they can do so locally (development env) with ENABLE_AI_RATING=false in a .dev.vars file as described in the README. Then, unsafe content will be displayed on the /results page with a flag.
Tested on staging.

QUESTION/REQUEST:

Any good way to test the quality/accuracy of this AI content filter? I've been using movies but I'm not sure if there's a better way.

misc:

fixed 'name' field on results list to not cut off letters

telescopetest-io/migrations/0002_add_content_rating.sql

sufian-cf · 2026-02-23T22:49:59Z

telescopetest-io/src/lib/ai/ai-content-rater.ts

+    .replace(/<(script|style|noscript|head|template)[\s\S]*?<\/\1>/gi, '')
+    .replace(/<[^>]+>/g, ' ')
+    .replace(/&amp;/g, '&')
+    .replace(/&lt;/g, '<')
+    .replace(/&gt;/g, '>')
+    .replace(/&quot;/g, '"')
+    .replace(/&#39;/g, "'")
+    .replace(/&nbsp;/g, ' ')
+    .replace(/&[a-z]+;/gi, ' ')
+    .replace(/\s+/g, ' ')


Hmm, I think this sort of replacement won't work across newlines, and is omitting valid escaped text.

We should most likely be parsing the HTML and extracting the text nodes (most likely via https://developer.mozilla.org/en-US/docs/Web/API/DOMParser/parseFromString) from the parsed document.

Also, do we need to extract text at all? Like, assuming the content scanner is an LLM capable of sifting through structured documents, it probably could be passed the HTML document as-is and make a determination on the content?

I think the [\s\S] part of the regex allows it to work over newlines, shown here and through testing.

I don't think DOMParser works with Cloudflare workers, explained here, though I might be wrong. Cloudflare has its own HTMLRewriter tool I could use but that adds in streaming. There's also this third-party library linkedom I could try using, but what are your thoughts?

For needing to extract text, the LLM seems to be for conversation like strings: https://developers.cloudflare.com/workers-ai/models/llama-guard-3-8b/, so I haven't actually tested with just the HTML document. I can probably try this too though.

sufian-cf · 2026-02-23T22:50:38Z

telescopetest-io/src/lib/ai/ai-content-rater.ts

+        .join(' ')
+        .replace(/\s+/g, ' ')
+        .trim()
+        .slice(0, 4000);


We're intentionally only scanning the first ~4k characters?

We can probably increase this to ~100,000 chars b/c the model can take 131,072 tokens, but yeah we can definitely increase this.

sergeychernyshev · 2026-02-26T23:00:11Z

telescopetest-io/src/lib/ai/ai-content-rater.ts

+    signal: AbortSignal.timeout(10_000),
+  });
+  const html = await response.text();
+  return html


@Judyzc did you have any success with sending HTML to the agent here?

…to find bad? sites

…t max only 2 AI calls, tinkered with ai-content-filtering

…G to IN_PROGRESS to be less confusing

Judyzc changed the title ~~Temp ai~~ telescopetest-io: add AI content filtering Feb 20, 2026

Judyzc marked this pull request as ready for review February 20, 2026 19:10

Judyzc requested a review from a team February 20, 2026 19:10

Judyzc marked this pull request as draft February 20, 2026 19:11

Judyzc marked this pull request as ready for review February 20, 2026 19:32

Judyzc force-pushed the temp-ai branch from 2f9b3cc to 9c692b6 Compare February 23, 2026 20:24

sufian-cf reviewed Feb 23, 2026

View reviewed changes

telescopetest-io/migrations/0002_add_content_rating.sql Show resolved Hide resolved

sufian-cf reviewed Feb 23, 2026

View reviewed changes

Judyzc force-pushed the temp-ai branch from 9c692b6 to c091f5e Compare February 24, 2026 15:19

Judyzc changed the title ~~telescopetest-io: add AI content filtering~~ Telescopetest-io: add AI content filtering Feb 24, 2026

sergeychernyshev reviewed Feb 26, 2026

View reviewed changes

Judyzc force-pushed the temp-ai branch from c091f5e to a008398 Compare February 27, 2026 20:54

sergeychernyshev added the ticket This label indicates that internal ticket was created to track it. label Mar 2, 2026

Judyzc added 9 commits March 2, 2026 14:24

init commit

154be1e

rebased off main

de2fb11

fixed prisma schema, made 2nd migration, tested content rating, need …

04c7e1b

…to find bad? sites

temp state, might be broken

af7a809

add dev.vars, fix migration, add 'rating' enum to content_rating so a…

2c53b84

…t max only 2 AI calls, tinkered with ai-content-filtering

fix name field height

78bb620

add to readme

c2109c2

changed rating.ts to return unknown when want to block, changed RATIN…

32b4bbd

…G to IN_PROGRESS to be less confusing

fixing rebase off main

59e858a

Judyzc force-pushed the temp-ai branch from 56b101a to 59e858a Compare March 2, 2026 20:28

fixed rebase off main

2306349

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Telescopetest-io: add AI content filtering#144

Telescopetest-io: add AI content filtering#144
Judyzc wants to merge 10 commits intocloudflare:mainfrom
Judyzc:temp-ai

Judyzc commented Feb 20, 2026 •

edited

Loading

Uh oh!

Uh oh!

sufian-cf Feb 23, 2026

Uh oh!

Judyzc Feb 24, 2026

Uh oh!

sufian-cf Feb 23, 2026

Uh oh!

Judyzc Feb 24, 2026

Uh oh!

sergeychernyshev Feb 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Judyzc commented Feb 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

sufian-cf Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

Judyzc Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

sufian-cf Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

Judyzc Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

sergeychernyshev Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Judyzc commented Feb 20, 2026 •

edited

Loading