llm/skills: add skill for debugging buildkite/ci failures by aljoscha · Pull Request #35248 · MaterializeInc/materialize

aljoscha · 2026-02-27T14:35:18Z

No description provided.

github-actions · 2026-02-27T14:35:28Z

Thanks for opening this PR! Here are a few tips to help make the review process smooth for everyone.

PR title guidelines

Use imperative mood: "Fix X" not "Fixed X" or "Fixes X"
Be specific: "Fix panic in catalog sync when controller restarts" not "Fix bug" or "Update catalog code"
Prefix with area if helpful: compute: , storage: , adapter: , sql:

Pre-merge checklist

The PR title is descriptive and will make sense in the git log.
This PR has adequate test coverage / QA involvement has been duly considered. (trigger-ci for additional test/nightly runs)
If this PR includes major user-facing behavior changes, I have pinged the relevant PM to schedule a changelog post.
This PR has an associated up-to-date design doc, is a design doc (template), or is sufficiently small to not require a design.
If this PR evolves an existing $T ⇔ Proto$T mapping (possibly in a backwards-incompatible way), then it is tagged with a T-proto label.
If this PR will require changes to cloud orchestration or tests, there is a companion cloud PR to account for those changes that is tagged with the release-blocker label (example).

antiguru

Dis you figure out a way for the cli to fetch artifacts? I failed because it'd need graphql permissions.

aljoscha · 2026-02-27T19:39:45Z

Dis you figure out a way for the cli to fetch artifacts? I failed because it'd need graphql permissions.

I fixed this by giving the token graphql access. It doesn't allow limiting by scopes, but ... 🤷‍♂️

def- · 2026-02-27T23:05:32Z

.claude/skills/debug-ci/SKILL.md

+## Step 1: Extract PR number
+
+Parse `$ARGUMENTS` to get the PR number. Handle both formats:
+- Plain number: `35192`
+- Full URL: `https://github.com/MaterializeInc/materialize/pull/35192`
+
+## Step 2: List failing checks
+
+```bash
+gh pr checks <PR_NUMBER> 2>&1
+```
+
+Filter the output to lines containing `fail`. Each line has tab-separated fields:
+```
+name	fail	0	https://buildkite.com/materialize/<PIPELINE>/builds/<BUILD>#<JOB_ID>	description
+```
+
+Extract from the URL:
+- **Pipeline**: path segment after `materialize/` (usually `test`)
+- **Build number**: the number after `builds/`
+- **Job ID**: the UUID after `#`


It might be easier to just do

bk build list --branch=def-:pr-fix-secret-cli

where def- is the username of my fork and pr-fix-secret-cli the branch name, instead of these github api roundtrips.

Claude said it wouldn't want to add this, because it's to complicated to know the username of the fork, but we did incorporate the other suggestions. 🙇‍♂️

def- · 2026-02-27T23:08:45Z

.claude/skills/debug-ci/SKILL.md

+- **Build number**: the number after `builds/`
+- **Job ID**: the UUID after `#`
+
+## Step 3: Fetch logs in triage order


I would look at the annotation for test failures first, before looking into logs. Can save you a bunch of tokens or grepping around.

def- · 2026-02-27T23:09:31Z

.claude/skills/debug-ci/SKILL.md

+### Testdrive cascades
+After one test crashes environmentd, all subsequent tests in that shard fail with `Name or service not known` or `connection closed`. **Only the first failure in a shard matters** — everything after it is a cascade. Look for the first `error:` or `FAIL` in the log.
+
+Testdrive shards with the same number (e.g., `testdrive-10` and `testdrive-with-alloydb-10`) run the same tests — if both fail, it's the same root cause.


Suggested change

Testdrive shards with the same number (e.g., `testdrive-10` and `testdrive-with-alloydb-10`) run the same tests — if both fail, it's the same root cause.

Testdrive shards with the same number (e.g., `testdrive-10` and `testdrive-with-alloydb-10`) run the same tests — if both fail, it's likely to be the same root cause.

def- · 2026-02-27T23:10:35Z

.claude/skills/debug-ci/SKILL.md

+1. **Root cause A** — description, which jobs it affects, what to fix
+2. **Root cause B** — description, which jobs it affects, what to fix
+
+Distinguish between issues that are clearly caused by the PR's changes vs. pre-existing flaky tests.


Pre-existing flaky tests can often be discovered through the annotations, which will link to the issue that is causing it.

aljoscha requested review from antiguru, bosconi and mtabebe February 27, 2026 16:44

antiguru reviewed Feb 27, 2026

View reviewed changes

aljoscha requested a review from antiguru February 27, 2026 19:39

antiguru approved these changes Feb 27, 2026

View reviewed changes

def- reviewed Feb 27, 2026

View reviewed changes

aljoscha force-pushed the push-srwssurrzvmx branch from 5f48e2d to 320c6dd Compare March 1, 2026 13:50

llm/skills: add skill for debugging buildkite/ci failures

b6d0a08

aljoscha force-pushed the push-srwssurrzvmx branch from 320c6dd to b6d0a08 Compare March 2, 2026 06:51

aljoscha merged commit ee7b07a into MaterializeInc:main Mar 2, 2026
5 checks passed

aljoscha deleted the push-srwssurrzvmx branch March 2, 2026 11:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llm/skills: add skill for debugging buildkite/ci failures#35248

llm/skills: add skill for debugging buildkite/ci failures#35248
aljoscha merged 1 commit intoMaterializeInc:mainfrom
aljoscha:push-srwssurrzvmx

aljoscha commented Feb 27, 2026

Uh oh!

github-actions bot commented Feb 27, 2026

Uh oh!

antiguru left a comment

Uh oh!

aljoscha commented Feb 27, 2026

Uh oh!

def- Feb 27, 2026

Uh oh!

aljoscha Mar 2, 2026

Uh oh!

def- Feb 27, 2026

Uh oh!

def- Feb 27, 2026

Uh oh!

def- Feb 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	Testdrive shards with the same number (e.g., `testdrive-10` and `testdrive-with-alloydb-10`) run the same tests — if both fail, it's the same root cause.
	Testdrive shards with the same number (e.g., `testdrive-10` and `testdrive-with-alloydb-10`) run the same tests — if both fail, it's likely to be the same root cause.

Conversation

aljoscha commented Feb 27, 2026

Uh oh!

github-actions bot commented Feb 27, 2026

PR title guidelines

Pre-merge checklist

Uh oh!

antiguru left a comment

Choose a reason for hiding this comment

Uh oh!

aljoscha commented Feb 27, 2026

Uh oh!

def- Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

aljoscha Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

def- Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

def- Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

def- Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants