Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Original file line number Diff line number Diff line change
@@ -1,20 +1,27 @@
# Publishes the SWE-Lancer task browser to GitHub Pages (Project site: /swelancer-tasks/).
# Unified GitHub Pages deployment workflow.
# Builds both benchmark result pages and SWE-Lancer task browser,
# then deploys the full docs/ directory to GitHub Pages.
#
# Repository setting: Settings → Pages → Build and deployment → Source: GitHub Actions.

name: Deploy SWE-Lancer task pages
name: Deploy to GitHub Pages

on:
push:
branches:
- main
- master
branches: [main, master]
paths:
# Benchmark results
- 'public/data/leaderboard.json'
- 'scripts/build-results-pages.ts'
# SWE-Lancer task browser
- 'repos/frontier-evals/project/swelancer/all_swelancer_tasks.csv'
- 'scripts/build-swelancer-pages.ts'
- 'docs/swelancer-tasks/**'
- '.github/workflows/swelancer-pages.yml'
# Shared static assets and workflows
- 'docs/**'
- '.github/workflows/gh-pages.yml'
- 'package.json'
- 'bun.lock'
# Allow manual trigger
workflow_dispatch:

permissions:
Expand All @@ -23,7 +30,7 @@ permissions:
id-token: write

concurrency:
group: swelancer-pages
group: pages
cancel-in-progress: false

jobs:
Expand All @@ -33,19 +40,27 @@ jobs:
- name: Checkout
uses: actions/checkout@v4

- name: Init frontier-evals submodule (CSV source)
- name: Init frontier-evals submodule (CSV source for SWE-Lancer pages)
run: git submodule update --init repos/frontier-evals

- name: Setup Bun
uses: oven-sh/setup-bun@v2
with:
bun-version: latest

- name: Install dependencies
run: bun install --frozen-lockfile

- name: Build tasks.json and verify static assets
- name: Build benchmark result pages
run: bun scripts/build-results-pages.ts

- name: Build SWE-Lancer task pages
run: bun run build:swelancer-pages

- name: Upload Pages artifact
- name: Setup Pages
uses: actions/configure-pages@v5

- name: Upload artifact
uses: actions/upload-pages-artifact@v3
with:
path: docs
Expand Down
4 changes: 4 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -46,3 +46,7 @@ yarn.lock

# Generated by scripts/build-swelancer-pages.ts (rebuild for local preview or rely on GitHub Actions)
docs/swelancer-tasks/tasks.json

# Generated by scripts/build-results-pages.ts (rebuild for local preview or rely on GitHub Actions)
docs/results/
docs/data/leaderboard.json
2 changes: 2 additions & 0 deletions docs/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,3 +5,5 @@
This directory exists for **GitHub Pages** output:

- **`swelancer-tasks/`** — built by `bun run build:swelancer-pages` (see root `README.md`). `tasks.json` may be gitignored when generated locally; CI can regenerate.
- **`index.html`** — benchmark results dashboard (tier ratings, historical runs, task breakdown).
- **`results/`** — per-agent result pages with OGP metadata, built by `bun run build:results-pages`. Generated files are gitignored; CI rebuilds on deploy.
Binary file added docs/assets/icons/aider.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/assets/icons/claude.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/assets/icons/cline.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/assets/icons/codex.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/assets/icons/copilot.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/assets/icons/cursor.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/assets/icons/devin.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/assets/icons/gemini.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/assets/icons/goose.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/assets/icons/kimi.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/assets/icons/opencode.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/assets/icons/qwen.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/assets/icons/windsurf.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Loading