Skip to content

Commit 643c1e9

Browse files
committed
ci: update actions to latest versions, bump Python to >=3.11
- actions/checkout v4 → v6 - actions/setup-python v5 → v6 - astral-sh/setup-uv v5 → v7 - actions/cache v4 → v5 - Drop Python 3.10 (onnxruntime 1.24.3 requires >=3.11) - Add Python 3.14 to CI matrix and classifiers
1 parent 062865f commit 643c1e9

4 files changed

Lines changed: 1952 additions & 1221 deletions

File tree

.github/workflows/tests.yml

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -10,17 +10,17 @@ jobs:
1010
strategy:
1111
fail-fast: true
1212
matrix:
13-
python-version: ['3.10', '3.11', '3.12', '3.13']
13+
python-version: ['3.11', '3.12', '3.13', '3.14']
1414
os: [ubuntu-latest, macos-latest, windows-latest]
1515
runs-on: ${{ matrix.os }}
1616
steps:
17-
- uses: actions/checkout@v4
18-
- uses: actions/setup-python@v5
17+
- uses: actions/checkout@v6
18+
- uses: actions/setup-python@v6
1919
with:
2020
python-version: ${{ matrix.python-version }}
2121

2222
- name: Install uv
23-
uses: astral-sh/setup-uv@v5
23+
uses: astral-sh/setup-uv@v7
2424
with:
2525
enable-cache: true
2626
cache-dependency-glob: "uv.lock"
@@ -48,7 +48,7 @@ jobs:
4848
print(f"hash={config_hash}", file=fh)
4949
5050
- name: Cache ONNX models
51-
uses: actions/cache@v4
51+
uses: actions/cache@v5
5252
with:
5353
path: ${{ steps.model-cache-dir.outputs.dir }}
5454
key: ${{ runner.os }}-onnx-models-${{ steps.model-cache-dir.outputs.hash }}

README.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -277,10 +277,10 @@ Text Input → Text Chunking → Embedding Generation → Vector Aggregation →
277277
`iscc-sct` employs the following process:
278278

279279
1. Splits the text into overlaping chunks (using syntactically sensible breakpoints).
280-
2. Uses a pre-trained deep learning model for text embedding.
281-
3. Generates feature vectors capturing essential characteristics of the chunks.
282-
4. Aggregates these vectors and binarizes them to produce a Semantic Text-Code.
283-
5. Prefixes the binarized vector with the matching ISCC header, encodes it with base32, and adds the
280+
1. Uses a pre-trained deep learning model for text embedding.
281+
1. Generates feature vectors capturing essential characteristics of the chunks.
282+
1. Aggregates these vectors and binarizes them to produce a Semantic Text-Code.
283+
1. Prefixes the binarized vector with the matching ISCC header, encodes it with base32, and adds the
284284
"ISCC:" prefix.
285285

286286
This process ensures robustness to variations and translations, enabling cross-lingual matching

pyproject.toml

Lines changed: 5 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ authors = [
77
]
88
license = {text = "Apache-2.0"}
99
readme = "README.md"
10-
requires-python = ">=3.10,<=3.13"
10+
requires-python = ">=3.11"
1111
keywords = ["iscc", "text similarity", "cross lingual", "semantic similarity"]
1212
classifiers = [
1313
"Development Status :: 4 - Beta",
@@ -16,10 +16,10 @@ classifiers = [
1616
"License :: OSI Approved :: Apache Software License",
1717
"Natural Language :: English",
1818
"Operating System :: OS Independent",
19-
"Programming Language :: Python :: 3.10",
2019
"Programming Language :: Python :: 3.11",
2120
"Programming Language :: Python :: 3.12",
2221
"Programming Language :: Python :: 3.13",
22+
"Programming Language :: Python :: 3.14",
2323
"Topic :: Text Processing",
2424
"Topic :: Text Processing :: General",
2525
"Topic :: Text Processing :: Indexing",
@@ -69,8 +69,9 @@ dev = [
6969
"mdformat-gfm-alerts",
7070
"mdformat-frontmatter",
7171
"gradio",
72-
"radon>=6.0.1",
73-
"xenon>=0.9.3",
72+
"radon",
73+
"xenon",
74+
"datasets",
7475
]
7576

7677
[project.urls]

0 commit comments

Comments
 (0)