feature: Malicious packages scanner [TAROT-3600] #175

kendrickcurtis · 2025-08-11T11:34:40Z

OpenSSF publishes a regularly updated list of 3rd party packages that contain malware. This PR adds a new Trivy rule "Malicious packages detection" at the highest severity level.

The OpenSSF DB is 227mb which is about 1/3rd the size of Trivy's vuln DB (734mb) -- I would hope this would not add a vast extra burden on processing times. Probably we would want/need to run malicious package detection both on commit and in the nightly SCA process, since packages can be retroactively designated as malicious.

Copilot

Pull Request Overview

This PR integrates OpenSSF's malicious packages database into Trivy to detect known malicious packages. This adds a new "Malicious packages detection" rule at the highest severity level to identify packages containing malware, typosquatting attacks, and dependency confusion attacks.

Implements OpenSSF scanner with OSV format parsing for malicious package detection
Adds new malicious_packages rule with critical severity level
Integrates scanning into the main tool execution flow

Reviewed Changes

Copilot reviewed 15 out of 15 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
internal/tool/tool.go	Integrates OpenSSF scanner into main execution flow and adds new rule pattern
internal/tool/openssf_scanner.go	Core implementation of OpenSSF malicious packages scanner with OSV parsing
internal/docgen/rule.go	Adds rule definition for malicious packages detection
docs/patterns.json	Adds pattern configuration for malicious packages rule
docs/multiple-tests/pattern-malicious/*	Test files demonstrating malicious package detection
docs/description/*	Documentation files for the new pattern
Dockerfile	Copies OpenSSF cache data into container
.circleci/config.yml	Downloads OpenSSF database during CI build

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

internal/tool/openssf_scanner.go

afsmeira

There are still more fixes and improvements but I think I've left enough comments for a first pass.

We'll move to those fixes after that. And obviously there are still tests to write.

.codacy/cli.sh

.codacy/codacy.yaml

docs/description/description.json

docs/multiple-tests/pattern-malicious/src/javascript/package.json

docs/multiple-tests/pattern-vulnerability-high/results.xml

internal/tool/openssf_scanner.go

afsmeira

There are still comments to address. And unit tests to add.

docs/multiple-tests/pattern-malicious/src/javascript/package-lock.json

.codacy/cli.sh

.codacy/codacy.yaml

docs/patterns.json

scripts/build_openssf_index.py

Dockerfile

scripts/build_openssf_index.py

internal/tool/tool.go

internal/tool/openssf_scanner.go

…s to accelerate scanning.

… package-lock.json also

… reporting in openssf scanner

internal/tool/malicious_packages_scanner.go

codacy-production

Pull Request Overview

Adds OpenSSF malicious-packages detection: CI steps to build the index, packaging the index into the image, a MaliciousPackagesScanner with tests, wiring into codacyTrivy, docs and a build script. Static analysis shows one existing Revive warning about exported New returning an unexported type; PR changes intentionally modify New signature to return (*codacyTrivy, error). Overall good coverage (lots of unit tests). Key risks: error handling around semver/ecosystem versions, potential performance/memory impacts loading a ~227MB DB at runtime (even gzipped), and a behavioral change in New() API.

About this PR

Loading the full OpenSSF index into memory (even gzipped) could increase container start time and memory. Consider measuring memory use and supporting a streaming/lookup-backed index or lazy loading, and add runtime telemetry or config to opt-out in constrained environments.
Medium risk | High confidence

This introduces a new exported constructor New(maliciousPackagesIndexPath string) that returns (*codacyTrivy, error) and may break callers that expected the previous parameterless New() returning a value. Ensure all call sites updated (CI/main binary updated in this PR) and consider a compat shim New() for backward compatibility if other consumers exist.
Medium risk | High confidence

Good unit test coverage for scanner logic. Add an integration test (or smoke test) that exercises building and loading the real index artifact or a realistic-sized sample to validate CI performance and failure modes.
Low risk | High confidence

💡 Codacy uses AI. Check for mistakes.

internal/tool/malicious_packages_scanner.go

cmd/tool/main.go

scripts/build_openssf_index.py

internal/tool/tool.go

Dockerfile

internal/tool/malicious_packages_scanner.go

afsmeira · 2025-11-27T15:56:49Z

BTW, I think there is a risk of overlap between a vulnerable dependency and a malicious package. I've seen some malicious packages having a CVE in their metadata.

In those cases, an analysis would create two issues: one for the vulnerable dependency pattern and one for the malicious package pattern.

I figure this would be rare.

kendrickcurtis · 2025-11-27T20:24:02Z

BTW, I think there is a risk of overlap between a vulnerable dependency and a malicious package. I've seen some malicious packages having a CVE in their metadata.

In those cases, an analysis would create two issues: one for the vulnerable dependency pattern and one for the malicious package pattern.

I figure this would be rare.

I'm fine with this. There are already multiple CVEs detected per line in package files.

kendrickcurtis requested a review from a team as a code owner August 11, 2025 11:34

alerizzo requested a review from Copilot August 13, 2025 14:09

Copilot AI reviewed Aug 13, 2025

View reviewed changes

afsmeira reviewed Aug 26, 2025

View reviewed changes

afsmeira reviewed Oct 9, 2025

View reviewed changes

This comment was marked as outdated.

Sign in to view

kendrickcurtis and others added 21 commits November 26, 2025 10:29

openssf malicious packages integration

8ce1953

updated test to match new live CVE

74218c6

revised malicious package detection to prebuild an index nightly so a…

d314bc6

…s to accelerate scanning.

fixed build - tool wasn't scanning for package.json -- added test for…

b01f785

… package-lock.json also

merged main, added Dockerfile

ba074fc

resolved codacy warnings

c7e814b

fixed codacy cyclo issue

4ee5970

fixed AI nonsense

f84915d

fixed CICD -- necessary file wasn't being copied

e11d8a7

fixed another CICD issue

f41a50a

bugfix in tool.go to live with nil PURLs and fixed absent line number…

dfe1f8f

… reporting in openssf scanner

stopped copying unnecessary files

45da0ab

fixed test data

d2d5356

test fixes

a4efa1c

ignored codacy config

83b035a

review comments tackled

4082165

fixed stupid ai shit - npm ref in gradle file

649f440

fixed missing vuln

76cf210

Delete .codacy/cli.sh

1385e2b

Delete .codacy/codacy.yaml

1234196

clean: Assorted cleanup after rebase

438d8ef

afsmeira force-pushed the kpc-malware-scanner branch from 204371d to 438d8ef Compare November 26, 2025 11:40

This comment was marked as outdated.

Sign in to view

afsmeira force-pushed the kpc-malware-scanner branch from 4f94427 to 0cfdc38 Compare November 26, 2025 19:12

This comment was marked as outdated.

Sign in to view

clean: Address codacy comments

d6f2a33

afsmeira force-pushed the kpc-malware-scanner branch from 0cfdc38 to d6f2a33 Compare November 26, 2025 19:15

This comment was marked as outdated.

Sign in to view

afsmeira added 2 commits November 27, 2025 11:48

tests: Add unit tests and fix faulty implementations

d6fb292

ci: Serialize steps to avoid problems when saving to workspace

df03b28

This comment was marked as outdated.

Sign in to view

tests: Fix integration tests

b96b28d

afsmeira force-pushed the kpc-malware-scanner branch from 6a64afb to b96b28d Compare November 27, 2025 12:55

This comment was marked as outdated.

Sign in to view

clean: Log when failing to open file when building index

dd6bd7c

This comment was marked as outdated.

Sign in to view

afsmeira changed the title ~~openssf malicious packages integration~~ feature: Malicious packages scanner [TAROT-3600] Nov 27, 2025

heliocodacy previously approved these changes Nov 27, 2025

View reviewed changes

clean: Address AI review comments

852dd65

afsmeira dismissed heliocodacy’s stale review via 852dd65 November 27, 2025 15:00

This comment was marked as outdated.

Sign in to view

afsmeira reviewed Nov 27, 2025

View reviewed changes

internal/tool/malicious_packages_scanner.go Show resolved Hide resolved

feat: Support the last_affected field in range events

194cdf1

codacy-production bot reviewed Nov 27, 2025

View reviewed changes

afsmeira reviewed Nov 27, 2025

View reviewed changes

internal/tool/malicious_packages_scanner.go Show resolved Hide resolved

internal/tool/malicious_packages_scanner.go Show resolved Hide resolved

afsmeira approved these changes Nov 28, 2025

View reviewed changes

jorgebraz approved these changes Nov 28, 2025

View reviewed changes

afsmeira merged commit dcdf340 into master Nov 28, 2025
8 checks passed

afsmeira deleted the kpc-malware-scanner branch November 28, 2025 16:15

feature: Malicious packages scanner [TAROT-3600] #175

feature: Malicious packages scanner [TAROT-3600] #175

Uh oh!

Conversation

kendrickcurtis commented Aug 11, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

afsmeira left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

afsmeira left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

Uh oh!

codacy-production bot left a comment

Choose a reason for hiding this comment

Pull Request Overview

About this PR

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

afsmeira commented Nov 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

afsmeira left a comment •

edited

Loading

afsmeira commented Nov 27, 2025 •

edited

Loading