ROX-32422: Populate vulnerability datasource #18416

dcaravel · 2026-01-09T02:58:30Z

Description

Adds a new DataSource field to storage.EmbeddedVulnerability. This datasource will be used to uniquely associate data with CVEs in Central.

The initial use cases will track CVE fixed dates.

The DataSource will only be populated by Scanner V4 and only for vulnerabilities NOT sourced from Red Hat (because more fields are needed to uniquely identify a Red Hat vulnerability then what is exposed by Scanner V4).

The DataSource field will contain the concatenation of the updater name from Scanner V4/ClairCore + the package os/version (if applicable), examples:

debian-bookworm-updater::debian:12
alpine-main-v3.23-updater::alpine:3.23
osv/go
osv/maven

This is not an ideal solution and adds an uncomfortable reliance on the updater names and Central. To help with this:

unit tests and e2e tests have been created that check for changes to the updaters and will fail if attention is needed.
The field will be treated as opaque, it will not be serialized in API responses and will be cleared from storage.EmbeddedVulnerability at the datastore layer as part of ROX-30641
- This PR added json:"-" to the new field which will omit it from REST API responses, however it is still populated in gRPC reponses until cleared at the datastore level.

The scanner/e2etests were not starting prior this PR - the restart check was removed because the scanner pod was always restarting at least once before bundles were loaded. After removing the restart check the e2etest now run, however the existing TestImage fails due to data differences - this is unrelated to the new TestUpdaterNames that runs successfully (see here).

User-facing documentation

CHANGELOG.md is updated OR update is not needed
documentation PR is created and is linked above OR is not needed

Testing and quality

the change is production ready: the change is GA, or otherwise the functionality is gated by a feature flag
CI results are inspected

Automated testing

added unit tests
added e2e tests
modified existing tests

There is slight overlap between the added unit and e2e tests that check to see if updaters have been changed. The unit tests will cover the obvious changes to updater sets, but cannot validate the actual updater names since those are determined at run time. The actual updater names are evaluated as part of the e2e tests.

How I validated my change

Because the dataSource field is not filtered yet for gRPC requests - I used roxctl (gRPC) to verify that the field is being populated as expected, the below jq queries show the counts / values of .scan.components[].vulns[].datasource for various images

$ rctl image scan --image=datadog/agent -f 2>/dev/null | jq '.scan.components[] | select(.vulns | length > 0) | .vulns[] | [.datasource] | @tsv' -r | sort | uniq -c

   4 osv/go
   2 osv/pypi
  38 ubuntu/updater/noble::ubuntu:24.04

$ rctl image scan --image=node -f 2>/dev/null | jq '.scan.components[] | select(.vulns | length > 0) | .vulns[] | [.datasource] | @tsv' -r | sort | uniq -c

1947 debian/updater::debian:12
   3 osv/npm

$ rctl image scan --image=nginx -f 2>/dev/null | jq '.scan.components[] | select(.vulns | length > 0) | .vulns[] | [.datasource] | @tsv' -r | sort | uniq -c

 122 debian/updater::debian:13

$ rctl image scan --image=grafana/grafana -f 2>/dev/null | jq '.scan.components[] | select(.vulns | length > 0) | .vulns[] | [.datasource] | @tsv' -r | sort | uniq -c

   8 osv/go

$ rctl image scan --image=alpine:3.19.7 -f 2>/dev/null | jq '.scan.components[] | select(.vulns | length > 0) | .vulns[] | [.datasource] | @tsv' -r | sort | uniq -c

  10 alpine-main-v3.19-updater::alpine:3.19

Additionally, ran the new e2e tests against an already running instances of StackRox

$ go test -run ^TestUpdaterNames$ github.com/stackrox/rox/scanner/e2etests -v
=== RUN   TestUpdaterNames
    updater_names_test.go:91: Querying updaters
{"level":"debug","count":105,"time":"2026-01-09T17:13:34-06:00","message":"found updaters"}
    updater_names_test.go:96: Found 105 updaters in database
--- PASS: TestUpdaterNames (0.57s)
PASS
ok  	github.com/stackrox/rox/scanner/e2etests	1.971s

For good measure, modified one of the patterns in the test to prove failure (removed the x from rhel-vex:

$ go test -run ^TestUpdaterNames$ github.com/stackrox/rox/scanner/e2etests -v
=== RUN   TestUpdaterNames
    updater_names_test.go:91: Querying updaters
{"level":"debug","count":105,"time":"2026-01-09T17:14:46-06:00","message":"found updaters"}
    updater_names_test.go:96: Found 105 updaters in database
    updater_names_test.go:114: 
        	Error Trace:	/Users/dcaravel/dev/stackrox/stackrox-add-datasource-to-vulns/scanner/e2etests/updater_names_test.go:114
        	Error:      	Should be true
        	Test:       	TestUpdaterNames
        	Messages:   	Unknown updater name: "rhel-vex"
--- FAIL: TestUpdaterNames (0.57s)
FAIL
FAIL	github.com/stackrox/rox/scanner/e2etests	1.956s
FAIL

openshift-ci · 2026-01-09T02:58:34Z

Skipping CI for Draft Pull Request.
If you want CI signal for your change, please convert it to an actual PR.
You can still manually trigger a test run with /test all

rhacs-bot · 2026-01-09T03:34:19Z

Images are ready for the commit at a0a88c2.

To use with deploy scripts, first export MAIN_IMAGE_TAG=4.10.x-743-ga0a88c2653.

codecov · 2026-01-09T03:46:38Z

Codecov Report

❌ Patch coverage is 97.50000% with 1 line in your changes missing coverage. Please review.
✅ Project coverage is 48.86%. Comparing base (b002714) to head (a0a88c2).

Files with missing lines	Patch %	Lines
scanner/updater/export.go	0.00%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master   #18416      +/-   ##
==========================================
- Coverage   48.90%   48.86%   -0.05%     
==========================================
  Files        2629     2631       +2     
  Lines      197917   198124     +207     
==========================================
+ Hits        96789    96805      +16     
- Misses      93747    93936     +189     
- Partials     7381     7383       +2

Flag	Coverage Δ
go-unit-tests	`48.86% <97.50%> (-0.05%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

sourcery-ai

Hey - I've found 1 issue, and left some high level feedback:

In envOS, calling distributions(report) without checking for a nil report will panic in cases the tests already model (e.g., non-nil env with nil report); consider adding a nil guard so the helper safely returns an empty string when report is nil.

Prompt for AI Agents

Please address the comments from this code review:

## Overall Comments
- In `envOS`, calling `distributions(report)` without checking for a nil `report` will panic in cases the tests already model (e.g., non-nil env with nil report); consider adding a nil guard so the helper safely returns an empty string when `report` is nil.

## Individual Comments

### Comment 1
<location> `pkg/scanners/scannerv4/convert.go:98-107` </location>
<code_context>

+// envOS will return the operating system name and version associated with an
+// environment.
+func envOS(env *v4.Environment, report *v4.VulnerabilityReport) string {
+	if env == nil {
+		return ""
+	}
+
+	dists := distributions(report)
+	dist, ok := dists[env.GetDistributionId()]
+	if !ok || dist.GetDid() == "" || dist.GetVersionId() == "" {
+		return ""
+	}
+
+	return dist.GetDid() + ":" + dist.GetVersionId()
+}
+
</code_context>

<issue_to_address>
**issue (bug_risk):** envOS assumes report is non-nil, which can panic if used defensively elsewhere

`envOS` dereferences `report` via `distributions(report)` without a nil check. If any caller passes a nil report (e.g., future code or tests), this will panic. Either add a nil guard for `report` in `envOS`, or clearly enforce/document that `report` must be non-nil at all call sites.
</issue_to_address>

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

pkg/scanners/scannerv4/convert.go

sourcery-ai

Hey - I've left some high level feedback:

envOS calls distributions(report) without guarding against a nil report, but TestEnvOS includes cases with a non-nil env and nil report; either envOS or distributions should defensively handle a nil report to avoid a potential panic and align with the test expectations.
vulnDataSource relies on a simple strings.HasPrefix(updater, "rhel") check to exclude Red Hat–sourced vulnerabilities; consider centralizing or tightening this Red Hat source detection (e.g., matching specific updater set namespaces) so that future updater naming changes don’t accidentally bypass the exclusion.

Prompt for AI Agents

Please address the comments from this code review:

## Overall Comments
- envOS calls distributions(report) without guarding against a nil report, but TestEnvOS includes cases with a non-nil env and nil report; either envOS or distributions should defensively handle a nil report to avoid a potential panic and align with the test expectations.
- vulnDataSource relies on a simple strings.HasPrefix(updater, "rhel") check to exclude Red Hat–sourced vulnerabilities; consider centralizing or tightening this Red Hat source detection (e.g., matching specific updater set namespaces) so that future updater naming changes don’t accidentally bypass the exclusion.

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

dcaravel · 2026-01-12T04:15:23Z

/retest

openshift-ci · 2026-01-12T06:26:27Z

@dcaravel: The following tests failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name	Commit	Details	Required	Rerun command
ci/prow/gke-ui-e2e-tests	`a0a88c2`	link	true	`/test gke-ui-e2e-tests`
ci/prow/ocp-4-20-ui-e2e-tests	`a0a88c2`	link	false	`/test ocp-4-20-ui-e2e-tests`

Full PR test history. Your PR dashboard.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

openshift-ci bot added the do-not-merge/work-in-progress label Jan 9, 2026

github-actions bot added the area/scanner label Jan 9, 2026

Base automatically changed from dc/add-updater-to-vulns to master January 9, 2026 03:06

dcaravel force-pushed the dc/add-datasource-to-vulns branch from f7272fc to f4a8063 Compare January 9, 2026 03:08

dcaravel force-pushed the dc/add-datasource-to-vulns branch 2 times, most recently from b5e3ec4 to b48a82f Compare January 9, 2026 15:42

dcaravel added the scanner-functional-tests label Jan 9, 2026

dcaravel force-pushed the dc/add-datasource-to-vulns branch from b48a82f to f6ed3fa Compare January 9, 2026 22:06

github-actions bot added area/ci ai-review labels Jan 9, 2026

sourcery-ai bot reviewed Jan 9, 2026

View reviewed changes

pkg/scanners/scannerv4/convert.go Show resolved Hide resolved

dcaravel force-pushed the dc/add-datasource-to-vulns branch from f6ed3fa to 027f67e Compare January 9, 2026 23:44

dcaravel marked this pull request as ready for review January 10, 2026 01:23

dcaravel requested review from a team as code owners January 10, 2026 01:23

openshift-ci bot removed the do-not-merge/work-in-progress label Jan 10, 2026

sourcery-ai bot reviewed Jan 10, 2026

View reviewed changes

init

a0a88c2

dcaravel force-pushed the dc/add-datasource-to-vulns branch from 027f67e to a0a88c2 Compare January 11, 2026 18:54

dcaravel added the auto-retest PRs with this label will be automatically retested if prow checks fails label Jan 12, 2026

dcaravel removed the auto-retest PRs with this label will be automatically retested if prow checks fails label Jan 12, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ROX-32422: Populate vulnerability datasource #18416

ROX-32422: Populate vulnerability datasource #18416

dcaravel commented Jan 9, 2026 •

edited

Loading

Uh oh!

openshift-ci bot commented Jan 9, 2026

Uh oh!

rhacs-bot commented Jan 9, 2026 •

edited

Loading

Uh oh!

codecov bot commented Jan 9, 2026 •

edited

Loading

Uh oh!

sourcery-ai bot left a comment

Uh oh!

Uh oh!

sourcery-ai bot left a comment

Uh oh!

dcaravel commented Jan 12, 2026

Uh oh!

openshift-ci bot commented Jan 12, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ROX-32422: Populate vulnerability datasource #18416

Are you sure you want to change the base?

ROX-32422: Populate vulnerability datasource #18416

Conversation

dcaravel commented Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

User-facing documentation

Testing and quality

Automated testing

How I validated my change

Uh oh!

openshift-ci bot commented Jan 9, 2026

Uh oh!

rhacs-bot commented Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

sourcery-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sourcery-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

dcaravel commented Jan 12, 2026

Uh oh!

openshift-ci bot commented Jan 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

dcaravel commented Jan 9, 2026 •

edited

Loading

rhacs-bot commented Jan 9, 2026 •

edited

Loading

codecov bot commented Jan 9, 2026 •

edited

Loading

openshift-ci bot commented Jan 12, 2026 •

edited

Loading