fix: disregard RPM module build number in version comparison #2375

willmurphyscode · 2025-01-15T00:35:00Z

Previously, different build numbers of the same RPM version release string were compared lexicographically, leading to incorrect comparisons between RedHat and RedHat clone RPMs, resulting in FPs or FNs on centOS.

wagoodman · 2025-02-03T20:51:12Z

grype/version/rpm_version.go

+	aParts := strings.Split(a, "+")
+	bParts := strings.Split(b, "+")
+	if len(aParts) > 2 {
+		aParts = aParts[:len(aParts)-2]
+	}
+	if len(bParts) > 2 {
+		bParts = bParts[:len(bParts)-2]
+	}
+	trimmedA := strings.Join(aParts, "+")
+	trimmedB := strings.Join(bParts, "+")


this takes the presumption that the last two + should be ignored, but in reality any number of +s could be supplied... shouldn't we really be ignoring anything after the first +?

Are there arbitrarily many + separators? Is there like a spec to follow or something here @wagoodman?

I see some version strings with 4 + in them in the grype database (apparently Oracle Linux RPMs). Other vendors seem only to ever have 3 + in the RPM.

@wagoodman it doesn't seem to work to remove everything after the first +, because we are sometimes comparing versions with a different number of +:

This query run in Grype DB:

select id, fixed_in_versions from vulnerability where version_format = "rpm" and fixed_in_versions like "%+%" and fixed_in_versions like "%module+el%" and namespace like "%red%" order by random() limit 10;

id fixed_in_versions

CVE-2019-2808 ["0:8.0.17-3.module+el8.0.0+3898+e09bb8de"]

CVE-2020-14350 ["0:9.6.20-1.module+el8.3.0+8938+7f0e88b6"]

CVE-2024-21051 ["0:8.0.36-1.module+el8.9.0+21207+6c20cb3d"]

CVE-2020-2898 ["0:8.0.21-1.module+el8.2.0+7855+47abd494"]

CVE-2019-9516 ["1:10.16.3-2.module+el8.0.0+4214+49953fda"]

CVE-2019-2481 ["0:8.0.17-3.module+el8.0.0+3898+e09bb8de"]

CVE-2021-23214 ["0:12.9-1.module+el8.5.0+13373+4554acc4"]

CVE-2022-21638 ["0:8.0.30-1.module+el8.6.0+16523+5cb0e868"]

CVE-2021-2305 ["0:8.0.26-1.module+el8.4.0+12359+b8928c02"]

CVE-2024-21896 ["1:20.11.1-1.module+el9.3.0+21385+bac43d5a"]

produces plenty of rows. But then

❯ syft -q -o json \ docker.io/anchore/test_images@sha256:524ff8a75f21fd886ec7ed82387766df386671e8b77e898d05786118d5b7880b | \ jq -r '.artifacts[] | .purl' | rg -e module_el -e module+el | \ python3 -c "import sys, urllib.parse; from packageurl import PackageURL; print('\n'.join(PackageURL.from_string(line.strip()).version for line in sys.stdin))" 10.3.28-1.module_el8.3.0+757+d382997d 10.3.28-1.module_el8.3.0+757+d382997d 2.066-4.module_el8.4.0+517+be1595ff 7.4.30-1.module_el8.7.0+1190+d11b935a 7.4.30-1.module_el8.7.0+1190+d11b935a 7.4.30-1.module_el8.7.0+1190+d11b935a 7.4.30-1.module_el8.7.0+1190+d11b935a 7.4.30-1.module_el8.7.0+1190+d11b935a 7.4.30-1.module_el8.7.0+1190+d11b935a 7.4.30-1.module_el8.7.0+1190+d11b935a 7.4.30-1.module_el8.7.0+1190+d11b935a 7.4.30-1.module_el8.7.0+1190+d11b935a 12.11-2.module_el8.6.0+1153+eb826827

Notice that in grype-db we see the substring module+el8.3.0 and in Syft PURLs we see the SBOM module_el8.3.0 (_ not +), so if we take the first plus, these compare incorrectly because the el8.3.0 is dropped from one but not the other. (edit: better shell oneliner, but no change in output)

Good to know, but I don't think we can always assume we can operate on the last two + fields. What about parsing known fields from the start of the string through [_+]el\d and dropping everything after the first + found from that point on?

Previously, different build numbers of the same RPM version release string were compared lexicographically, leading to incorrect comparisons between RedHat and RedHat clone RPMs, resulting in FPs or FNs on centOS. Signed-off-by: Will Murphy <[email protected]>

Also, add more test cases from different distros. Signed-off-by: Will Murphy <[email protected]>

Signed-off-by: Will Murphy <[email protected]>

willmurphyscode · 2025-02-14T16:35:44Z

Based on some experimentation, this change is insufficient as is:

Consider two version numbers:

[root@19ac959dacf7 /]# rpmdev-vercmp 3:10.3.28-1.module_el8.3.0+757+d382997d 3:10.3.28-1.module+el8.3.0+10472+7adc332a
3:10.3.28-1.module_el8.3.0+757+d382997d < 3:10.3.28-1.module+el8.3.0+10472+7adc332a

This is correct! build 757 is earlier than build 10472. However, if we're comparing Centos8 artifacts against RHEL8 data (which we do), than it is incorrect, because the build numbers are not comparable.

This change, as is, would make us worse at comparing RHEL8 packages to RHEL8 vuln data, in exchange for making us (probably) better at comparing Centos8 packages to RHEL8 data. Instead, we should add logic to detect whether we're comparing across similar OSes, and use fuzzier version comparison only then.

wagoodman reviewed Feb 3, 2025

View reviewed changes

willmurphyscode added 3 commits February 13, 2025 13:36

fix: account for N plus signs in rpm release string

ca4a246

Also, add more test cases from different distros. Signed-off-by: Will Murphy <[email protected]>

chore: bump vulnerability match labels

736220a

Signed-off-by: Will Murphy <[email protected]>

willmurphyscode force-pushed the fix-rpm-release-without-build-no branch from 12dcb51 to 736220a Compare February 13, 2025 18:37

wagoodman approved these changes Feb 13, 2025

View reviewed changes

willmurphyscode marked this pull request as draft February 14, 2025 16:32

willmurphyscode mentioned this pull request Feb 17, 2025

Difficulties in cross-clone version comparison #2451

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: disregard RPM module build number in version comparison #2375

fix: disregard RPM module build number in version comparison #2375

willmurphyscode commented Jan 15, 2025

wagoodman Feb 3, 2025

willmurphyscode Feb 11, 2025

willmurphyscode Feb 11, 2025

willmurphyscode Feb 12, 2025 •

edited

Loading

wagoodman Feb 12, 2025

willmurphyscode commented Feb 14, 2025

id	fixed_in_versions
CVE-2019-2808	["0:8.0.17-3.module+el8.0.0+3898+e09bb8de"]
CVE-2020-14350	["0:9.6.20-1.module+el8.3.0+8938+7f0e88b6"]
CVE-2024-21051	["0:8.0.36-1.module+el8.9.0+21207+6c20cb3d"]
CVE-2020-2898	["0:8.0.21-1.module+el8.2.0+7855+47abd494"]
CVE-2019-9516	["1:10.16.3-2.module+el8.0.0+4214+49953fda"]
CVE-2019-2481	["0:8.0.17-3.module+el8.0.0+3898+e09bb8de"]
CVE-2021-23214	["0:12.9-1.module+el8.5.0+13373+4554acc4"]
CVE-2022-21638	["0:8.0.30-1.module+el8.6.0+16523+5cb0e868"]
CVE-2021-2305	["0:8.0.26-1.module+el8.4.0+12359+b8928c02"]
CVE-2024-21896	["1:20.11.1-1.module+el9.3.0+21385+bac43d5a"]

fix: disregard RPM module build number in version comparison #2375

Are you sure you want to change the base?

fix: disregard RPM module build number in version comparison #2375

Conversation

willmurphyscode commented Jan 15, 2025

wagoodman Feb 3, 2025

Choose a reason for hiding this comment

willmurphyscode Feb 11, 2025

Choose a reason for hiding this comment

willmurphyscode Feb 11, 2025

Choose a reason for hiding this comment

willmurphyscode Feb 12, 2025 • edited Loading

Choose a reason for hiding this comment

wagoodman Feb 12, 2025

Choose a reason for hiding this comment

willmurphyscode commented Feb 14, 2025

willmurphyscode Feb 12, 2025 •

edited

Loading