Added Adler32 to System.IO.Hashing. by AraHaan · Pull Request #123601 · dotnet/runtime

AraHaan · 2026-01-25T18:35:08Z

Fixes #90191.

src/libraries/System.IO.Hashing/src/System/IO/Hashing/Adler32.cs

src/libraries/System.IO.Hashing/tests/Adler32Tests.cs

src/libraries/System.IO.Hashing/src/System/IO/Hashing/Adler32.cs

src/libraries/System.IO.Hashing/ref/System.IO.Hashing.cs

src/libraries/System.IO.Hashing/src/System/IO/Hashing/Adler32.cs

src/libraries/System.IO.Hashing/src/System/IO/Hashing/Adler32.Vectorized.cs

src/libraries/System.IO.Hashing/src/System/IO/Hashing/Adler32.Avx2Sse.cs

AraHaan · 2026-02-01T19:45:19Z

Let me know if these changes look good as is or if any more changes are needed.

src/libraries/System.IO.Hashing/src/System/IO/Hashing/Adler32.cs

stephentoub · 2026-02-11T19:53:45Z

🤖 Copilot Code Review — PR #123601

Holistic Assessment

Motivation: The API is approved (issue #90191 has api-approved label). Adler-32 is a legitimate checksum algorithm specified in RFC 1950, commonly used in zlib. Adding it to System.IO.Hashing fills a gap in the library.

Approach: The implementation correctly follows the Adler-32 algorithm per RFC 1950: initial state of 1, modulo 65521 (largest prime < 65536), and big-endian output. The API shape matches the approved proposal and follows patterns established by sibling types (Crc32, Crc64, XxHash*).

Summary: ⚠️ Needs Human Review. The algorithm implementation is correct, and the PR has addressed key feedback from reviewers (empty buffer handling, endianness, bounds check optimization). However, several items warrant human attention: (1) test vector provenance needs confirmation, (2) the scalar-only implementation may be acceptable for initial merge with vectorization deferred, but this is a policy decision.

Detailed Findings

✅ Algorithm Correctness — Verified correct per RFC 1950

The core algorithm is correctly implemented:

Initial state s1=1, s2=0 (combined as 1u)
Uses BASE=65521 (largest prime < 65536)
Uses NMax=5552 to safely defer modulo operations without 32-bit overflow
Accumulates s1 += b; s2 += s1; per byte
Returns (s2 << 16) | s1 — correct
Uses WriteUInt32BigEndian — matches RFC 1950's "most-significant-byte first (network) order" requirement

(Flagged by: GPT-5.2, Gemini 3 Pro, Claude Opus — consensus)

✅ API Consistency — Matches approved API and sibling type patterns

Correctly inherits from NonCryptographicHashAlgorithm
[CLSCompliant(false)] on uint return methods matches Crc32/XxHash32
Clone(), Hash(), TryHash(), HashToUInt32() methods follow established patterns
Ref assembly places Adler32 before Crc32 alphabetically — correct

✅ Empty Input Handling — Fixed

The earlier concern about Update returning 1u for empty buffers has been addressed. The code now correctly returns the current adler state unchanged when buf.IsEmpty, and the base test framework includes AppendingEmptyHasNoEffect which validates this behavior.

⚠️ Test Vectors — Need Oracle Validation

A reviewer asked: "Where did these hashes originate from? Do we have an oracle we can use to validate them?"

The test vectors appear correct (I verified "123456789" → 0x091E01DE matches RFC 1950's example), but the PR should confirm:

The test vectors were generated using an authoritative source (zlib, Python's zlib.adler32, or similar)
Consider adding a P/Invoke test to zlib for continuous validation, similar to how other crypto tests work

(Flagged by: GPT-5.2)

⚠️ Test Coverage — Consider NMax Boundary Tests

Current tests exercise basic correctness but don't explicitly test the NMax (5552-byte) chunking boundary. Consider adding test vectors with lengths at/around NMax-1, NMax, NMax+1, and 2*NMax to ensure the modulo reduction and chunking logic can't regress.

(Flagged by: GPT-5.2)

💡 Vectorization Deferred — Acceptable for Initial Merge

The implementation is scalar-only, while sibling types like Crc32 have vectorized paths. The PR comments indicate vectorization is planned for a follow-up PR. This is a reasonable approach — merge the correct scalar implementation with comprehensive tests first, then optimize in a subsequent PR.

(Flagged by: GPT-5.2, Gemini 3 Pro — consensus)

💡 Test Comment Cleanup

The test file includes comments referencing "vector optimizations" (lines 52, 58, 63) but the implementation is scalar-only. Consider updating these comments to reference "chunking (NMax) coverage" instead, or leave them as-is if vectorization is imminent.

💡 XML Documentation Style

The implementation has XML documentation, but some methods use slightly different formatting than Crc32.cs (e.g., /// <para> vs /// <para>, multi-line vs compact). This is minor but could be harmonized for consistency.

Summary

Category	Finding
✅ Correct	Algorithm implementation, API shape, empty input handling, endianness
⚠️ Attention	Test vector oracle confirmation, NMax boundary test coverage
💡 Optional	Vectorization (deferred OK), test comments, doc formatting

Verdict: The code looks correct and follows library patterns. Recommend merge after confirming test vectors were generated from an authoritative source. Vectorization can follow in a separate PR as planned.

This review was generated using multi-model analysis (GPT-5.2, Gemini 3 Pro, Claude Opus 4.5). Findings flagged by multiple models are noted.

Fixes dotnet#90191. Signed-off-by: AraHaan <[email protected]>

Co-authored-by: Jeremy Barton <[email protected]>

Also another attempt at fixing the expected hash values as well.

Signed-off-by: AraHaan <[email protected]>

Co-authored-by: Kevin Jones <[email protected]>

Added another remarks entry.

…c to UpdateScalar(). Also added optional spot where ARM intrinsics can be used in UpdateScalar if vectorization is not possible.

…ted.

Signed-off-by: AraHaan <[email protected]>

github-actions bot added the needs-area-label An area label is needed to ensure this gets routed to the appropriate area owners label Jan 25, 2026

dotnet-policy-service bot added the community-contribution Indicates that the PR has been added by a community member label Jan 25, 2026

This comment was marked as outdated.

Sign in to view

akoeplinger added area-System.IO.Hashing and removed needs-area-label An area label is needed to ensure this gets routed to the appropriate area owners labels Jan 25, 2026

bartonjs reviewed Jan 25, 2026

View reviewed changes

src/libraries/System.IO.Hashing/src/System/IO/Hashing/Adler32.cs Outdated Show resolved Hide resolved

src/libraries/System.IO.Hashing/tests/Adler32Tests.cs Outdated Show resolved Hide resolved

This comment was marked as outdated.

Sign in to view

stephentoub reviewed Jan 26, 2026

View reviewed changes

src/libraries/System.IO.Hashing/src/System/IO/Hashing/Adler32.cs Show resolved Hide resolved

huoyaoyuan reviewed Jan 26, 2026

View reviewed changes

src/libraries/System.IO.Hashing/ref/System.IO.Hashing.cs Outdated Show resolved Hide resolved

vcsjones reviewed Jan 26, 2026

View reviewed changes

src/libraries/System.IO.Hashing/src/System/IO/Hashing/Adler32.cs Outdated Show resolved Hide resolved

vcsjones reviewed Jan 26, 2026

View reviewed changes

src/libraries/System.IO.Hashing/src/System/IO/Hashing/Adler32.cs Outdated Show resolved Hide resolved

Copilot AI mentioned this pull request Jan 26, 2026

Add test verifying empty data appends have no effect on hash results #123647

Merged

AraHaan marked this pull request as ready for review January 27, 2026 15:40

vcsjones reviewed Jan 27, 2026

View reviewed changes

src/libraries/System.IO.Hashing/src/System/IO/Hashing/Adler32.cs Outdated Show resolved Hide resolved

vcsjones reviewed Jan 27, 2026

View reviewed changes

src/libraries/System.IO.Hashing/src/System/IO/Hashing/Adler32.cs Outdated Show resolved Hide resolved

JimBobSquarePants reviewed Jan 28, 2026

View reviewed changes

src/libraries/System.IO.Hashing/src/System/IO/Hashing/Adler32.cs Show resolved Hide resolved

AraHaan commented Jan 28, 2026

View reviewed changes

src/libraries/System.IO.Hashing/src/System/IO/Hashing/Adler32.Vectorized.cs Outdated Show resolved Hide resolved

build-analysis bot mentioned this pull request Jan 28, 2026

Unable to read data from the transport connection: An existing connection was forcibly closed by the remote host. dotnet/dnceng#5922

Open

3 tasks

vcsjones reviewed Jan 29, 2026

View reviewed changes

src/libraries/System.IO.Hashing/src/System/IO/Hashing/Adler32.Avx2Sse.cs Outdated Show resolved Hide resolved

This was referenced Jan 30, 2026

iOS.Device test WorkItemExecutions #122874

Open

Unable to pull image from mcr.microsoft.com #117164

Open

XHarness package install failure on iOS due to devicectl NSPOSIXErrorDomain error 49 #123796

Open

AraHaan mentioned this pull request Feb 6, 2026

Tests failing on tvos with "The app 'net.dot.System.Runtime.Tests' terminated with signal 11" #124072

Open

stephentoub reviewed Feb 10, 2026

View reviewed changes

src/libraries/System.IO.Hashing/src/System/IO/Hashing/Adler32.cs Outdated Show resolved Hide resolved

stephentoub reviewed Feb 10, 2026

View reviewed changes

src/libraries/System.IO.Hashing/src/System/IO/Hashing/Adler32.cs Outdated Show resolved Hide resolved

AraHaan and others added 19 commits February 12, 2026 17:22

Added Adler32 to System.IO.Hashing.

61a7dff

Fixes dotnet#90191. Signed-off-by: AraHaan <[email protected]>

Update Adler32.cs

1cb342c

Co-authored-by: Jeremy Barton <[email protected]>

Removed Residue test cases from Adler32Tests.

d593e85

Also another attempt at fixing the expected hash values as well.

Fixed empty hash value tests.

7a45ac7

Signed-off-by: AraHaan <[email protected]>

Optimized away needless bounds checking in each access to buf.

99c187d

Signed-off-by: AraHaan <[email protected]>

Moved Adler32 up in the ref source.

fa7f6a0

Signed-off-by: AraHaan <[email protected]>

Fix Adler32.Update to return current adler when empty

e541c89

Moved Base and NMax into locals inside of Update().

6243013

Signed-off-by: AraHaan <[email protected]>

Update src/libraries/System.IO.Hashing/src/System/IO/Hashing/Adler32.cs

8a0acc7

Co-authored-by: Kevin Jones <[email protected]>

Fixed writing LittleEndian -> BigEndian

422642d

Added another remarks entry.

Fixed tests.

b500939

Added method for vectorized adler32, and moved the normal adler32 cal…

5bf6d94

…c to UpdateScalar(). Also added optional spot where ARM intrinsics can be used in UpdateScalar if vectorization is not possible.

Added functions for Avx2 and Sse Scalar implementations.

b67347d

Commented out the optimization paths until they are properly implemen…

b5c5126

…ted.

Removed commented out code.

2a115c8

Signed-off-by: AraHaan <[email protected]>

Removed Adler32.Vectorized.cs as it is not used at the moment.

e5b1596

Signed-off-by: AraHaan <[email protected]>

Apply suggestion from @stephentoub

b32c63f

Apply suggestion from @stephentoub

0b5b56c

Add additional tests for larger inputs

55235a4

stephentoub force-pushed the add-adler32-to-system-io-hashing branch from f776e19 to 55235a4 Compare February 12, 2026 23:18

stephentoub approved these changes Feb 12, 2026

View reviewed changes

stephentoub enabled auto-merge (squash) February 12, 2026 23:19

bartonjs approved these changes Feb 12, 2026

View reviewed changes

stephentoub merged commit fd26ea9 into dotnet:main Feb 13, 2026
88 of 90 checks passed

This was referenced Feb 13, 2026

[android] Android.Device_Emulator.JIT.Test failing on emulators with CoreCLR #112633

Open

[Android][CoreCLR] System.Security.Cryptography.Tests killed by lowmemorykiller #118603

Open

CryptographicException in NonPowerOfTwoKeySizeOaepRoundtrip test #120606

Open

dotnet-maestro bot mentioned this pull request Feb 14, 2026

[main] Source code updates from dotnet/runtime dotnet/dotnet#4873

Open

stephentoub mentioned this pull request Feb 14, 2026

Vectorize Adler32 #124409

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added Adler32 to System.IO.Hashing.#123601

Added Adler32 to System.IO.Hashing.#123601
stephentoub merged 19 commits intodotnet:mainfrom
AraHaan:add-adler32-to-system-io-hashing

AraHaan commented Jan 25, 2026

Uh oh!

This comment was marked as outdated.

Uh oh!

Uh oh!

This comment was marked as outdated.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

AraHaan commented Feb 1, 2026

Uh oh!

Uh oh!

Uh oh!

stephentoub commented Feb 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Conversation

AraHaan commented Jan 25, 2026

Uh oh!

This comment was marked as outdated.

Uh oh!

Uh oh!

This comment was marked as outdated.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

AraHaan commented Feb 1, 2026

Uh oh!

Uh oh!

Uh oh!

stephentoub commented Feb 11, 2026

🤖 Copilot Code Review — PR #123601

Holistic Assessment

Detailed Findings

✅ Algorithm Correctness — Verified correct per RFC 1950

✅ API Consistency — Matches approved API and sibling type patterns

✅ Empty Input Handling — Fixed

⚠️ Test Vectors — Need Oracle Validation

⚠️ Test Coverage — Consider NMax Boundary Tests

💡 Vectorization Deferred — Acceptable for Initial Merge

💡 Test Comment Cleanup

💡 XML Documentation Style

Summary

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants