Add differential fuzzing against wasmi (a Wasm interpreter). #2453

cfallin · 2020-11-25T23:35:52Z

This PR adds a new fuzz target, differential_wasmi, that runs a
Cranelift-based Wasm backend alongside a simple third-party Wasm
interpeter crate (wasmi). The fuzzing runs the first function in a
given module to completion on each side, and then diffs the return value
and linear memory contents.

This strategy should provide end-to-end coverage including both the Wasm
translation to CLIF (which has seen some subtle and scary bugs at
times), the lowering from CLIF to VCode, the register allocation, and
the final code emission.

github-actions · 2020-11-25T23:48:43Z

Subscribe to Label Action

cc @fitzgen

This issue or pull request has been labeled: "fuzzing"

Thus the following users have been cc'd because of the following labels:

fitzgen: fuzzing

To subscribe or unsubscribe from this label, edit the .github/subscribe-to-label.json configuration file.

Learn more.

alexcrichton

Nice!

crates/fuzzing/src/oracles.rs

fitzgen

Looks good, thanks @cfallin!

Mostly just echoing what Alex had to say, but a couple inline comments nonetheless.

crates/fuzzing/src/oracles.rs

cfallin · 2020-12-02T00:32:13Z

Updated, thanks!

It turns out that wasmi doesn't canonicalize NaNs (wasmi-labs/wasmi#19) so I've disabled fuzzing of modules with FP ops for now. If it seems worthwhile, we can add a Config option to wasm-smith for that too, but fuzzing seems to be getting coverage pretty efficiently right now as I watch it run.

This fails the "publish" check because it's using a crate-source patch in the root Cargo.toml for wasm-smith; @fitzgen, would you mind publishing a new version to crates.io?

fitzgen · 2020-12-02T00:41:03Z

This fails the "publish" check because it's using a crate-source patch in the root Cargo.toml for wasm-smith; @fitzgen, would you mind publishing a new version to crates.io?

Sure thing.

fitzgen · 2020-12-02T00:43:25Z

Done!

cfallin · 2020-12-02T00:48:55Z

Thanks! Updated.

I also just added an experimental_x64 feature flag to the fuzz crate so that we can fuzz the new backend.

This issue was found while fuzzing the new backend (bytecodealliance#2453); I suspect that it arises with the new backend because we can sink instructions (e.g. loads or extends) in more interesting ways than before, but I'm not entirely sure. Test coverage will be via the fuzz corpus once bytecodealliance#2453 lands.

github-actions · 2020-12-02T01:52:20Z

Subscribe to Label Action

cc @peterhuene

This issue or pull request has been labeled: "wasmtime:api"

Thus the following users have been cc'd because of the following labels:

peterhuene: wasmtime:api

To subscribe or unsubscribe from this label, edit the .github/subscribe-to-label.json configuration file.

Learn more.

A dynamic heap address computation may create up to two conditional branches: the usual bounds-check, but also (in some cases) an offset-addition overflow check. The x64 backend had reversed the condition code for this check, resulting in an always-trapping execution for a valid offset. I'm somewhat surprised this has existed so long, but I suppose the particular conditions (large offset, small offset guard, dynamic heap) have been somewhat rare in our testing so far. Found via fuzzing in bytecodealliance#2453.

- Sort by generated-code offset to maintain invariant and avoid gimli panic. - Fix srcloc interaction with branch peephole optimization in MachBuffer: if a srcloc range overlaps with a branch that is truncated, remove that srcloc range. These issues were found while fuzzing the new backend (bytecodealliance#2453); I suspect that they arise with the new backend because we can sink instructions (e.g. loads or extends) in more interesting ways than before, but I'm not entirely sure. Test coverage will be via the fuzz corpus once bytecodealliance#2453 lands.

crates/fuzzing/Cargo.toml

crates/fuzzing/src/oracles.rs

fuzz/Cargo.toml

A dynamic heap address computation may create up to two conditional branches: the usual bounds-check, but also (in some cases) an offset-addition overflow check. The x64 backend had reversed the condition code for this check, resulting in an always-trapping execution for a valid offset. I'm somewhat surprised this has existed so long, but I suppose the particular conditions (large offset, small offset guard, dynamic heap) have been somewhat rare in our testing so far. Found via fuzzing in bytecodealliance#2453.

- Sort by generated-code offset to maintain invariant and avoid gimli panic. - Fix srcloc interaction with branch peephole optimization in MachBuffer: if a srcloc range overlaps with a branch that is truncated, remove that srcloc range. These issues were found while fuzzing the new backend (bytecodealliance#2453); I suspect that they arise with the new backend because we can sink instructions (e.g. loads or extends) in more interesting ways than before, but I'm not entirely sure. Test coverage will be via the fuzz corpus once bytecodealliance#2453 lands.

A dynamic heap address computation may create up to two conditional branches: the usual bounds-check, but also (in some cases) an offset-addition overflow check. The x64 backend had reversed the condition code for this check, resulting in an always-trapping execution for a valid offset. I'm somewhat surprised this has existed so long, but I suppose the particular conditions (large offset, small offset guard, dynamic heap) have been somewhat rare in our testing so far. Found via fuzzing in bytecodealliance#2453.

- Sort by generated-code offset to maintain invariant and avoid gimli panic. - Fix srcloc interaction with branch peephole optimization in MachBuffer: if a srcloc range overlaps with a branch that is truncated, remove that srcloc range. These issues were found while fuzzing the new backend (bytecodealliance#2453); I suspect that they arise with the new backend because we can sink instructions (e.g. loads or extends) in more interesting ways than before, but I'm not entirely sure. Test coverage will be via the fuzz corpus once bytecodealliance#2453 lands.

This PR adds a new fuzz target, `differential_wasmi`, that runs a Cranelift-based Wasm backend alongside a simple third-party Wasm interpeter crate (`wasmi`). The fuzzing runs the first function in a given module to completion on each side, and then diffs the return value and linear memory contents. This strategy should provide end-to-end coverage including both the Wasm translation to CLIF (which has seen some subtle and scary bugs at times), the lowering from CLIF to VCode, the register allocation, and the final code emission. This PR also adds a feature `experimental_x64` to the fuzzing crate (and the chain of dependencies down to `cranelift-codegen`) so that we can fuzz the new x86-64 backend as well as the current one.

cfallin requested review from fitzgen and iximeow November 25, 2020 23:35

github-actions bot added the fuzzing Issues related to our fuzzing infrastructure label Nov 25, 2020

cfallin force-pushed the differential-fuzz-interp branch from 1f47183 to 98b08e3 Compare November 25, 2020 23:45

cfallin force-pushed the differential-fuzz-interp branch from 84b205a to 207ac97 Compare November 26, 2020 00:11

alexcrichton reviewed Nov 26, 2020

View reviewed changes

crates/fuzzing/src/oracles.rs Outdated Show resolved Hide resolved

crates/fuzzing/src/oracles.rs Outdated Show resolved Hide resolved

crates/fuzzing/src/oracles.rs Outdated Show resolved Hide resolved

fitzgen reviewed Nov 30, 2020

View reviewed changes

crates/fuzzing/src/oracles.rs Outdated Show resolved Hide resolved

crates/fuzzing/src/oracles.rs Outdated Show resolved Hide resolved

crates/fuzzing/src/oracles.rs Outdated Show resolved Hide resolved

crates/fuzzing/src/oracles.rs Outdated Show resolved Hide resolved

cfallin force-pushed the differential-fuzz-interp branch 2 times, most recently from 0cf19cc to d0fe898 Compare December 2, 2020 00:23

cfallin force-pushed the differential-fuzz-interp branch from d0fe898 to 04d7f6d Compare December 2, 2020 00:48

cfallin mentioned this pull request Dec 2, 2020

Debug info: two fixes in x64 backend. #2462

Merged

github-actions bot added the wasmtime:api Related to the API of the `wasmtime` crate itself label Dec 2, 2020

cfallin mentioned this pull request Dec 2, 2020

x64 backend: fix condition-code used for part of explicit heap check. #2463

Merged

alexcrichton approved these changes Dec 2, 2020

View reviewed changes

cfallin force-pushed the differential-fuzz-interp branch from 04d7f6d to bbdea06 Compare December 2, 2020 22:52

cfallin merged commit b93381e into bytecodealliance:main Dec 2, 2020

cfallin deleted the differential-fuzz-interp branch January 6, 2021 18:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add differential fuzzing against wasmi (a Wasm interpreter). #2453

Add differential fuzzing against wasmi (a Wasm interpreter). #2453

cfallin commented Nov 25, 2020

github-actions bot commented Nov 25, 2020

alexcrichton left a comment

fitzgen left a comment

cfallin commented Dec 2, 2020

fitzgen commented Dec 2, 2020

fitzgen commented Dec 2, 2020

cfallin commented Dec 2, 2020

github-actions bot commented Dec 2, 2020

Add differential fuzzing against wasmi (a Wasm interpreter). #2453

Add differential fuzzing against wasmi (a Wasm interpreter). #2453

Conversation

cfallin commented Nov 25, 2020

github-actions bot commented Nov 25, 2020

Subscribe to Label Action

alexcrichton left a comment

Choose a reason for hiding this comment

fitzgen left a comment

Choose a reason for hiding this comment

cfallin commented Dec 2, 2020

fitzgen commented Dec 2, 2020

fitzgen commented Dec 2, 2020

cfallin commented Dec 2, 2020

github-actions bot commented Dec 2, 2020

Subscribe to Label Action