Match emulator to circuits: use a Harvard Architecture #688

matthiasgoergens · 2024-12-04T06:41:17Z

Our circuits use a modified Harvard architecture where the memory space for instructions and data are completely separated, even though they are both addressed with 32 bits. That is a Good Thing.

That means writing to the data cell with address x does not change the value of the instruction cell with address x.

In contrast, our emulator uses a von Neumann architecture at its core, but goes through quite a few gymnastics to hide that fact.

In a von Neumann architecture, data memory and stored program memory are intermixed. Modern computers have moved away from this architecture for security and performance reasons.

In this PR, we turn our emulator into a Harvard Architecture machine as well to match what our circuits are doing. That means we change the emulator to explicitly load instructions not from the data RAM, but from the program ROM, which we are already storing separately anyway.

Instead of thinking about separate address spaces for data and code, you can also imagine that we have an instruction cache and a data cache, and that we load the instructions into their cache once at the start of the program, and then never invalidate that cache.

This works towards #630

…nto matthias/harvard-architecture

icemelon · 2024-12-04T21:27:06Z

Since you change the emulator to Harvard Architecture, should you also update the memory address space here?

icemelon · 2024-12-04T21:29:44Z

Is this PR ready for review? I see there're many todos left

matthiasgoergens · 2024-12-05T01:41:24Z

Since you change the emulator to Harvard Architecture, should you also update the memory address space here?

Well, we could just totally remove the whole ROM section, because there's no problem with executing code anywhere, now that we separated the address spaces. But I want to keep this PR as small as possible, and there's also no immediate problem with keeping the ROM section contained, either. Or am I missing something?

Is this PR ready for review? I see there're many todos left

The TODOs are for future work, I want to keep this PR small. (I can convert them into issues, too, if you think that's better, but for these small organisational items I prefer to have them discoverable in the code.)

matthiasgoergens · 2024-12-05T01:46:36Z

@icemelon Let me make the follow up PR quickly that addresses the TODOs. Think of this PR as pre-emptively extracted from the big PR the TODOs hint at. Smaller PRs are easier to review and get consensus on.

lispc

lgtm

icemelon · 2024-12-05T06:22:21Z

there's also no immediate problem with keeping the ROM section contained, either.

@matthiasgoergens Yes, you're right. I was thinking of having different memory space for ROM and RAM. But it's not necessary.

The TODOs are for future work, I want to keep this PR small.

I don't mind making PR small. But why not finish the implementation of the Hardvard architecture emulator in this PR? I don't think it will make the pr too large. But I'll leave this up to @lispc

lispc · 2024-12-05T06:29:24Z

writing to the data cell with address x does not change the value of the instruction cell with address x

I think this pr finishes what it claims to do "writing to the data cell with address x does not change the value of the instruction cell with address x, so make emulator be consistent with prover".

the TODOs are good refactor, but not directly part of this purpose. So i think it is ok now.

matthiasgoergens · 2024-12-05T07:11:34Z

I don't mind making PR small. But why not finish the implementation of the Hardvard architecture emulator in this PR? I don't think it will make the pr too large. But I'll leave this up to @lispc

This PR finishes the Harvard architecture. The TODOs are for cleanup enabled by the switch.

In the [Harvard Architecture PR](#688) we left a few TODOs. Here we make good on them. There's a few things happening in this PR: - We remove the fake instruction `EANY`. `EANY` was only ever an artifact of Risc0's incredible 'interesting' approach to decoding. We use the real instructions `ECALL` and `EBREAK` instead. - Remove `AUIPC` and `LUI`. Both of them are now implemented as pseudo-instructions, that translate to `ADDI` during decoding. - Use the same library as SP1 for RiscV decoding, instead of copy-and-pasting-and-editing Risc0's 'interesting' decoder. That simplifies our code, and comes with a lot more tests than we ever had. Both because of explicit tests in the library, and because of the usage in SP1 and other projects. This gets rid of much error prone bit manipulating code. - Use `struct Instruction` throughout the code when handling and testing instructions, instead of `u32`. That makes specifying tests a lot simpler and more readable. No more `0b_000000001010_00000_000_00001_0010011, // addi x1, x0, 10` in the code. - Remove the notion of executable vs non-executable ROM. This is only necessary for a von-Neumann architecture: everything that's in our instruction-cache is meant to be executable already. (We can re-implement this restriction later by controlling what is allowed to make it into the instruction cache when we eg decode the ELF. But it's unnecessary: we already honour the executable flag for memory sections in the ELF.)

Move to Harvard Architecture

ad6c1a2

matthiasgoergens requested a review from kunxian-xia December 4, 2024 06:41

matthiasgoergens mentioned this pull request Dec 4, 2024

Make the emulator match the circuits #630

Open

matthiasgoergens changed the title ~~Synchronise circuits and emulator to both use a Harvard Architecture~~ Match emulator to circuits: use a Harvard Architecture Dec 4, 2024

matthiasgoergens added 4 commits December 4, 2024 14:46

Merge branch 'master' into matthias/harvard-architecture

9c1a840

Comment

1b46255

Explain

eb1cda5

Merge remote-tracking branch 'origin/matthias/harvard-architecture' i…

9073ae9

…nto matthias/harvard-architecture

matthiasgoergens requested a review from lispc December 4, 2024 06:53

Merge branch 'master' into matthias/harvard-architecture

9343b30

lispc approved these changes Dec 5, 2024

View reviewed changes

matthiasgoergens merged commit 43a9617 into master Dec 5, 2024
6 checks passed

matthiasgoergens deleted the matthias/harvard-architecture branch December 5, 2024 07:11

matthiasgoergens mentioned this pull request Dec 9, 2024

Implement the TODOs in the Harvard Architecture PR #711

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Match emulator to circuits: use a Harvard Architecture #688

Match emulator to circuits: use a Harvard Architecture #688

matthiasgoergens commented Dec 4, 2024 •

edited

Loading

icemelon commented Dec 4, 2024

icemelon commented Dec 4, 2024

matthiasgoergens commented Dec 5, 2024

matthiasgoergens commented Dec 5, 2024

lispc left a comment

icemelon commented Dec 5, 2024 •

edited

Loading

lispc commented Dec 5, 2024

matthiasgoergens commented Dec 5, 2024

Match emulator to circuits: use a Harvard Architecture #688

Match emulator to circuits: use a Harvard Architecture #688

Conversation

matthiasgoergens commented Dec 4, 2024 • edited Loading

icemelon commented Dec 4, 2024

icemelon commented Dec 4, 2024

matthiasgoergens commented Dec 5, 2024

matthiasgoergens commented Dec 5, 2024

lispc left a comment

Choose a reason for hiding this comment

icemelon commented Dec 5, 2024 • edited Loading

lispc commented Dec 5, 2024

matthiasgoergens commented Dec 5, 2024

matthiasgoergens commented Dec 4, 2024 •

edited

Loading

icemelon commented Dec 5, 2024 •

edited

Loading