rtlil: Speeds up string decoding by 30% #3940

QuantamHD · 2023-09-19T00:04:11Z

This change represents about a 2% speed up of Yosys as a whole.

rmlarsen · 2023-09-19T00:10:27Z

Did you intend to include the change to decode_string here? Or is that going to be a separate PR?

@rmlarsen

This change represents about a 2% speed up of Yosys as a whole. Written by @rmlarsen Co-authored-by: Rasmus Larsen <[email protected]> Signed-off-by: Ethan Mahintorabi <[email protected]>

QuantamHD · 2023-09-19T00:12:41Z

Good catch. I added it back.

povik · 2023-09-19T10:22:21Z

kernel/rtlil.cc

+        bits.reserve(str.size() * 8);
+        for (int i = str.size() - 1; i >= 0; --i) {
+          const unsigned char ch = str[i];
+          bits.insert(bits.end(), LUT[ch].begin(), LUT[ch].end());


I am bit surprised by this being faster -- would you perhaps have a figure for this change alone?

This in itself is about 20% faster. Decode_string is about 30% faster.

Before:

After:

povik · 2023-09-19T10:31:08Z

kernel/rtlil.cc

+
+        int i = n_over_8 * 8;
+        if (i < n) {
+          fill_byte(i, n);


I must say I really dislike the closure here. What about something like:

int cap = GetSize(bits); // where the current byte ends for (int i = GetSize(bits) / 8 * 8; i >= 0; i -= 8) { for (int j = i; j < cap; j++) { ... } cap = i; }

Having a named helper function for filling each byte is more readable to me, but I suppose it's a matter of what style you are used to reading. But we can certainly change it.

Please do change it. While at it, please fix the patch to use tabs like in the rest of the file.

Also to consider is splitting the patches: we are ready to merge the decode_string() change but we are not so sure about the lookup table in the constructor.

@QuantamHD do you want to update / split the PR or should I just file a new one?

povik · 2023-09-26T11:20:43Z

We are curious about the timings. Couple of questions: when you say this speeds up Yosys as a whole by 2 %, do you mean some representative ASIC flow you are running on your end? Would you have some typical timings for the read_verilog / hierarchy steps on their own, ideally absolute? We assume those steps are where the significant speedup is, e.g. in constructing the Const values for all the src attributes, et cetera.

rmlarsen · 2023-09-26T16:53:44Z

Hi Martin,

Thanks for the comments. Response below.

We are curious about the timings. Couple of questions: when you say this speeds up Yosys as a whole by 2 %, do you mean some representative ASIC flow you are running on your end? Would you have some typical timings for the read_verilog / hierarchy steps on their own, ideally absolute? We assume those steps are where the significant speedup is, e.g. in constructing the Const values for all the src attributes, et cetera.

Yes, the 2% is the reduction in overall time of the synthesis flow for a representative circuit from a Google hardware block that we are running through Yosys. I can provide you with a flame graph for example, since that probably gives more context, or a wider slice of the Tree view.

povik · 2023-09-26T16:55:45Z

Hello Rasmus,

Yes, anything you could provide to inform our discussion would be helpful.

rmlarsen · 2023-09-26T16:57:23Z

@povik Will do. Notice that @QuantamHD is OOO until tomorrow. I'd like to coordinate with him on how to proceed in terms of splitting, so give us a few days. Thanks.

povik · 2023-09-26T17:00:03Z

We will probably next look at this on Monday 14:00 UTC during the weekly development call (which is btw open to public and you are more than welcome to join if interested).

rmlarsen · 2023-09-26T17:35:52Z

Ah, good to know. Thanks! Would be nice to join the call if I can at some point.

povik · 2023-09-26T17:50:41Z

Link here: https://meet.jit.si/yosyshq-slack-devel-discuss

It's scheduled to be an hour long, occurs every week.

rmlarsen · 2023-09-28T00:20:54Z

@povik I moved the decode_string() change to #3959. I got rid of the lambda and after a few tries found a variant that was a bit faster too.

QuantamHD · 2023-09-28T00:23:51Z

@rmlarsen Should we close this then?

rmlarsen · 2023-09-28T00:26:25Z

@QuantamHD sure. We can upstream the Const constructor separately, if the Yosys authors find it acceptable.

rtlil: Speeds up string decoding by 30%

2235c3f

This change represents about a 2% speed up of Yosys as a whole. Written by @rmlarsen Co-authored-by: Rasmus Larsen <[email protected]> Signed-off-by: Ethan Mahintorabi <[email protected]>

QuantamHD force-pushed the const_string_improvements branch from 8f6a9f5 to 2235c3f Compare September 19, 2023 00:12

povik reviewed Sep 19, 2023

View reviewed changes

rmlarsen mentioned this pull request Sep 28, 2023

Speed up RTLIL::Const::decode_string by 1.7x. #3959

Merged

QuantamHD closed this Sep 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rtlil: Speeds up string decoding by 30% #3940

rtlil: Speeds up string decoding by 30% #3940

QuantamHD commented Sep 19, 2023

rmlarsen commented Sep 19, 2023

QuantamHD commented Sep 19, 2023

povik Sep 19, 2023

rmlarsen Sep 19, 2023

rmlarsen Sep 19, 2023

povik Sep 19, 2023 •

edited

Loading

rmlarsen Sep 19, 2023 •

edited

Loading

povik Sep 26, 2023

povik Sep 26, 2023

rmlarsen Sep 26, 2023

povik commented Sep 26, 2023 •

edited

Loading

rmlarsen commented Sep 26, 2023 •

edited

Loading

povik commented Sep 26, 2023

rmlarsen commented Sep 26, 2023

povik commented Sep 26, 2023

rmlarsen commented Sep 26, 2023

povik commented Sep 26, 2023

rmlarsen commented Sep 28, 2023

QuantamHD commented Sep 28, 2023

rmlarsen commented Sep 28, 2023

rtlil: Speeds up string decoding by 30% #3940

rtlil: Speeds up string decoding by 30% #3940

Conversation

QuantamHD commented Sep 19, 2023

rmlarsen commented Sep 19, 2023

QuantamHD commented Sep 19, 2023

povik Sep 19, 2023

Choose a reason for hiding this comment

rmlarsen Sep 19, 2023

Choose a reason for hiding this comment

rmlarsen Sep 19, 2023

Choose a reason for hiding this comment

povik Sep 19, 2023 • edited Loading

Choose a reason for hiding this comment

rmlarsen Sep 19, 2023 • edited Loading

Choose a reason for hiding this comment

povik Sep 26, 2023

Choose a reason for hiding this comment

povik Sep 26, 2023

Choose a reason for hiding this comment

rmlarsen Sep 26, 2023

Choose a reason for hiding this comment

povik commented Sep 26, 2023 • edited Loading

rmlarsen commented Sep 26, 2023 • edited Loading

povik commented Sep 26, 2023

rmlarsen commented Sep 26, 2023

povik commented Sep 26, 2023

rmlarsen commented Sep 26, 2023

povik commented Sep 26, 2023

rmlarsen commented Sep 28, 2023

QuantamHD commented Sep 28, 2023

rmlarsen commented Sep 28, 2023

povik Sep 19, 2023 •

edited

Loading

rmlarsen Sep 19, 2023 •

edited

Loading

povik commented Sep 26, 2023 •

edited

Loading

rmlarsen commented Sep 26, 2023 •

edited

Loading