AI HW Acceleration #53

jzhou1318 · 2023-11-08T13:55:50Z

Before submitting your Pull Request, please ensure that you have carefully reviewed and completed all items on this checklist.

Content
- The chapter content is complete and covers the topic in detail.
- All technical terms are well-defined and explained.
- Any code snippets or algorithms are well-documented and tested.
- The chapter follows a logical flow and structure.
References & Citations
- All references are correctly listed at the end of the chapter.
- In-text citations are used appropriately and match the references.
- All figures, tables, and images have proper sources and are cited correctly.
Quarto Website Rendering
- The chapter has been locally built and tested using Quarto.
- All images, figures, and tables render properly without any glitches.
- All images have a source or they are properly linked to external sites.
- Any interactive elements or widgets work as intended.
- The chapter's formatting is consistent with the rest of the book.
Grammar & Style
- The chapter has been proofread for grammar and spelling errors.
- The writing style is consistent with the rest of the book.
- Any jargon is clearly explained or avoided where possible.
Collaboration
- All group members have reviewed and approved the chapter.
- Any feedback from previous reviews or discussions has been addressed.
Miscellaneous
- All external links (if any) are working and lead to the intended destinations.
- If datasets or external resources are used, they are properly credited and linked.
- Any necessary permissions for reused content have been obtained.
Final Steps
- The chapter is pushed to the correct branch on the repository.
- The Pull Request is made with a clear title and description.
- The Pull Request includes any necessary labels or tags.
- The Pull Request mentions any stakeholders or reviewers who should take a look.

Missing references bibtex emerging tech

updated solved issue with youtube video

AditiR-42 · 2023-11-09T14:45:32Z

hw_acceleration.qmd

+
+In response, new manufacturing techniques like wafer-scale fabrication and advanced packaging now allow much higher levels of integration. The goal is to create unified, specialized AI compute complexes tailored for deep learning and other AI algorithms. Tighter integration is key to delivering the performance and efficiency needed for the next generation of AI.
+
+#### Wafter-scale AI


Typo: Wafer-scale AI

Thank you, fixed.

AditiR-42 · 2023-11-09T14:50:32Z

hw_acceleration.qmd

+
+- **Co-simulation:** Unified platforms like the SCALE-Sim [@samajdar2018scale] integrate hardware and software simulation into a single tool. This enables what-if analysis to quantify the system-level impacts of cross-layer optimizations early in the design cycle.
+
+For example, an FPGA-based AI accelerator design could be simulated using Verilog hardware description language and synthesized into a Gem5 model. The accelerator could have ML workloads simulated using TVM compiled onto it within the Gem5 environment for unified modeling.


This example is a bit difficult to follow. It would be nice to have a step by step explanation of why an FPGA-based AI accelerator should be simulated using Verilog (what exactly about Verilog makes it well-suited for this type of accelerator?), as well as why it should be synthesized into a Gem5 model (what specifically about Gem5 makes it optimal for this task?)

Is this helpful @AditiR-42

For example, an FPGA-based AI accelerator design could be simulated using Verilog hardware description language and synthesized into a Gem5 model. Verilog is well-suited for describing the digital logic and interconnects that make up the accelerator architecture. Using Verilog allows the designer to specify the datapaths, control logic, on-chip memories, and other components that will be implemented in the FPGA fabric. Once the Verilog design is complete, it can be synthesized into a model that simulates the behavior of the hardware, such as using the Gem5 simulator. Gem5 is useful for this task because it allows modeling of full systems including processors, caches, buses, and custom accelerators. Gem5 supports interfacing Verilog models of hardware to the simulation, enabling unified system modeling.

The synthesized FPGA accelerator model could then have ML workloads simulated using TVM compiled onto it within the Gem5 environment for unified modeling. TVM allows optimized compilation of ML models onto heterogeneous hardware like FPGAs. Running TVM-compiled workloads on the accelerator within the Gem5 simulation provides an integrated way to validate and refine the hardware design, software stack, and system integration before ever needing to physically realize the accelerator on a real FPGA.

Hope that works, updated it in the text.

arnaumarin · 2023-11-13T14:43:07Z

Looks good. Thanks for addressing the typo!

gnodipac886 · 2023-11-13T19:42:50Z

added more references to the sections that I was responsible for (challenges and solutions)

profvjreddi · 2023-11-13T19:44:25Z

Thank you folks!

…

On Mon, Nov 13, 2023 at 2:43 PM gnodipac886 ***@***.***> wrote: added more references to the sections that I was responsible for (challenges and solutions) — Reply to this email directly, view it on GitHub <#53 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ABT6DFHEIR277A7XR7NHVTTYEJZ4LAVCNFSM6AAAAAA7C5SPIOVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTQMBYHEYTQMJYGA> . You are receiving this because you commented.Message ID: ***@***.***>

-- Vijay Janapa Reddi, Ph. D. | John L. Loeb Associate Professor of Engineering and Applied Sciences | John A. Paulson School of Engineering and Applied Sciences | Science and Engineering Complex (SEC) | 150 Western Ave, Room #5.305 | Boston, MA 02134 | Harvard University | Email ***@***.***> | Website <http://scholar.harvard.edu/vijay-janapa-reddi> | Google Scholar <https://scholar.google.com/citations?hl=en&user=gy4UVGcAAAAJ&view_op=list_works&sortby=pubdate> | Edge Computing Lab <https://edge.seas.harvard.edu> | Schedule a Meeting <https://scholar.harvard.edu/vijay-janapa-reddi/schedule> | Admin <https://scholar.harvard.edu/vijay-janapa-reddi/contact> |

mpstewart1 · 2023-11-15T14:44:44Z

Really great job on this chapter!

jzhou1318 and others added 10 commits November 6, 2023 18:09

add skeleton

c01f235

add Eric sections without bib

b212fc9

types hw accelerators

fe4ff97

introduction + background + references background

7471b60

emerging technologies

90f7a7c

Missing references bibtex emerging tech

add co-design section and references

4c843a5

add sections: Software for AI hardware & Benchmarking AI Hardware

51293ab

Additional References + Formatting + Content

a591676

Update hw_acceleration.qmd

c358bec

Formatting updates to the text (manual linting 😅)

03fabfd

uchendui added the cs249r label Nov 8, 2023

pongtr and others added 8 commits November 8, 2023 14:30

add references and figure in Software for AI Hardware section

6ae43f4

updated references for Emerging tech

e3f1068

all references emerging technologies

1f8a465

updated

6bf36f7

updated solved issue with youtube video

updated video

1dfe94a

Minor fixes to rendering

1bbf2c4

Added OCP video

dc1bb18

added overview paragraph text for the chapter

c698b22

AditiR-42 reviewed Nov 9, 2023

View reviewed changes

Addressing Aditi's feedback.

7cbb7bf

uchendui force-pushed the main branch 2 times, most recently from d230ebb to 9868ae3 Compare November 10, 2023 01:56

Figure fixes

d51785c

jzhou1318 and others added 2 commits November 13, 2023 09:52

fix refeerences for acceleration types

fb3e89c

add references

f5b282d

profvjreddi and others added 4 commits November 14, 2023 14:56

Manual lint checking + reference updates

e920d3e

Fix references

226fb70

Manually linting :)

5ba476e

Merge branch 'main' into main

a39e3bb

mpstewart1 merged commit 02dddc4 into harvard-edge:main Nov 15, 2023
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AI HW Acceleration #53

AI HW Acceleration #53

jzhou1318 commented Nov 8, 2023 •

edited

Loading

AditiR-42 Nov 9, 2023

profvjreddi Nov 9, 2023

arnaumarin Nov 13, 2023

AditiR-42 Nov 9, 2023

profvjreddi Nov 9, 2023

profvjreddi Nov 9, 2023

arnaumarin commented Nov 13, 2023

gnodipac886 commented Nov 13, 2023

profvjreddi commented Nov 13, 2023 via email

mpstewart1 commented Nov 15, 2023


		In response, new manufacturing techniques like wafer-scale fabrication and advanced packaging now allow much higher levels of integration. The goal is to create unified, specialized AI compute complexes tailored for deep learning and other AI algorithms. Tighter integration is key to delivering the performance and efficiency needed for the next generation of AI.

		#### Wafter-scale AI


		- Co-simulation: Unified platforms like the SCALE-Sim [@samajdar2018scale] integrate hardware and software simulation into a single tool. This enables what-if analysis to quantify the system-level impacts of cross-layer optimizations early in the design cycle.

		For example, an FPGA-based AI accelerator design could be simulated using Verilog hardware description language and synthesized into a Gem5 model. The accelerator could have ML workloads simulated using TVM compiled onto it within the Gem5 environment for unified modeling.

AI HW Acceleration #53

AI HW Acceleration #53

Conversation

jzhou1318 commented Nov 8, 2023 • edited Loading

AditiR-42 Nov 9, 2023

Choose a reason for hiding this comment

profvjreddi Nov 9, 2023

Choose a reason for hiding this comment

arnaumarin Nov 13, 2023

Choose a reason for hiding this comment

AditiR-42 Nov 9, 2023

Choose a reason for hiding this comment

profvjreddi Nov 9, 2023

Choose a reason for hiding this comment

profvjreddi Nov 9, 2023

Choose a reason for hiding this comment

arnaumarin commented Nov 13, 2023

gnodipac886 commented Nov 13, 2023

profvjreddi commented Nov 13, 2023 via email

mpstewart1 commented Nov 15, 2023

jzhou1318 commented Nov 8, 2023 •

edited

Loading