There are three simple steps for accelerating your application on an AWS F1 instance:
- Build the host application, Xilinx FPGA binary and verify you are ready for FPGA acceleration
- Create an AFI
- Run the FPGA accelerated application on AWS FPGA instances
This quick start guide will use a simple "Hello World" SDAccel example to get you started.
It is highly recommended you read the documentation and utilize software and hardware emulation prior to running on F1. The F1 HW compile time is ~4hrs (4DDR) and ~1hr (1DDR), therefore, software and hardware emulation should be used during development.
- Overview
- Prerequisites
- Build the host application, Xilinx FPGA binary and verify you are ready for FPGA acceleration
- Create an Amazon FPGA Image (AFI)
- Run the FPGA accelerated application on F1
- Additional SDAccel Information
- SDAccel is a complete development environment for applications accelerated with Xilinx FPGAs
- It leverages the OpenCL heterogeneous computing framework to offload compute intensive workloads to the FPGA
- The accelerated application is written in C/C++ with OpenCL APIs
- The code for the FPGA binary can be written in C/C++, OpenCL or RTL
- Once you have gone through this quick start example. See the SDAccel GUI Guide to access the fully integrated Eclipse-based environment with built-in debug, profiling and performance analysis tools.
AWS Account, F1/EC2 Instances, On-Premises, AWS IAM Permissions, AWS CLI and S3 Setup (One-time Setup)
- Setup an AWS Account
- Launch an instance using the FPGA Developer AMI which comes pre-installed with SDAccel and required licenses.
- You may use this F1 instance to build your host application and Xilinx FPGA binary, however, it may be more cost efficient to either:
- Launch the FPGA Developer AMI on a lower cost EC2 instance, with a minimum of 30GiB RAM), OR
- Follow the On-Premises Instructions to purchase and install a license from Xilinx.
- You may use this F1 instance to build your host application and Xilinx FPGA binary, however, it may be more cost efficient to either:
- Setup AWS IAM permissions for creating FPGA Images (CreateFpgaImage and DescribeFpgaImages). EC2 API Permissions are described in more detail. It is highly recommended that you validate your AWS IAM permissions prior to proceeding with this quick start. By calling the DescribeFpgaImages API you can check that your IAM permissions are correct.
- Setup AWS CLI and S3 Bucket to enable AFI creation.
- Install optional packages required to run all examples. If you do not install these packages, some examples may not work properly. The setup scripts will warn you of any missing packages.
- Additional dependancies may get flagged during the AWS SDAccel scripts as warnings or errors.
-
Clone this github repository and source the sdaccel_setup.sh script. This will take care of:
- Downloading the required files:
- AWS Platform that allows Xilinx FPGA Binary files to target AWS F1 instances
- AFI Creation script that generates an AFI and AWS FPGA Binary from a Xilinx FPGA Binary
- SDAccel HAL source code and binary files for mapping SDAccel/OpenCL runtime libraries to AWS FPGA instance.
- Installing the required libraries and drivers
$ git clone https://github.com/aws/aws-fpga.git $AWS_FPGA_REPO_DIR $ cd $AWS_FPGA_REPO_DIR $ source sdaccel_setup.sh
- Select a platform:
- AWS_PLATFORM_4DDR - Default AWS F1 platform with 4 DDRs and profiling support. Optimized for multi DDR use cases. This platform should be used for all production applications which require more than 1 DDR bank.
- AWS_PLATFORM_4DDR_DEBUG - This platform is a debug variant of the 4DDR platform and should be used for hardware debugging of kernels. This version consists of an additional debug feature which allows advanced users to insert ILA’s in the kernels for debugging purposes. All other features are identical to the AWS_PLATFORM_4DDR platform.
- AWS_PLATFORM_1DDR - This platform consist of 1 DDR that is located in the shell region. This allow maximum space for kernels. This also allows much faster compile times for all the use cases which require only 1 DDR bank. This platform does not support APM and hence no profiling data can be obtained.
$ export AWS_PLATFORM=$AWS_PLATFORM_1DDR
- Downloading the required files:
This section will walk you through creating, emulating and compiling your host application and FPGA Binary
The main goal of emulation is to ensure functional correctness and to determine how to partition the application between the host CPU and the FPGA.
For CPU-based (SW) emulation, both the host code and the FPGA binary code are compiled to run on an x86 processor. The SW Emulation enables developer to iterate and refine the algorithms through fast compilation. The iteration time is similar to software compile and run cycles on a CPU.
The instructions below describe how to run the SDAccel SW Emulation flow using the Makefile provided with a simple "hello world" example
$ cd $SDACCEL_DIR/examples/xilinx/getting_started/host/helloworld_ocl/
$ make clean
$ make check TARGETS=sw_emu DEVICES=$AWS_PLATFORM all
For more information on how to debug your application in a SW Emulation environment, please see the SDAccel Debug Guide.
The SDAccel hardware emulation flow enables the developer to check the correctness of the logic generated for the FPGA binary. This emulation flow invokes the hardware simulator in the SDAccel environment to test the functionality of the code that will be executed on the FPGA Custom Logic.
The instructions below describe how to run the HW Emulation flow using the Makefile provided with a simple "hello world" example:
$ cd $SDACCEL_DIR/examples/xilinx/getting_started/host/helloworld_ocl/
$ make clean
$ make check TARGETS=hw_emu DEVICES=$AWS_PLATFORM all
For more information on how to debug your application in a HW Emulation environment, please see the SDAccel Debug Guide.
The SDAccel system build flow enables the developer to build their host application as well as their Xilinx FPGA Binary.
The instructions below describe how to build the Xilinx FPGA Binary and host application using the Makefile provided with a simple "hello world" example:
$ cd $SDACCEL_DIR/examples/xilinx/getting_started/host/helloworld_ocl/
$ make clean
$ make TARGETS=hw DEVICES=$AWS_PLATFORM all
Now that you have built your Xilinx FPGA binary, see SDAccel Power Analysis Guide for more details on how to analyze power for your binary.
This assumes you have:
- Compiled your host application and Xilinx FPGA Binary
- Validated your code using SW/HW Emulation and you are ready to create an AFI and test on F1.
- Setup AWS CLI and S3 bucket for AFI creation
The create_sdaccel_afi.sh script is provided to facilitate AFI creation from a Xilinx FPGA Binary, it:
- Takes in your Xilinx FPGA Binary *.xclbin file
- Calls aws ec2 create_fgpa_image to generate an AFI under the hood
- Generates a <timestamp>_afi_id.txt which contains the identifiers for your AFI
- Creates an AWS FPGA Binary file with an *.awsxclbin extension that is composed of: Metadata and AGFI-ID.
- This *.awsxclbin is the AWS FPGA Binary file that will need to be loaded by your host application to the FPGA
$ $SDACCEL_DIR/tools/create_sdaccel_afi.sh -xclbin=<input_xilinx_fpga_binary_xclbin_filename>
-o=<output_aws_fpga_binary_awsxclbin_filename_root> \
-s3_bucket=<bucket-name> -s3_dcp_key=<dcp-folder-name> -s3_logs_key=<logs-folder-name>
Save the *.awsxclbin, you will need to copy it to your F1 instance along with your executable host application.
NOTE: Attempting to load your FPGA Binary immediately on an F1 instance will result in an 'Invalid AFI ID' error. Please wait until you confirm the AFI has been created successfully.
The *_afi_id.txt file generated by the create_sdaccel_afi.sh also includes the two identifiers for your AFI:
- FPGA Image Identifier or AFI ID: this is the main ID used to manage your AFI through the AWS EC2 CLI commands and AWS SDK APIs.
This ID is regional, i.e., if an AFI is copied across multiple regions, it will have a different unique AFI ID in each region.
An example AFI ID is
afi-06d0ffc989feeea2a
. - Global FPGA Image Identifier or AGFI ID: this is a global ID that is used to refer to an AFI from within an F1 instance.
For example, to load or clear an AFI from an FPGA slot, you use the AGFI ID.
This is embedded into the AWS FPGA Binary *.awsxclbin file generated by create_sdaccel_afi.sh.
Since the AGFI IDs is global (by design), it allows you to copy a combination of AFI/AMI to multiple regions, and they will work without requiring any extra setup.
An example AGFI ID is
agfi-0f0e045f919413242
.
Use the describe-fpga-images API to check the AFI state during the background AFI generation process.
$ aws ec2 describe-fpga-images --fpga-image-ids <AFI ID>
When AFI creation completes successfully, the output should contain:
...
"State": {
"Code": "available"
},
...
If the “State” code indicates the AFI generation has "failed", the AFI creation logs can be found in the bucket location (s3://<bucket-name>/<logs-folder-name>
) provided to create_sdaccel_afi.sh above. These will detail the errors encountered during the AFI creation process.
For help with AFI creation issues, see create-fpga-image error codes
Here are the steps:
- Start an F1 instance using FPGA Developer AMI on AWS Marketplace, alternatively you can create your own Runtime AMI for running your SDAccel applications on F1.
- Assuming the developer flow (compilation) was done on a separate instance you will need to:
- Copy the compiled host executable (exe) to new instance
- Copy the *.awsxclbin AWS FPGA binary file to the new instance
- If using 1DDR platform or 4DDR Rtl kernel debug platform: Depending on the host code, the *.awsxclbin may need to be renamed. Ex:
cp vector_addition.hw.xilinx_aws-vu9p-f1_1ddr-xpr-2pr_4_0.awsxclbin vector_addition.hw.xilinx_aws-vu9p-f1_4ddr-xpr-2pr_4_0.awsxclbin
- Copy any data files required for execution to the new instance
- Clone the github repository to the new F1 instance and install runtime drivers
- Clone the github repository to the new F1 instance and install runtime drivers
- Assuming the developer flow (compilation) was done on a separate instance you will need to:
$ git clone https://github.com/aws/aws-fpga.git $AWS_FPGA_REPO_DIR
$ cd $AWS_FPGA_REPO_DIR
$ source sdaccel_setup.sh
- Ensure the host application can find and load the *.awsxclbin AWS FPGA binary file.
- Source the Runtime Environment & Execute your Host Application
$ sudo sh
# source /opt/Xilinx/SDx/2017.1.rte.4ddr/setup.sh # Use 2017.1.rte.1ddr or 2017.1.rte.4ddr_debug when using AWS_PLATFORM_1DDR or AWS_PLATFORM_4DDR_DEBUG. Other runtime env settings needed by the host app should be setup after this step
# ./helloworld