This repository contains information/source code to perform the experiments done in DDIO paper published at ATC'20.
This repositoy mainly focuses on the experiments performed in Section 4 and 5 of DDIO paper.
The experiments are located at experiments/
. Every folder has a Makefile
and README.md
that can be used to run the experiment.
For more information, please check README.
Note that you have to setup your testbed based on our guidelines before running any experiment.
You can use DDIOTune element in Fastclick to enable/disable/tune DDIO. If you want to tune DDIO in a different context, you can use the following guidelines.
- Tuning: Our experiments show that changing the values of
IIO LLC WAYS
register, located at0xC8B
, could improve the performance of DDIO. The default value of this register in our testbed is0x600
, which has 2 set bits. You can read the current value and write new values to this register viamsr-tools
, as follows:
sudo apt-get install msr-tools
sudo modprobe msr
sudo rdmsr 0xc8b
sudo wrmsr 0xc8b 0x7f0
- Disabling/Enabling DDIO: DDIO is enabled by default on Intel Xeon processors. DDIO can be disabled globally (i.e., by setting the
Disable_All_Allocating_Flows
bit iniiomiscctrl
register) or per-root PCIe port (i.e., setting bitNoSnoopOpWrEn
and unsetting bitUse_Allocating_Flow_Wr
inperfctrlsts_0
register). You can find more information about these registers in the second volume of your processor's datasheet. For instance, you can check Haswell and Cascade Lake datasheets.
change-ddio.c
is a simple C program to change the state of DDIO for a PCIe port. To use change-ddio
, run the following commands:
sudo apt-get install libpci-dev
gcc change-ddio.c -o change-ddio -lpci
sudo ./change-ddio
You need to define the proper value for nic_bus and ddio_state in the code. For example, if you have a NIC that is mounted on 03:00.0
, you should change nic_bus to 0x03
. ddio_state=0 will disable the DDIO for the PCIe root responsible for that specific NIC.
You can find the PCIe BDF (Bus Device Function) of your NIC via lspci
, e.g., try lspci -vvv | grep Mellanox
if you have a Mellanox card.
You can also check the implementation of DDIOTune element in Fastclick.
We have investigated the impact of dynamically reducing the number of RX descriptors in case of congestion in the TX path. Our implementation can be found at DMAdynamic branch of Fastclick.
If you want to try it, you have to compile Fastclick with --enable_dynamic_rxburst
flag.
If you use ddio-bench in any context, please cite our paper:
@inproceedings {farshin-ddio,
author = {Farshin, Alireza and Roozbeh, Amir and {Maguire Jr.}, Gerald Q. and Kosti\'{c}, Dejan},
title = {{Reexamining Direct Cache Access to Optimize I/O Intensive Applications for Multi-hundred-gigabit Networks}},
booktitle = {2020 {USENIX} Annual Technical Conference ({USENIX} {ATC} 20)},
year = {2020},
isbn = {978-1-939133-14-4},
pages = {673--689},
url = {https://www.usenix.org/conference/atc20/presentation/farshin},
publisher = {{USENIX} Association},
month = jul,
}
If you have any questions regarding our code or the paper, you can contact Alireza Farshin (farshin at kth.se) and/or Amir Roozbeh (amirrsk at kth.se).