diff --git a/README.md b/README.md index b2d51b2a1709..3b9010695760 100644 --- a/README.md +++ b/README.md @@ -159,6 +159,9 @@ to help run our Unit tests. * To run the **Jest unit tests**: `npm run test` +## Performance tests +We use Reassure for monitoring performance regression. More detailed information can be found [here](tests/perf-test/README.md): + ---- # Debugging diff --git a/tests/perf-test/README.md b/tests/perf-test/README.md new file mode 100644 index 000000000000..e7391612f071 --- /dev/null +++ b/tests/perf-test/README.md @@ -0,0 +1,69 @@ +# Performance testing + +We use Reassure for monitoring performance regression. It helps us check if our app is getting faster and quickly spot any issues we need to fix. + +## How does Reassure work? + +- Reassure builds on the existing React Testing Library setup and adds a performance measurement functionality. It's intended to be used on local machine and on a remote server as part of your continuous integration setup. +- To make sure the results are reliable and consistent, Reassure runs tests twice – once for the current branch and once for the base branch. + +## Performance Testing Strategy (`measurePerformance`) + +- The primary focus is on testing business cases rather than small, reusable parts that typically don't introduce regressions, although some tests in that area are still necessary. +- To achieve this goal, it's recommended to stay relatively high up in the React tree, targeting whole screens to recreate real-life scenarios that users may encounter. +- For example, consider scenarios where an additional `useMemo` call could impact performance negatively. + +## `measureFunction` API approach + +- Identifying functions with heavy calculations. +- Targeting functions that are frequently used throughout the app. + +## Running tests locally + +- Checkout your base environment, eg. `git checkout main`. +- Collect baseline metrics with `npx reassure --baseline`. +- Apply any desired changes (for testing purposes you can eg. try to slow down a list). +- Collect current metrics with `npx reassure`. +- Open up the resulting `output.md` / `output.json` (see console output) to compare the results. +- With all that information, Reassure can present the render duration times as statistically significant or meaningless. + +## Metrics for Regression Detection + +- **Render count**: If the number of renders increases by one compared to the baseline, it will be considered a performance regression, leading to a failed test. This metric helps detect unexpected changes in component rendering behavior. *NOTE: sometimes regressions are intentional. For instance, if a new functionality is added to the tested component, causing an additional re-render, this regression is expected.* +- **Render duration**: A performance regression will occur if the measured rendering time is 20% higher than the baseline, resulting in a failed test. This threshold allows for reasonable fluctuations and accounts for changes that may lead to longer rendering times. + + +## Tips for Performance Testing with Reassure + +- Before you start using Reassure, take a bit of time to learn what it does [docs](https://callstack.github.io/reassure/). +- Mocking could initially be challenging, but preparing utilities for handling collections of data simplified the process. We’ve already prepared base for utilities for mocking collections. More information and instructions can be found [here](https://github.com/Expensify/App/tree/main/tests#mocking-collections--collection-items). *Feel free to adjust/ add more helper methods if you think it’s needed*. +- Mocking is a crucial part of performance testing. To achieve more accurate and meaningful results, mock and use as much data as possible. +- Inside each test, there is a defined scenario function that represents the specific user interaction you want to measure (HINT: there is no need to add assertions in performance tests). +- More runs generally lead to better and more reliable results by averaging out variations. Additionally, consider adjusting the number of runs per series for each specific test to achieve more granular insights. + +## What can be tested (scenarios) + +- Rendering lists with multiple items (try adding as much data as possible to get a more reliable result when it comes to potential regressions). +- Scrolling performance. +- onPress/onSelect action. +- Text input interactions. + +## Example Test + +```javascript +test('Count increments on press', async () => { + const scenario = async (screen) => { + const button = screen.getByText('Action'); + + await screen.findByText('Count: 0'); + fireEvent.press(button); + fireEvent.press(button); + await screen.findByText('Count: 2'); + }; + + await measurePerformance( + , + { scenario, runs: 20 } + ); +}); +```