Skip to content

Commit

Permalink
feat: add localization evaluation (autowarefoundation#611)
Browse files Browse the repository at this point in the history
* feat: add localization evaluation

Signed-off-by: Melike Tanrıkulu <[email protected]>
  • Loading branch information
meliketanrikulu authored Oct 2, 2024
1 parent 934644b commit 68ee545
Show file tree
Hide file tree
Showing 4 changed files with 283 additions and 9 deletions.
1 change: 1 addition & 0 deletions docs/tutorials/.pages
Original file line number Diff line number Diff line change
Expand Up @@ -2,3 +2,4 @@ nav:
- index.md
- ad-hoc-simulation
- scenario-simulation
- components_evaluation
5 changes: 5 additions & 0 deletions docs/tutorials/components_evaluation/index.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
# Localization Evaluation

!!! warning

Under Construction
Original file line number Diff line number Diff line change
@@ -0,0 +1,252 @@
### Introduction

#### Related Links

Data Collection Documentation --> <https://autowarefoundation.github.io/autoware-documentation/main/datasets/#istanbul-open-dataset>

#### Purpose

The purpose of this test is to see the performance of the current NDT based default autoware localization system in an urban environment and evaluate its results. Our expectation is to see the deficiencies of the localization and understand which situations we need to improve.

#### Test Environment

The test data was collected in Istanbul. It also includes some scenarios that may be challenging for localization, such as tunnels and bridges. You can find the entire route marked in the image below.
![ist_dataset_route_3_resized](https://github.com/user-attachments/assets/e04c77b4-5192-4faf-839c-12429138708f)

#### Test Dataset & Map

Test and Mapping datasets are same and the dataset contains data from the portable mapping kit used for general mapping purposes.

The data contains data from the following sensors:

- 1 x Applanix POS LVX GNSS/INS System
- 1 x Hesai Pandar XT32 LiDAR

You can find the data collected for testing and mapping in this [Documentation](https://autowarefoundation.github.io/autoware-documentation/main/datasets/#istanbul-open-dataset).

> <span style="color:green">**NOTE !**</span> </br>
> Since there was no velocity source coming from the vehicle during all these tests, the twist message coming from GNSS/INS was given to ekf_localizer as the linear&angular velocity source. </br>
> In order to understand whether this increases the error in cases where the GNSS/INS error increases in the tunnel and how it affects the system, localization in the tunnel was tested by giving only the pose from the NDT, without giving this velocity to ekf_localizer. </br>
> The video of this test is [here](https://www.youtube.com/watch?v=ajgedIwwuaM). </br>
> As seen in the video, when velocity is not given, localization in the tunnel deteriorates more quickly.
> It is also predicted that if the IMU Twist message combined (/localization/twist_estimator/twist_with_covariance) with the linear velocity from the vehicle is given instead of the GNSS/INS Twist message, the performance in the tunnel will increase. However, this test cannot be done with the current data.
#### Expected Tests

1.) Firstly, it is aimed to detect the points where the localization is completely broken and to roughly control the localization.

2.) By extracting the metrics, it is aimed to see concretely how much the localization error has increased and what the level of performance is.

### How to Reproduce Tests

#### Test With Raw Data

If you test with raw data, you need to follow these `Test With Raw Data` instructions. Since you need to repeat all preprocessing operations on the input data in this test, you need to test by switching to the test branches in the sensor kit and individual params repos. You can perform these steps by following the instructions below.

##### Installation

1.) Download and unpack a test map files.

- You can also download [the map](https://drive.google.com/file/d/1WPWmFCjV7eQee4kyBpmGNlX7awerCPxc/view?usp=drive_link) manually.

```bash
mkdir ~/autoware_ista_map
gdown --id 1WPWmFCjV7eQee4kyBpmGNlX7awerCPxc -O ~/autoware_ista_map/
```

> <span style="color:green">**NOTE !**</span></br>
> You also need to add `lanelet2_map.osm` file to autoware_ista_map folder. Since no lanelet file is created for this map at the
> moment, you can run any `lanelet2_map.osm` file by placing it in this folder.
2.) Download the test rosbag files.

- You can also download [the rosbag file](https://drive.google.com/drive/folders/1BMPcUhjq_BCLi521X88WpujoOiEi3_CJ?usp=drive_link) manually.

```bash
mkdir ~/autoware_ista_data
gdown --id 1uta5Xr_ftV4jERxPNVqooDvWerK0dn89 -O ~/autoware_ista_data/
```

##### Prepare Autoware to Test

1.) Checkout autoware_launch:

```bash
cd ~/autoware/src/launcher/autoware_launch/
git remote add autoware_launch https://github.com/meliketanrikulu/autoware_launch.git
git remote update
git checkout evaluate_localization_issue_7652
```

2.) Checkout individual_params:

```bash
cd ~/autoware/src/param/autoware_individual_params/
git remote add autoware_individual_params https://github.com/meliketanrikulu/autoware_individual_params.git
git remote update
git checkout evaluate_localization_issue_7652
```

3.) Checkout sample_sensor_kit_launch:

```bash
cd ~/autoware/src/sensor_kit/sample_sensor_kit_launch/
git remote add sample_sensor_kit_launch https://github.com/meliketanrikulu/sample_sensor_kit_launch.git
git remote update
git checkout evaluate_localization_issue_7652
```

4.) Compile updated packages:

```bash
cd ~/autoware
colcon build --symlink-install --packages-select sample_sensor_kit_launch autoware_individual_params autoware_launch common_sensor_launch
```

##### Launch Autoware

```bash
source ~/autoware/install/setup.bash
ros2 launch autoware_launch logging_simulator.launch.xml map_path:=~/autoware_ista_map/ vehicle_model:=sample_vehicle sensor_model:=sample_sensor_kit
```

##### Run Rosbag

```bash
source ~/autoware/install/setup.bash
ros2 bag play ~/autoware_ista_data/rosbag2_2024_09_11-17_53_54_0.db3
```

#### Localization Test Only

If you only want to see the localization performance, follow the `Localization Test Only` instructions. For those who only want to perform localization tests, a second test bag file and a separate launch file have been created for this test. You can perform this test by following the instructions below.

##### Installation

1.) Download and unpack a test map files.

- You can also download [the map](https://drive.google.com/file/d/1WPWmFCjV7eQee4kyBpmGNlX7awerCPxc/view?usp=drive_link) manually.

```bash
mkdir ~/autoware_ista_map
gdown --id 1WPWmFCjV7eQee4kyBpmGNlX7awerCPxc -O ~/autoware_ista_map/
```

> <span style="color:green">**NOTE !**</span></br>
> You also need to add `lanelet2_map.osm` file to autoware_ista_map folder. Since no lanelet file is created for this map at the
> moment, you can run any `lanelet2_map.osm` file by placing it in this folder.
2.) Download the test rosbag files.

- You can also download [the localization rosbag file](https://drive.google.com/file/d/1yEB5j74gPLLbkkf87cuCxUgHXTkgSZbn/view?usp=sharing) manually.

```bash
mkdir ~/autoware_ista_data
gdown --id 1yEB5j74gPLLbkkf87cuCxUgHXTkgSZbn -O ~/autoware_ista_data/
```

##### Prepare Autoware to Test

1.) Checkout autoware_launch:

```bash
cd ~/autoware/src/launcher/autoware_launch/
git remote add autoware_launch https://github.com/meliketanrikulu/autoware_launch.git
git remote update
git checkout evaluate_localization_issue_7652
```

2.) Compile updated packages:

```bash
cd ~/autoware
colcon build --symlink-install --packages-select autoware_launch
```

##### Launch Autoware

```bash
source ~/autoware/install/setup.bash
ros2 launch autoware_launch urban_environment_localization_test.launch.xml map_path:=~/autoware_ista_map/
```

##### Run Rosbag

```bash
source ~/autoware/install/setup.bash
ros2 bag play ~/autoware_ista_data/rosbag2_2024_09_12-14_59_58_0.db3
```

### Test Results

#### Test 1: Simple Test of Localization

Simply put, how localization works in an urban environment was tested and the results were recorded on video.
Here is the [test video](https://youtu.be/Bk4Oyk6FOg0?t=6).
[<img src="https://github.com/user-attachments/assets/e0f7edbf-0596-4806-8dcd-b4156584e4c0" width="60%">](https://youtu.be/Bk4Oyk6FOg0?t=6")

We can see from video that there is a localization error in the longitudinal axis along the Eurasia tunnel. As expected, NDT based localization does not work properly here. However, since the NDT score cannot detect the distortion here, localization is not completely broken until the end of the tunnel. Localization is completely broken at the exit of the tunnel. I re-initialized the localization after vehicle exit the tunnel.

From this point on, we move on to the bridge scenario. This is one of the bridges connecting the Bosphorus and was the longest bridge on our route. Here too, we thought that NDT-based localization might be disrupted. However, I did not observe any disruption.

After this part there is another tunnel(Kagithane - Bomonti) and localization behaves similar to Eurasia tunnel. However, at the exit of this tunnel, it recovers localization on its own without the need for re-initialization.

##### Test 1 Summary

In summary, the places where there was visible deterioration with our test route were the tunnels. This was already an expected situation. Apart from this, we anticipated that we could have problems on the bridges, but I could not observe any deterioration in the localization on the bridges. Of course, it is useful to remind you at this point that these tests were conducted with a single data.

#### Test 2: Comparing Results with Ground Truth

1.) How NDT score changed ?

Along the route, only in a small section of the longest tunnel, the Eurasia Tunnel, did the NDT score remain below the expected values. However, there is a visible deterioration in localization in both tunnels on the route. As a result of this test, we saw that we cannot test the problems with the NDT score. You can see how the NDT score changes along the route in the image below. While looking at the visual, let's remember that the NDT score threshold is accepted as 2.3.

![ndt_nvtl drawio](https://github.com/user-attachments/assets/d2fcd062-856b-4dce-bd4a-8a1f49b835f4)

NDT Score (Nearest Voxel Transformation Likelihood) Threshold = 2.3

2.) Compare with Ground Truth

Ground Truth : In these tests, the post-processed GNSS / INS data was used as ground truth. Since the error of this ground truth data also decreases in the tunnel environment, it is necessary to evaluate these regions by taking into account the Ground Truth error.

During these tests, I compared the NDT and EKF exposures with Ground Truth and presented the results. I am sharing the test results below as png. However, if you want to examine this data in more detail, I have created an executable file for you to visualize and take a closer look at. You can access this executable file from [here](https://drive.google.com/drive/folders/145QXl6wfV7IB9NS-PzFQtsNnxnAn6a9o?usp=sharing). Currently there is only a version that works on Ubuntu at this link, but I plan to add it for Windows as well.
You need to follow these steps:

```bash
cd /your/path/show_evaluation_ubuntu
./pose_main
```

You can also update configurations with changing /configs/evaluation_pose.yaml

2d Trajectory :
![2D_Trajectory drawio](https://github.com/user-attachments/assets/4eebd34f-dc87-4a2d-a4f9-07f068a7d8d7)

2D Error :
![2d_error](https://github.com/user-attachments/assets/01f1a5a0-6fbd-4ae2-aa86-4ce7b5cdf141)

3D Trajectory:
![3d_trajectory](https://github.com/user-attachments/assets/310b5d5f-430b-4f7a-b4fd-93063fc2ea8b)

3D Error:
![3d_error](https://github.com/user-attachments/assets/6ab4a1cf-ad69-4847-8747-26efb3e9ce8e)

Lateral Error:

![lateral_error](https://github.com/user-attachments/assets/50ffba9e-b1c2-4407-90bc-002886540f9a)

Longitudinal Error:
![longitudinal_error](https://github.com/user-attachments/assets/617ca024-7c29-4556-bb62-c1e54b19fd38)

Roll-Pitch-Yaw:
![rpy](https://github.com/user-attachments/assets/68eef655-189a-4552-a99c-ab89df2136af)

Roll-Pitch-Yaw Error:
![rpy_error](https://github.com/user-attachments/assets/071cd440-d64a-49a8-bf0a-86278e174048)

X-Y-Z:
![xyz](https://github.com/user-attachments/assets/6d946f18-98f7-4297-bb1b-fad936fd1328)

X-Y-Z Error:

![xyz_error](https://github.com/user-attachments/assets/612559cc-a973-4096-9a76-3e27d87539fd)
34 changes: 25 additions & 9 deletions docs/tutorials/index.md
Original file line number Diff line number Diff line change
@@ -1,32 +1,34 @@
# Simulation tutorials
# Tutorials

## Simulation tutorials

Simulations provide a way of verifying Autoware's functionality before field testing with an actual vehicle.
There are three main types of simulation that can be run ad hoc or via a scenario runner.

## Simulation methods
### Simulation methods

### Ad hoc simulation
#### Ad hoc simulation

Ad hoc simulation is a flexible method for running basic simulations on your local machine, and is the recommended method for anyone new to Autoware.

### Scenario simulation
#### Scenario simulation

Scenario simulation uses a scenario runner to run more complex simulations based on predefined scenarios.
It is often run automatically for continuous integration purposes, but can also be run on a local machine.

## Simulation types
### Simulation types

### Planning simulation
#### Planning simulation

Planning simulation uses simple dummy data to test the Planning and Control components - specifically path generation, path following and obstacle avoidance. It verifies that a vehicle can reach a goal destination while avoiding pedestrians and surrounding cars, and is another method for verifying the validity of Lanelet2 maps. It also allows for testing of traffic light handling.

#### How does planning simulation work?
##### How does planning simulation work?

1. Generate a path to the goal destination
2. Control the car along the generated path
3. Detect and avoid any humans or other vehicles on the way to the goal destination

### Rosbag replay simulation
#### Rosbag replay simulation

Rosbag replay simulation uses prerecorded rosbag data to test the following aspects of the Localization and Perception components:

Expand All @@ -35,6 +37,20 @@ Rosbag replay simulation uses prerecorded rosbag data to test the following aspe

By repeatedly playing back the data, this simulation type can also be used for endurance testing.

### Digital twin simulation
#### Digital twin simulation

Digital twin simulation is a simulation type that is able to produce realistic data and simulate almost the entire system. It is also commonly referred to as end-to-end simulation.

## Evaluation Tutorials

### Components Evaluation

Components evaluation tutorials provide a way to evaluate the performance of Autoware's components in a controlled environment.

#### Localization Evaluation

The Localization Evaluation tutorial provides a way to evaluate the performance of the Localization component.

##### Urban Environment Evaluation

The Urban Environment Evaluation tutorial provides a way to evaluate the performance of the Localization component in urban environment. It uses the [Istanbul Open Dataset](https://autowarefoundation.github.io/autoware-documentation/main/datasets/#istanbul-open-dataset) for testing. Test data, map, test steps are included. Test results are also shared.

0 comments on commit 68ee545

Please sign in to comment.