feat: add localization evaluation (autowarefoundation#611)

* feat: add localization evaluation Signed-off-by: Melike Tanrıkulu <[email protected]>
tier4 · Oct 2, 2024 · 68ee545 · 68ee545
1 parent 934644b
commit 68ee545
Show file tree

Hide file tree

Showing 4 changed files with 283 additions and 9 deletions.
diff --git a/docs/tutorials/.pages b/docs/tutorials/.pages
@@ -2,3 +2,4 @@ nav:
   - index.md
   - ad-hoc-simulation
   - scenario-simulation
+  - components_evaluation
diff --git a/docs/tutorials/components_evaluation/index.md b/docs/tutorials/components_evaluation/index.md
@@ -0,0 +1,5 @@
+# Localization Evaluation
+
+!!! warning
+
+    Under Construction
diff --git a/...s/components_evaluation/localization_evaluation/urban-environment-evaluation.md b/...s/components_evaluation/localization_evaluation/urban-environment-evaluation.md
@@ -0,0 +1,252 @@
+### Introduction
+
+#### Related Links
+
+Data Collection Documentation --> <https://autowarefoundation.github.io/autoware-documentation/main/datasets/#istanbul-open-dataset>
+
+#### Purpose
+
+The purpose of this test is to see the performance of the current NDT based default autoware localization system in an urban environment and evaluate its results. Our expectation is to see the deficiencies of the localization and understand which situations we need to improve.
+
+#### Test Environment
+
+The test data was collected in Istanbul. It also includes some scenarios that may be challenging for localization, such as tunnels and bridges. You can find the entire route marked in the image below.
+![ist_dataset_route_3_resized](https://github.com/user-attachments/assets/e04c77b4-5192-4faf-839c-12429138708f)
+
+#### Test Dataset & Map
+
+Test and Mapping datasets are same and the dataset contains data from the portable mapping kit used for general mapping purposes.
+
+The data contains data from the following sensors:
+
+- 1 x Applanix POS LVX GNSS/INS System
+- 1 x Hesai Pandar XT32 LiDAR
+
+You can find the data collected for testing and mapping in this [Documentation](https://autowarefoundation.github.io/autoware-documentation/main/datasets/#istanbul-open-dataset).
+
+> <span style="color:green">**NOTE !**</span> </br>  
+> Since there was no velocity source coming from the vehicle during all these tests, the twist message coming from GNSS/INS was given to ekf_localizer as the linear&angular velocity source. </br>
+> In order to understand whether this increases the error in cases where the GNSS/INS error increases in the tunnel and how it affects the system, localization in the tunnel was tested by giving only the pose from the NDT, without giving this velocity to ekf_localizer. </br>  
+> The video of this test is [here](https://www.youtube.com/watch?v=ajgedIwwuaM). </br>
+> As seen in the video, when velocity is not given, localization in the tunnel deteriorates more quickly.
+> It is also predicted that if the IMU Twist message combined (/localization/twist_estimator/twist_with_covariance) with the linear velocity from the vehicle is given instead of the GNSS/INS Twist message, the performance in the tunnel will increase. However, this test cannot be done with the current data.
+
+#### Expected Tests
+
+1.) Firstly, it is aimed to detect the points where the localization is completely broken and to roughly control the localization.
+
+2.) By extracting the metrics, it is aimed to see concretely how much the localization error has increased and what the level of performance is.
+
+### How to Reproduce Tests
+
+#### Test With Raw Data
+
+If you test with raw data, you need to follow these `Test With Raw Data` instructions. Since you need to repeat all preprocessing operations on the input data in this test, you need to test by switching to the test branches in the sensor kit and individual params repos. You can perform these steps by following the instructions below.
+
+##### Installation
+
+1.) Download and unpack a test map files.
+
+- You can also download [the map](https://drive.google.com/file/d/1WPWmFCjV7eQee4kyBpmGNlX7awerCPxc/view?usp=drive_link) manually.
+
+```bash
+mkdir ~/autoware_ista_map
+gdown --id 1WPWmFCjV7eQee4kyBpmGNlX7awerCPxc -O ~/autoware_ista_map/
+```
+
+> <span style="color:green">**NOTE !**</span></br>  
+> You also need to add `lanelet2_map.osm` file to autoware_ista_map folder. Since no lanelet file is created for this map at the
+> moment, you can run any `lanelet2_map.osm` file by placing it in this folder.
+
+2.) Download the test rosbag files.
+
+- You can also download [the rosbag file](https://drive.google.com/drive/folders/1BMPcUhjq_BCLi521X88WpujoOiEi3_CJ?usp=drive_link) manually.
+
+```bash
+mkdir ~/autoware_ista_data
+gdown --id 1uta5Xr_ftV4jERxPNVqooDvWerK0dn89 -O ~/autoware_ista_data/
+```
+
+##### Prepare Autoware to Test
+
+1.) Checkout autoware_launch:
+
+```bash
+cd ~/autoware/src/launcher/autoware_launch/
+git remote add autoware_launch https://github.com/meliketanrikulu/autoware_launch.git
+git remote update
+git checkout evaluate_localization_issue_7652
+```
+
+2.) Checkout individual_params:
+
+```bash
+cd ~/autoware/src/param/autoware_individual_params/
+git remote add autoware_individual_params https://github.com/meliketanrikulu/autoware_individual_params.git
+git remote update
+git checkout evaluate_localization_issue_7652
+```
+
+3.) Checkout sample_sensor_kit_launch:
+
+```bash
+cd ~/autoware/src/sensor_kit/sample_sensor_kit_launch/
+git remote add sample_sensor_kit_launch https://github.com/meliketanrikulu/sample_sensor_kit_launch.git
+git remote update
+git checkout evaluate_localization_issue_7652
+```
+
+4.) Compile updated packages:
+
+```bash
+cd ~/autoware
+colcon build --symlink-install --packages-select sample_sensor_kit_launch autoware_individual_params autoware_launch common_sensor_launch
+```
+
+##### Launch Autoware
+
+```bash
+source ~/autoware/install/setup.bash
+ros2 launch autoware_launch logging_simulator.launch.xml map_path:=~/autoware_ista_map/ vehicle_model:=sample_vehicle sensor_model:=sample_sensor_kit
+```
+
+##### Run Rosbag
+
+```bash
+source ~/autoware/install/setup.bash
+ros2 bag play ~/autoware_ista_data/rosbag2_2024_09_11-17_53_54_0.db3
+```
+
+#### Localization Test Only
+
+If you only want to see the localization performance, follow the `Localization Test Only` instructions. For those who only want to perform localization tests, a second test bag file and a separate launch file have been created for this test. You can perform this test by following the instructions below.
+
+##### Installation
+
+1.) Download and unpack a test map files.
+
+- You can also download [the map](https://drive.google.com/file/d/1WPWmFCjV7eQee4kyBpmGNlX7awerCPxc/view?usp=drive_link) manually.
+
+```bash
+mkdir ~/autoware_ista_map
+gdown --id 1WPWmFCjV7eQee4kyBpmGNlX7awerCPxc -O ~/autoware_ista_map/
+```
+
+> <span style="color:green">**NOTE !**</span></br>  
+> You also need to add `lanelet2_map.osm` file to autoware_ista_map folder. Since no lanelet file is created for this map at the
+> moment, you can run any `lanelet2_map.osm` file by placing it in this folder.
+
+2.) Download the test rosbag files.
+
+- You can also download [the localization rosbag file](https://drive.google.com/file/d/1yEB5j74gPLLbkkf87cuCxUgHXTkgSZbn/view?usp=sharing) manually.
+
+```bash
+mkdir ~/autoware_ista_data
+gdown --id 1yEB5j74gPLLbkkf87cuCxUgHXTkgSZbn -O ~/autoware_ista_data/
+```
+
+##### Prepare Autoware to Test
+
+1.) Checkout autoware_launch:
+
+```bash
+cd ~/autoware/src/launcher/autoware_launch/
+git remote add autoware_launch https://github.com/meliketanrikulu/autoware_launch.git
+git remote update
+git checkout evaluate_localization_issue_7652
+```
+
+2.) Compile updated packages:
+
+```bash
+cd ~/autoware
+colcon build --symlink-install --packages-select autoware_launch
+```
+
+##### Launch Autoware
+
+```bash
+source ~/autoware/install/setup.bash
+ros2 launch autoware_launch urban_environment_localization_test.launch.xml map_path:=~/autoware_ista_map/
+```
+
+##### Run Rosbag
+
+```bash
+source ~/autoware/install/setup.bash
+ros2 bag play ~/autoware_ista_data/rosbag2_2024_09_12-14_59_58_0.db3
+```
+
+### Test Results
+
+#### Test 1: Simple Test of Localization
+
+Simply put, how localization works in an urban environment was tested and the results were recorded on video.
+Here is the [test video](https://youtu.be/Bk4Oyk6FOg0?t=6).
+[<img src="https://github.com/user-attachments/assets/e0f7edbf-0596-4806-8dcd-b4156584e4c0" width="60%">](https://youtu.be/Bk4Oyk6FOg0?t=6")
+
+We can see from video that there is a localization error in the longitudinal axis along the Eurasia tunnel. As expected, NDT based localization does not work properly here. However, since the NDT score cannot detect the distortion here, localization is not completely broken until the end of the tunnel. Localization is completely broken at the exit of the tunnel. I re-initialized the localization after vehicle exit the tunnel.
+
+From this point on, we move on to the bridge scenario. This is one of the bridges connecting the Bosphorus and was the longest bridge on our route. Here too, we thought that NDT-based localization might be disrupted. However, I did not observe any disruption.
+
+After this part there is another tunnel(Kagithane - Bomonti) and localization behaves similar to Eurasia tunnel. However, at the exit of this tunnel, it recovers localization on its own without the need for re-initialization.
+
+##### Test 1 Summary
+
+In summary, the places where there was visible deterioration with our test route were the tunnels. This was already an expected situation. Apart from this, we anticipated that we could have problems on the bridges, but I could not observe any deterioration in the localization on the bridges. Of course, it is useful to remind you at this point that these tests were conducted with a single data.
+
+#### Test 2: Comparing Results with Ground Truth
+
+1.) How NDT score changed ?
+
+Along the route, only in a small section of the longest tunnel, the Eurasia Tunnel, did the NDT score remain below the expected values. However, there is a visible deterioration in localization in both tunnels on the route. As a result of this test, we saw that we cannot test the problems with the NDT score. You can see how the NDT score changes along the route in the image below. While looking at the visual, let's remember that the NDT score threshold is accepted as 2.3.
+
+![ndt_nvtl drawio](https://github.com/user-attachments/assets/d2fcd062-856b-4dce-bd4a-8a1f49b835f4)
+
+NDT Score (Nearest Voxel Transformation Likelihood) Threshold = 2.3
+
+2.) Compare with Ground Truth
+
+Ground Truth : In these tests, the post-processed GNSS / INS data was used as ground truth. Since the error of this ground truth data also decreases in the tunnel environment, it is necessary to evaluate these regions by taking into account the Ground Truth error.
+
+During these tests, I compared the NDT and EKF exposures with Ground Truth and presented the results. I am sharing the test results below as png. However, if you want to examine this data in more detail, I have created an executable file for you to visualize and take a closer look at. You can access this executable file from [here](https://drive.google.com/drive/folders/145QXl6wfV7IB9NS-PzFQtsNnxnAn6a9o?usp=sharing). Currently there is only a version that works on Ubuntu at this link, but I plan to add it for Windows as well.
+You need to follow these steps:
+
+```bash
+cd /your/path/show_evaluation_ubuntu
+./pose_main
+```
+
+You can also update configurations with changing /configs/evaluation_pose.yaml
+
+2d Trajectory :
+![2D_Trajectory drawio](https://github.com/user-attachments/assets/4eebd34f-dc87-4a2d-a4f9-07f068a7d8d7)
+
+2D Error :
+![2d_error](https://github.com/user-attachments/assets/01f1a5a0-6fbd-4ae2-aa86-4ce7b5cdf141)
+
+3D Trajectory:
+![3d_trajectory](https://github.com/user-attachments/assets/310b5d5f-430b-4f7a-b4fd-93063fc2ea8b)
+
+3D Error:
+![3d_error](https://github.com/user-attachments/assets/6ab4a1cf-ad69-4847-8747-26efb3e9ce8e)
+
+Lateral Error:
+
+![lateral_error](https://github.com/user-attachments/assets/50ffba9e-b1c2-4407-90bc-002886540f9a)
+
+Longitudinal Error:
+![longitudinal_error](https://github.com/user-attachments/assets/617ca024-7c29-4556-bb62-c1e54b19fd38)
+
+Roll-Pitch-Yaw:
+![rpy](https://github.com/user-attachments/assets/68eef655-189a-4552-a99c-ab89df2136af)
+
+Roll-Pitch-Yaw Error:
+![rpy_error](https://github.com/user-attachments/assets/071cd440-d64a-49a8-bf0a-86278e174048)
+
+X-Y-Z:
+![xyz](https://github.com/user-attachments/assets/6d946f18-98f7-4297-bb1b-fad936fd1328)
+
+X-Y-Z Error:
+
+![xyz_error](https://github.com/user-attachments/assets/612559cc-a973-4096-9a76-3e27d87539fd)
diff --git a/docs/tutorials/index.md b/docs/tutorials/index.md
@@ -1,32 +1,34 @@
-# Simulation tutorials
+# Tutorials
+
+## Simulation tutorials
 
 Simulations provide a way of verifying Autoware's functionality before field testing with an actual vehicle.  
 There are three main types of simulation that can be run ad hoc or via a scenario runner.
 
-## Simulation methods
+### Simulation methods
 
-### Ad hoc simulation
+#### Ad hoc simulation
 
 Ad hoc simulation is a flexible method for running basic simulations on your local machine, and is the recommended method for anyone new to Autoware.
 
-### Scenario simulation
+#### Scenario simulation
 
 Scenario simulation uses a scenario runner to run more complex simulations based on predefined scenarios.
 It is often run automatically for continuous integration purposes, but can also be run on a local machine.
 
-## Simulation types
+### Simulation types
 
-### Planning simulation
+#### Planning simulation
 
 Planning simulation uses simple dummy data to test the Planning and Control components - specifically path generation, path following and obstacle avoidance. It verifies that a vehicle can reach a goal destination while avoiding pedestrians and surrounding cars, and is another method for verifying the validity of Lanelet2 maps. It also allows for testing of traffic light handling.
 
-#### How does planning simulation work?
+##### How does planning simulation work?
 
 1. Generate a path to the goal destination
 2. Control the car along the generated path
 3. Detect and avoid any humans or other vehicles on the way to the goal destination
 
-### Rosbag replay simulation
+#### Rosbag replay simulation
 
 Rosbag replay simulation uses prerecorded rosbag data to test the following aspects of the Localization and Perception components:
 
@@ -35,6 +37,20 @@ Rosbag replay simulation uses prerecorded rosbag data to test the following aspe
 
 By repeatedly playing back the data, this simulation type can also be used for endurance testing.
 
-### Digital twin simulation
+#### Digital twin simulation
 
 Digital twin simulation is a simulation type that is able to produce realistic data and simulate almost the entire system. It is also commonly referred to as end-to-end simulation.
+
+## Evaluation Tutorials
+
+### Components Evaluation
+
+Components evaluation tutorials provide a way to evaluate the performance of Autoware's components in a controlled environment.
+
+#### Localization Evaluation
+
+The Localization Evaluation tutorial provides a way to evaluate the performance of the Localization component.
+
+##### Urban Environment Evaluation
+
+The Urban Environment Evaluation tutorial provides a way to evaluate the performance of the Localization component in urban environment. It uses the [Istanbul Open Dataset](https://autowarefoundation.github.io/autoware-documentation/main/datasets/#istanbul-open-dataset) for testing. Test data, map, test steps are included. Test results are also shared.