Skip to content

Commit

Permalink
Update VisionNode_CodeSummary.md
Browse files Browse the repository at this point in the history
  • Loading branch information
Lukasnol authored Nov 3, 2024
1 parent 629e68f commit 60c758d
Showing 1 changed file with 2 additions and 4 deletions.
6 changes: 2 additions & 4 deletions doc/research/paf24/perception/VisionNode_CodeSummary.md
Original file line number Diff line number Diff line change
Expand Up @@ -23,14 +23,14 @@ The `VisionNode` class is designed to perform object detection and segmentation
- [Minimum X and Y Calculations](#minimum-x-and-y-calculations)
- [Models](#models)


## Overview

The `VisionNode` class leverages both PyTorch and Ultralytics pretrained models for object detection and segmentation, integrating features such as bounding boxes, segmentation masks, and distance calculations based on LIDAR data.

## Class Initialization

Upon instantiation, `VisionNode` initializes various configurations, such as:

- **Model Dictionary** (`self.model_dict`): Holds model configurations with different detection and segmentation models from PyTorch and Ultralytics.
- **Camera Configurations**: Subscribes to topics for front, rear, left, and right camera views if specified.
- **Device Selection**: Automatically selects `CUDA` if available.
Expand Down Expand Up @@ -92,10 +92,9 @@ These functions subscribe to the camera topics, allowing the node to receive ima
- **`min_x`**: Calculates the minimum x-distance for objects in LIDAR data, representing the closest object in the forward direction.
- **`min_abs_y`**: Calculates the minimum y-distance in absolute terms, representing the closest object sideways.


## Models

Following there will be a short overview of the used machine learning and computer vision models.
Following there will be a short overview of the used machine learning and computer vision models.

| Model | Techniques | Features | Description |
| -------------------------------- | --------------------------------- | ----------------------------------------------------------------------------------------- | -------------------------------------------------------------------------------------------------- |
Expand All @@ -115,4 +114,3 @@ Following there will be a short overview of the used machine learning and comput
| yolov8x-seg | YOLO (Segmentation) | Only detection without segmentation. | Extension of YOLO for pixel-precise object detection. |
| sam_l | SAM | High accuracy in segmentation for universal applications. | Universal segmentation using SAM technique. |
| FastSAM-x | Fast SAM | Faster variant for real-time application requirements. | Fast segmentation using an accelerated SAM model. |

0 comments on commit 60c758d

Please sign in to comment.