Refactoring/Adding Documentation #813

travisdriver · 2024-10-14T20:53:49Z

I'm trying to include some documentation on how we modularize the Structure-from-Motion problem to make it easier for user and contributors to get up-to-speed on GTSfM.

I tried to emulate Nerfstudio's documentation structure, but I'm open to suggestions.

Pages that need to be filled out:

akshay-krishnan · 2025-01-12T12:59:08Z

It would be best to do this with multiple PRs, so we can first try to merge this and I will work on a multi-view optimizer page in parallel.

akshay-krishnan · 2025-01-12T13:05:04Z

assets/IMAGE_PAIRS_GENERATOR.md

+Global descriptor modules are implemented following the [`GlobalDescriptorBase`](https://github.com/borglab/gtsfm/blob/master/gtsfm/frontend/global_descriptor/global_descriptor_base.py) class and must be wrapped using a corresponding [`RetrieverBase`](https://github.com/borglab/gtsfm/blob/master/gtsfm/retriever/retriever_base.py) implementation, where the global descriptor module takes in individual images and outputs their corresponding descriptor and the retriever module takes these descriptors descriptors and computes the image pair similarity scores and outputs the putative image pairs based on a specified threshold (see [`NetVLADGlobalDescriptor`](https://github.com/borglab/gtsfm/blob/master/gtsfm/frontend/global_descriptor/netvlad_global_descriptor.py) and [`NetVLADRetriever`](https://github.com/borglab/gtsfm/blob/master/gtsfm/retriever/netvlad_retriever.py)).
+
+```python
+class RetrieverBase(GTSFMProcess):


we should explain what a Retreiver is if this is needed.

akshay-krishnan · 2025-01-12T13:07:09Z

assets/CORRESPONDENCE_GENERATOR.md

+
+## What is a Correspondence Generator?
+
+The Correspondence Generator is responsible for taking in putative image pairs from the [`ImagePairsGenerator`](https://github.com/borglab/gtsfm/blob/master/gtsfm/retriever/image_pairs_generator.py) and returning keypoints for each image and correspondences between each specified image pair. Correspondence generation is implemented by the [`CorrespondenceGeneratorBase`](https://github.com/borglab/gtsfm/blob/master/gtsfm/frontend/correspondence_generator/correspondence_generator_base.py) class defined below.


nit: returning keypoints for each image and (their / keypoint) correspondences between each specified image pair.

akshay-krishnan · 2025-01-12T13:09:13Z

assets/IMAGE_PAIRS_GENERATOR.md

+
+## What is an Image Pair Generator?
+
+The Image Pair Generator takes in images from the Loader and outputs putative image pairs for correspondence generation. Image pair generation is implemented by the [`ImagePairsGenerator`](https://github.com/borglab/gtsfm/blob/master/gtsfm/retriever/image_pairs_generator.py) class defined below, which wraps a specific [`Retriever`](https://github.com/borglab/gtsfm/blob/master/gtsfm/retriever/retriever_base.py), and, optionally, a [`GlobalDescriptor`](https://github.com/borglab/gtsfm/blob/master/gtsfm/frontend/global_descriptor/global_descriptor_base.py).


It would be good to know what we assume from a reader of this documentation. It seems like they would already need some knowledge of SfM. If not, terms like "putative image pairs" are unclear. What makes a pair? (answer: view overlap, potential for keypoint correspondences).

akshay-krishnan · 2025-01-12T13:11:46Z

assets/IMAGE_PAIRS_GENERATOR.md

+
+## Global Image Descriptors
+
+Global desriptors work similar to local feature desriptors except that these methods generate a single descriptor for each image. Distances between these global image descriptors can then be used as a metric for the expected "matchability" of the image pairs during the correspondence generation phase, where a threshold can be used to reject potentially dissimilar image pairs before conducting correspondence generation. This reduces the likelihood of matching image pairs with little to no overlap that could cause erroneous correspondences to be inserted into the optimization process in the back-end while also significantly reducing the runtimes as compared to exhaustive matching. 


nit: I dont see much of a similarity here, the descriptors are actually very different. To the point that I would say "unlike feature descriptors that learn a local descriptor for each image patch/pixel, global descriptors generate a single descriptor for each image". The difference seems more important than any similarity (which would only be in the model architecture).

akshay-krishnan · 2025-01-12T13:13:45Z

assets/gtsfm-overview.svg

a small but siginificant change: rotation averaging should occur before translation averaging.

this should be changed for all files.

Refactoring/Adding Documentation

664a90a

travisdriver requested review from akshay-krishnan, dellaert and ayushbaid October 14, 2024 20:53

Finished Image Pair Generator page

0734a04

akshay-krishnan reviewed Jan 12, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactoring/Adding Documentation #813

Refactoring/Adding Documentation #813

travisdriver commented Oct 14, 2024 •

edited

Loading

akshay-krishnan commented Jan 12, 2025

akshay-krishnan Jan 12, 2025

akshay-krishnan Jan 12, 2025

akshay-krishnan Jan 12, 2025

akshay-krishnan Jan 12, 2025 •

edited

Loading

akshay-krishnan Jan 12, 2025

akshay-krishnan Jan 12, 2025


		## What is a Correspondence Generator?

		The Correspondence Generator is responsible for taking in putative image pairs from the [`ImagePairsGenerator`](https://github.com/borglab/gtsfm/blob/master/gtsfm/retriever/image_pairs_generator.py) and returning keypoints for each image and correspondences between each specified image pair. Correspondence generation is implemented by the [`CorrespondenceGeneratorBase`](https://github.com/borglab/gtsfm/blob/master/gtsfm/frontend/correspondence_generator/correspondence_generator_base.py) class defined below.


		## What is an Image Pair Generator?

		The Image Pair Generator takes in images from the Loader and outputs putative image pairs for correspondence generation. Image pair generation is implemented by the [`ImagePairsGenerator`](https://github.com/borglab/gtsfm/blob/master/gtsfm/retriever/image_pairs_generator.py) class defined below, which wraps a specific [`Retriever`](https://github.com/borglab/gtsfm/blob/master/gtsfm/retriever/retriever_base.py), and, optionally, a [`GlobalDescriptor`](https://github.com/borglab/gtsfm/blob/master/gtsfm/frontend/global_descriptor/global_descriptor_base.py).


		## Global Image Descriptors

		Global desriptors work similar to local feature desriptors except that these methods generate a single descriptor for each image. Distances between these global image descriptors can then be used as a metric for the expected "matchability" of the image pairs during the correspondence generation phase, where a threshold can be used to reject potentially dissimilar image pairs before conducting correspondence generation. This reduces the likelihood of matching image pairs with little to no overlap that could cause erroneous correspondences to be inserted into the optimization process in the back-end while also significantly reducing the runtimes as compared to exhaustive matching.

Refactoring/Adding Documentation #813

Are you sure you want to change the base?

Refactoring/Adding Documentation #813

Conversation

travisdriver commented Oct 14, 2024 • edited Loading

akshay-krishnan commented Jan 12, 2025

akshay-krishnan Jan 12, 2025

Choose a reason for hiding this comment

akshay-krishnan Jan 12, 2025

Choose a reason for hiding this comment

akshay-krishnan Jan 12, 2025

Choose a reason for hiding this comment

akshay-krishnan Jan 12, 2025 • edited Loading

Choose a reason for hiding this comment

akshay-krishnan Jan 12, 2025

Choose a reason for hiding this comment

akshay-krishnan Jan 12, 2025

Choose a reason for hiding this comment

travisdriver commented Oct 14, 2024 •

edited

Loading

akshay-krishnan Jan 12, 2025 •

edited

Loading