[DOC] Add individual estimator capabilities in the documentation #1430

baraline · 2024-04-16T08:05:48Z

Describe the issue linked to the documentation

Following #1242 and #1426, we would like to add the estimator capabilities in the API documentation. While an automatic way to do this would be best, another option is to manually add a table in the docstring of each estimator.

One way to (manually) achieve the result would be to use the notes sections of the class docstring as follows:

Notes
-----
   +-----------------+----------------+
   | Capability      |    Support     |
   +=================+================+
   | multivariate    |       ✗        |
   +-----------------+----------------+
   | missing values  |       ✓        |
   +-----------------+----------------+
   | unequal length  |       ✓        |
   +-----------------+----------------+

The value of these tags can be determined by calling the estimator.get_class_tags() .

Suggest a potential alternative/fix

No response

The text was updated successfully, but these errors were encountered:

inclinedadarsh · 2024-12-18T06:54:58Z

Hello @baraline,
this seems to be a pretty good issue to start with, I'd like to attempt it.

As much as I have understood, there are a lot of estimators in the package (coming from estimator overview page)

Taking BOSSEnsemble as an example, I'll have to add the following lines in this file in the NOTES section:

   +-----------------+----------------+
   | Capability      |    Support     |
   +=================+================+
   | missing values  |       ✗        |
   +-----------------+----------------+
   | multithreading  |       ✓        |
   +-----------------+----------------+
   | univariate      |       ✓        |
   +-----------------+----------------+
   | multivariate    |       ✗        |
   +-----------------+----------------+
   | unequal length  |       ✗        |
   +-----------------+----------------+
   | train estimat   |       ✓        |
   +-----------------+----------------+
   | contractable    |       ✗        |
   +-----------------+----------------+

Please let me if I'm missing something

Thank you!

baraline · 2024-12-18T07:27:42Z

Hi @inclinedadarsh, this is the idea, yes. I think we could default to only adding the tags that are set to True to avoid bloating the notes sections.

Although the ideal solution would be to have this done through a script like for the estimator page in #1426, as we have numerous estimators and their capabilities might change on rare occasions.

SebastianSchmidl · 2024-12-18T08:42:14Z

If we add only the true tags, then a table is not necessary and a simple list will do.

For anomaly detectors, we already have a list of capabilities in the form of a table:

aeon/aeon/anomaly_detection/_dwt_mlead.py

Lines 46 to 54 in 122e7f6

    
               .. list-table:: Capabilities 
        
                  :stub-columns: 1 
        
                  * - Input data format 
        
                    - univariate 
        
                  * - Output data format 
        
                    - anomaly scores 
        
                  * - Learning Type 
        
                    - unsupervised

This renders as seen here: https://www.aeon-toolkit.org/en/latest/api_reference/auto_generated/aeon.anomaly_detection.DWT_MLEAD.html#id4

This capabilities table does not include all kinds of tags and, thus, needs to be adapted or changed to the new format.

inclinedadarsh · 2024-12-18T10:54:58Z

Hi @inclinedadarsh, this is the idea, yes. I think we could default to only adding the tags that are set to True to avoid bloating the notes sections.

Although the ideal solution would be to have this done through a script like for the estimator page in #1426, as we have numerous estimators and their capabilities might change on rare occasions.

Okay @baraline
Here's what I understand --

I'll have to add a script in conf.py file just like someone did in #1426
The script should:

Get all the estimators
Get the capabilities of those estimators
Add capabilities of the estimators in their respective files

Let me know if I'm going in the wrong direction.

baraline · 2024-12-18T14:03:17Z

I'm not 100% sure how such a script would actually perform the insertion in the api docs files, but if we already have a format as the one presented by @SebastianSchmidl, we should follow this (e.g. the dwt_mlead example he gave) instead of the table I described in the original issue.

But yeah the script should loop through all estimators, get their _tags, create the table based on them, and insert it in the generated api doc files.

Note that i'm unsure if it is possible to do it like this, so you can explore a bit. But manually adding these tables for all estimators is kind of the last resort solution.

inclinedadarsh · 2024-12-18T15:43:53Z

Okay that works

I'll be experimenting a bit. Any other resource that aeon has used to achieve anything similar to this will be helpful.
Also, If we're only going to list the true tags, then we won't be needing a table. Will a simple list like this work?

**Capabilities**:
- Multi Threading
- Univariate
- Train Estimate

Which will render to:
Capabilities:

Multi Threading
Univariate
Train Estimate

If not this, then I'll be glad if @baraline @SebastianSchmidl can help me figure out what way we can present this.

Also @SebastianSchmidl I tried looking through the codebase to understand how that capabilities table in all the anomaly detection modules are added, but couldn't :(

Can you let me know if those tables are added manually or a script was used to do that?

baraline · 2024-12-18T16:47:49Z

You can copy the list format used in Sebastian example. And they added it manually in the mentioned estimators I'm afraid.

SebastianSchmidl · 2024-12-19T07:04:29Z

Yes, the capabilities-tables in the AD module were added manually. So, I don't have any other resources for adding this manually, I'm afraid.

inclinedadarsh · 2024-12-21T18:33:30Z

@baraline there are two ways we can write the script:

The script will add the table in the actual documentation and then the html will be generated, as a side-effect the table will remain in all the documentations (& will also be pushed to github)

The problem with this is, how will the program understand if the table that already exists in the files is correct or not, to fix this we can create it such that everytime documentation is generated, it simply removes the table if it exists and adds a new one

the script will first generate the html first, and then the table will be added

i would want it to happen this way, but I'm not sure if we can achieve that using Sphinx.

Please let me know how i should proceed

inclinedadarsh · 2024-12-21T18:38:16Z

Moreover, here's what im currently exploring:

I am writing a custom Sphinx, which basically --

calls the all_estimators() function
gets all the estimators
crawls all the modules using walk_packages function from pkgutil (all_estimators function also uses this`)
if the module name is similar to the estimators from all_estimators() function then get it's path and update the table there (by calling estimator.get_class_tags()

this is kinda slow because I'll be crawling through all the modules but this is the only way I can think of

something else I'll try after this is successful is that, i'll try to pass the file path of the estimator module directly from the all_estimators() function because it's already crawling all the modules once, we don't really need to do the same thing again

baraline · 2024-12-21T19:22:30Z

Hey,

I think a good start is looking at the sphinx event callback graph here to find at which step we could insert the tables into the docs.

Having to modify the GitHub sources is not ideal, but we indeed have to deal with the case where the table is already present in the estimator documentation, as it is the case for some anomaly detector. This will likely be done through some regular expression on the doc string.

Concerning the rest, I think you got it right, use the all estimator function and loop through them, checking the tags, the builded doc, and making the adjustement/additions needed. So I think we should use a callback after "build", so we can browse (and modify) the generated HTML before it is displayed.

inclinedadarsh · 2024-12-22T12:26:02Z

Aha, got it!

Thanks for this @baraline, searched for the event callback we need and figured out that's html-page-context
Using it I'm able to edit & build the context sources before building without affecting the actual source files.

Now I just need to figure out how to get the NOTES section and put our table there. Most probably using regex.

Also, assigning this issue to myself as I'm working on it.

inclinedadarsh · 2024-12-22T12:27:45Z

@aeon-actions-bot assign @inclinedadarsh

baraline added the documentation Improvements or additions to documentation label Apr 16, 2024

baraline mentioned this issue Apr 18, 2024

[DOC] Add estimator overview table with capabilities #1426

Merged

TonyBagnall added the good first issue Good for newcomers label Jun 8, 2024

aeon-actions-bot bot assigned inclinedadarsh Dec 22, 2024

aeon-actions-bot bot removed the good first issue Good for newcomers label Dec 22, 2024

inclinedadarsh linked a pull request Dec 22, 2024 that will close this issue

[ENH] Add sphinx event to add capability table to estimators' docs individually #2468

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DOC] Add individual estimator capabilities in the documentation #1430

[DOC] Add individual estimator capabilities in the documentation #1430

baraline commented Apr 16, 2024

inclinedadarsh commented Dec 18, 2024 •

edited

Loading

baraline commented Dec 18, 2024 •

edited

Loading

SebastianSchmidl commented Dec 18, 2024

inclinedadarsh commented Dec 18, 2024

baraline commented Dec 18, 2024

inclinedadarsh commented Dec 18, 2024

baraline commented Dec 18, 2024

SebastianSchmidl commented Dec 19, 2024

inclinedadarsh commented Dec 21, 2024

inclinedadarsh commented Dec 21, 2024

baraline commented Dec 21, 2024

inclinedadarsh commented Dec 22, 2024

inclinedadarsh commented Dec 22, 2024

[DOC] Add individual estimator capabilities in the documentation #1430

[DOC] Add individual estimator capabilities in the documentation #1430

Comments

baraline commented Apr 16, 2024

Describe the issue linked to the documentation

Suggest a potential alternative/fix

inclinedadarsh commented Dec 18, 2024 • edited Loading

baraline commented Dec 18, 2024 • edited Loading

SebastianSchmidl commented Dec 18, 2024

inclinedadarsh commented Dec 18, 2024

baraline commented Dec 18, 2024

inclinedadarsh commented Dec 18, 2024

baraline commented Dec 18, 2024

SebastianSchmidl commented Dec 19, 2024

inclinedadarsh commented Dec 21, 2024

inclinedadarsh commented Dec 21, 2024

baraline commented Dec 21, 2024

inclinedadarsh commented Dec 22, 2024

inclinedadarsh commented Dec 22, 2024

inclinedadarsh commented Dec 18, 2024 •

edited

Loading

baraline commented Dec 18, 2024 •

edited

Loading