Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[WIP] Fix #708 #1025

Open
wants to merge 9 commits into
base: main
Choose a base branch
from

Conversation

NimaSarajpoor
Copy link
Collaborator

@NimaSarajpoor NimaSarajpoor commented Aug 6, 2024

See #708. This PR should address #708 (and #1011)

Each main checkbox below has at least one indented checkbox. The main checkbox represents a callee function with fastmath=True and an indented checkbox represents a caller function. If the caller function itself has fastmath=True flag as well, a star(*) is written next to its name. In such case, the callers of this function are represented in another checkbox set.


  • maap._compute_multi_p_norm

    • maap._maamp
  • core._sliding_dot_product

    • * core._p_norm_distance_profile
    • core._mass_distance_matrix
    • * scrump._compute_PI
    • tests/test_preceision.py::test_calculate_squared_distance
    • tests/test_core.py::test_njit_sliding_dot_product
  • core._p_norm_distance_profile

    • * scrammp._compute_PI
    • tests/test_core.py::test_p_norm_distance_profile
  • core._calculate_squared_distance_profile

    • * core.calculate_distance_profile
    • stomp._stomp
    • * mstump._compute_multi_D
    • * scrump._compute_PI
    • tests/test_core.py::test_calculate_squared_distance_profile
  • core.calculate_distance_profile

    • * core._mass
    • stumpi._update_egress
    • stumpi._update
    • tests/test_core.py::test_calculate_distance_profile
  • core._mass

    • core.mass
    • core._mass_distance_matrix
    • ostinato._across_series_nearest_neighbors
    • ostinato._ostinato
  • core._apply_exclusion_zone

    • * maap._compute_multi_p_norm
    • core.apply_exclusion_zone
    • * scraamp._compute_PI
    • * mstump._compute_multi_D
    • * scrump._compute_PI
  • stump._stump

    • stumped._dask_stumped
    • stumped._ray_stumped
    • stump.stump
    • scrump.update
  • stump._compute_diagonal

    • * stump._stump
  • * scraamp._compute_PI

    • * scraamp._prescraamp
  • scraamp._prescraamp

    • scraamp.prescraamp
    • scraamp._init_
  • mstump._compute_multi_D

    • mstump._mstump
  • scrump._compute_PI

    • * scrump._prescrump
  • scrump. _prescrump

    • scrump.prescrump
    • scrump._init_
  • aamp._compute_diagonal

    • * aamp._aamp
  • aamp._aamp

    • aamp.aamp
    • scraamp.update
    • aamped._dask_aamped
    • aamped._ray_aamped

NOTE: The functions core._count_diagonal_ndist and core._total_diagonal_ndists are ignored as their input parameters cannot have non-finite value.

Copy link

codecov bot commented Aug 6, 2024

Codecov Report

Attention: Patch coverage is 0% with 118 lines in your changes missing coverage. Please review.

Project coverage is 96.57%. Comparing base (79096a8) to head (f548907).
Report is 3 commits behind head on main.

Files with missing lines Patch % Lines
utils.py 0.00% 72 Missing ⚠️
njit_fastmath.py 0.00% 46 Missing ⚠️
Additional details and impacted files
@@            Coverage Diff             @@
##             main    #1025      +/-   ##
==========================================
- Coverage   97.32%   96.57%   -0.75%     
==========================================
  Files          89       91       +2     
  Lines       14964    15145     +181     
==========================================
+ Hits        14563    14626      +63     
- Misses        401      519     +118     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@seanlaw
Copy link
Contributor

seanlaw commented Aug 6, 2024

@NimaSarajpoor Thank you for painstakingly going through all of the files/functions. I am wondering if we might be able to write a Python script to help notify us as to which functions are missing fastmath=True or should be some other fastmath variant? Or to warn us that an outer calling function has (incorrectly) fastmath=True but is calling an inner function that cannot take non-finite values (or produces nont-finite output values)? Is it worth investigating?

@NimaSarajpoor
Copy link
Collaborator Author

NimaSarajpoor commented Sep 11, 2024

@seanlaw
I think, as our first step, a scrip that just gives us that list above should be helpful. Basically, the script can search for fastmath=True, and for any of those functions, it finds all caller functions, and so on. I think the search (for each function) can stop once we reach an outer function whose outer function does not have fastmath flag.


I was wondering how one can accomplish this:

Or to warn us that an outer calling function has (incorrectly) fastmath=True but is calling an inner function that cannot take non-finite values (or produces nont-finite output values)? Is it worth investigating?

? This seems to be hard. Right? I mean... detecting that by just going through the function's docstring / body may not be a feasible solution. What do you think?

@seanlaw
Copy link
Contributor

seanlaw commented Sep 11, 2024

@NimaSarajpoor I don't know but I think we'll have to import ast, build up some abstract syntax tree, and then traverse the nodes accordingly

@NimaSarajpoor
Copy link
Collaborator Author

NimaSarajpoor commented Sep 16, 2024

[Update]
After some research, I came up with the following code that can get the callers and the callee(s) in a script.

import ast

def _visit_all_child_nodes(node, lst):
    for n in ast.iter_child_nodes(node):
        if isinstance(n, ast.Call):
            if isinstance(n.func, ast.Attribute):
                name = n.func.attr
            else:
                name = n.func.id
                
            lst.append(name)
            
        _visit_all_child_nodes(n, lst)


def visit_all_child_nodes(node):
    lst = []
    _visit_all_child_nodes(node, lst)

    return lst


def find_caller_and_callees(fpath):
    tree = ast.parse(open(fpath).read())
    
    out = {}   # caller: callee(s)
    for node in ast.walk(tree):
        if isinstance(node, ast.FunctionDef):
            out[node.name] = set(visit_all_child_nodes(node))
        
    return out

Note1: The if-block if isinstance(node, ast.FunctionDef): needs to be revised to only capture the functions with njit decorators.

Note2: I tried it on the following script:

# in file temp.py

def func1(x, y):
    return x + y

def func2(x, y, z):
    xy = func1(x, y)
    out = z + xy
    return z

And the output is correct:

out = find_caller_and_callees('./temp.py')

# out 
{'func1': set(), 'func2': {'func1'}}

And when I tried it for stumpy/stump.py, I noticed it captures ALL functions:

{
'_compute_diagonal': {'_shift_insert_at_index',
  'dot',
  'min',
  'njit',
  'range',
  'searchsorted',
  'uint64'},

 '_stump': {'_compute_diagonal',
  '_count_diagonal_ndist',
  '_get_array_ranges',
  '_merge_topk_ρI',
  'abs',
  'empty',
  'full',
  'njit',
  'prange',
  'range',
  'sqrt'},

 'stump': {'ValueError',
  '_check_P',
  '_stump',
  'arange',
  'ceil',
  'check_ignore_trivial',
  'check_window_size',
  'column_stack',
  'empty',
  'int',
  'min',
  'mparray',
  'non_normalized',
  'preprocess_diagonal'}
}

That's. a huge set. This will be reduced when I check if a callee is a njit-decorated function


@seanlaw
Do you think I am on the right track here?

@seanlaw
Copy link
Contributor

seanlaw commented Sep 16, 2024

@NimaSarajpoor I think that looks great. Here are a few things to consider:

  1. Maybe instead of lst = [], call it something like all_functions = []
  2. What happens to your script when the caller and callee chain are in different/multiple .py files?
  3. There seems to be a little bit of overlap between this code and the docstring.py code. It might be worth considering if we should combine them into a single .py file somehow? It feels like we are starting to accumulate scripts and so maybe there is a refactoring opportunity to do a utils.py. Just thinking out loud here.

@NimaSarajpoor
Copy link
Collaborator Author

  1. Maybe instead of lst = [], call it something like all_functions = []
  2. What happens to your script when the caller and callee chain are in different/multiple .py files?
  3. There seems to be a little bit of overlap between this code and the docstring.py code. It might be worth considering if we should combine them into a single .py file somehow? It feels like we are starting to accumulate scripts and so maybe there is a refactoring opportunity to do a utils.py. Just thinking out loud here.
  1. Yes! Better name indeed.

  2. The provided script captures the callee even if it is from a different script. But it does not create a chain. That part needs to be added. The problem with the output of the current script is that it does not show from what .py file it comes. For instance, _stump calls the function core._merge_topk_ρI. The output includes _merge_topk_ρI.

One solution is to create one "helper" dictionary where the key is function name, and the value is the name of its corresponding script. So, this can help us to follow the chain by going from caller to the callee, and then see what functions the callee calls, and so on.

  1. Noted. I will take a look at the docstring.py to just prep my mind for now. It might help me to borrow some idea from there as well.

@NimaSarajpoor
Copy link
Collaborator Author

NimaSarajpoor commented Sep 22, 2024

@seanlaw
I looked at docstring.py and borrowed the following parts to detect functions/methods :

stumpy/docstring.py

Lines 77 to 85 in b7b355c

file_contents = ""
with open(filepath, encoding="utf8") as f:
file_contents = f.read()
module = ast.parse(file_contents)
# Check Functions
function_definitions = [
node for node in module.body if isinstance(node, ast.FunctionDef)
]

stumpy/docstring.py

Lines 91 to 96 in b7b355c

# Check Class Methods
class_definitions = [
node for node in module.body if isinstance(node, ast.ClassDef)
]
for cd in class_definitions:
methods = [node for node in cd.body if isinstance(node, ast.FunctionDef)]


The new version of my script (provided below) goes through each file and returns a dictionary with the following keys:

  • func_names: list of function/method names defined in the file
  • is_njit: list of boolean values indicating whether the function is decorated with @njit
  • callees: list of sets of function names that are called by the function

After obtaining such dictionary for each file, the script combines the information, and do some processing. The final output is a nested dictionary. One item of this dictionary is provided below as instance:

# key is filename
file_name:  stump

# value is a dictionary with the following (key, value) pairs
func_names: ['_compute_diagonal', '_stump', 'stump']
is_njit: [True, True, False]
callees: [
{'core._shift_insert_at_index'}, 
{'stump._compute_diagonal', 'core._count_diagonal_ndist', 'core._merge_topk_ρI', 'core._get_array_ranges'},
{'stump._stump', 'core.non_normalized', 'core.preprocess_diagonal', 'core.check_window_size', 'core._check_P', 'core.check_ignore_trivial'}
]

Note 1: The callees only contain the functions that are defined in one of the .py files in ./stumpy/

Note 2: In the each set in callees, I used the format <file_name>.<callee_function> to just help us track easily where each callee comes from.

Code:

#!/usr/bin/env python

import ast
import pathlib
import re


def _get_callees(node, all_functions):
    for n in ast.iter_child_nodes(node):
        if isinstance(n, ast.Call):
            obj = n.func
            if isinstance(obj, ast.Attribute):
                name = obj.attr
            elif isinstance(obj, ast.Subscript):
                name = obj.value.id
            elif isinstance(obj, ast.Name):                
                name = obj.id
            else:
                msg = f"The type {type(obj)} is not supported"
                raise ValueError(msg)
            
            all_functions.append(name)
            
        _get_callees(n, all_functions)


def get_all_callees(node):
    """"
    Forr a given node of type ast.FunctionDef, visit all of its child nodes,
    and return a list of all callees
    """
    all_functions = []
    _get_callees(node, all_functions)

    return all_functions



def get_functions_metadata(filepath):
    """
    For a given filepath, return a dictionary with the following keys:
    - func_names: list of function/method names defined in the file
    - is_njit: list of boolean values indicating whether the function is decorated with `@njit`
    - callees: list of sets of function names that are called by the function
    """
    file_contents = ""
    with open(filepath, encoding="utf8") as f:
        file_contents = f.read()
    module = ast.parse(file_contents)

    
    function_definitions = []
    
    # include functions
    function_definitions.extend([
        node for node in module.body if isinstance(node, ast.FunctionDef) 
    ])

    # include methods of classes
    all_methods = []
    class_definitions = [
        node for node in module.body if isinstance(node, ast.ClassDef)
    ]
    for cd in class_definitions:
        methods = [node for node in cd.body if isinstance(node, ast.FunctionDef)]
        all_methods.extend(methods)
    
    function_definitions.extend(all_methods)

    
    # Create list of function/method names
    func_names = [None] * len(function_definitions)
    for i, fd in enumerate(function_definitions):
        func_names[i] = fd.name


    # Create list of flags indicating whether the function is decorated with `@njit`
    is_njit = [False] * len(function_definitions)
    for i, fd in enumerate(function_definitions):
        for decorator in fd.decorator_list:
            if not isinstance(decorator, ast.Call):
                continue

            obj = decorator.func
            if isinstance(obj, ast.Attribute):
                name = obj.attr
            elif isinstance(obj, ast.Subscript):
                name = obj.value.id
            elif isinstance(obj, ast.Name):                
                name = obj.id
            else:
                msg = f"The type {type(obj)} is not supported."
                raise ValueError(msg)

            if name == "njit":
                is_njit[i] = True
                break


    # Create list of sets of callees
    callees = [None] * len(function_definitions)
    for i, fd in enumerate(function_definitions):
        # walk all child nodes of `fd`
        callees[i] = set(get_all_callees(fd))
    

    out = {
        'func_names' : func_names,
        'is_njit' : is_njit,
        'callees' : callees
    }

    return out



ignore = ["__init__.py", "__pycache__"]

stumpy_path = pathlib.Path(__file__).parent / "stumpy"
filepaths = sorted(f for f in pathlib.Path(stumpy_path).iterdir() if f.is_file())

out = {}
collect_defined_funcs = []
for filepath in filepaths:
    name = filepath.name
    if name not in ignore and str(filepath).endswith(".py"):
        functions_data = get_functions_metadata(filepath)
        out[name] = functions_data
        
        n = name.replace('.py', '')
        collect_defined_funcs.extend([f"{n}.{f}" for f in functions_data['func_names']])            


collect_defined_funcs_nameonly = [item.split('.')[1] for item in collect_defined_funcs]

d = {}
for fname, func_data in out.items():
    file_name = fname.replace('.py', '')
    d[file_name] = {} 
    d[file_name]['func_names'] = func_data['func_names']
    d[file_name]['is_njit'] = func_data['is_njit']
    
    d[file_name]['callees'] = [] 
    for i, f in enumerate(func_data['callees']):
        s = set()
        for x in f:
            if x not in collect_defined_funcs_nameonly:
                continue
            
            if x in d[file_name]['func_names']:
                s.update([f"{file_name}.{x}"])
            else:
                idx = collect_defined_funcs_nameonly.index(x)
                s.update([collect_defined_funcs[idx]])
        
        d[file_name]['callees'].append(s)


# print content of the final output `d`
for fname, func_data in d.items():
    print('file_name: ', fname)
    for k, v in func_data.items():
        print(f'{k}: {v}')
    print('--------------------------------')

Next step is to revise the script above to check for fastmath flag as well when is_njit is True, and include that in the output.

@NimaSarajpoor
Copy link
Collaborator Author

Next step is to revise the script above to check for fastmath flag as well when is_njit is True, and include that in the output.

@seanlaw Since we are in PR, I was wondering if I should push my new script to this PR chunk by chunk so that you can review it more easily.

@seanlaw
Copy link
Contributor

seanlaw commented Sep 22, 2024

Since we are in PR, I was wondering if I should push my new script to this PR chunk by chunk so that you can review it more easily.

Sure. If there is sufficient overlap with docstring.py then we should also consolidate and refactor if it makes sense.

@NimaSarajpoor
Copy link
Collaborator Author

NimaSarajpoor commented Sep 23, 2024

@seanlaw

If there is sufficient overlap with docstring.py then we should also consolidate and refactor if it makes sense
It feels like we are starting to accumulate scripts and so maybe there is a refactoring opportunity to do a utils.py. Just thinking out loud here.

So, are you thinking of having common functions in utils.py and use those in docstring.py and <new-script>.py after refactoring? One way is to start with the new script and refactor later. Or, just start with adding some general functions in utils.py. I went with the latter approach. Not sure which approach is more reasonable though.


Let's check the output of check_functions in utils.py

Example 1:

func_names, is_njit, fastmath_values = check_functions("stumpy/stump.py")

# values of the returned lists
# Function names: ['_compute_diagonal', '_stump', 'stump']
# Is njit: [True, True, False]
# Fastmath values: [True, True, None]

Example 2:

func_names, is_njit, fastmath_values = check_functions("stumpy/core.py")

# values of the returned lists at index 25
# Function names: _welford_nanvar
# Is njit: True
# Fastmath values: {'contract', 'reassoc', 'nsz', 'arcp', 'afn'}

@seanlaw
Copy link
Contributor

seanlaw commented Sep 24, 2024

@NimaSarajpoor Things look fine. However, for the two examples that you provided, I don't like that the outputs are allowed to be of different type. Specifically, I expect Example 2 to return lists as well.

@NimaSarajpoor
Copy link
Collaborator Author

NimaSarajpoor commented Sep 24, 2024

@NimaSarajpoor Things look fine. However, for the two examples that you provided, I don't like that the outputs are allowed to be of different type. Specifically, I expect Example 2 to return lists as well.

@seanlaw
Thanks to you, I am getting closer to your mindset :) The outputs in the second example are lists but I just showed the value for index 25. Otherwise, I had to show three lists, each with 95 elements that correspond to the 95 functions in core.py.

I think I should have written something like # func_names[25]: _welford_nanvar to avoid confusion.

for file_name, file_functions_metadata in all_functions.items():
all_stumpy_functions.update(file_functions_metadata["func_names"])

all_stumpy_functions = list(all_stumpy_functions)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This feels strange. You declare all_stumpy_functions as a set() above and then you update it and then you convert it into a list all within a for-loop.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

One thing that I forgot to notice is that the nested dictionary all_functions has unique values for all_functions['file_name']['func_names"]. So, I can just use list.

all_stumpy_functions = []
for file_name in all_functions.keys():
    all_stumpy_functions.extend(all_functions['file_name']['func_names'])

Does that address your concern?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It's not entirely clear what the goal is here

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Every time I look at it... I feel like the nested dictionary makes it hard to read. I need to re-design it.

It's not entirely clear what the goal is here

My goal was to create a comprehensive list that contains all the functions that are defined in ./stumpy/, and then use it to ignore any callee that is NOT in that comprehensive list. For instance, range can be a callee of a function. However, I do not want to keep it. I want to only have the callers/ callees that are defined in stumpy.

return func_names, is_njit, fastmath_values


def _get_callees(node, all_functions):
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

[Note to self]
One thing I need to consider here is how things will work if a callee is assigned to a variable before being used. For instance:

a_subseq_isconstant = _rolling_isconstant



# output 2: func_callers
func_callers = {}
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@seanlaw
Previously, I wanted to follow the chain of callees. Now, as you may notice here, I am trying to get list of callers for each function. We can start with those functions that are njit-decorated and have fastmath flag, and then follow the chain of callers.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sure. Whatever you think might work

for f in func_metadata.keys():
func_callers[f] = []

for filepath in filepaths:
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Going to confess here... I think there is a code smell here because of nested for-loop. Although I tried a couple of times, I think I still need to work on this part as it seems to be complicated.

@seanlaw
Copy link
Contributor

seanlaw commented Jan 5, 2025

@NimaSarajpoor I noticed that your original list did not contain functions like core._get_array_ranges or core._get_ranges or core._merge_topk_PI (and many others). They should all have some sort of fastmath flag

@NimaSarajpoor
Copy link
Collaborator Author

I noticed that your original list did not contain functions like core._get_array_ranges or core._get_ranges or core._merge_topk_PI (and many others).

Correct. Currently, the NJIT functions without fastmath are:

# format: (module_name, func_name)

('core', '_get_array_ranges')
('core', '_get_ranges')
('core', '_merge_topk_PI')
('core', '_merge_topk_ρI')
('core', '_shift_insert_at_index')

They should all have some sort of fastmath flag

Got it 👍

@seanlaw
Copy link
Contributor

seanlaw commented Jan 5, 2025

@NimaSarajpoor We should probably have this script also check that all njit functions have a fastmath parameter and then error out if not.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants