refactoring tests to log io behavior #31

betolink · 2024-01-29T15:46:32Z

This PR is to integrate code that logs fsspec network operations and tweak the I/O libraries (hypy, fsspec).
The fsspec-logs notebook should be a stand-alone test when ready (90% there)

…y access pattern

betolink · 2024-02-06T02:00:38Z

I think this PR is ready to be reviewed, still work in progress. h5coro needs upstream changes to work with s3 links that don't require a login. We probably need to create a pip installable package called h5benchmark and then all the bootstrapping code and relative paths should go away.

The main notebooks are the ones with portable in their name, I think once we have subsetting ready we could move on to complete the benchmarking numbers.

asteiker · 2024-02-07T20:08:46Z

h5tests/single-test.ipynb

@betolink I received an error in the 3rd codeblock:

results = xarray_original.run(io_params)

ZeroDivisionError: division by zero

I ran it on a fresh clone and didn't get that error, perhaps we do some screen sharing to see what's going on.

oh I see, there is a bug, fixing it now....

asteiker · 2024-02-07T20:13:18Z

notebooks/fsspec-logs.ipynb

@betolink I received an error in the last plotting block:

for name, group in df.groupby(['tool', 'dataset', 'format']): tool, dataset, formated = name x = f'{tool}, {dataset}, {formated}' y = group['time'].mean() ax.bar(f'{tool}, {dataset}, {formated}', group['time'].mean(), label=f'{tool}, {dataset}, {formated}', align='center') ax.text(x, y + 0.05, f'{group["time"].mean():.2f}', ha='center', va='bottom', color='black', fontsize=8)

KeyError: 'tool'

this notebook was just prototyping some stuff, the ones that we want to plot the results are the ones named "portable-"

asteiker · 2024-02-07T20:14:50Z

notebooks/portable-h5coro-test.ipynb

@betolink We should add a warning to this notebook to explain that it's not functioning due to the anonymous access issue, and/or comment all the code blocks that produce an error.

Good idea, or in the meantime we can use the same granules just from the CryoCloud bucket.

asteiker · 2024-02-07T20:16:32Z

notebooks/portable-h5py-test.ipynb

@betolink

One small suggestion is to add a short title + description to the top of this notebook to explain its usage.

I also got a divide by zero error here in the 3rd code block:

results = h5py_original.run(io_params)

asteiker · 2024-02-07T20:22:00Z

notebooks/portable-xarray-test.ipynb

Like the h5py test notebook:

One small suggestion is to add a short title + description to the top of this notebook to explain its usage.

I also got a divide by zero error here in the 3rd code block:
results = h5py_original.run(io_params)

asteiker

I wasn't able to run many of the notebooks end to end (tested in CryoCloud). Once those errors are resolved plus suggested updates to include brief title and descriptions for the new notebooks, then it looks good on my end.

Also for our collaborators, do we need any additional documentation for them to utilize the notebooks (i.e. any breaking changes for the other CO formats that others would need to be aware of)?

JessicaS11 · 2024-02-19T21:03:05Z

helpers/links-old.json

Are these file names/paths still being used, or could we remove this file since the links were updated in links.json so it's effectively in the git history?

JessicaS11 · 2024-02-19T21:05:09Z

helpers/s3itslive.json

Is this a second storage location (different hub) for the same set of files? Or are the files different?

Saves results of experiments to results

Copied plotting from portable-full-comparison

Add plotting for end to end running

so that canonical plotting is in plot_benchmark_results.ipynb

betolink added 3 commits January 28, 2024 20:25

refactoring tests to log io behavior

77e11c2

testing with out of region access

5c04827

updating notebooks, portable can quickly test and visualize results b…

80b0353

…y access pattern

betolink requested review from JessicaS11, weiji14 and asteiker February 6, 2024 01:34

h5coro needs some upstream changes to work with annon=True access

a5fe2e2

betolink marked this pull request as ready for review February 6, 2024 02:01

This was linked to issues Feb 7, 2024

Incorporate fsspec notebook into testing code #28

Open

Parse logs into ros3vfd-log-info tool #29

Closed

This was unlinked from issues Feb 7, 2024

Incorporate fsspec notebook into testing code #28

Open

Parse logs into ros3vfd-log-info tool #29

Closed

asteiker reviewed Feb 7, 2024

View reviewed changes

asteiker requested changes Feb 7, 2024

View reviewed changes

asteiker linked an issue Feb 8, 2024 that may be closed by this pull request

Incorporate fsspec notebook into testing code #28

Open

refactoring the whole thing

a7a891d

betolink removed the request for review from weiji14 February 12, 2024 16:20

update environment

2e47c30

JessicaS11 reviewed Feb 19, 2024

View reviewed changes

helpers/s3itslive.json

Copy link

Member

JessicaS11 Feb 19, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this a second storage location (different hub) for the same set of files? Or are the files different?

Andy Barrett added 5 commits February 28, 2024 17:51

Add plot

d7e8aae

Add benchmarks summary csv

bfacc4a

Saves results of experiments to results

Create dedicated notebook to plot results

d7b57b1

Copied plotting from portable-full-comparison

Remove outputs and add plotting

8d8996a

Add plotting for end to end running

Remove savefig

aa5dd0b

so that canonical plotting is in plot_benchmark_results.ipynb

asteiker linked an issue Feb 29, 2024 that may be closed by this pull request

Re-plot performance testing based on individual files #25

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactoring tests to log io behavior #31

refactoring tests to log io behavior #31

betolink commented Jan 29, 2024

betolink commented Feb 6, 2024

asteiker Feb 7, 2024

betolink Feb 7, 2024

betolink Feb 7, 2024

asteiker Feb 7, 2024

betolink Feb 7, 2024 •

edited

Loading

asteiker Feb 7, 2024 •

edited

Loading

betolink Feb 7, 2024

asteiker Feb 7, 2024 •

edited

Loading

asteiker Feb 7, 2024

asteiker left a comment

JessicaS11 Feb 19, 2024

JessicaS11 Feb 19, 2024

refactoring tests to log io behavior #31

Are you sure you want to change the base?

refactoring tests to log io behavior #31

Conversation

betolink commented Jan 29, 2024

betolink commented Feb 6, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

betolink Feb 7, 2024 • edited Loading

Choose a reason for hiding this comment

asteiker Feb 7, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

asteiker Feb 7, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

asteiker left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

betolink Feb 7, 2024 •

edited

Loading

asteiker Feb 7, 2024 •

edited

Loading

asteiker Feb 7, 2024 •

edited

Loading