Allow recompute via `_make_file` func #1093

CBroz1 · 2024-09-06T20:05:35Z

Description

This PR adds the ability to recompute when fetching a non-existent file from v1.SpikeSortingRecording, adding some infrastructure to support expanding this tool in the future. To facilitate this, I add ...

A standard of a _make_file func that other tables can implement in the future.
NWB file specific and more general directory hashing tools
Tests for round-trip file deletion, then calling fetch_nwb

I have tested these hashers for...

All files in v1.SpikeSortingRecording
All directories in v0.SpikeSortingRecording
A subset of files in Raw (ongoing, so far 50%)

Checklist:

No. This PR should be accompanied by a release: (yes/no/unsure)
N/a. If release, I have updated the CITATION.cff
Yes. This PR makes edits to table definitions: (yes/no)
Yes. If table edits, I have included an alter snippet for release notes.
N/a. If this PR makes changes to position, I ran the relevant tests locally.
Yes. I have updated the CHANGELOG.md with PR number and description.
I have added/edited docs/notebooks to reflect the changes

Question

I found at least one file/directory where hashes mismatched across

(a) the existing version
(b) regenerating with this branch

What should we do about these? Do we need to restrict our deletions to only items replicable with an up-to-date Spyglass? Should I start building the infrastructure to compile this list? What do we want to do with files that don't replicate? Do we need to then do some experimentation with dependency versions? This might point us to adding conda env dumps to some part of this process

CBroz1 · 2024-09-16T14:52:54Z

I ran in to some issues here, being unable to replicate the DJ-stored file hash with seemingly matched contents. Need to do more testing on whether or not memory pointers are factored into the hash. If so, we may need to store a hash of the object computed by the specific table, and adjust the DJ-stored file hash on recreation

…to rcp

edeno

I think the overall structure looks pretty good. For comparing arrays, remind me why something like np.allclose wouldn't work well?

edeno · 2025-02-20T17:16:00Z

src/spyglass/common/common_nwbfile.py

+                raise FileNotFoundError(
+                    f"Found {len(query)} files for: {analysis_nwb_file_name}"
+                )
+            return f"{analysis_dir}/" + query.fetch1("filepath")


Probably be better to make this less file system dependent by using pathlib

edeno · 2025-02-20T17:17:36Z

src/spyglass/common/common_nwbfile.py

+            )
+
+        external_tbl = schema.external["analysis"]
+        file_path = (


Also another pathlib place

edeno · 2025-02-20T17:20:05Z

src/spyglass/spikesorting/v0/spikesorting_recording.py

@@ -317,6 +317,8 @@ def make(self, key):
        recording = self._get_filtered_recording(key)
        recording_name = self._get_recording_name(key)

+        # recording_dir = Path("/home/cbroz/wrk/temp_ssr0/")


Remove before merge

edeno · 2025-02-20T17:33:38Z

src/spyglass/spikesorting/v1/usage.py

@@ -0,0 +1,257 @@
+"""This schema is used to transition manage files for recompute.


This module could have a more descriptive name. recompute.py perhaps?

edeno · 2025-02-20T17:40:24Z

src/spyglass/utils/nwb_hash.py

+        self.file.close()
+
+    def remove_version(self, key: str) -> bool:
+        version_pattern = (


I do worry a bit about these regex's failing. Versioning is hopefully pretty stable.

edeno · 2025-02-24T13:39:05Z

Just adding a reminder here to warn the lab before this gets merged.

CBroz1 added 5 commits September 6, 2024 10:45

WIP: remove AnalysisNwbfileLog

9192573

WIP: recompute

27f0004

WIP: recompute 2

743502d

WIP: recompute 3

9d23949

WIP: recompute 4

39f07bf

CBroz1 added 5 commits September 18, 2024 14:14

WIP: recompute 5, electrodes object

1b38818

WIP: recompute 6, add file hash

282d553

WIP: recompute 7

94168de

Merge branch 'master' of https://github.com/LorenFrankLab/spyglass in…

b553f77

…to rcp

✅ : recompute

a594786

edeno added the enhancement New feature or request label Sep 25, 2024

CBroz1 and others added 7 commits October 21, 2024 12:33

w

df1800e

Handle groups and links

6d0df07

Remove debug

1587997

Add directory hasher

1ed831e

Merge branch 'rcp' of https://github.com/CBroz1/spyglass into rcp

7547fe2

Merge branch 'master' of https://github.com/LorenFrankLab/spyglass in…

23799f8

…to rcp

Update directory hasher

d0011bf

CBroz1 mentioned this pull request Nov 19, 2024

Recompute method in computed tables #917

Open

CBroz1 added 4 commits January 8, 2025 11:39

WIP: update hasher

ad7c74a

WIP: fetch upstream, resolve conflicts

558f38b

WIP: error specificity

54a3ca1

Add tables for recompute processing

1e41698

edeno self-requested a review February 20, 2025 17:12

edeno reviewed Feb 20, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow recompute via `_make_file` func #1093

Allow recompute via `_make_file` func #1093

CBroz1 commented Sep 6, 2024 •

edited

Loading

CBroz1 commented Sep 16, 2024

edeno left a comment

edeno Feb 20, 2025

edeno Feb 20, 2025

edeno Feb 20, 2025

edeno Feb 20, 2025

edeno Feb 20, 2025

edeno commented Feb 24, 2025

		@@ -0,0 +1,257 @@
		"""This schema is used to transition manage files for recompute.

Allow recompute via _make_file func #1093

Are you sure you want to change the base?

Allow recompute via _make_file func #1093

Conversation

CBroz1 commented Sep 6, 2024 • edited Loading

Description

Checklist:

Question

CBroz1 commented Sep 16, 2024

edeno left a comment

Choose a reason for hiding this comment

edeno Feb 20, 2025

Choose a reason for hiding this comment

edeno Feb 20, 2025

Choose a reason for hiding this comment

edeno Feb 20, 2025

Choose a reason for hiding this comment

edeno Feb 20, 2025

Choose a reason for hiding this comment

edeno Feb 20, 2025

Choose a reason for hiding this comment

edeno commented Feb 24, 2025

Allow recompute via `_make_file` func #1093

Allow recompute via `_make_file` func #1093

CBroz1 commented Sep 6, 2024 •

edited

Loading