Allow AtriumDB to output time information in its native format and add further optimizations. #93

WilliamDixon · 2024-07-26T16:46:32Z

A deeper explanation for the following changes can be found in my comment in discussion #85.

AtriumDB 2.2.4

This PR makes use of AtriumDB's 2.2.4 features which is currently in prerelease.

Native AtriumDB Output

To allow one of our AtriumDB classes to output data in its native format, code is added to benchmark.py and utils.py, in order to detect data in time-value pair format and convert to WFDB's nan based format outside of the benchmarking process.

New Nan Adapter Class

A second AtriumDB class called waveform_benchmark.formats.atriumdb.NanAdaptedAtriumDB has been created which converts the data output from AtriumDB's native format to the benchmark's preferred format within the benchmarked region of code.

No sort / check

AtriumDB in normal operation needs to check for the case where interblock (data between two or more blocks) or intrablock (data within a single block) data is out of order, but for this benchmark such a check is not needed.

Buffered Writes

AtriumDB has a new write_buffer which lets it piece together multiple small segments efficiently without the need for code outside code to accomplish best efficiency.

File metadata loading

AtriumDB can now read it's time index for file meta information for the entire file and store that information in memory rather than requerying the file metadata for each small read. This significantly increases the read performance of small reads in exchange for a very light memory usage increase.

Mac OS Error

AtriumDB now gives a descriptive error when you try to use it on a Mac OS (I hope to have Mac support in 2.5.0)

…rted data, block metadata cache.

…y performance.

…late a file format.

WilliamDixon · 2024-08-12T20:01:46Z

After looking at the cached PR, I thought it would be better if the default AtriumDB version didn't use any metadata caching and instead stored that data in a header-like file to better compare with the other formats.

So I've made that change to this PR.

…riting.

briangow · 2024-10-10T19:29:11Z

Thanks @WilliamDixon , this looks good!

William Dixon added 8 commits July 26, 2024 09:00

Add AtriumDB optimizations: time-value pair output support, assume so…

4a62c9f

…rted data, block metadata cache.

Optimize the cache and set the block size lower to improve small quer…

8494435

…y performance.

Increase the block size slightly.

8ebc1d0

Truncate unneeded values.

f15374d

Properly send nan-value array when no signal data is available.

27d4517

Merge branch 'refs/heads/main' into atriumdb-upgrade

0fb34c8

Remove the metadata cache and instead read header files to better emu…

31cecf2

…late a file format.

Properly calculate the value datatype

040b58e

William Dixon added 3 commits August 12, 2024 16:12

Use pickle instead of json for the headers (binary is faster than text).

9a28d50

Force integer style save.

4b61531

Change query style to standard, add Nan adapter class, add buffered w…

cfde630

…riting.

briangow merged commit 6edb890 into chorus-ai:main Oct 10, 2024
1 of 2 checks passed

WilliamDixon mentioned this pull request Oct 28, 2024

Fix AtriumDB version in requirements.txt to match desired release. #106

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow AtriumDB to output time information in its native format and add further optimizations. #93

Allow AtriumDB to output time information in its native format and add further optimizations. #93

WilliamDixon commented Jul 26, 2024 •

edited

Loading

WilliamDixon commented Aug 12, 2024

briangow commented Oct 10, 2024

Allow AtriumDB to output time information in its native format and add further optimizations. #93

Allow AtriumDB to output time information in its native format and add further optimizations. #93

Conversation

WilliamDixon commented Jul 26, 2024 • edited Loading

AtriumDB 2.2.4

Native AtriumDB Output

New Nan Adapter Class

No sort / check

Buffered Writes

File metadata loading

Mac OS Error

WilliamDixon commented Aug 12, 2024

briangow commented Oct 10, 2024

WilliamDixon commented Jul 26, 2024 •

edited

Loading