-
Notifications
You must be signed in to change notification settings - Fork 696
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feature: IMDReader Integration #4923
base: develop
Are you sure you want to change the base?
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hello there first time contributor! Welcome to the MDAnalysis community! We ask that all contributors abide by our Code of Conduct and that first time contributors introduce themselves on GitHub Discussions so we can get to know you. You can learn more about participating here. Please also add yourself to package/AUTHORS
as part of this PR.
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## develop #4923 +/- ##
===========================================
- Coverage 93.66% 85.51% -8.16%
===========================================
Files 177 179 +2
Lines 21850 22021 +171
Branches 3079 3102 +23
===========================================
- Hits 20466 18831 -1635
- Misses 933 2746 +1813
+ Partials 451 444 -7 ☔ View full report in Codecov by Sentry. |
Your thoughts on this are appreciated - @orbeckst |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the initial PR.
- The first big step is to get the tests running properly so that the CI uses an imdclient without IMDReader. Otherwise we are not sure we're testing the code here.
- Minor initial comments while I skimmed.
- Simple thing: run
black
over all files to get the formatting and ordering of imports right
imdclient: | ||
default: 'imdclient' |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It's going to take time to get imdclient to a stage where the imdclient package does not actually affect MDAnalysis.
Is there a way that we could temporarily (for initial CI testing) install imdclient from a branch or tarball, e.g., in a pip
section? Then we could fairly rapidly create a preliminary (unpublished) imdclient package without IMDReader.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
By initial CI testing, do you mean "in this PR"?
There's a pip section just below, which should work if you put in the git location for pip install, but also you can just temporarily modify the CI script to do an additional pip install if it's for "testing within the PR itself".
If it's "after merge", this would require more discussion.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just for right now to bootstrap the PR.
I don't want to merge without a solid conda-forge imdclient package in place.
|
||
import logging | ||
|
||
logger = logging.getLogger("imdclient.IMDClient") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Change to MDAnalysis.coordinates.IMDReader
@@ -1841,3 +1841,194 @@ def __repr__(self): | |||
|
|||
def convert(self, obj): | |||
raise NotImplementedError | |||
|
|||
class StreamReaderBase(ReaderBase): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think coordinates.base
is the right place. @hmacdope ??
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yep correct!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just put this code to where it's needed, e.g., directly into IMD.py.
Do not add a util.py here, we need to keep this module as clean as possible because it's already quite crowded.
import select | ||
import time | ||
|
||
logger = logging.getLogger("imdclient.IMDClient") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
adjust logger
@@ -79,6 +79,7 @@ extra_formats = [ | |||
"pytng>=0.2.3", | |||
"gsd>3.0.0", | |||
"rdkit>=2020.03.1", | |||
"imdclient", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We almost certainly need to add a minimal version.
assert_timestep_almost_equal, | ||
) | ||
|
||
logger = logging.getLogger("imdclient.IMDClient") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
adjust logger
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Also, do we need logging in tests???
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
No logging in tests
file_handler = logging.FileHandler("test.log") | ||
formatter = logging.Formatter( | ||
"%(asctime)s - %(name)s - %(levelname)s - %(message)s" | ||
) | ||
file_handler.setFormatter(formatter) | ||
logger.addHandler(file_handler) | ||
logger.setLevel(logging.DEBUG) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do we really need this or can this go?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Remove
logger = logging.getLogger("imdclient.IMDClient") | ||
|
||
|
||
class IMDReader(StreamReaderBase): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think I'm getting confused about which PR is for which thing. @orbeckst given our discussion earlier this week, and your comment above which I take to be "IMDClient is still in flux", does it not make sense for the IMDReader to exist upstream and then just import it here? (edit: here my intent is "well then you could make releases and it wouldn't be limited to MDA release frequency").
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We have to split IMDReader from imdclient and make a version of imdclient without IMDReader (which is in the works Becksteinlab/imdclient#54 ). At the same time we are moving what was split off into coordinates.IMD.
Amru is working on both at the moment.
The way IMDReader depends on imdclient is not the problem, and imdclient itself is also pretty stable, it's just that the tests for imdclient have made use of a lot of MDAnalysis/IMDReader for convenience, and we now have to rewrite some of these tests to use bare-bones python.
I set the PR to Work in progress for the time being, just to indicate that we're not yet at the stage where the CI is working. Once the tests run properly, we can update the status. Obviously, this shouldn't discourage anyone from contributing and commenting. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
A first quick look
return ts | ||
|
||
|
||
@pytest.mark.skipif(not HAS_IMDCLIENT, reason="IMDClient not isntalled") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@pytest.mark.skipif(not HAS_IMDCLIENT, reason="IMDClient not isntalled") | |
@pytest.mark.skipif(not HAS_IMDCLIENT, reason="IMDClient not installed") |
@pytest.mark.skipif(not HAS_IMDCLIENT, reason="IMDClient not isntalled") | ||
class TestIMDReaderBaseAPI(MultiframeReaderTest): | ||
|
||
@pytest.fixture() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@pytest.fixture() | |
@pytest.fixture(scope='function') |
If you need it at every test, then it's best to be explicit about the fixture scope
logger.addHandler(file_handler) | ||
logger.setLevel(logging.DEBUG) | ||
|
||
@pytest.mark.skipif(not HAS_IMDCLIENT, reason="IMDClient not isntalled") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@pytest.mark.skipif(not HAS_IMDCLIENT, reason="IMDClient not isntalled") | |
@pytest.mark.skipif(not HAS_IMDCLIENT, reason="IMDClient not installed") |
@pytest.mark.skip( | ||
reason="Stream-based reader cannot determine n_frames until EOF" | ||
) | ||
def test_n_frames(self, reader, ref): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
For all these that have skips, could you reduce the code to just a pass
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Agree, skip normally denotes an issue outside of repo, e.g dependency issue.
if ref.dimensions is None: | ||
assert reader.ts.dimensions is None | ||
else: | ||
assert_array_almost_equal( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
please switch to assert_allclose for all these
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I read through and nothing jumped out at me that wasn't already mentioned, except that I didn't see these changes in the documentation that was linked. The following needs to be added:
Add doc/sphinx/source/documentation_pages/coordinates/IMD.rst
.. automodule:: MDAnalysis.coordinates.IMD
doc/sphinx/source/documentation_pages/coordinate_modules.rst
coordinates/IMD
doc/sphinx/source/documentation_pages/references.rst
If you use IMD capability...
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sorry for the delay, had a lookover, I will try and push some changes addressing some of these myself also, but would be good to pick up the momentum here again if possible.
|
||
try: | ||
import imdclient | ||
from imdclient.IMDClient import IMDClient |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
These need test coverage with mocks
except ImportError: | ||
HAS_IMDCLIENT = False | ||
|
||
# Allow building doucmnetation without imdclient |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
# Allow building doucmnetation without imdclient | |
# Allow building documentation without imdclient |
is the port number. | ||
n_atoms : int (optional) | ||
number of atoms in the system. defaults to number of atoms | ||
in the topology. don't set this unless you know what you're doing. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
in the topology. don't set this unless you know what you're doing. | |
in the topology. Don't set this unless you know what you're doing. |
|
||
Parameters | ||
---------- | ||
filename : a string of the form "host:port" where host is the hostname |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is this format checked on __init__
if imdf.positions is not None: | ||
# must call copy because reference is expected to reset | ||
# see 'test_frame_collect_all_same' in MDAnalysisTests.coordinates.base | ||
self.ts.positions = imdf.positions |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Use np.copyto
here and below
@@ -1841,3 +1841,194 @@ def __repr__(self): | |||
|
|||
def convert(self, obj): | |||
raise NotImplementedError | |||
|
|||
class StreamReaderBase(ReaderBase): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yep correct!
assert_timestep_almost_equal, | ||
) | ||
|
||
logger = logging.getLogger("imdclient.IMDClient") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
No logging in tests
file_handler = logging.FileHandler("test.log") | ||
formatter = logging.Formatter( | ||
"%(asctime)s - %(name)s - %(levelname)s - %(message)s" | ||
) | ||
file_handler.setFormatter(formatter) | ||
logger.addHandler(file_handler) | ||
logger.setLevel(logging.DEBUG) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Remove
@pytest.mark.skip( | ||
reason="Stream-based reader cannot determine n_frames until EOF" | ||
) | ||
def test_n_frames(self, reader, ref): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Agree, skip normally denotes an issue outside of repo, e.g dependency issue.
@pytest.mark.skip( | ||
reason="Timeseries currently requires n_frames to be known" | ||
) | ||
@pytest.mark.parametrize( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Same thing here, remove boilerplate otherwise confusing.
Fixes #4827
This draft PR addresses the feature request discussed in #4827.
Note:
The
IMDReader
feature which was previously a part of theimdclient
package has been moved intoMDAnalysis
below. Any other modules have been in retained inimdclient
, which has been added as an optional dependency here. We are currently in the process of splitting theimdclient
package as mentioned above. (Issue, PR)Major changes made in this Pull Request:
IMDReader
, other associated base classes and a utility function were added to coordinates in the main package.test-imd.py
*.yaml
filesimdclient
was added as an optional dependencyPR Checklist
package/CHANGELOG
file updated?package/AUTHORS
? (If it is not, add it!)Developers Certificate of Origin
I certify that I can submit this code contribution as described in the Developer Certificate of Origin, under the MDAnalysis LICENSE.
📚 Documentation preview 📚: https://mdanalysis--4923.org.readthedocs.build/en/4923/