Use OOP forcefields #267

a-ws-m · 2023-08-18T18:26:10Z

This implements several changes to make it easier for end-users to implement their own forcefields. Most of these changes revolve around the Forcefield class, which stores the path dictionaries that were previously separate entities. To use a non-default forcefield, make a new Forcefield instance and then pass it to Simulation using the ff_class argument. This argument is optional; passing a string value for forcefield will still work, it's simply converted behind the scenes. I also tidied up a few things, which may be worth discussing:

The system.top and system_octwet.top files have been changed to be compatible with Python's built-in Template functionality. This makes the code a bit easier to understand.
Water models are now attributes of the individual forcefields, and are read directly from the corresponding watermodels.dat file. As I understand it, GROMACS_WATER_MODELS is a master list of all water models, some of which are only available for OPLS-AA. If this is correct, it might be good to phase this out. I have changed the core functions that used this: get_solvent_model and get_solvent_identifier. The only other places in the code where GROMACS_WATER_MODELS is used is in the tests and in get_water_model, a function which is also unused in the code base except in tests.
I have changed one test to check for equality rather than equivalence (is -> ==).

I'll write some example code that demonstrates how to use a Forcefield next week. In the meantime, I'm sharing this PR so that the changes can be discussed.

a-ws-m · 2023-08-18T18:27:38Z

@orbeckst here's the PR you asked me to ping you about!

codecov · 2023-08-18T19:00:20Z

Codecov Report

Merging #267 (3645776) into develop (f98424b) will increase coverage by 0.42%.
The diff coverage is 93.54%.

@@             Coverage Diff             @@
##           develop     #267      +/-   ##
===========================================
+ Coverage    81.26%   81.68%   +0.42%     
===========================================
  Files           15       15              
  Lines         1975     2026      +51     
  Branches       297      310      +13     
===========================================
+ Hits          1605     1655      +50     
+ Misses         278      276       -2     
- Partials        92       95       +3

Files Changed	Coverage Δ
mdpow/equil.py	`80.79% <88.23%> (+0.11%)`	⬆️
mdpow/forcefields.py	`92.56% <94.73%> (+4.75%)`	⬆️

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

orbeckst

Thank you for the PR. In general I like your approach

consolidate in Forcefield class
make the templates real templates

I have two initial requests:

The original intention with the templates was that they should in principle be legal input form GROMACS. However, I like explicit templating better (also, with 10+ experience since I wrote the original code...) so could you please rename the files from *.top to *.top.template (or some other appropriate suffix... *.top.in?) to make clear that these are now template files?
You seem to be using an autoformatter which makes reading the diff difficult. Can you please either undo the formatting or submit a separate PR only with the formatting changes on the files that you touched. We can then merge that PR and then we can merge the main into this PR (or you can rebase) and we have a clean diff.

a-ws-m · 2023-08-21T13:13:33Z

Done! I always forget to disable black when I'm working on new code, sorry about that.

orbeckst · 2023-08-22T14:45:24Z

You gave me the motivation to try out black on a project so I opened #271. Let's do PR #269 first and then come back here.

orbeckst · 2023-08-25T21:50:22Z

@a-ws-m now that PR #269 is merged, could you please rebase against current develop so that we hopefully end up with a nice, clean diff? Thanks!

…f-switching

…o martini-example

orbeckst

Thanks for the update.

There are failing tests. Please make changes so that the tests pass again.

You can also run the tests locally (which is faster) if you have GROMACS installed (there are conda packages).

a-ws-m · 2023-08-30T17:18:16Z

I've made a few important changes:

The solvation errors arose because Gromacs expects a .top file suffix, but the copies of the templates were saved with identical names (i.e., including .template). I have added a few lines to ensure that the topology file does have the correct suffix.
forcefields.get_solvent_identifier() is now consistent with get_solvent_model() and raises an error when no such identifier exists for the given force field.
I removed the python.system_packages key from the read the docs configuration as it was just deprecated and was causing build errors.

a-ws-m · 2023-08-30T17:24:34Z

Regarding the error in the FEP tests, I'm unable to reproduce it on my machine and I think it's an issue with the MD workflow rather than any Python code that I've changed here.

a-ws-m · 2023-09-04T13:48:39Z

I've made those changes. The GitHub tests seem to be throwing some weird errors to do with cloning the repo.

orbeckst · 2023-09-04T14:33:49Z

I restarted CI and all jobs are running. It was a temporary problem.

orbeckst

This is all looking really good and I only have very few final requests:

I'd like to unify the API a bit by only using forcefield as a kwarg instead of ff_class and forcefield (see comment).
There are some other minor things in the comments.
Please update the docs at the top of forcefields.py where it says

In the principle it is possible to switch to a different force field by supplying alternative template files.

The idea with this PR is that it now becomes easier to add custom force fields. Can you show an example? You could add a section "Adding your own forcefield". That would really help (1) for maintenance because it tells developers how it works and (2) for users so that we don't need to answer questions but can just point to docs.

Sorry it's taking a while. It's just that this is a sizable (and very good!!) change and I want to make sure that everything continues to work well together.

mdpow/equil.py

mdpow/forcefields.py

mdpow/tests/test_forcefields.py

mdpow/forcefields.py

a-ws-m · 2023-09-04T19:50:25Z

No problem! I'll work on those. I have actually been working on an example, it's in the martini-example branch of my fork. I have some questions before it's ready that I'll raise on the Q&A.

orbeckst · 2023-09-04T19:52:12Z

awesome!

a-ws-m · 2023-09-04T21:37:42Z

I've merged the example using Martini, you can find it in doc/examples/martini-example. I imagine there's scope for improving the run parameters to get a more accurate value, and perhaps fixing the issue whereby the nan error for the coulomb contribution propagates and gives a nan value for the overall error. I'm not sure how to improve these, though -- I'd be happy to hear suggestions!

I also need to go back and clear some of the old, erroneous output from that notebook. The plots, for example, are completely wrong and don't reflect the latest results.

orbeckst · 2023-09-05T00:37:02Z

Your MARTINI/MDPOW example notebook is fantastic!

(I also noted that some of the GROMACS runs actually didn't run but just restarted from the last frame. However, it's nice to know that it's not an issue re-running the workflow as it won't destroy data.)

For nan value in the Coulomb calculation I am not sure where it comes from. I assume that there is a valid Coulomb leg in the FEP calculation because MARTINI benzene has partial charges, doesn't it?

One thing that's a bit odd is that alchemlyb and mdpow report different values for statistical inefficiency, for example alchemlyb states 15.84 and mdpow states 7.9840.

mdpow.fep   : INFO     Performing statistical inefficiency analysis for window coulomb 0000
2023-09-04 22:41:16.809 | DEBUG    | alchemlyb.preprocessing.subsampling:statistical_inefficiency:520 - Running statistical inefficiency analysis.
2023-09-04 22:41:16.812 | DEBUG    | alchemlyb.preprocessing.subsampling:statistical_inefficiency:522 - Statistical inefficiency: 15.84.
2023-09-04 22:41:16.813 | DEBUG    | alchemlyb.preprocessing.subsampling:statistical_inefficiency:528 - Number of uncorrelated samples: 312.
mdpow.fep   : INFO     The statistical inefficiency value is 7.9840.
mdpow.fep   : INFO     The data are subsampled every 8 frames.

I think that's because we have to infer the actual value from what alchemlyb subsample in
https://github.com/Becksteinlab/MDPOW/blob/6449879a9bc5fbc80210290eea84fd40ad164dff/mdpow/fep.py#L1192C20-L1192C20
and there's also a suspicious looking factor of 2 involved. (The whole question about getting the raw SI from alchemlyb was the topic of alchemistry/alchemlyb#295 ). @VOD555 can you please check that the MDPOW output is consistent with the one from alchemlyb? I assume that other users that read the log output will have the same question as I

VOD555 · 2023-09-05T04:08:02Z

@orbeckst I've checked the code. The logger in the mdpow outputs the correlation time (g/2) but not the statistical inefficiency (g). And as the one calculated in mdpow is based on the length output of the subsampled dataframe, there is small difference between the mdpow one and alchemlyb one.

As currently alchemlyb has a logger output for the SI, I think we can remove the mdpow logger information.

VOD555 · 2023-09-05T04:11:25Z

Open an issue #275

a-ws-m · 2023-09-06T14:38:51Z

Thank you! The Gromacs output is for a restart because it was the quickest way to re-generate most of the useful information after clearing the cell outputs. I'll work on those other points today.

orbeckst

I am very happy with your contribution. It's a really neat re-organization that's also backwards compatible.

Your example notebook is also really good. Nice work!!

doc/examples/martini/run.mdp

orbeckst · 2023-09-07T15:35:21Z

@a-ws-m I let you look at my one comment about "sedstate" in the MDP file but if you tell me that this is what it ought to be, I'll squash-merge. (If you want rewrite the history on the PR into less than 5 commits then you're also welcome to do so, e.g. code changes+tests, example, but if not then that's fine too.)

orbeckst · 2023-09-07T17:58:56Z

Thank you very much @a-ws-m , great contribution!!

a-ws-m added 2 commits August 18, 2023 19:07

Use OOP forcefields

206e383

Add Sphinx docstring

954f072

VOD555 requested review from orbeckst and VOD555 August 18, 2023 18:52

orbeckst requested changes Aug 18, 2023

View reviewed changes

a-ws-m mentioned this pull request Aug 21, 2023

Black formatting #269

Merged

Change template file extensions

0f53204

[WIP] Boilerplate for Martini benzene Pow

2b7f3a9

a-ws-m and others added 6 commits August 29, 2023 16:19

Use OOP forcefields

1ccaba6

Add Sphinx docstring

4331ebe

Change template file extensions

8053d73

Merge branch 'ff-switching' of https://github.com/a-ws-m/MDPOW into f…

f53054f

…f-switching

Fix references to system.top

68bef44

Merge branch 'develop' into ff-switching

ce596a1

a-ws-m requested a review from orbeckst August 29, 2023 15:28

a-ws-m added 2 commits August 29, 2023 16:34

[WIP] Boilerplate for Martini benzene Pow

7034428

Merge branch 'martini-example' of https://github.com/a-ws-m/MDPOW int…

761d0c1

…o martini-example

orbeckst requested changes Aug 29, 2023

View reviewed changes

a-ws-m added 5 commits August 30, 2023 17:25

Correct solvation identifier test

0f32aa0

Add pytest settings for VSCode

a2f5cbf

Add typehinting

e26d8e5

Fix top file suffix after changing from template

5140b74

Remove system_packages key

2f578f5

Formatting

256b76b

a-ws-m requested a review from orbeckst September 4, 2023 13:47

Merge branch 'ff-switching' into martini-example

d5dc5ba

orbeckst requested changes Sep 4, 2023

View reviewed changes

mdpow/equil.py Outdated Show resolved Hide resolved

mdpow/equil.py Outdated Show resolved Hide resolved

mdpow/forcefields.py Show resolved Hide resolved

mdpow/tests/test_forcefields.py Outdated Show resolved Hide resolved

mdpow/forcefields.py Outdated Show resolved Hide resolved

[WIP] Martini example

e92ffd8

a-ws-m and others added 2 commits September 4, 2023 22:30

Working example

b739c88

Merge pull request #1 from a-ws-m/martini-example

d621dee

Remove erroneous outputs

a80ecb3

a-ws-m added 2 commits September 6, 2023 19:05

Use get_forcefield() + docstrings

fd0a033

Test ff __repr__ and use ALL_FORCEFIELDS

301c08e

a-ws-m requested a review from orbeckst September 6, 2023 18:07

a-ws-m and others added 2 commits September 6, 2023 19:34

Fix reference to ff_class

6268765

Merge branch 'develop' into ff-switching

9c897b8

orbeckst approved these changes Sep 7, 2023

View reviewed changes

doc/examples/martini/run.mdp Outdated Show resolved Hide resolved

orbeckst self-assigned this Sep 7, 2023

Correct init-lambda-value

3645776

VOD555 approved these changes Sep 7, 2023

View reviewed changes

orbeckst linked an issue Sep 7, 2023 that may be closed by this pull request

MDPOW does not use TIP3P for CHARMM and AMBER for hydration free energy calculation by default #112

Closed

orbeckst merged commit 120c5dd into Becksteinlab:develop Sep 7, 2023

Use OOP forcefields #267

Use OOP forcefields #267

Uh oh!

Conversation

a-ws-m commented Aug 18, 2023

Uh oh!

a-ws-m commented Aug 18, 2023

Uh oh!

codecov bot commented Aug 18, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

orbeckst left a comment

Choose a reason for hiding this comment

Uh oh!

a-ws-m commented Aug 21, 2023

Uh oh!

orbeckst commented Aug 22, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

orbeckst commented Aug 25, 2023

Uh oh!

orbeckst left a comment

Choose a reason for hiding this comment

Uh oh!

a-ws-m commented Aug 30, 2023

Uh oh!

a-ws-m commented Aug 30, 2023

Uh oh!

a-ws-m commented Sep 4, 2023

Uh oh!

orbeckst commented Sep 4, 2023

Uh oh!

orbeckst left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

a-ws-m commented Sep 4, 2023

Uh oh!

orbeckst commented Sep 4, 2023 via email

Uh oh!

a-ws-m commented Sep 4, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

orbeckst commented Sep 5, 2023

Uh oh!

VOD555 commented Sep 5, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

VOD555 commented Sep 5, 2023

Uh oh!

a-ws-m commented Sep 6, 2023

Uh oh!

orbeckst left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

orbeckst commented Sep 7, 2023

Uh oh!

orbeckst commented Sep 7, 2023

Uh oh!

Uh oh!

codecov bot commented Aug 18, 2023 •

edited

Loading

orbeckst commented Aug 22, 2023 •

edited

Loading

a-ws-m commented Sep 4, 2023 •

edited

Loading

VOD555 commented Sep 5, 2023 •

edited

Loading