-
Notifications
You must be signed in to change notification settings - Fork 5
Fix issue with second lmp instance #407
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
WalkthroughThe change adds an explicit finalization step in the shutdown method of the concurrent wrapper. When a job exists, Changes
Estimated code review effort🎯 1 (Trivial) | ⏱️ ~5 minutes
Poem
Pre-merge checks and finishing touches❌ Failed checks (1 warning, 1 inconclusive)
✅ Passed checks (1 passed)
✨ Finishing touches
🧪 Generate unit tests (beta)
📜 Recent review detailsConfiguration used: defaults Review profile: CHILL Plan: Pro 📒 Files selected for processing (1)
⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (7)
🔇 Additional comments (2)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
|
Tests fail with a bunch of errors like this Since all the PR does is properly closing lammps when calling close. I didn't check it, but I guess that the tests rely on keeping the MPI instance alive despite lammps being closed, which breaks the calphy process. So I assume the tests test something that should not work. |
| self._job.finalize() | ||
| self._job.close() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should not the order be the other way around? First calling close() and then finalize(), at least this is how it is done in the LAMMPS examples: https://github.com/lammps/lammps/blob/develop/examples/mliap/mliap_pytorch_ACE.py
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Still even after this change the tests do not pass #412 . It seems to me that this option is only required for specific LAMMPS models, primarily the https://docs.lammps.org/pair_mliap.html models, still I do not find any explanation in the documentation, rather the finalize() call seems to be only used in examples which use mliap models.
|
As the unit tests are currently failing, I marked the pull request as draft. |
Properly close the lammps instance to allow multi step processes such as calphy to run on GPUs. Without properly closing the process hangs without throwing an error, so in the case of calphy it stopped silently after the averaging routine.
Complementary to ICAMS/calphy#205
Summary by CodeRabbit
✏️ Tip: You can customize this high-level summary in your review settings.