Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

optimize and profile OpenMP implementation #26

Closed
p-costa opened this issue Aug 20, 2021 · 1 comment
Closed

optimize and profile OpenMP implementation #26

p-costa opened this issue Aug 20, 2021 · 1 comment
Assignees
Labels
enhancement New feature or request

Comments

@p-costa
Copy link
Owner

p-costa commented Aug 20, 2021

Currently, the loops are not collapsed (which I believe should be fine for shared-memory runs), static scheduling is not explicitly imposed, and loops in solver.f90 lacks some OpenMP directives (although the most demanding part, the iterative solvers in hypre, have an OpenMP implementation).

This has been fine for CPU only runs using only MPI, but in light of future porting efforts, it would be good to make sure the OpenMP implementation is performing fine.

@p-costa p-costa self-assigned this Aug 20, 2021
@p-costa p-costa added enhancement New feature or request priority labels Aug 20, 2021
@p-costa p-costa removed the priority label Sep 7, 2021
@p-costa
Copy link
Owner Author

p-costa commented Sep 12, 2021

This has been taken care of for the moment. And the sustainable way forward may be moving to DO CONCURRENT (#25).

@p-costa p-costa closed this as completed Sep 12, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant