Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Parallel workers and look up data url #28

Merged
merged 2 commits into from
Sep 26, 2022

Conversation

Bjwebb
Copy link

@Bjwebb Bjwebb commented Sep 26, 2022

No description provided.

@andylolz
Copy link
Member

andylolz commented Sep 26, 2022

Could you say a bit about what effect the parallel workers change has, @Bjwebb? Does it relate to #27?

The data URL lookup change is definitely good, thanks for that.

@Bjwebb
Copy link
Author

Bjwebb commented Sep 26, 2022

Could you say a bit about what effect the parallel workers change has, @Bjwebb? Does it relate to #27?

Yes it's relevant to that, but I've been looking at it as part of #21, as that will increase the amount of work further.

This is functionality the stats runner has to split the loop step by file, and distribute the work between multiple processes: https://github.com/codeforIATI/IATI-Stats/blob/main/statsrunner/loop.py#L167-L172. I've set this to 2, as GitHub actions gets access to 2 cores.

@andylolz
Copy link
Member

Okay nice, sounds good.

Ahhh – I see you’ve tried it on dev, and it looks like it makes a big improvement to the workflow run time! That’s fab, let’s definitely get this merged

@Bjwebb Bjwebb merged commit 336e64b into main Sep 26, 2022
@Bjwebb Bjwebb deleted the loop-2-workers-and-look-up-data-url branch September 26, 2022 18:10
@andylolz
Copy link
Member

(Thanks also for explaining this Ben!)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants