Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[spark-rapids] generate spark-rapids/spark-rapids.sh from template #1284

Draft
wants to merge 125 commits into
base: master
Choose a base branch
from

Conversation

cjac
Copy link
Contributor

@cjac cjac commented Dec 25, 2024

This is a re-implementation of the script using templates created while re-factoring common code under gpu/, dask/, rapids/, spark-rapids/, horovod/, mlvm/ and many of the other initialization actions.

For templates used to generate the new spark-rapids.sh and mig.sh PTAL at

https://github.com/GoogleCloudDataproc/initialization-actions/pull/1282/files#diff-9887ca799a0fd5f78f754eb371e1f771c3e80d5abc06ee6d17d54fd2ad2962a3

@cjac cjac self-assigned this Dec 25, 2024
@cjac
Copy link
Contributor Author

cjac commented Dec 25, 2024

/gcbrun

11 similar comments
@cjac
Copy link
Contributor Author

cjac commented Dec 25, 2024

/gcbrun

@cjac
Copy link
Contributor Author

cjac commented Dec 26, 2024

/gcbrun

@cjac
Copy link
Contributor Author

cjac commented Dec 26, 2024

/gcbrun

@cjac
Copy link
Contributor Author

cjac commented Dec 26, 2024

/gcbrun

@cjac
Copy link
Contributor Author

cjac commented Dec 26, 2024

/gcbrun

@cjac
Copy link
Contributor Author

cjac commented Dec 26, 2024

/gcbrun

@cjac
Copy link
Contributor Author

cjac commented Dec 27, 2024

/gcbrun

@cjac
Copy link
Contributor Author

cjac commented Dec 27, 2024

/gcbrun

@cjac
Copy link
Contributor Author

cjac commented Dec 27, 2024

/gcbrun

@cjac
Copy link
Contributor Author

cjac commented Dec 27, 2024

/gcbrun

@cjac
Copy link
Contributor Author

cjac commented Dec 27, 2024

/gcbrun

@cjac
Copy link
Contributor Author

cjac commented Dec 27, 2024

okay, that brings the new code up to the standard of the previous code. Now let's start enabling more tests...

@cjac
Copy link
Contributor Author

cjac commented Dec 27, 2024

/gcbrun

3 similar comments
@cjac
Copy link
Contributor Author

cjac commented Dec 27, 2024

/gcbrun

@cjac
Copy link
Contributor Author

cjac commented Dec 27, 2024

/gcbrun

@cjac
Copy link
Contributor Author

cjac commented Dec 27, 2024

/gcbrun

@cjac
Copy link
Contributor Author

cjac commented Dec 27, 2024

okay so now we have some rocky8 and rocky9 coverage. I don't think we have to disable the 2.0 images. Let's verify...

@cjac
Copy link
Contributor Author

cjac commented Dec 28, 2024

/gcbrun

@cjac
Copy link
Contributor Author

cjac commented Dec 28, 2024

okay, now it:

  • meets the expectations of the previous implementation
  • does not disable rocky tests

Let's try skipping fewer tests still

@cjac
Copy link
Contributor Author

cjac commented Dec 28, 2024

/gcbrun

1 similar comment
@cjac
Copy link
Contributor Author

cjac commented Dec 28, 2024

/gcbrun

@cjac
Copy link
Contributor Author

cjac commented Dec 28, 2024

I think we've got single node clusters working fine on rocky again.

@cjac
Copy link
Contributor Author

cjac commented Dec 28, 2024

/gcbrun

@cjac
Copy link
Contributor Author

cjac commented Dec 28, 2024

can we skip no tests?

@cjac
Copy link
Contributor Author

cjac commented Dec 28, 2024

/gcbrun

3 similar comments
@cjac
Copy link
Contributor Author

cjac commented Dec 28, 2024

/gcbrun

@cjac
Copy link
Contributor Author

cjac commented Dec 28, 2024

/gcbrun

@cjac
Copy link
Contributor Author

cjac commented Dec 28, 2024

/gcbrun

@cjac
Copy link
Contributor Author

cjac commented Jan 9, 2025

rebase onto new master commit

@cjac
Copy link
Contributor Author

cjac commented Jan 9, 2025

/gcbrun

@cjac cjac force-pushed the spark-rapids-template-20241225 branch from 812a3b6 to e21f9b9 Compare January 9, 2025 23:40
@cjac
Copy link
Contributor Author

cjac commented Jan 10, 2025

I'm about to run some tests

@cjac
Copy link
Contributor Author

cjac commented Jan 10, 2025

/gcbrun

@cjac cjac force-pushed the spark-rapids-template-20241225 branch from e21f9b9 to 93bf9ec Compare January 10, 2025 02:33
@cjac
Copy link
Contributor Author

cjac commented Jan 10, 2025

/gcbrun

@cjac cjac force-pushed the spark-rapids-template-20241225 branch 2 times, most recently from 3ca9c84 to 1d790be Compare January 10, 2025 03:36
@cjac
Copy link
Contributor Author

cjac commented Jan 10, 2025

/gcbrun

@cjac cjac force-pushed the spark-rapids-template-20241225 branch from 1d790be to 40b2de2 Compare January 10, 2025 03:55
@cjac
Copy link
Contributor Author

cjac commented Jan 10, 2025

/gcbrun

@cjac cjac force-pushed the spark-rapids-template-20241225 branch from 40b2de2 to fce5350 Compare January 10, 2025 04:31
@cjac
Copy link
Contributor Author

cjac commented Jan 10, 2025

/gcbrun

spark-rapids/mig.sh, spark-rapids/spark-rapids.sh:
* no longer tracking generated action

spark-rapids/test_spark_rapids.py:
* using default machine type of n1-highmem-8
* de-duplicating shell command
* perform two tests on one cluster instead of one test on two

templates/spark-rapids/mig.sh.in,
templates/spark-rapids/spark-rapids.sh.in:
* these templates include only the code unique to these actions
@cjac cjac force-pushed the spark-rapids-template-20241225 branch from fce5350 to 55ab2c7 Compare January 10, 2025 04:54
@cjac
Copy link
Contributor Author

cjac commented Jan 10, 2025

/gcbrun

1 similar comment
@cjac
Copy link
Contributor Author

cjac commented Jan 23, 2025

/gcbrun

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant