You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: sky/templates/aws-ray.yml.j2
+9-6Lines changed: 9 additions & 6 deletions
Original file line number
Diff line number
Diff line change
@@ -89,6 +89,7 @@ initialization_commands: []
89
89
# Increment the following for catching performance bugs easier:
90
90
# current num items (num SSH connections): 1
91
91
setup_commands:
92
+
# Disable `unattended-upgrades` to prevent apt-get from hanging. It should be called at the beginning before the process started to avoid being blocked. (This is a temporary fix.)
92
93
# Create ~/.ssh/config file in case the file does not exist in the custom image.
93
94
# Make sure python3 & pip3 are available on this image.
94
95
# We set auto_activate_base to be false for pre-installed conda.
@@ -97,19 +98,21 @@ setup_commands:
97
98
# Line 'sudo grep ..': set the number of threads per process to unlimited to avoid ray job submit stucking issue when the number of running ray jobs increase.
98
99
# Line 'mkdir -p ..': disable host key check
99
100
# Line 'python3 -c ..': patch the buggy ray files and enable `-o allow_other` option for `goofys`
Copy file name to clipboardExpand all lines: sky/templates/azure-ray.yml.j2
+9-6Lines changed: 9 additions & 6 deletions
Original file line number
Diff line number
Diff line change
@@ -88,26 +88,29 @@ initialization_commands: []
88
88
# Increment the following for catching performance bugs easier:
89
89
# current num items (num SSH connections): 1
90
90
setup_commands:
91
+
# Disable `unattended-upgrades` to prevent apt-get from hanging. It should be called at the beginning before the process started to avoid being blocked. (This is a temporary fix.)
91
92
# Create ~/.ssh/config file in case the file does not exist in the image.
92
93
# Make sure python3 & pip3 are available on this image.
93
94
# Line 'sudo bash ..': set the ulimit as suggested by ray docs for performance. https://docs.ray.io/en/latest/cluster/vms/user-guides/large-cluster-best-practices.html#system-configuration
94
95
# Line 'sudo grep ..': set the number of threads per process to unlimited to avoid ray job submit stucking issue when the number of running ray jobs increase.
95
96
# Line 'mkdir -p ..': disable host key check
96
97
# Line 'python3 -c ..': patch the buggy ray files and enable `-o allow_other` option for `goofys`
97
98
# This also kills the service that is holding the lock on dpkg (problem only exists on aws/azure, not gcp)
Copy file name to clipboardExpand all lines: sky/templates/gcp-ray.yml.j2
+9-1Lines changed: 9 additions & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -123,6 +123,7 @@ initialization_commands: []
123
123
# Increment the following for catching performance bugs easier:
124
124
# current num items (num SSH connections): 1 (+1 if tpu_vm)
125
125
setup_commands:
126
+
# Disable `unattended-upgrades` to prevent apt-get from hanging. It should be called at the beginning before the process started to avoid being blocked. (This is a temporary fix.)
126
127
# Line 'mkdir -p ..': Create ~/.ssh/config file in case the file does not exist in the custom image.
127
128
# Line 'pip3 --v ..': Make sure python3 & pip3 are available on this image.
128
129
# Line 'which conda ..': some images (TPU VM) do not install conda by
@@ -132,7 +133,14 @@ setup_commands:
132
133
# Line 'sudo grep ..': set the number of threads per process to unlimited to avoid ray job submit stucking issue when the number of running ray jobs increase.
133
134
# Line 'mkdir -p ..': disable host key check
134
135
# Line 'python3 -c ..': patch the buggy ray files and enable `-o allow_other` option for `goofys`
0 commit comments