"Permission denied" error when not using the default namespace in hf_interactive notebook #129

MichaelClifford · 2023-05-24T18:10:11Z

I've noticed the below permissions error while running the hf_interactive.ipynb notebook demo.

This error occurs when I deploy my RayCluster in any namespace besides the default namespace. Simply using default overcomes the permissions issue. However, this is not behavior we want to support. We need to ensure that the actions of the codeflare-sdk have the required permissions in the correct namespaces.

This issue can be partially circumvented by adding runtime_env = "env_vars": {"HF_HOME":"huggingface"} to your ray.init() command, however, it will just lead to a similar permission issue later on in training.

How can we ensure that the users have the correct permissions in their namespaces?

The text was updated successfully, but these errors were encountered:

KPostOffice · 2023-05-30T20:54:38Z

Is this connected to #125 ?

KPostOffice · 2023-06-09T15:34:48Z

My current thought is that this is an issue with the upstream image not being built to be compatible with running as an arbitrary user on OpenShift. I'm planning on rebuilding and pushing the image after adding something like:

RUN chgrp -R 0 /home/ray && \
    chmod -R g+rwX /home/ray

as seen here

If this works we can rebuild the images ourselves and push them to our quay org. Then we can see if Ray is open to adding this to their builds

KPostOffice · 2023-06-13T15:26:03Z

I was able to get the hf_interactive notebook working using the following Dockerfile

FROM ghcr.io/foundation-model-stack/base:ray2.1.0-py38-gpu-pytorch1.12.0cu116-20221213-193103

USER 0

RUN chgrp -R 0 /home/ray && chmod -R g+rwX /home/ray

which is available at quay.io/kpostlet/ray:2.1.0 if anyone else wants to use this for testing you can pass it to your ClusterConfiguration with the image parameter.

cluster = Cluster(ClusterConfiguration(
    ...,
    image='quay.io/kpostlet/ray:2.1.0'
))

KPostOffice · 2023-06-14T18:42:36Z

This was added but reverted in the ray project. See ray-project/ray#32025 for discussion

KPostOffice · 2023-06-14T18:45:13Z

My current ideas are that we can either:

host a custom image with the correct permissions in the home directory
run pods with ray user (I think this is UID=1000)

MichaelClifford added this to Project CodeFlare Sprint Board May 24, 2023

MichaelClifford moved this to Todo in Project CodeFlare Sprint Board May 24, 2023

KPostOffice self-assigned this Jun 8, 2023

KPostOffice moved this from Todo to In Progress in Project CodeFlare Sprint Board Jun 8, 2023

KPostOffice mentioned this issue Jun 14, 2023

add scc that forces ray pods to run as user 1000 opendatahub-io/distributed-workloads#56

Merged

3 tasks

KPostOffice moved this from In Progress to Ready For Review in Project CodeFlare Sprint Board Jun 20, 2023

KPostOffice closed this as completed in opendatahub-io/distributed-workloads#56 Jun 20, 2023

github-project-automation bot moved this from Ready For Review to Done in Project CodeFlare Sprint Board Jun 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

"Permission denied" error when not using the default namespace in hf_interactive notebook #129

"Permission denied" error when not using the default namespace in hf_interactive notebook #129

MichaelClifford commented May 24, 2023

KPostOffice commented May 30, 2023

KPostOffice commented Jun 9, 2023 •

edited

Loading

KPostOffice commented Jun 13, 2023

KPostOffice commented Jun 14, 2023

KPostOffice commented Jun 14, 2023

"Permission denied" error when not using the default namespace in hf_interactive notebook #129

"Permission denied" error when not using the default namespace in hf_interactive notebook #129

Comments

MichaelClifford commented May 24, 2023

KPostOffice commented May 30, 2023

KPostOffice commented Jun 9, 2023 • edited Loading

KPostOffice commented Jun 13, 2023

KPostOffice commented Jun 14, 2023

KPostOffice commented Jun 14, 2023

KPostOffice commented Jun 9, 2023 •

edited

Loading