Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to prepare the whole dataset for training? #3

Open
Hongbin98 opened this issue Jun 12, 2023 · 3 comments
Open

How to prepare the whole dataset for training? #3

Hongbin98 opened this issue Jun 12, 2023 · 3 comments

Comments

@Hongbin98
Copy link

Hi!
First of all, thank you for your excellent work!

When I try to train a Prompt Diffusion model, I meet the FileNotFoundError.
Even if I have run the 'annotate_data.py' to prepare the dataset in default settings, I still get into this trouble. Would you offer me a hand?

Are there any additional requirements for the dataset preparation, or am I mistaken?

@Zhendong-Wang
Copy link
Owner

Hi there,

Thanks for your interest! Could you provide the error message? I am not sure what files are missing when you were running.
For running annotate_data.py, you need to provide the data path, and the seed start and end number for annotation.

@Hongbin98
Copy link
Author

For example, when I try to train a Prompt Diffusion model, I meet the FileNotFoundError: No such file or directory: './clip-filtered-dataset/0409092/2158047904_1_depth.jpg'.
And I also find that the annotate_data.py only transforms the images f"{seed}_0_depth.jpg" (only transform '0' but not '1').
Are there any problems with this script?

@Zhendong-Wang
Copy link
Owner

The annotate_data.py is an example script that we used for data annotation. It shows how to annotate data for {seed}_0. You could easily modify it to annotate {seed}_1 data as well.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants