-
Notifications
You must be signed in to change notification settings - Fork 57
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fixes issue with model reconstruction of the upper half of the image & saves model checkpoint in s3 #193
Conversation
- Lr -> 1e-5 to 1e-5 - Data -> Size: 256 x 256, patchsize: 16 - Log checkpoints to s3 - Save model params along with optimizer & epoch state
…while training & validation.
for more information, see https://pre-commit.ci
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good except for some strange test errors 🐈
half of the image & saves model checkpoint in s3 (#193) - Fix issue with not shuffling during validation run. Use shuffle=True while training & validation. - Log to devseed-gaia account of wandb & save checkpoints on s3. - Update params for v0.2 model run - Lr -> 1e-4 to 1e-5 - Data -> Size: 256 x 256, patchsize: 16 - Log checkpoints to s3 - Save model params along with optimizer & epoch state
@weiji14 I am getting an error with create a
For now, I have merged this branch with main, as we need to develop v1 on top of v0.2. We can fix the issues with conda-lock & do a v0.2 release next week. |
Hmm, did you run
I'll patch this up at #225, and also update the docs slightly under the Note section in https://clay-foundation.github.io/model/installation.html#advanced about re-locking the conda-lock.yml file. |
This PR resolves an issue with the model reconstructing just the bottom 50% of the image during validation and stores model checkpoints in the s3 store.
Fixes #156 #138