Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Worker: Runs get lost if the dataclip is >150mb #897

Open
josephjclark opened this issue Mar 6, 2025 · 0 comments
Open

Worker: Runs get lost if the dataclip is >150mb #897

josephjclark opened this issue Mar 6, 2025 · 0 comments

Comments

@josephjclark
Copy link
Collaborator

On Staging, with a 200mb dataclip, I'm seeing this pattern of behaviour

  • The step complete with pretty high memory usage: [R/T] ❯ Final memory usage: [step 515mb] [system 603mb]
  • The step complete message is sent to lightning
  • The run complete message seems to fail to send
  • Maybe 1 second later, the server seems to restart. No exception or message in the log.

Since run:complete fails to send, the run will be Lost

Here's an example on staging

Here's logs on GCP

@github-project-automation github-project-automation bot moved this to New Issues in v2 Mar 6, 2025
@theroinaochieng theroinaochieng moved this from New Issues to DevX Backlog in v2 Mar 7, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: DevX Backlog
Development

No branches or pull requests

1 participant