Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

import-projects failing with 404 at api.find_data_objects #386

Open
mbthornton-lbl opened this issue Feb 18, 2025 · 0 comments
Open

import-projects failing with 404 at api.find_data_objects #386

mbthornton-lbl opened this issue Feb 18, 2025 · 0 comments
Assignees

Comments

@mbthornton-lbl
Copy link
Contributor

mbthornton-lbl commented Feb 18, 2025

Running import for Bioscales in with no DB Update - this error was raised:

  File "/global/cfs/cdirs/m3408/nmdc_automation/prod/nmdc_automation/nmdc_automation/run_process/run_import.py", line 92, in import_projects
    import_mapper.add_do_mappings_from_workflow_executions()
  File "/global/cfs/cdirs/m3408/nmdc_automation/prod/nmdc_automation/nmdc_automation/import_automation/import_mapper.py", line 288, in add_do_mappings_from_workflow_executions
    data_object = self.runtime_api.find_data_objects(data_object_id)

  File "/global/cfs/cdirs/m3408/nmdc_automation/prod/nmdc_automation/nmdc_automation/api/nmdcapi.py", line 427, in find_data_objects
    resp.raise_for_status()
  File "/global/cfs/cdirs/m3408/nmdc_automation/prod/nmdc_automation/.venv/lib/python3.11/site-packages/requests/models.py", line 1024, in raise_for_status
    raise HTTPError(http_error_msg, response=self)
requests.exceptions.HTTPError: 404 Client Error: Not Found for url: https://api.microbiomedata.org/data_objects/nmdc:dobj-11-yxpvtk31

Analysis:

The function add_do_mappings_from_workflow_executions is making the API call that is failing with a 404 error.
This function does the initial population of data object mapping based on workflow execution records found in the database, so the Not Found error is unexpected in this case, as all data object IDs being searched are from the has_output array of workflow execution records found in the database

The Data Object ID in question: nmdc:dobj-11-yxpvtk31 appears to be a dangling reference and is not found in the prod DB

https://api.microbiomedata.org/data_objects/nmdc%3Adobj-11-yxpvtk31

{"detail":"Not found"}
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Development

No branches or pull requests

1 participant