[Question] Tracks with AI agents #9054

PMazarovich · 2025-02-05T09:41:04Z

Hello!
Recently we've found out this https://www.cvat.ai/blog/announcing-cvat-ai-agents#what-is-a-cvat-ai-agent-
As far as I understand the process, right now it is impossible to output tracks from any model which can be mounted to the agent due to frame-by-frame approach.
So, we feed the model with images frame by frame and get annotations from this model also for each separate frame => no tracks possible
Am i right?
Thanks!

SpecLad · 2025-02-05T10:27:39Z

This is correct at the moment. However, we're planning to expand agent capabilities to cover tracking as well.

PMazarovich · 2025-02-05T10:30:52Z

@SpecLad , thanks for the answer. I think we can close this? Or you'll close it when tracks are in place?

PMazarovich · 2025-02-05T11:23:55Z

@SpecLad , another question here )
Right now it is impossible to send image + annotations for this image as an input to the model. Only image is flowing into a model.
Do you think this will be supported?
Thanks!

SpecLad · 2025-02-05T11:32:29Z

I haven't considered it, though in principle it could be done. Could you explain your use case for a feature like this?

PMazarovich · 2025-02-05T11:55:29Z

Sure.
Some models might benefit from inputs of users (or other models) to run pre-annotations. For instance, an OCR model might benefit from bounding box information representing the location of the text (or texts) to be extracted from the image.

Imagine a large image, where a car is visible. The task of extracting the car license plate from the full image is vastly simplified if the OCR model is given both the image data and information about the location of the plate (bbox) inside that image. For this reason, having the ability to send both image and extra data (such as bboxes) might be important.

In the above scenario, the bbox of the license plate would be created in CVAT via the UI, or potentially by another model that detects license plates.

bsekachev assigned SpecLad Feb 5, 2025

bsekachev added the question Further information is requested label Feb 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] Tracks with AI agents #9054

[Question] Tracks with AI agents #9054

PMazarovich commented Feb 5, 2025

SpecLad commented Feb 5, 2025

PMazarovich commented Feb 5, 2025

PMazarovich commented Feb 5, 2025 •

edited

Loading

SpecLad commented Feb 5, 2025

PMazarovich commented Feb 5, 2025 •

edited

Loading

[Question] Tracks with AI agents #9054

[Question] Tracks with AI agents #9054

Comments

PMazarovich commented Feb 5, 2025

SpecLad commented Feb 5, 2025

PMazarovich commented Feb 5, 2025

PMazarovich commented Feb 5, 2025 • edited Loading

SpecLad commented Feb 5, 2025

PMazarovich commented Feb 5, 2025 • edited Loading

PMazarovich commented Feb 5, 2025 •

edited

Loading

PMazarovich commented Feb 5, 2025 •

edited

Loading