You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Screenshot Collection: Puppeteer captures the browser screen and sends it to the FastAPI backend.
Vision Model Prediction: The Llama Vision Model processes the screenshot and predicts: Coordinates (x, y) of the element. Action to be performed (e.g., click, scroll).
Action Execution: Puppeteer performs the predicted action in the browser.
The text was updated successfully, but these errors were encountered:
SharanyaSD
changed the title
Final Testing and Presentation (3 Hours)
Final Testing and Presentation
Dec 19, 2024
Screenshot Collection: Puppeteer captures the browser screen and sends it to the FastAPI backend.
Vision Model Prediction: The Llama Vision Model processes the screenshot and predicts: Coordinates (x, y) of the element. Action to be performed (e.g., click, scroll).
Action Execution: Puppeteer performs the predicted action in the browser.
The text was updated successfully, but these errors were encountered: