Add a simple Gradio UI for Open Deep Research #525

dceluis · 2025-02-06T23:51:52Z

Hi,

This PR makes a few changes to get open deep researcher working smoothly with Gemini 2.0 Flash through LiteLLM, and sets up a basic Gradio demo to show it off. I focused on Gemini 2.0 Flash specifically because it's a cheap way to experiment with the library.

Here's a breakdown of what I did:

Gemini 2.0 Flash has a couple of quirks that needed addressing:
- Empty user messages: Gemini 2.0 Flash (unlike some other models) requires the user message to be present. I've changed the planning_step method in src/smolagents/agents.py to send the initial_facts prompt as a user message instead of a system message. This resolves an error where Gemini would complain about contents being empty.
- litellm.add_function_to_prompt set to True or False depending on the selected model.
- Empty Content: model_output because sometimes returns None, causing error. Returned early in this case.
- used getattr To avoid errors when last_input_token_count is None
Tool Calling Fixes: The changes make sure Litellm can create a valid tool calling request for Gemini, avoiding an empty parameters dict.
Gradio Demo App: I've included a simple Gradio application.

Peek.2025-02-06.18-50.mp4

Managed Agent Prompt Changes: Prompt changes in code_agent.yaml and toolcalling_agent.yaml to improve accuracy when calling managed agents.

~~Blocked by this PR on LiteLLM~~

~~But one can do:~~

~~export LITELLM_LOCAL_MODEL_COST_MAP=True and set the config locally.~~

sysradium · 2025-02-07T17:05:11Z

src/smolagents/agents.py

@@ -455,7 +454,7 @@ def planning_step(self, task, is_first_step: bool, step: int) -> None:
        """
        if is_first_step:
            message_prompt_facts = {
-                "role": MessageRole.SYSTEM,
+                "role": MessageRole.USER,


Won't this actually break other models?..

OK I did some searching and it seems like there was a change recently that broke Gemini compatibility:

https://github.com/huggingface/smolagents/pull/502/files#diff-e03ffeb1ffdc0c18d4a29eaec2694a39de7c9ce98647a9dcf5fc372a61d4d40aL520

the above change makes the request to model() not include any USER , as it moves the task description to be a SYSTEM message. This change could have been made for a) increasing accuracy or b) simplyfing the prompting implementation. In any case, the case of models not supporting being sent a single system model was probably unknown and understadably overlooked. I can revert my change, the above change, or wait for instructions.

Regarding breaking compatibility, I know most (if not all) support being sent a single user message. The opposite is not true, as at least Gemini models do not yet support this

sysradium · 2025-02-07T17:08:44Z

src/smolagents/models.py

@@ -686,6 +691,12 @@ def __call__(
    ) -> ChatMessage:
        import litellm

+        # IMPORTANT - Set this to TRUE to add the function to the prompt for Non OpenAI LLMs
+        if litellm.supports_function_calling(model=self.model_id) == True:


if litellm.supports_function_calling(model=self.model_id):

Or

if litellm.supports_function_calling(model=self.model_id) is True:

if you want to be very specific.

I'm new to this project so I'm not sure if some idioms are acceptable or not.

What do you think of:
litellm.add_function_to_prompt = not litellm.supports_function_calling(model=self.model_id)

dceluis · 2025-02-08T02:54:47Z

Added a notice about add_function_to_prompt. Seems like the feature is broken in LiteLLM.
It wasnt working before and its not working now, so its not a breaking change. But i submitted a fix upstream. #

dceluis added 6 commits February 6, 2025 13:48

Add a simple gradio ui for open_deep_research

86d2f24

Merge branch 'feat/gradio-ui-1'

760db6d

Add example for calling team members (agents)

0812e72

fix issues

e79fc4c

some fixes for gemini

de18e7b

Revert or ""

c0a8a61

sysradium reviewed Feb 7, 2025

View reviewed changes

Cleanup and add FIX notice

96abfdb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a simple Gradio UI for Open Deep Research #525

Add a simple Gradio UI for Open Deep Research #525

dceluis commented Feb 6, 2025 •

edited

Loading

sysradium Feb 7, 2025

dceluis Feb 7, 2025

sysradium Feb 7, 2025

dceluis Feb 7, 2025

dceluis commented Feb 8, 2025

Add a simple Gradio UI for Open Deep Research #525

Are you sure you want to change the base?

Add a simple Gradio UI for Open Deep Research #525

Conversation

dceluis commented Feb 6, 2025 • edited Loading

sysradium Feb 7, 2025

Choose a reason for hiding this comment

dceluis Feb 7, 2025

Choose a reason for hiding this comment

sysradium Feb 7, 2025

Choose a reason for hiding this comment

dceluis Feb 7, 2025

Choose a reason for hiding this comment

dceluis commented Feb 8, 2025

dceluis commented Feb 6, 2025 •

edited

Loading