Skip to content

kk-engineer/ai-agents-v1

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

22 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AI Agents

This project has multiple AI Agents.

You can run the LLM locally or integrate with a LLM via API.

Agents:

  • RoboClaw
    • Interact with the agent via WhatsApp and CLI
  • Search Agent
    • A streamlit UI chat interface to talk to your agent (searches the web when it does not have the info).

1. Getting Started (local LLM) 🚀

1.1 Use llama.cpp (Preferred, as faster on Mac)

brew install cmake
git clone https://github.com/ggerganov/llama.cpp
cd llama.cpp
cmake -B build -DGGML_METAL=ON
cmake --build build --config Release

1.2 Install Ollama Before running the project, you must have Ollama installed to manage your local LLMs.

2. Set Up the Model

2.1 For llama.cpp (mistral-nemo-12b)

Note: Run commands in llama.cpp directory

Download the model

brew install aria2
aria2c -x 16 -s 16 "https://huggingface.co/bartowski/Mistral-Nemo-Instruct-2407-GGUF/resolve/main/Mistral-Nemo-Instruct-2407-Q4_K_M.gguf?download=true" -d models -o mistral-nemo-12b.gguf

Run the model

Note: Run commands in llama.cpp directory

./build/bin/llama-server -m models/mistral-nemo-12b.gguf --n-gpu-layers -1 --ctx-size 8192 --port 8000

Note: Downloading the model from internet will take 5-10 mins(depending upon internet speed)

Note: You can also try Qwen model (they are faster), but call terminal(shell) tool for everything. Will need to fix the prompt accordingly.

Download and run Qwen 2.5

Note: Run commands in llama.cpp directory

cd /models
curl -L https://huggingface.co/bartowski/Qwen2.5-Coder-7B-Instruct-GGUF/resolve/main/Qwen2.5-Coder-7B-Instruct-Q4_K_M.gguf -o qwen2.5-7b.gguf
cd ..
./build/bin/llama-server -m models/qwen2.5-7b.gguf --n-gpu-layers -1 --ctx-size 8192 --port 8000

2.2 For Ollama (gemma4:e4b model) Once Ollama is installed, pull and run the specific version of a model:

# Pull the model (this may take some time, depends on your internet speed)
ollama pull gemma4:e4b

Note: I am running the model on M1 Mac 16 GB RAM.

In case your system has lower RAM then - you can use smaller model, such as, "qwen2.5-coder:7b-instruct".

3. Clone and Setup Project

# Clone the repository
git clone https://github.com/kk-engineer/ai_agents.git
cd ai_agents/

# Install uv for Python package management (instead of pip)
brew install uv

# Create a virtual environment
uv venv
source .venv/bin/activate  # On Windows: venv\Scripts\activate

# Install dependencies
uv pip install -e .

4. Run the App

  • RoboClaw
python3 agents/robo_claw/app.py
  • Search Agent
streamlit run agents/search_agent/app.py

About

AI Agents to search web, call terminal commands, and many more.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages