LLama3.3-RAG Application using SambaNova the Lightning-fast inference engine

A high-performance RAG (Retrieval-Augmented Generation) application for document chat, built with state-of-the-art components.

Tech Stack

SambaNova: Lightning-fast inference engine for Llama 3.3
Llama Index: RAG orchestration framework
Qdrant: Vector database for efficient embedding storage
Streamlit: User interface framework

Prerequisites

Python 3.11 or later
Docker (for Qdrant)
SambaNova API key

Setup Instructions

Configure SambaNova

Get your API key from SambaNova and create a .env file:
```
SAMBANOVA_API_KEY=<YOUR_SAMBANOVA_API_KEY>
```

Launch Qdrant VectorDB

docker run -p 6333:6333 -p 6334:6334 \
-v $(pwd)/qdrant_storage:/qdrant/storage:z \
qdrant/qdrant

Install Dependencies

pip install streamlit llama-index-vector-stores-qdrant llama-index-llms-sambanovasystems sseclient-py

Launch Application
```
streamlit run app.py
```

Usage

Start the application using the command above
Upload your documents through the Streamlit interface
Begin chatting with your documents using natural language queries

Contributing

We welcome contributions! Please:

Fork the repository
Create your feature branch
Commit your changes
Push to your branch
Create a Pull Request

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
README.md		README.md
app.py		app.py
notebook.ipynb		notebook.ipynb
rag_code.py		rag_code.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLama3.3-RAG Application using SambaNova the Lightning-fast inference engine

Tech Stack

Prerequisites

Setup Instructions

Usage

Contributing

License

fastest_rag_stack

About

Releases

Packages

Languages

qed42/fastest_rag_stack

Folders and files

Latest commit

History

Repository files navigation

LLama3.3-RAG Application using SambaNova the Lightning-fast inference engine

Tech Stack

Prerequisites

Setup Instructions

Usage

Contributing

License

fastest_rag_stack

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages