rag_pdf_services

TODO:

conda create -n backend python==3.10 -y
conda activate backend
pip install -r requirements.txt

Milvus server run on Docker server, please see these following links first:

Download milvus-standalone-docker-compose.yml and save it as docker-compose.yml manually, or with the following command:

wget https://github.com/milvus-io/milvus/releases/download/v2.3.2/milvus-standalone-docker-compose.yml -O docker-compose.yml

In the same directory as the docker-compose.yml file, start up Milvus by running:

sudo docker-compose up -d

Now check if same directory as the docker-compose.yml file has folder volumes, and run the following command:

sudo docker compose ps

Connect to Milvus through port:

docker port milvus-standalone 19530/tcp

Supporting pipeline:

Folder examples/ has some usecases, generally split into three steps:

call DirectoryReader from src.reader.dir_reader and put list of file path to argument input_files
initialize Hydra config and put into argument config of ConfigurationManager
Call class pipeline in folder src.pipeline, put config and run main() function. Currently only supporting pipeline for data ingestion.