make rag to support embedded documents workspace #90

fengsh27 · 2024-01-12T04:56:09Z

Abstract

This submission is to make rag support embedding documents workspace. Currently, all embedded documents operations such as obtaining all documents, performing similarity search and removing documents, are preformed across all embedded documents in vector store. With this submission, an embedded documents workspace (essentially a list of embedded document IDs) will be passed to rag, which is used to define the scope for all operations. If no workspace (i.e., None) passed in, all operations will be considered to be performed across all documents as before.

Besides introducing embedded documents workspace, this submission also moved xinference importing back to XinferenceDocumentEmbedder constructor. Previously, the xinference importing was moved to the beginning of the file to facilitate unit tests. However, this unexpectedly increased the size of biochatter-server docker image, even if xinference was not yet needed. By moving it back, it optimizes biochatter-server docker size and ensures that xinference is only imported when necessary.

merge biocypher/biochatter to main

…rkspace make RAG to support embedding document workspace

make rag support to specify milvus user and pwd

slobentanzer

Looks good and makes sense, thanks! Will bump version to 0.3.11 after the merge.

fengsh27 and others added 5 commits January 6, 2024 11:07

Merge pull request #10 from fengsh27/wip/fengsh/merge_upstream_20240106

0f7ce9e

merge biocypher/biochatter to main

make RAG to support embedding document workspace

ec8eadf

move xinference importing back to constructor

8601f98

remove deprecated code

beceb6e

Merge pull request #11 from fengsh27/wip/fengsh/embedding_document_wo…

0622d78

…rkspace make RAG to support embedding document workspace

fengsh27 temporarily deployed to Test CI January 12, 2024 04:56 — with GitHub Actions Inactive

fengsh27 requested a review from slobentanzer January 12, 2024 04:56

fengsh27 mentioned this pull request Jan 12, 2024

Implement RAG setting page biocypher/biochatter-next#20

Merged

Ubuntu and others added 2 commits January 12, 2024 21:13

make rag support to specify milvus user and pwd

fac3c8b

Merge pull request #12 from fengsh27/wip/fengsh/milvus_user_password

4d29c4f

make rag support to specify milvus user and pwd

fengsh27 temporarily deployed to Test CI January 13, 2024 02:18 — with GitHub Actions Inactive

slobentanzer approved these changes Jan 13, 2024

View reviewed changes

slobentanzer merged commit 5a4a26d into biocypher:main Jan 13, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

make rag to support embedded documents workspace #90

make rag to support embedded documents workspace #90

fengsh27 commented Jan 12, 2024 •

edited

Loading

slobentanzer left a comment

make rag to support embedded documents workspace #90

make rag to support embedded documents workspace #90

Conversation

fengsh27 commented Jan 12, 2024 • edited Loading

Abstract

slobentanzer left a comment

Choose a reason for hiding this comment

fengsh27 commented Jan 12, 2024 •

edited

Loading