Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: Pinecone Vector Database #86

Open
wants to merge 2 commits into
base: main
Choose a base branch
from
Open

feat: Pinecone Vector Database #86

wants to merge 2 commits into from

Conversation

FinnConnor
Copy link
Collaborator

Added Pinecone vector database support. To setup (also in readme) set VECTOR_DB_TYPE=pinecone and PINECONE_API_KEY=<api key>

Tested with Pinecone, openai text-embedding-3-small

  1. Embedding multiple text files
  2. Getting all ids for multiple files
  3. Deleting documents, ensuring not in database
  4. Querying one file, ensuring relevant results to query
  5. Getting documents according to id
  6. Querying multiple documents, ensuring all documents are queried
  7. Change back to Pgvector , to ensure still working

@FinnConnor FinnConnor requested a review from danny-avila October 9, 2024 19:39
@FinnConnor FinnConnor self-assigned this Oct 9, 2024
@FinnConnor
Copy link
Collaborator Author

FinnConnor commented Oct 9, 2024

IDs per vector assigned as (file_id)_# . Using a prefix file_id with numbering 0,1,2,.... Should not affect naming file_id able to include '_' in file_id such that read my_file_0

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant