Commit Graph

532 Commits

Author SHA1 Message Date
cachho
41a1dbab0e feat: add count to get document count in vector db (#164) 2023-07-07 14:31:14 +05:30
cachho
f3888cc7a8 feat: reset (#169) 2023-07-07 00:45:01 +05:30
cachho
e50c7e6843 refactor: Add config for init, app and query (#158) 2023-07-06 23:46:40 +05:30
Sahil Kumar Yadav
68e732a426 feat: add google doc support added (#155) 2023-07-06 14:04:27 +05:30
Taranjeet Singh
687e3fb61c feat: Add support for chat endpoint (#150)
* feat: Add support for chat endpoint

This commit introduces a new endpoint called chat.
It maintains context of the last 5 user messages
and continues to do semantic search to find the
context.
2023-07-06 13:32:16 +05:30
Rohit Gupta
0e74688145 update: import gpt4all locally (#146)
Import gpt4all locally to avoid error in case its not present.
2023-07-06 01:01:32 +05:30
cachho
f8e5ccd007 feat: add dry run (#47)
This commit adds support to simulate the semantic search and see what the prompt will look lile.
2023-07-05 23:19:53 +05:30
Taranjeet Singh
200f11a0e0 fix: Fix dependency of openai env variables for OpenSourceApp (#144)
This commit fixes dependency of initializing openai env variables
for OpenSourceApp.
2023-07-05 23:03:15 +05:30
Taranjeet Singh
cf1e000fb3 Open source embedding and LLM models (#133)
* Add open source LLM model: gpt4all
* Add open source embedding model: sentence transformers
2023-07-05 02:23:23 +05:30
Mike Makuch
3461ef4b14 add: Allow setting OpenAI organization along with api_key (#84) 2023-07-05 01:30:30 +05:30
Sahil Kumar Yadav
5fa2f5c5a2 add: add embedchainjs github repo to readme (#123) 2023-07-05 01:17:41 +05:30
limbo
85008ff153 Fix: Solve IndexError when no relevant documents are found in the vector db. (#59) 2023-07-04 22:18:56 +05:30
cachho
f5f5e7edd1 feat: add local text (#44)
This commits extends the "add_local" function. It
adds support to take text and index/embed it.
2023-06-25 23:13:41 +05:30
Taranjeet Singh
3413dbf5e0 bug: Make vectordb a module
Add missing __init__ file in vectordb directory
2023-06-24 09:44:32 +05:30
cachho
3720ded80f docs: add clean_string() doc string (#43) 2023-06-23 22:44:51 +05:30
cachho
cf99dce940 Refactor query endpoint into 3 parts (#42)
Query endpoint now consists of 3 sub functions
- get data from db
- get prompt
- get answer from the data retrieved above by passing to LLM
2023-06-23 22:17:52 +05:30
cachho
2d35ade7aa refactor: local indicator in file and class name, renamed 2023-06-23 19:53:57 +05:30
cachho
e60f1680a0 chore: removed markdown to plaintext function 2023-06-23 19:53:57 +05:30
cachho
feec5e4a09 refactor: empty line added 2023-06-23 19:53:57 +05:30
cachho
ff2d5ce7fa feat: add local qna pair 2023-06-23 19:53:57 +05:30
cachho
df7614d349 feat: add markdown to plaintext 2023-06-23 19:53:57 +05:30
Taranjeet Singh
e40f1e5061 Skip creating embedding if it already exists
This commit adds support to skip generating embedding
for chunks which are already present in vector db.
2023-06-23 17:08:49 +05:30
Taranjeet Singh
cff244b894 Add default db loader, fix import
This commits builds on DumoeDss's PR. It

- adds a default db directory name.
- adds a default db instance (Chroma).

Both points offers flexibility for users who
want to use default and users who want to
customize

Lastly, it fixes an import
2023-06-23 11:51:51 +05:30
Sayo
85a6a0c161 [feat] Refactor VectorDB class hierarchy for flexibility 2023-06-22 12:15:26 +08:00
Taranjeet Singh
63bd18056f Revert "Rename query endpoint to qna"
This reverts commit 6bc7e9b7a7.
2023-06-21 16:06:51 +05:30
Taranjeet Singh
6bc7e9b7a7 Rename query endpoint to qna
Query endpoint takes a query and returns an answer
on the basis of retrieved similar docs.
Thought about the name and realized that qna makes
much more sense than query, as it takes a query
and returns an answer
2023-06-21 12:35:38 +05:30
Taranjeet Singh
8582613940 Add doc strings
Adds doc string for functions in embedchain/embedchain.py
2023-06-20 21:37:21 +05:30
Taranjeet Singh
ac0ec0bf8f Update log statement, doc and package dependency version 2023-06-20 21:04:33 +05:30
Taranjeet Singh
08f155a551 Update website to web page
This commit renames the website loader, chunker
to web page, as it is loading and chunking a single
url than the complete website.
2023-06-20 16:50:57 +05:30
Taranjeet Singh
4329caa17c Chunkers: Refactor each chunker & add base class
Adds a base chunker from which any chunker can inherit.
Existing chunkers are refactored to inherit from this base
chunker.
2023-06-20 16:30:23 +05:30
Taranjeet Singh
d2da80f5bc Add import in embedchain init file 2023-06-20 14:50:56 +05:30
Taranjeet Singh
468db83337 Add simple app functionality
This commit enables anyone to create a app and add 3 types of data
sources:

* pdf file
* youtube video
* website

It exposes a function called query which first gets similar docs from
vector db and then passes it to LLM to get the final answer.
2023-06-20 14:42:55 +05:30