Commit Graph

49 Commits

Author SHA1 Message Date
Deven Patel
7de8d85199 [Feature] Add Postgres data loader (#918)
Co-authored-by: Deven Patel <deven298@yahoo.com>
2023-11-08 23:50:46 -08:00
Deven Patel
53037b5ed8 [Feature Improvement] Update JSON Loader to support loading data from more sources (#898)
Co-authored-by: Deven Patel <deven298@yahoo.com>
2023-11-03 10:00:27 -07:00
Deven Patel
a054f7be9c [Data loader] Make json data loader work for other languages (#897)
Co-authored-by: Deven Patel <deven298@yahoo.com>
2023-11-01 22:27:40 -07:00
Deven Patel
df314dc6d1 Clean json data before loading (#895)
Co-authored-by: Deven Patel <deven298@yahoo.com>
2023-11-01 21:52:34 -07:00
Deven Patel
5255a37c93 Embedchain json url support (#878)
Co-authored-by: Deven Patel <deven298@yahoo.com>
2023-10-30 16:19:11 -07:00
Deven Patel
68dc274f72 Embedchain json loader update (#876)
Co-authored-by: Deven Patel <deven298@yahoo.com>
2023-10-30 15:30:49 -07:00
Deshraj Yadav
29bd038579 [Bug fix] Fix missing dependency issue with gmail (#862) 2023-10-27 20:02:03 -07:00
Deven Patel
68183e9dce [Feature] Gmail Loader (#841) 2023-10-27 18:05:08 -07:00
Deven Patel
797bb567c6 [feat]: Add openapi spec data loader (#818) 2023-10-25 14:19:13 -07:00
Deshraj Yadav
b47405e1bd Update version to v0.0.74 (#825) 2023-10-18 17:12:51 -07:00
Muhammad Muzammil
8b64deab40 [Feature]: Unstructured File Loader Support - USF (#815) 2023-10-18 16:43:41 -07:00
Deven Patel
7641cba01d [Feature] JSON data loader support (#816) 2023-10-18 13:53:15 -07:00
Sidharth Mohanty
413b107b9a Update dependencies for fast installation (#811) 2023-10-17 08:21:30 -07:00
Rupesh Bansal
19a9141c2d Added Clip dependency (#778) 2023-10-09 12:02:45 -07:00
Sidharth Mohanty
b91d922600 Improve tests (#780) 2023-10-09 11:26:21 -07:00
Richard Awoyemi
1741d3bef6 [fix]: Fix sitemap loader (#753) 2023-10-06 16:24:15 -07:00
Ojuswi Rastogi
540a0a3685 [feat]: Add support for XML file format (#757) 2023-10-06 15:39:32 -07:00
Deshraj Yadav
87d0b5c76f [bugfix] Fix issue when llm config is not defined (#763) 2023-10-04 12:08:21 -07:00
Rupesh Bansal
d0af018b8d Add support for image dataset (#571)
Co-authored-by: Rupesh Bansal <rupeshbansal@Shankars-MacBook-Air.local>
2023-10-04 09:50:40 +05:30
cachho
79efa51941 fix: url metadata for all datatypes (#613) 2023-09-13 10:19:48 -07:00
Deshraj Yadav
701d0b21ef [chore] fix lint issues (#607) 2023-09-12 20:04:31 -07:00
Taranjeet Singh
36b26e08c3 feat: add support for mdx file (#604) 2023-09-13 05:13:18 +05:30
cachho
03146946fa chore: linting (#597) 2023-09-12 21:34:38 +05:30
Taranjeet Singh
2bd6881361 feat: Add embedding manager (#570) 2023-09-12 12:13:53 +05:30
cachho
dfe0b414ac refactor: use llama hub instead of llama index (#592) 2023-09-12 09:26:58 +05:30
Deshraj Yadav
79f5a1d052 [chore]: Rename modules for better readability and maintainability (#587) 2023-09-11 07:01:40 +05:30
Dev Khant
13fda2efe1 fix: --upgrade flag for all pip instances (#557) 2023-09-08 08:12:55 +05:30
cachho
bd595f84e8 feat: csv loader (#470)
Co-authored-by: Taranjeet Singh <reachtotj@gmail.com>
2023-09-05 13:48:03 +05:30
cachho
0d4ad07d7b Feat/serialize deserialize (#508)
Co-authored-by: Taranjeet Singh <reachtotj@gmail.com>
2023-09-04 01:20:18 +05:30
cachho
c26559a2d3 fix: notion install error message (#439) 2023-08-15 02:57:23 +05:30
cachho
ce6eb39009 feat: notion loader (#405) 2023-08-09 13:15:22 +05:30
cachho
55bfd7cafe refactor: loader chunker typing (#324) 2023-07-26 23:14:57 +05:30
cachho
3da5724853 feat: filter sitemap (#304) 2023-07-19 05:36:39 +05:30
Deshraj Yadav
a548863a09 Feature: Add support for loading docs website (#293) 2023-07-16 22:22:52 -07:00
Taranjeet Singh
e8b3d53faf fix: Handle divide by zero error when original size is 0 (#290) 2023-07-17 07:19:47 +05:30
cachho
addf1c0666 feat: exclude by class, id in web_page data type and add logging (#273) 2023-07-16 00:51:25 +05:30
Deshraj Yadav
fd97fb268a feat: Update line length to 120 chars (#278) 2023-07-15 19:41:55 +05:30
Taranjeet Singh
86e4146126 feat: Add new data type: code_docs_loader (#274) 2023-07-15 09:02:11 +05:30
cachho
cd0c7bc971 fix: escape bs4 parsing error (#271) 2023-07-15 08:50:11 +05:30
aaishikdutta
d2e8f796ca feat: add SitemapLoader (#68) 2023-07-12 12:13:30 +05:30
Deshraj Yadav
9ca836520f Resolve conflicts (#208) 2023-07-11 10:20:05 +05:30
aaishikdutta
6936d6983d Added documentation (#219) 2023-07-11 08:31:42 +05:30
Sahil Kumar Yadav
0bb3d0afe9 feat: changed doc_file to docx and update readme (#157) 2023-07-07 16:18:05 +05:30
Sahil Kumar Yadav
68e732a426 feat: add google doc support added (#155) 2023-07-06 14:04:27 +05:30
Sahil Kumar Yadav
5fa2f5c5a2 add: add embedchainjs github repo to readme (#123) 2023-07-05 01:17:41 +05:30
cachho
f5f5e7edd1 feat: add local text (#44)
This commits extends the "add_local" function. It
adds support to take text and index/embed it.
2023-06-25 23:13:41 +05:30
cachho
2d35ade7aa refactor: local indicator in file and class name, renamed 2023-06-23 19:53:57 +05:30
Taranjeet Singh
08f155a551 Update website to web page
This commit renames the website loader, chunker
to web page, as it is loading and chunking a single
url than the complete website.
2023-06-20 16:50:57 +05:30
Taranjeet Singh
468db83337 Add simple app functionality
This commit enables anyone to create a app and add 3 types of data
sources:

* pdf file
* youtube video
* website

It exposes a function called query which first gets similar docs from
vector db and then passes it to LLM to get the final answer.
2023-06-20 14:42:55 +05:30