Commit Graph

62 Commits

Author SHA1 Message Date
Alex
9228005a7e chunked embedding 2023-02-12 16:25:01 +00:00
vintro
2a203aa547
Create __init__.py
otherwise running `python ingest.py` complains about `parser` not being a package
2023-02-10 19:49:00 -05:00
Alex
37ad3b35c8 Update ingest.py 2023-02-10 17:30:37 +00:00
Alex
d642782a5a move folder 2023-02-10 16:10:53 +00:00
Pavel
8c4fcff617 requirement 2023-02-10 19:47:51 +04:00
Pavel
79b5ef9c14 Bulk ingest
Added a method based on indexGPT folder ingester. Additional rst reader included.
2023-02-10 19:44:42 +04:00
Alex
605c599b5d Create code_docs_gen.py 2023-02-08 19:06:23 +00:00
Patrick Shriwise
64fb36b3de Adding location argument to ingest scripts 2023-02-07 10:20:07 -06:00
Alex
08215248d7 Inputs folder change 2023-02-07 10:52:03 +00:00
monkish54
c94866e9e9 Add cost estimate feature
Calculates number of tokens/user cost and requires user permission to proceed.

User permission bypass is built-in to allow for non-human users.
2023-02-06 18:32:06 -08:00
Pavel
1c734727a1 Ingest rst with sphinx
Transforms all rst files in provided folder to txt format first (utilising sphinx library). In my tests size of raw sample decreased 2-3 times.
2023-02-06 23:43:23 +04:00
Alex
b71a9bf5ee init2 2023-02-03 12:45:29 +00:00