Commit Graph

28 Commits

Author SHA1 Message Date
Pavel
4532b6cd8c print minus 2023-03-14 17:49:57 +04:00
Pavel
53424a5c19 Added cli commands 2023-03-14 17:33:19 +04:00
Pavel
b6c02c850a token ingeest 2023-03-14 13:32:29 +04:00
Pavel
bac25112b7 v1 2023-03-13 19:14:33 +04:00
Alex
ac0224b687 mdx format 2023-03-08 23:16:20 +00:00
Alex
1f02f3b376 Update rst_parser.py 2023-03-08 11:32:44 +00:00
Alex
f7d7244588 chunks rst 2023-03-08 00:07:53 +00:00
Alex
1af4ca2340
Merge pull request #129 from arc53/code-ingestion
Code_to_dict
2023-02-25 13:52:14 +00:00
Pavel
2c364d3c00 Code_to_dict
3 languages added, works well with python. Java and Js require additional revieving
2023-02-25 17:37:33 +04:00
Alex
5c2a537393
Merge pull request #116 from arc53/code-ingestion
Code ingestion
2023-02-22 18:46:50 +00:00
Pavel
0fb28e5213 Calc + structure 2023-02-22 21:19:13 +04:00
Manan
524e0f6f01 fix | Chunk creation error when
title not the first element in HTML
2023-02-22 20:20:54 +05:30
Manan
16eb503e36 Added HTML Support. read, clean-up, filter return 2023-02-21 23:06:00 +05:30
Manan
e8baa46eb6 Merge branch 'main' of https://github.com/arc53/DocsGPT into main 2023-02-21 22:11:57 +05:30
Manan
d0b472ad38 Implemented html_parser: cleaning & chunk creation 2023-02-19 01:53:16 +05:30
Alex
4d1ff8238d switching between llms 2023-02-15 18:40:23 +00:00
Alex
f9fe3f2f48 Merge branch 'main' into custom-llm 2023-02-15 14:42:57 +00:00
EricGao888
aeac186484 Add retry strategy to increase stability 2023-02-15 17:29:39 +08:00
冯不游
b83589a308 feat: add support for directory list
example: `python ingest.py --dir inputs1 --dir another --dir ../inputs`,
the outputs will be in `outputs/input_folder_name/`
2023-02-15 02:30:39 +08:00
冯不游
636783ca8a fix: avoid second error issue 2023-02-14 22:29:17 +08:00
冯不游
458f2a3ff3 fix: restore index back when continue process 2023-02-14 22:05:16 +08:00
Alex
046fbebf56 Enable other llm's 2023-02-14 13:06:28 +00:00
Alex
e88ff885fe
Merge pull request #75 from arc53/rst-interpreters 2023-02-12 18:32:20 +00:00
Pavel
b1a6ebffba Directives + Interpreted
Some additional filters for rst parsing
2023-02-12 22:29:40 +04:00
Alex
205be538a3 fix dbqa, with new chain type, also fix for doc export 2023-02-12 17:58:54 +00:00
Alex
9228005a7e chunked embedding 2023-02-12 16:25:01 +00:00
vintro
2a203aa547
Create __init__.py
otherwise running `python ingest.py` complains about `parser` not being a package
2023-02-10 19:49:00 -05:00
Alex
d642782a5a move folder 2023-02-10 16:10:53 +00:00