Commit Graph

105 Commits (558ecd84a675ca3cf7ceb6f744401b0bd9e1be8b)

Author SHA1 Message Date
Anton Larin 962becb9a5
Linting
* validate python formatting on every build with Ruff
* fix lint warnings
1 year ago
Anton Larin 168648e789 Proper PEP8 formatting 1 year ago
dependabot[bot] 80dfdd1cb9
Bump redis from 4.5.3 to 4.5.4 in /scripts
Bumps [redis](https://github.com/redis/redis-py) from 4.5.3 to 4.5.4.
- [Release notes](https://github.com/redis/redis-py/releases)
- [Changelog](https://github.com/redis/redis-py/blob/master/CHANGES)
- [Commits](https://github.com/redis/redis-py/compare/v4.5.3...v4.5.4)

---
updated-dependencies:
- dependency-name: redis
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>
1 year ago
dependabot[bot] b7f1a94ba4
Bump redis from 4.5.1 to 4.5.3 in /scripts
Bumps [redis](https://github.com/redis/redis-py) from 4.5.1 to 4.5.3.
- [Release notes](https://github.com/redis/redis-py/releases)
- [Changelog](https://github.com/redis/redis-py/blob/master/CHANGES)
- [Commits](https://github.com/redis/redis-py/compare/v4.5.1...v4.5.3)

---
updated-dependencies:
- dependency-name: redis
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>
1 year ago
Pavel ce8f0ef9e1
Merge pull request #168 from arc53/feature/backend-uploads
Feature/backend uploads
1 year ago
Pavel c9e1c326f5 - index.plk 2 years ago
Pavel 4532b6cd8c print minus 2 years ago
Pavel 53424a5c19 Added cli commands 2 years ago
Pavel b6c02c850a token ingeest 2 years ago
Alex 20a0800aa7 Create test_ingestion.py 2 years ago
Pavel bac25112b7 v1 2 years ago
Alex 1d2162705d uploads backend first 2 years ago
Alex ac0224b687 mdx format 2 years ago
Alex 0799728000 Create requirements.txt 2 years ago
Alex 1f02f3b376 Update rst_parser.py 2 years ago
Alex f7d7244588 chunks rst 2 years ago
Alex 63e35fbdbc env vars 2 years ago
Alex 1af4ca2340
Merge pull request #129 from arc53/code-ingestion
Code_to_dict
2 years ago
Pavel 2c364d3c00 Code_to_dict
3 languages added, works well with python. Java and Js require additional revieving
2 years ago
Alex 5c2a537393
Merge pull request #116 from arc53/code-ingestion
Code ingestion
2 years ago
Pavel 0fb28e5213 Calc + structure 2 years ago
Manan 524e0f6f01 fix | Chunk creation error when
title not the first element in HTML
2 years ago
Manan 16eb503e36 Added HTML Support. read, clean-up, filter return 2 years ago
Manan e8baa46eb6 Merge branch 'main' of https://github.com/arc53/DocsGPT into main 2 years ago
Alex c92f5dba32 New docs gen 2 years ago
Alex 962be4d8ec
Merge pull request #109 from arc53/main
updates
2 years ago
Manan d0b472ad38 Implemented html_parser: cleaning & chunk creation 2 years ago
Alex 8a5e1e8d98 cleanups 2 years ago
Alex 4d1ff8238d switching between llms 2 years ago
Alex f9fe3f2f48 Merge branch 'main' into custom-llm 2 years ago
EricGao888 aeac186484 Add retry strategy to increase stability 2 years ago
Pavel d57c7b0296 -y-description 2 years ago
冯不游 b83589a308 feat: add support for directory list
example: `python ingest.py --dir inputs1 --dir another --dir ../inputs`,
the outputs will be in `outputs/input_folder_name/`
2 years ago
Alex d4ede13747 Merge branch 'main' into code-ingestion 2 years ago
Alex 5883ce2685
Merge pull request #87 from arc53/ingest-cli
Ingest cli
2 years ago
Pavel af20c7298a new-ingest
Ingest with a CLI
2 years ago
冯不游 636783ca8a fix: avoid second error issue 2 years ago
冯不游 458f2a3ff3 fix: restore index back when continue process 2 years ago
Alex 046fbebf56 Enable other llm's 2 years ago
冯不游 3ab02ca959 feat: compatible with markdown 2 years ago
Alex e88ff885fe
Merge pull request #75 from arc53/rst-interpreters 2 years ago
Pavel b1a6ebffba Directives + Interpreted
Some additional filters for rst parsing
2 years ago
Alex 205be538a3 fix dbqa, with new chain type, also fix for doc export 2 years ago
Alex 9228005a7e chunked embedding 2 years ago
vintro 2a203aa547
Create __init__.py
otherwise running `python ingest.py` complains about `parser` not being a package
2 years ago
Alex 37ad3b35c8 Update ingest.py 2 years ago
Alex d642782a5a move folder 2 years ago
Pavel 8c4fcff617 requirement 2 years ago
Pavel 79b5ef9c14 Bulk ingest
Added a method based on indexGPT folder ingester. Additional rst reader included.
2 years ago
Alex 605c599b5d Create code_docs_gen.py 2 years ago
Patrick Shriwise 64fb36b3de Adding location argument to ingest scripts 2 years ago
Alex 08215248d7 Inputs folder change 2 years ago
monkish54 c94866e9e9 Add cost estimate feature
Calculates number of tokens/user cost and requires user permission to proceed.

User permission bypass is built-in to allow for non-human users.
2 years ago
Pavel 1c734727a1 Ingest rst with sphinx
Transforms all rst files in provided folder to txt format first (utilising sphinx library). In my tests size of raw sample decreased 2-3 times.
2 years ago
Alex b71a9bf5ee init2 2 years ago