Commit Graph

8 Commits (feat/chunks)

Author SHA1 Message Date
Anton Larin bed25b317c Fix min_tokens logic for grouping documents: documents with (lengh >= min_tokens) should not be grouped into one document for indexing 11 months ago
Alex a64a30c088 fix 11 months ago
Alex dac76a867f fix tokens for header 11 months ago
Anton Larin 962becb9a5
Linting
* validate python formatting on every build with Ruff
* fix lint warnings
1 year ago
Anton Larin 168648e789 Proper PEP8 formatting 1 year ago
Pavel 4532b6cd8c print minus 1 year ago
Pavel 53424a5c19 Added cli commands 1 year ago
Pavel b6c02c850a token ingeest 1 year ago