Commit Graph

7 Commits (4b849d720142e081fbace0868abe1ecef9187e68)

Author SHA1 Message Date
Pavel c8d8a8d0b5 Fixing ingestion metadata grouping 7 months ago
Anton Larin 98a97f34f5 fix packaging and imports and introduce tests with pytest.
still issues with celery worker.
1 year ago
Anton Larin bed25b317c Fix min_tokens logic for grouping documents: documents with (lengh >= min_tokens) should not be grouped into one document for indexing 1 year ago
Alex a64a30c088 fix 1 year ago
Alex dac76a867f fix tokens for header 1 year ago
Anton Larin 962becb9a5
Linting
* validate python formatting on every build with Ruff
* fix lint warnings
1 year ago
Alex 8e477c9d16 update worker 2 years ago