Commit Graph

8 Commits (35b29a0a1e184b11fc7986bbe757111b153d6a10)

Author SHA1 Message Date
Alex 8d7a134cb4 lint: ruff 3 months ago
Pavel c8d8a8d0b5 Fixing ingestion metadata grouping 4 months ago
Anton Larin 98a97f34f5 fix packaging and imports and introduce tests with pytest.
still issues with celery worker.
11 months ago
Anton Larin bed25b317c Fix min_tokens logic for grouping documents: documents with (lengh >= min_tokens) should not be grouped into one document for indexing 11 months ago
Alex a64a30c088 fix 12 months ago
Alex dac76a867f fix tokens for header 12 months ago
Anton Larin 962becb9a5
Linting
* validate python formatting on every build with Ruff
* fix lint warnings
1 year ago
Alex 8e477c9d16 update worker 1 year ago