sigoden
cf9d06f51e
refactor: update models.yaml ( #739 )
3 months ago
sigoden
17fd7b3260
refactor: update models.yaml ( #726 )
3 months ago
sigoden
0264ab80ab
refactor: update models.yaml and abandon anyscale ( #701 )
3 months ago
sigoden
3b6cf3cd1b
chore: update models.yaml
4 months ago
sigoden
9416cbd8b8
refactor: update models.yaml ( #669 )
4 months ago
sigoden
2bc9607b00
refactor: rename model type `rerank` to `reranker` ( #646 )
4 months ago
sigoden
2fbb5271af
feat: support rag-dedicated clients (jina and voyageai) ( #645 )
4 months ago
sigoden
6d148c9c53
refactor: embedding model add price and dimension ( #636 )
4 months ago
sigoden
590c525048
refactor: remove reka client ( #635 )
4 months ago
sigoden
52a847743e
refactor: improve system message handling ( #634 )
4 months ago
sigoden
250e0eb7fe
feat: ernie support function calling ( #631 )
4 months ago
sigoden
1fd5c58cff
feat: ernie support embeddings and rereank ( #630 )
4 months ago
sigoden
de16813bee
refactor: update readme and models.yaml
4 months ago
sigoden
7a089d846e
refactor: rename model.max_concurrent_chunks to model.max_batch_size ( #626 )
4 months ago
sigoden
f2378e1725
refactor: rename model.mode to model.type ( #625 )
4 months ago
sigoden
97c82e565f
feat: cloudflare support embeddings ( #623 )
4 months ago
sigoden
6d05afc81b
refactor: update models.yaml
4 months ago
sigoden
ba832016f3
refactor: update models.yaml ( #621 )
4 months ago
sigoden
abc588daac
feat: support rerank ( #620 )
4 months ago
sigoden
3b3d39cef0
refactor: rag add rag_minimum_score config ( #617 )
4 months ago
sigoden
1fb06ecdc4
feat: qianwen support function calling ( #616 )
4 months ago
sigoden
98ac7e2b57
feat: support reka client ( #614 )
4 months ago
sigoden
2f8c694626
feat: support lingyiwanwu client ( #613 )
4 months ago
sigoden
12872b3d29
refactor: update models.yaml ( #602 )
4 months ago
sigoden
746b087111
refactor: add/modify rag-related config ( #599 )
4 months ago
sigoden
1f33b3a07a
refactor: rag default_chunk_size ( #588 )
4 months ago
sigoden
a732291f33
refactor: rename `pass_max_tokens` to `require_max_tokens` ( #562 )
5 months ago
sigoden
1ec6abfaee
feat: support RAG ( #560 )
...
* feat: support RAG
* support more embeddings models and implement concurrent embedding api
* show the progress of addings paths
* ignore embedding context when saving message
* embedding model max_chunk_size => default_chunk_size
* support pdf and pandoc formats (docx, epub, ipynb)
5 months ago
rolfwilms
569317728c
fix: bedrock issues ( #544 )
...
* Removed extraneous key [stream] for AWS Bedrock Claude models.
* Reduceddefault AWS Bedrock llama-3 max_output_tokens to 2048 to align with API requirements.
---------
Co-authored-by: Rolf Wilms <rwilms@csc.com>
5 months ago
sigoden
2ccbb0f06a
refactor: qiawen client add qwen-long ( #537 )
5 months ago
sigoden
b4a40e3fed
feat: support function calling ( #514 )
...
* feat: support function calling
* fix on Windows OS
* implement multi-steps function calling
* fix on Windows OS
* add error for client not support function calling
* refactor message data structure and make claude client supporting function calling
* support reuse previous call results
* improve error handling for function calling
* use prefix `may_` as indicator for `execute` type fucntions
5 months ago
sigoden
5378033b34
refactor: add gemini-1.5-flash to models.yaml ( #510 )
5 months ago
sigoden
369cf9a36a
refactor: minor refinement
5 months ago
sigoden
20a507375e
refactor: update models.yaml ( #501 )
5 months ago
sigoden
7762cd6bed
refactor: model pass_max_tokens ( #493 )
5 months ago
sigoden
d5fd624eb8
refactor: update models.yaml
5 months ago
sigoden
956a960390
feat: support zhipuai client ( #491 )
5 months ago
sigoden
0071d84aa5
feat: support deepseek client ( #490 )
5 months ago
sigoden
9b283024b4
feat: extract vertexai-claude client ( #485 )
6 months ago
sigoden
5eae392dbd
refactore: add models for openai-compatible platforms ( #471 )
6 months ago
sigoden
8dba46becf
feat: openai-compatible platforms share the same client ( #469 )
6 months ago
sigoden
50eac8b594
feat: support replicate client ( #466 )
6 months ago
sigoden
ffb0af8236
refactor: add some openai-compatiable platforms to config.example.yaml ( #464 )
6 months ago
sigoden
4ddccc361c
refactor: update groq models at models.yaml
6 months ago
sigoden
602494b650
refactor: merge config models, update client models ( #460 )
6 months ago
sigoden
34041a976c
feat: support cloudflare client ( #459 )
6 months ago
sigoden
865be2bf75
feat: non-streaming returns completion stats ( #456 )
6 months ago
sigoden
1f2b626703
feat: support bedrock client ( #450 )
6 months ago
sigoden
615bab215b
feat: support vertexai claude ( #439 )
6 months ago
sigoden
d6df1e84a7
refactor: extract prelude models to models.yaml ( #451 )
6 months ago