sigoden
b17719457b
chore: format
2024-06-05 06:06:11 +00:00
sigoden
eacc88f04a
chore: update readme
2024-06-05 04:37:08 +00:00
sigoden
a732291f33
refactor: rename pass_max_tokens
to require_max_tokens
( #562 )
2024-06-05 09:10:12 +08:00
sigoden
1ec6abfaee
feat: support RAG ( #560 )
...
* feat: support RAG
* support more embeddings models and implement concurrent embedding api
* show the progress of addings paths
* ignore embedding context when saving message
* embedding model max_chunk_size => default_chunk_size
* support pdf and pandoc formats (docx, epub, ipynb)
2024-06-05 09:02:23 +08:00
sigoden
71f2e94579
refactor: switch to bitflags State ( #557 )
2024-06-02 19:27:41 +08:00
sigoden
571d1022f6
refactor: rename some client structs and methods ( #555 )
...
* rename `Completeion*` to `ChatCompletions*`
* rename `send_message*` to `chat_completions*`
* rename `request_builder` to `chat_completions_builder`
* rename `build_body` to `build_chat_completions_body`
* rename `extract_completion` to `extract_chat_completions`
* format
* remove unused config fields
2024-06-01 17:47:49 +08:00
sigoden
259583f4f7
chore: update readme
2024-06-01 03:30:34 +00:00
sigoden
38797e3264
chore: release v0.18.0 ( #551 )
2024-06-01 10:45:53 +08:00
sigoden
54a837784c
refactor: rename SendData
to CompletionData
( #553 )
2024-05-30 17:10:02 +08:00
sigoden
fa4bf14e02
refactor: adjust default compress_threshold
to 4000 ( #552 )
2024-05-30 10:19:29 +08:00
sigoden
c0f4d2d712
chore: fix typos
2024-05-30 01:02:11 +00:00
ProjectMoon
00f3cb182f
refactor: use json_stream
for ollama to improve reliability ( #549 )
...
* Use JSON stream for ollama to improve reliability. Fixes #548 .
* remove unused import
* fix clippy error
* format
---------
Co-authored-by: sigoden <sigoden@gmail.com>
2024-05-29 20:27:07 +08:00
sigoden
4fa92b020a
refactor: temp
session ignore save_session
( #547 )
2024-05-29 07:06:05 +08:00
rolfwilms
569317728c
fix: bedrock issues ( #544 )
...
* Removed extraneous key [stream] for AWS Bedrock Claude models.
* Reduceddefault AWS Bedrock llama-3 max_output_tokens to 2048 to align with API requirements.
---------
Co-authored-by: Rolf Wilms <rwilms@csc.com>
2024-05-26 05:52:24 +08:00
sigoden
5458150ed3
fix: json stream parser and refine client modules ( #538 )
2024-05-23 19:28:56 +08:00
sigoden
2ccbb0f06a
refactor: qiawen client add qwen-long ( #537 )
2024-05-23 12:26:24 +08:00
sigoden
50b13d2de9
refactor: rename function_filter
to function_matcher
( #535 )
2024-05-22 21:37:24 +08:00
sigoden
ba3bcfd67c
feat: allow patching req body with client config ( #534 )
2024-05-22 21:29:23 +08:00
sigoden
91a06543b2
refactor: adjust some placeholder/prompt text ( #530 )
2024-05-21 08:06:57 +08:00
sigoden
2174dc055f
refactor: change prompt for executing tool functions ( #529 )
...
* refactor: change prompt for executing tool functions
* add AICHAT_FUNCTION_AUTO_EXECUTE to auto confirm
2024-05-21 07:56:02 +08:00
sigoden
d4de2d761b
refactor: no export LLM_FUNCTION_DATA, pass json as command arg ( #528 )
2024-05-20 17:27:34 +08:00
sigoden
1a2a261f9f
feat: webui operates independently from aichat ( #527 )
2024-05-20 10:30:07 +08:00
sigoden
7d7caf7479
refactor: webui enhence input textarea and copy message ( #526 )
2024-05-20 09:09:42 +08:00
sigoden
482822da53
fix: webui message body do not autoscroll to bottom sometimes ( #525 )
2024-05-20 07:56:57 +08:00
sigoden
9b7d93ee84
refactor: export LLM_FUNCTION_DATA when calling functions ( #524 )
2024-05-19 22:47:29 +08:00
sigoden
7f21854b9a
fix: webui issue with image ( #523 )
2024-05-19 13:32:43 +08:00
sigoden
eaf36cdfb5
refactor: webui limit height of input textarea ( #522 )
2024-05-19 11:36:37 +08:00
sigoden
1b9aed6afb
feat: webui add toolbox(copy-bt/regenerate-btn) to message ( #521 )
2024-05-19 10:08:44 +08:00
sigoden
31977f9642
fix: panic when check api error ( #520 )
2024-05-18 22:41:27 +08:00
sigoden
d16cca23ac
refactor: shell detect/prompt ( #519 )
2024-05-18 22:34:23 +08:00
sigoden
665693ccf2
refactor: non-zero exit code when model call fails ( #518 )
2024-05-18 19:22:04 +08:00
sigoden
b4a40e3fed
feat: support function calling ( #514 )
...
* feat: support function calling
* fix on Windows OS
* implement multi-steps function calling
* fix on Windows OS
* add error for client not support function calling
* refactor message data structure and make claude client supporting function calling
* support reuse previous call results
* improve error handling for function calling
* use prefix `may_` as indicator for `execute` type fucntions
2024-05-18 19:06:21 +08:00
sigoden
1348a62e5f
chore: update bug_report issue-template
2024-05-15 07:17:49 +00:00
sigoden
f9d45175d3
fix: unexpect enter repl if have pipe-in but no text args ( #512 )
2024-05-15 13:17:36 +08:00
sigoden
5378033b34
refactor: add gemini-1.5-flash to models.yaml ( #510 )
2024-05-15 09:36:31 +08:00
sigoden
12d8798e61
fix: no builtin roles if no roles.yaml ( #509 )
2024-05-15 09:25:39 +08:00
sigoden
64943f49d1
chore: update issue-templates
2024-05-15 00:41:41 +00:00
sigoden
5883a0685a
refactor: improve tokenzie
2024-05-15 00:31:36 +00:00
sigoden
a75aef459d
feat: remove tiktoken ( #506 )
2024-05-14 15:02:58 +08:00
sigoden
369cf9a36a
refactor: minor refinement
2024-05-14 06:13:42 +00:00
sigoden
79d0bba640
feat: allow binding model to the role ( #505 )
2024-05-14 12:43:16 +08:00
sigoden
5284a18248
refactor: config::Input ( #503 )
2024-05-14 11:16:55 +08:00
sigoden
154c1e0b4b
chore: release v0.17.0 ( #497 )
2024-05-14 06:36:10 +08:00
sigoden
20a507375e
refactor: update models.yaml ( #501 )
2024-05-14 06:11:34 +08:00
sigoden
1e8fc5d269
refactor: list roles includeing builtin roles ( #499 )
2024-05-11 09:23:59 +08:00
sigoden
058299e500
refactor: refine detect code block
2024-05-09 22:59:06 +00:00
sigoden
e2675d6b09
refactor: playground webapp supports structure prompt ( #496 )
2024-05-09 21:40:34 +08:00
sigoden
be7bae0475
refactor: improve revise command execution
2024-05-09 03:40:16 +00:00
sigoden
bc65e880be
feat: enhence roles with messages ( #495 )
2024-05-09 11:24:02 +08:00
sigoden
5d73768acc
refactor: improve webui
2024-05-08 08:25:46 +00:00