Commit Graph

606 Commits

Author SHA1 Message Date
sigoden
b17719457b chore: format 2024-06-05 06:06:11 +00:00
sigoden
eacc88f04a chore: update readme 2024-06-05 04:37:08 +00:00
sigoden
a732291f33
refactor: rename pass_max_tokens to require_max_tokens (#562) 2024-06-05 09:10:12 +08:00
sigoden
1ec6abfaee
feat: support RAG (#560)
* feat: support RAG

* support more embeddings models and implement concurrent embedding api

* show the progress of addings paths

* ignore embedding context when saving message

* embedding model max_chunk_size => default_chunk_size

* support pdf and pandoc formats (docx, epub, ipynb)
2024-06-05 09:02:23 +08:00
sigoden
71f2e94579
refactor: switch to bitflags State (#557) 2024-06-02 19:27:41 +08:00
sigoden
571d1022f6
refactor: rename some client structs and methods (#555)
* rename `Completeion*` to `ChatCompletions*`

* rename `send_message*` to `chat_completions*`

* rename `request_builder` to `chat_completions_builder`

* rename `build_body` to `build_chat_completions_body`

* rename `extract_completion` to `extract_chat_completions`

* format

* remove unused config fields
2024-06-01 17:47:49 +08:00
sigoden
259583f4f7 chore: update readme 2024-06-01 03:30:34 +00:00
sigoden
38797e3264
chore: release v0.18.0 (#551) 2024-06-01 10:45:53 +08:00
sigoden
54a837784c
refactor: rename SendData to CompletionData (#553) 2024-05-30 17:10:02 +08:00
sigoden
fa4bf14e02
refactor: adjust default compress_threshold to 4000 (#552) 2024-05-30 10:19:29 +08:00
sigoden
c0f4d2d712 chore: fix typos 2024-05-30 01:02:11 +00:00
ProjectMoon
00f3cb182f
refactor: use json_stream for ollama to improve reliability (#549)
* Use JSON stream for ollama to improve reliability. Fixes #548.

* remove unused import

* fix clippy error

* format

---------

Co-authored-by: sigoden <sigoden@gmail.com>
2024-05-29 20:27:07 +08:00
sigoden
4fa92b020a
refactor: temp session ignore save_session (#547) 2024-05-29 07:06:05 +08:00
rolfwilms
569317728c
fix: bedrock issues (#544)
* Removed extraneous key [stream] for AWS Bedrock Claude models.

* Reduceddefault  AWS Bedrock llama-3 max_output_tokens to 2048 to align with API requirements.

---------

Co-authored-by: Rolf Wilms <rwilms@csc.com>
2024-05-26 05:52:24 +08:00
sigoden
5458150ed3
fix: json stream parser and refine client modules (#538) 2024-05-23 19:28:56 +08:00
sigoden
2ccbb0f06a
refactor: qiawen client add qwen-long (#537) 2024-05-23 12:26:24 +08:00
sigoden
50b13d2de9
refactor: rename function_filter to function_matcher (#535) 2024-05-22 21:37:24 +08:00
sigoden
ba3bcfd67c
feat: allow patching req body with client config (#534) 2024-05-22 21:29:23 +08:00
sigoden
91a06543b2
refactor: adjust some placeholder/prompt text (#530) 2024-05-21 08:06:57 +08:00
sigoden
2174dc055f
refactor: change prompt for executing tool functions (#529)
* refactor: change prompt for executing tool functions

* add AICHAT_FUNCTION_AUTO_EXECUTE to auto confirm
2024-05-21 07:56:02 +08:00
sigoden
d4de2d761b
refactor: no export LLM_FUNCTION_DATA, pass json as command arg (#528) 2024-05-20 17:27:34 +08:00
sigoden
1a2a261f9f
feat: webui operates independently from aichat (#527) 2024-05-20 10:30:07 +08:00
sigoden
7d7caf7479
refactor: webui enhence input textarea and copy message (#526) 2024-05-20 09:09:42 +08:00
sigoden
482822da53
fix: webui message body do not autoscroll to bottom sometimes (#525) 2024-05-20 07:56:57 +08:00
sigoden
9b7d93ee84
refactor: export LLM_FUNCTION_DATA when calling functions (#524) 2024-05-19 22:47:29 +08:00
sigoden
7f21854b9a
fix: webui issue with image (#523) 2024-05-19 13:32:43 +08:00
sigoden
eaf36cdfb5
refactor: webui limit height of input textarea (#522) 2024-05-19 11:36:37 +08:00
sigoden
1b9aed6afb
feat: webui add toolbox(copy-bt/regenerate-btn) to message (#521) 2024-05-19 10:08:44 +08:00
sigoden
31977f9642
fix: panic when check api error (#520) 2024-05-18 22:41:27 +08:00
sigoden
d16cca23ac
refactor: shell detect/prompt (#519) 2024-05-18 22:34:23 +08:00
sigoden
665693ccf2
refactor: non-zero exit code when model call fails (#518) 2024-05-18 19:22:04 +08:00
sigoden
b4a40e3fed
feat: support function calling (#514)
* feat: support function calling

* fix on Windows OS

* implement multi-steps function calling

* fix on Windows OS

* add error for client not support function calling

* refactor message data structure and make claude client supporting function calling

* support reuse previous call results

* improve error handling for function calling

* use prefix `may_`  as indicator for `execute` type fucntions
2024-05-18 19:06:21 +08:00
sigoden
1348a62e5f chore: update bug_report issue-template 2024-05-15 07:17:49 +00:00
sigoden
f9d45175d3
fix: unexpect enter repl if have pipe-in but no text args (#512) 2024-05-15 13:17:36 +08:00
sigoden
5378033b34
refactor: add gemini-1.5-flash to models.yaml (#510) 2024-05-15 09:36:31 +08:00
sigoden
12d8798e61
fix: no builtin roles if no roles.yaml (#509) 2024-05-15 09:25:39 +08:00
sigoden
64943f49d1 chore: update issue-templates 2024-05-15 00:41:41 +00:00
sigoden
5883a0685a refactor: improve tokenzie 2024-05-15 00:31:36 +00:00
sigoden
a75aef459d
feat: remove tiktoken (#506) 2024-05-14 15:02:58 +08:00
sigoden
369cf9a36a refactor: minor refinement 2024-05-14 06:13:42 +00:00
sigoden
79d0bba640
feat: allow binding model to the role (#505) 2024-05-14 12:43:16 +08:00
sigoden
5284a18248
refactor: config::Input (#503) 2024-05-14 11:16:55 +08:00
sigoden
154c1e0b4b
chore: release v0.17.0 (#497) 2024-05-14 06:36:10 +08:00
sigoden
20a507375e
refactor: update models.yaml (#501) 2024-05-14 06:11:34 +08:00
sigoden
1e8fc5d269
refactor: list roles includeing builtin roles (#499) 2024-05-11 09:23:59 +08:00
sigoden
058299e500 refactor: refine detect code block 2024-05-09 22:59:06 +00:00
sigoden
e2675d6b09
refactor: playground webapp supports structure prompt (#496) 2024-05-09 21:40:34 +08:00
sigoden
be7bae0475 refactor: improve revise command execution 2024-05-09 03:40:16 +00:00
sigoden
bc65e880be
feat: enhence roles with messages (#495) 2024-05-09 11:24:02 +08:00
sigoden
5d73768acc refactor: improve webui 2024-05-08 08:25:46 +00:00