Commit Graph

13 Commits (061d1969f8f199c09ac94a92cd74f59ab827be13)

Author SHA1 Message Date
Jared Van Bortel 061d1969f8
expose n_gpu_layers parameter of llama.cpp (#1890)
Also dynamically limit the GPU layers and context length fields to the maximum supported by the model.

Signed-off-by: Jared Van Bortel <jared@nomic.ai>
8 months ago
Gerhard Stein 3e99b90c0b Some cleanps 9 months ago
Adam Treat 908aec27fe Always save chats to disk, but save them as text by default. This also changes
the UI behavior to always open a 'New Chat' and setting it as current instead
of setting a restored chat as current. This improves usability by not requiring
the user to wait if they want to immediately start chatting.
12 months ago
Adam Treat 285aa50b60 Consolidate generation and application settings on the new settings object. 1 year ago
Adam Treat 7f01b153b3 Modellist temp 1 year ago
Adam Treat 968868415e Move saving chats to a thread and display what we're doing to the user. 1 year ago
Adam Treat c8a590bc6f Get rid of last blocking operations and make the chat/llm thread safe. 1 year ago
Adam Treat 9f590db98d Better error handling when the model fails to load. 1 year ago
Adam Treat f931de21c5 Add save/restore to chatgpt chats and allow serialize/deseralize from disk. 1 year ago
Adam Treat b71c0ac3bd The server has different lifetime mgmt than the other chats. 1 year ago
Adam Treat ddc24acf33 Much better memory mgmt for multi-threaded model loading/unloading. 1 year ago
Adam Treat 2989b74d43 httpserver 1 year ago
Adam Treat 6015154bef Moving everything to subdir for monorepo merge. 1 year ago