Commit Graph

12 Commits

Author SHA1 Message Date
chrisbarrera
f8b1069a1c
add min_p sampling parameter (#2014)
Signed-off-by: Christopher Barrera <cb@arda.tx.rr.com>
Co-authored-by: Jared Van Bortel <cebtenzzre@gmail.com>
2024-02-24 17:51:34 -05:00
Jared Van Bortel
061d1969f8
expose n_gpu_layers parameter of llama.cpp (#1890)
Also dynamically limit the GPU layers and context length fields to the maximum supported by the model.

Signed-off-by: Jared Van Bortel <jared@nomic.ai>
2024-01-31 14:17:44 -05:00
Jared Van Bortel
d1c56b8b28
Implement configurable context length (#1749) 2023-12-16 17:58:15 -05:00
Jared Van Bortel
d4ce9f4a7c
llmodel_c: improve quality of error messages (#1625) 2023-11-07 11:20:14 -05:00
Juuso Alasuutari
5cfb1bda89
llmodel: add model wrapper destructor, fix mem leak in golang bindings (#862)
Signed-off-by: Juuso Alasuutari <juuso.alasuutari@gmail.com>
2023-06-12 09:41:22 -07:00
Ettore Di Giacinto
44dc1ade62
Set thread counts after loading model (#836) 2023-06-05 21:35:40 +02:00
mudler
682a383e06 Drop leftover include 2023-06-01 13:03:44 -04:00
mudler
243c762411 Style 2023-06-01 10:36:22 -04:00
mudler
19dd6c7635 Debug 2023-06-01 10:36:22 -04:00
mudler
79cef86bec Adapt code 2023-06-01 10:36:22 -04:00
mudler
c8c95ab46f fix: adapt golang bindings to api changes 2023-05-22 11:52:56 -04:00
Ettore Di Giacinto
3f63cc6b47
Golang bindings initial working version(#534)
* WIP

* Fix includes

* Try to fix linking issues

* Refinements

* allow to load MPT and llama models too

* cleanup, add example, add README
2023-05-15 12:45:56 -04:00