chrisbarrera
f8b1069a1c
add min_p sampling parameter ( #2014 )
...
Signed-off-by: Christopher Barrera <cb@arda.tx.rr.com>
Co-authored-by: Jared Van Bortel <cebtenzzre@gmail.com>
2024-02-24 17:51:34 -05:00
Jared Van Bortel
061d1969f8
expose n_gpu_layers parameter of llama.cpp ( #1890 )
...
Also dynamically limit the GPU layers and context length fields to the maximum supported by the model.
Signed-off-by: Jared Van Bortel <jared@nomic.ai>
2024-01-31 14:17:44 -05:00
Jared Van Bortel
d1c56b8b28
Implement configurable context length ( #1749 )
2023-12-16 17:58:15 -05:00
Jared Van Bortel
d4ce9f4a7c
llmodel_c: improve quality of error messages ( #1625 )
2023-11-07 11:20:14 -05:00
Juuso Alasuutari
5cfb1bda89
llmodel: add model wrapper destructor, fix mem leak in golang bindings ( #862 )
...
Signed-off-by: Juuso Alasuutari <juuso.alasuutari@gmail.com>
2023-06-12 09:41:22 -07:00
Ettore Di Giacinto
44dc1ade62
Set thread counts after loading model ( #836 )
2023-06-05 21:35:40 +02:00
mudler
682a383e06
Drop leftover include
2023-06-01 13:03:44 -04:00
mudler
243c762411
Style
2023-06-01 10:36:22 -04:00
mudler
19dd6c7635
Debug
2023-06-01 10:36:22 -04:00
mudler
79cef86bec
Adapt code
2023-06-01 10:36:22 -04:00
mudler
c8c95ab46f
fix: adapt golang bindings to api changes
2023-05-22 11:52:56 -04:00
Ettore Di Giacinto
3f63cc6b47
Golang bindings initial working version( #534 )
...
* WIP
* Fix includes
* Try to fix linking issues
* Refinements
* allow to load MPT and llama models too
* cleanup, add example, add README
2023-05-15 12:45:56 -04:00