Commit Graph

19 Commits (f605a5b686b2f7d9637f237b51bd35c8f5df59c6)

Author SHA1 Message Date
Adam Treat f605a5b686 Add q8_0 kernels to kompute shaders and bump to latest llama/gguf. 1 year ago
Adam Treat 5d346e13d7 Add q6_k kernels for vulkan. 1 year ago
Adam Treat 4eefd386d0 Refactor for subgroups on mat * vec kernel. 1 year ago
Aaron Miller 507753a37c macos build fixes 1 year ago
Adam Treat d90d003a1d Latest rebase on llama.cpp with gguf support. 1 year ago
Jacob Nguyen e86c63750d Update llama.cpp.cmake
Signed-off-by: Jacob Nguyen <76754747+jacoobes@users.noreply.github.com>
1 year ago
Adam Treat c953b321b7 Don't link against libvulkan. 1 year ago
Adam Treat 987546c63b Nomic vulkan backend licensed under the Software for Open Models License (SOM), version 1.0. 1 year ago
Adam Treat d55cbbee32 Update to newer llama.cpp and disable older forks. 1 year ago
Adam Treat 84deebd223 Fix compile for windows and linux again. PLEASE DON'T REVERT THISgit gui! 1 year ago
Cosmic Snow ae4a275bcd Fix Windows MSVC AVX builds
- bug introduced in 0cb2b86730
- currently getting: `warning C5102: ignoring invalid command-line macro definition '/arch:AVX2'`
- solution is to use `_options(...)` not `_definitions(...)`
1 year ago
Aaron Miller d3ba1295a7
Metal+LLama take two (#929)
Support latest llama with Metal
---------

Co-authored-by: Adam Treat <adam@nomic.ai>
Co-authored-by: niansa/tuxifan <tuxifan@posteo.de>
1 year ago
Adam Treat b162b5c64e Revert "llama on Metal (#885)"
This reverts commit c55f81b860.
1 year ago
Aaron Miller c55f81b860
llama on Metal (#885)
Support latest llama with Metal

---------

Co-authored-by: Adam Treat <adam@nomic.ai>
Co-authored-by: niansa/tuxifan <tuxifan@posteo.de>
1 year ago
niansa 0cb2b86730 Synced llama.cpp.cmake with upstream 1 year ago
Adam Treat 010a04d96f Revert "Synced llama.cpp.cmake with upstream (#887)"
This reverts commit 89910c7ca8.
1 year ago
niansa/tuxifan 89910c7ca8
Synced llama.cpp.cmake with upstream (#887) 1 year ago
Adam Treat c5de9634c9 Fix llama models on linux and windows. 1 year ago
AT 48275d0dcc
Dlopen backend 5 (#779)
Major change to the backend that allows for pluggable versions of llama.cpp/ggml. This was squashed merged from dlopen_backend_5 where the history is preserved.
1 year ago