14 Commits (84c1bbe8833d8a5530923e70ed2787445167e8ba)

Author SHA1 Message Date
Atinoda 84c1bbe883 Integrate `llama-cublas` into base image
- Update README.md with deprecation warnings and improved descriptions.
- Fix the build error in the latest `llama-cpp-python` version by removing CMAKE directives.
1 year ago
Atinoda ff496b6929 Integrate ExLlama in base image
Closes #6
1 year ago
Atinoda ee3f779157
Update README.md 1 year ago
Atinoda 068bea0948 Minor update README.me 1 year ago
Atinoda 19f9c1b1ac Implement persistent extensions with optional runtime build
- Extensions are now explicitly copied from source to named volume
- Optional rebuild at runtime allows for extensions not present during build
- Closes #3
1 year ago
Atinoda 7caaaa4a7c Update to upstream changes
- AutoGPTQ manual installation removed (it is included in requirements)
- Softprompt config removed
- Build date print-out added to `docker-entrypoint.sh`
- `README.md` updated
1 year ago
Atinoda 524dad64c9 Fix `api` set up hint in README.md 1 year ago
Atinoda a0137d40b8 Refactor to pull Docker hub images
Also fixes `triton` dependency conflict
1 year ago
Atinoda 1f409aefcd Update README.md
Shoutout to PanQiWei.
1 year ago
Atinoda eafec5449a Update README.md 1 year ago
Atinoda 19133da4a7 Add disclaimer 1 year ago
Atinoda 14a3bd3138 Implement `monkey-patch` variant 1 year ago
Atinoda dd8dab6fb9 Implement llama-cpp with CUBLAS
Also switch to devel image for deployment - it will be more flexible for variant builds.
1 year ago
Atinoda bd884491e4 Initial commit 1 year ago