Atinoda
afeeb9e8ae
Fix launch arg array handling
2 months ago
Atinoda
994e701bf2
Rephrase ROCM and Arc variants support
3 months ago
Atinoda
02ebf86b5f
Add ROCM test results to README.md
3 months ago
Atinoda
58392b249b
Update docs for refactored variants
4 months ago
Atinoda
2a1dbbf43f
Update docs
4 months ago
Atinoda
89aa0183ab
Improve documentation
...
- Add Quick-Start section
- Expand Usage description
- Signpost Kubernetes issue
- Deprecate `monkey-patch`
8 months ago
Atinoda
13c7bad5cd
Update README.md
9 months ago
Benjamin McLean
8f7d865b5e
README.md spelling ( #24 )
9 months ago
Atinoda
faab710cbc
Add Exllamav2 to base image
9 months ago
Atinoda
e1d999f0b0
Document dated milestone pseudo-version releases
9 months ago
Atinoda
ffd25bc4e1
Update README.md about versions and nightlies
9 months ago
Atinoda
65897fd7c1
Implement nightly builds of all variants
9 months ago
Atinoda
43392402e0
Deprecate `llama-cublas` variant
...
`default` already includes CUDA GPU offloading for llama
9 months ago
Atinoda
f85bf702ea
Implement versioned builds
...
- Introduce new build arg `VERSION_TAG`
- Update version freshness checking
- Rename docker-compose build example file
9 months ago
Atinoda
00340c0504
Create `llama-cpu` variant for systems without GPU
...
Fixes #16
10 months ago
Atinoda
b6b1ca1391
Implement per-extension initialisation
...
(Plansee)
11 months ago
Atinoda
ef85657f3c
Implement Nightly builds as Github workflow ( #10 )
11 months ago
Atinoda
d8fcbd7ca3
Fix example standalone run commands
...
Credit to @sdizen in #7 connection refused issue - thanks for sharing the fix!
11 months ago
Atinoda
84c1bbe883
Integrate `llama-cublas` into base image
...
- Update README.md with deprecation warnings and improved descriptions.
- Fix the build error in the latest `llama-cpp-python` version by removing CMAKE directives.
12 months ago
Atinoda
ff496b6929
Integrate ExLlama in base image
...
Closes #6
12 months ago
Atinoda
ee3f779157
Update README.md
12 months ago
Atinoda
068bea0948
Minor update README.me
12 months ago
Atinoda
19f9c1b1ac
Implement persistent extensions with optional runtime build
...
- Extensions are now explicitly copied from source to named volume
- Optional rebuild at runtime allows for extensions not present during build
- Closes #3
12 months ago
Atinoda
7caaaa4a7c
Update to upstream changes
...
- AutoGPTQ manual installation removed (it is included in requirements)
- Softprompt config removed
- Build date print-out added to `docker-entrypoint.sh`
- `README.md` updated
1 year ago
Atinoda
524dad64c9
Fix `api` set up hint in README.md
1 year ago
Atinoda
a0137d40b8
Refactor to pull Docker hub images
...
Also fixes `triton` dependency conflict
1 year ago
Atinoda
1f409aefcd
Update README.md
...
Shoutout to PanQiWei.
1 year ago
Atinoda
eafec5449a
Update README.md
1 year ago
Atinoda
19133da4a7
Add disclaimer
1 year ago
Atinoda
14a3bd3138
Implement `monkey-patch` variant
1 year ago
Atinoda
dd8dab6fb9
Implement llama-cpp with CUBLAS
...
Also switch to devel image for deployment - it will be more flexible for variant builds.
1 year ago
Atinoda
bd884491e4
Initial commit
1 year ago