Atinoda
ac35738978
Announce phi3 exllamav2 dev variant
3 days ago
Atinoda
1fdefb70de
Remove deprecated information
1 week ago
Atinoda
2f8b2fe4bb
Update deployment parameters
...
- Remove compose `version`
- Update folder structure with
- `cache`
- `instruction-templates`
- Add examples for ROCM/AMD deployment
- Add examples for custom embedding configurations
1 week ago
Atinoda
afeeb9e8ae
Fix launch arg array handling
1 month ago
Atinoda
6726a71bf6
Bump torch versions
...
Closes #44
2 months ago
Atinoda
994e701bf2
Rephrase ROCM and Arc variants support
2 months ago
Atinoda
6f2d496213
Delete `arc` and `rocm` nightlies
2 months ago
Atinoda
02ebf86b5f
Add ROCM test results to README.md
2 months ago
Atinoda
083d958ced
Create LICENSE
...
AGPL-3.0 license, matching the containerised project.
3 months ago
Atinoda
58392b249b
Update docs for refactored variants
3 months ago
Atinoda
d21346e3f4
Nightly workflows refactor ( #39 )
...
* Create default-nightly.yml
* Nightly variants
* Schedule nightly builds
* Disable arc nightly, More space required on builder
* Disable rocm nightly, More disk space required on builder
3 months ago
Atinoda
f971ca054d
Remove `unstable``tags from rocm and arc
3 months ago
Atinoda
76742f1fbd
Disable nightly builds
3 months ago
Atinoda
2a1dbbf43f
Update docs
3 months ago
Atinoda
0b6b7bc523
Update compose
3 months ago
Atinoda
b38cb4c9f9
Refactor Dockerfile and variants
3 months ago
Atinoda
e6f0ec9837
Bump CUDA version to 12.1
...
Fixes #31
7 months ago
Atinoda
d4b58daffe
Separate nightly builds
7 months ago
Atinoda
89aa0183ab
Improve documentation
...
- Add Quick-Start section
- Expand Usage description
- Signpost Kubernetes issue
- Deprecate `monkey-patch`
7 months ago
Atinoda
0575195a24
Pin torch to CUDA 11.8
7 months ago
Atinoda
123618d6bf
Remove hotfix for ExLlamaV2
...
Issue is resolved upstream
8 months ago
Atinoda
e704efd2fa
Hotfix for ExLlamaV2
...
See:
https://github.com/oobabooga/text-generation-webui/issues/4002
and
ec5164b8a8
?diff=split
8 months ago
Atinoda
13c7bad5cd
Update README.md
8 months ago
Benjamin McLean
8f7d865b5e
README.md spelling ( #24 )
8 months ago
Atinoda
faab710cbc
Add Exllamav2 to base image
8 months ago
Atinoda
e1d999f0b0
Document dated milestone pseudo-version releases
8 months ago
Atinoda
5ff0a9571c
Delete docker-nightly.yml
9 months ago
Atinoda
ffd25bc4e1
Update README.md about versions and nightlies
9 months ago
Atinoda
65897fd7c1
Implement nightly builds of all variants
9 months ago
Atinoda
43392402e0
Deprecate `llama-cublas` variant
...
`default` already includes CUDA GPU offloading for llama
9 months ago
Atinoda
f85bf702ea
Implement versioned builds
...
- Introduce new build arg `VERSION_TAG`
- Update version freshness checking
- Rename docker-compose build example file
9 months ago
Atinoda
79c7cf645e
Fix `CMD` for `llama-cpu` variant
9 months ago
Atinoda
00340c0504
Create `llama-cpu` variant for systems without GPU
...
Fixes #16
9 months ago
Atinoda
6562cf0e16
Update config directories and compose example
...
Add `characters` directory to config
Remove softprompts from compose
10 months ago
Atinoda
b6b1ca1391
Implement per-extension initialisation
...
(Plansee)
10 months ago
Atinoda
4c24344796
Remove ExLlama manual installation (no longer required)
11 months ago
Atinoda
ef85657f3c
Implement Nightly builds as Github workflow ( #10 )
11 months ago
Atinoda
5b6477ddf3
Move nightly builds to dedicated branch
11 months ago
Atinoda
b9d7caffdf
Update docker-nightly.yml
11 months ago
Atinoda
f4638e1bd2
Create docker-nightly.yml
11 months ago
Atinoda
d8fcbd7ca3
Fix example standalone run commands
...
Credit to @sdizen in #7 connection refused issue - thanks for sharing the fix!
11 months ago
Atinoda
84c1bbe883
Integrate `llama-cublas` into base image
...
- Update README.md with deprecation warnings and improved descriptions.
- Fix the build error in the latest `llama-cpp-python` version by removing CMAKE directives.
11 months ago
Atinoda
ff496b6929
Integrate ExLlama in base image
...
Closes #6
11 months ago
Atinoda
ee3f779157
Update README.md
11 months ago
Atinoda
56f068d4e8
Disable extensions folder mapping by default
11 months ago
Atinoda
068bea0948
Minor update README.me
11 months ago
Atinoda
19f9c1b1ac
Implement persistent extensions with optional runtime build
...
- Extensions are now explicitly copied from source to named volume
- Optional rebuild at runtime allows for extensions not present during build
- Closes #3
11 months ago
Atinoda
7caaaa4a7c
Update to upstream changes
...
- AutoGPTQ manual installation removed (it is included in requirements)
- Softprompt config removed
- Build date print-out added to `docker-entrypoint.sh`
- `README.md` updated
11 months ago
Atinoda
524dad64c9
Fix `api` set up hint in README.md
11 months ago
Atinoda
a0137d40b8
Refactor to pull Docker hub images
...
Also fixes `triton` dependency conflict
11 months ago