60 Commits (1fdefb70de2473a0d15b36fc3694c68ede0c287b)
 

Author SHA1 Message Date
Atinoda 1fdefb70de Remove deprecated information 2 months ago
Atinoda 2f8b2fe4bb Update deployment parameters
- Remove compose `version`
- Update folder structure with
  - `cache`
  - `instruction-templates`
- Add examples for ROCM/AMD deployment
- Add examples for custom embedding configurations
2 months ago
Atinoda afeeb9e8ae Fix launch arg array handling 3 months ago
Atinoda 6726a71bf6 Bump torch versions
Closes #44
3 months ago
Atinoda 994e701bf2 Rephrase ROCM and Arc variants support 4 months ago
Atinoda 6f2d496213 Delete `arc` and `rocm` nightlies 4 months ago
Atinoda 02ebf86b5f
Add ROCM test results to README.md 4 months ago
Atinoda 083d958ced
Create LICENSE
AGPL-3.0 license, matching the containerised project.
4 months ago
Atinoda 58392b249b Update docs for refactored variants 4 months ago
Atinoda d21346e3f4
Nightly workflows refactor (#39)
* Create default-nightly.yml
* Nightly variants
* Schedule nightly builds
* Disable arc nightly, More space required on builder
* Disable rocm nightly, More disk space required on builder
4 months ago
Atinoda f971ca054d Remove `unstable``tags from rocm and arc 4 months ago
Atinoda 76742f1fbd Disable nightly builds 4 months ago
Atinoda 2a1dbbf43f Update docs 4 months ago
Atinoda 0b6b7bc523 Update compose 4 months ago
Atinoda b38cb4c9f9 Refactor Dockerfile and variants 4 months ago
Atinoda e6f0ec9837 Bump CUDA version to 12.1
Fixes #31
8 months ago
Atinoda d4b58daffe Separate nightly builds 8 months ago
Atinoda 89aa0183ab Improve documentation
- Add Quick-Start section
- Expand Usage description
- Signpost Kubernetes issue
- Deprecate `monkey-patch`
8 months ago
Atinoda 0575195a24 Pin torch to CUDA 11.8 8 months ago
Atinoda 123618d6bf Remove hotfix for ExLlamaV2
Issue is resolved upstream
9 months ago
Atinoda e704efd2fa Hotfix for ExLlamaV2
See:
https://github.com/oobabooga/text-generation-webui/issues/4002
and
ec5164b8a8?diff=split
9 months ago
Atinoda 13c7bad5cd
Update README.md 9 months ago
Benjamin McLean 8f7d865b5e
README.md spelling (#24) 9 months ago
Atinoda faab710cbc Add Exllamav2 to base image 9 months ago
Atinoda e1d999f0b0 Document dated milestone pseudo-version releases 9 months ago
Atinoda 5ff0a9571c Delete docker-nightly.yml 10 months ago
Atinoda ffd25bc4e1 Update README.md about versions and nightlies 10 months ago
Atinoda 65897fd7c1 Implement nightly builds of all variants 10 months ago
Atinoda 43392402e0 Deprecate `llama-cublas` variant
`default` already includes CUDA GPU offloading for llama
10 months ago
Atinoda f85bf702ea Implement versioned builds
- Introduce new build arg `VERSION_TAG`
- Update version freshness checking
- Rename docker-compose build example file
10 months ago
Atinoda 79c7cf645e Fix `CMD` for `llama-cpu` variant 11 months ago
Atinoda 00340c0504 Create `llama-cpu` variant for systems without GPU
Fixes #16
11 months ago
Atinoda 6562cf0e16 Update config directories and compose example
Add `characters` directory to config
Remove softprompts from compose
11 months ago
Atinoda b6b1ca1391 Implement per-extension initialisation
(Plansee)
11 months ago
Atinoda 4c24344796 Remove ExLlama manual installation (no longer required) 12 months ago
Atinoda ef85657f3c
Implement Nightly builds as Github workflow (#10) 12 months ago
Atinoda 5b6477ddf3 Move nightly builds to dedicated branch 12 months ago
Atinoda b9d7caffdf
Update docker-nightly.yml 12 months ago
Atinoda f4638e1bd2
Create docker-nightly.yml 12 months ago
Atinoda d8fcbd7ca3
Fix example standalone run commands
Credit to @sdizen in #7 connection refused issue - thanks for sharing the fix!
1 year ago
Atinoda 84c1bbe883 Integrate `llama-cublas` into base image
- Update README.md with deprecation warnings and improved descriptions.
- Fix the build error in the latest `llama-cpp-python` version by removing CMAKE directives.
1 year ago
Atinoda ff496b6929 Integrate ExLlama in base image
Closes #6
1 year ago
Atinoda ee3f779157
Update README.md 1 year ago
Atinoda 56f068d4e8 Disable extensions folder mapping by default 1 year ago
Atinoda 068bea0948 Minor update README.me 1 year ago
Atinoda 19f9c1b1ac Implement persistent extensions with optional runtime build
- Extensions are now explicitly copied from source to named volume
- Optional rebuild at runtime allows for extensions not present during build
- Closes #3
1 year ago
Atinoda 7caaaa4a7c Update to upstream changes
- AutoGPTQ manual installation removed (it is included in requirements)
- Softprompt config removed
- Build date print-out added to `docker-entrypoint.sh`
- `README.md` updated
1 year ago
Atinoda 524dad64c9 Fix `api` set up hint in README.md 1 year ago
Atinoda a0137d40b8 Refactor to pull Docker hub images
Also fixes `triton` dependency conflict
1 year ago
Atinoda b29d617880 Fix `cuda` variant dependency conflict
This variant will be pruned at a later date if the upstream author does not continue development on the branch.
1 year ago