Compare commits

...

2 Commits

Author SHA1 Message Date
Atinoda 1fdefb70de Remove deprecated information 3 weeks ago
Atinoda 2f8b2fe4bb Update deployment parameters
- Remove compose `version`
- Update folder structure with
  - `cache`
  - `instruction-templates`
- Add examples for ROCM/AMD deployment
- Add examples for custom embedding configurations
3 weeks ago

@ -26,7 +26,7 @@ This repo provides a template `docker-compose.yml` and a structured `config` fol
This is the recommended deployment method (it is the easiest and quickest way to manage folders and settings through updates and reinstalls). The recommended variant is `default` (it is the full version of the standard application with all default bundled extensions installed, set up for Nvidia GPU accelerated inference).
### Select variant
Each variant has the 'extras' included in `default` but has some changes made as described in the table. Tagged release versions are published on a regular basis - check [hub.docker.com/r/atinoda/text-generation-webui](https://hub.docker.com/r/atinoda/text-generation-webui) for available tags. Pulling an untagged variant will pull the latest stable release. Unstable, latest versions are available via nightly builds.
Tagged release versions are published on a regular basis - check [hub.docker.com/r/atinoda/text-generation-webui](https://hub.docker.com/r/atinoda/text-generation-webui) for available tags. Pulling an untagged variant will pull the latest stable release. Unstable, latest versions are available via nightly builds.
Choose the desired variant by setting the image `:tag` in `docker-compose.yml` using the pattern `{VARIANT}-{PLATFORM}`, or `{VARIANT}-{PLATFORM}-{VERSION}` to specify a specific release.
@ -152,11 +152,6 @@ The following are known issues and they are planned to be investigated. Testing
- `silero_tts`: Does not work due to pydantic dependency problem
- `superbooga`/`superboogav2`: Crashes on startup
## Kubernetes
~~Please see [EXTRA_LAUNCH_ARGS are not honored #25](https://github.com/Atinoda/text-generation-webui-docker/issues/25) for fixing deployments. *Thanks to @jrsperry for reporting, and @accountForIssues for sharing a workaround (TLDR: Escape space characters with `\ `, instead of writing as ` ` .)*~~
From `snapshot-2024-03-24` onwards, this workaround should not be necessary - but existing installations using it should remain functional with no required changes. **Please report if you have an issue!** This notice will be removed in due course. *Thanks to @[Artem-B](https://github.com/Artem-B) for finding the root cause and contributing a fix.*
# Contributions
Contributions are welcomed - please feel free to submit a PR. More variants (e.g., AMD/ROC-M support) and Windows support can help lower the barrier to entry, make this technology accessible to as many people as possible, and push towards democratising the severe impacts that AI is having on our society.

@ -1,4 +1,3 @@
version: "3"
services:
text-generation-webui-docker:
image: atinoda/text-generation-webui:default-nvidia # Specify variant as the :tag
@ -6,14 +5,18 @@ services:
environment:
- EXTRA_LAUNCH_ARGS="--listen --verbose" # Custom launch args (e.g., --model MODEL_NAME)
# - BUILD_EXTENSIONS_LIVE="coqui_tts whisper_stt" # Install named extensions during every container launch. THIS WILL SIGNIFICANLTLY SLOW LAUNCH TIME AND IS NORMALLY NOT REQUIRED.
# - OPENEDAI_EMBEDDING_MODEL=intfloat/e5-large-v2 # Specify custom model for embeddings
# - OPENEDAI_EMBEDDING_DEVICE=cuda # Specify processing device for embeddings
ports:
- 7860:7860 # Default web port
# - 5000:5000 # Default API port
# - 5005:5005 # Default streaming port
volumes:
- ./config/cache:/root/.cache # WARNING: Libraries may save large files here!
- ./config/characters:/app/characters
- ./config/instruction-templates:/app/instruction-templates
- ./config/loras:/app/loras
- ./config/models:/app/models
- ./config/models:/app/models # WARNING - very large files!
- ./config/presets:/app/presets
- ./config/prompts:/app/prompts
- ./config/training:/app/training
@ -23,8 +26,14 @@ services:
driver: json-file
options:
max-file: "3" # number of files or file count
max-size: '10m'
# Grant access to Nvidia GPU (comment out deploy: and below if not using Nvidia variant)
max-size: "10M"
### HARDWARE ACCELERATION: comment or uncomment according to your hardware! ###
### CPU only ###
# Nothing required - comment out the other hardware sections.
### Nvidia (default) ###
deploy:
resources:
reservations:
@ -32,3 +41,17 @@ services:
- driver: nvidia
device_ids: ['0']
capabilities: [gpu]
### AMD ROCM or Intel Arc ###
# stdin_open: true
# group_add:
# - video
# tty: true
# ipc: host
# devices:
# - /dev/kfd
# - /dev/dri
# cap_add:
# - SYS_PTRACE
# security_opt:
# - seccomp=unconfined

@ -10,7 +10,7 @@ function ctrl_c {
trap ctrl_c SIGTERM SIGINT SIGQUIT SIGHUP
# Generate default configs if empty
CONFIG_DIRECTORIES=("characters" "loras" "models" "presets" "prompts" "training/datasets" "training/formats")
CONFIG_DIRECTORIES=("characters" "instruction-templates" "loras" "models" "presets" "prompts" "training/datasets" "training/formats")
for config_dir in "${CONFIG_DIRECTORIES[@]}"; do
if [ -z "$(ls /app/"$config_dir")" ]; then
echo "*** Initialising config for: '$config_dir' ***"

Loading…
Cancel
Save