version: 9.0.0

This commit is contained in:
Bryce 2023-02-05 09:53:24 -08:00
parent eeb80f81aa
commit 5bedaebe45
2 changed files with 13 additions and 6 deletions

View File

@ -298,6 +298,8 @@ docker run -it --gpus all -v $HOME/.cache/huggingface:/root/.cache/huggingface -
## ChangeLog ## ChangeLog
**9.0.0**
- perf: cli now has minimal overhead such that `aimg --help` runs in ~650ms instead of ~3400ms - perf: cli now has minimal overhead such that `aimg --help` runs in ~650ms instead of ~3400ms
- feature: `edit` and `imagine` commands now accept multiple images (which they will process separately). This allows - feature: `edit` and `imagine` commands now accept multiple images (which they will process separately). This allows
batch editing of images as requested in [#229](https://github.com/brycedrennan/imaginAIry/issues/229) batch editing of images as requested in [#229](https://github.com/brycedrennan/imaginAIry/issues/229)
@ -530,7 +532,7 @@ would be uncorrelated to the rest of the surrounding image. It created terrible
## Todo ## Todo
- Performance Optimizations - Inference Performance Optimizations
- ✅ fp16 - ✅ fp16
- ✅ [Doggettx Sliced attention](https://github.com/CompVis/stable-diffusion/compare/main...Doggettx:stable-diffusion:autocast-improvements#) - ✅ [Doggettx Sliced attention](https://github.com/CompVis/stable-diffusion/compare/main...Doggettx:stable-diffusion:autocast-improvements#)
- ✅ xformers support https://www.photoroom.com/tech/stable-diffusion-100-percent-faster-with-memory-efficient-attention/ - ✅ xformers support https://www.photoroom.com/tech/stable-diffusion-100-percent-faster-with-memory-efficient-attention/
@ -538,6 +540,7 @@ would be uncorrelated to the rest of the surrounding image. It created terrible
- https://github.com/CompVis/stable-diffusion/pull/177 - https://github.com/CompVis/stable-diffusion/pull/177
- https://github.com/huggingface/diffusers/pull/532/files - https://github.com/huggingface/diffusers/pull/532/files
- https://github.com/HazyResearch/flash-attention - https://github.com/HazyResearch/flash-attention
- https://github.com/chavinlo/sda-node
- Development Environment - Development Environment
- ✅ add tests - ✅ add tests
@ -545,8 +548,8 @@ would be uncorrelated to the rest of the surrounding image. It created terrible
- ✅ unified pipeline (txt2img & img2img combined) - ✅ unified pipeline (txt2img & img2img combined)
- ✅ setup parallel testing - ✅ setup parallel testing
- add docs - add docs
- remove yaml config - 🚫 remove yaml config
- delete more unused code - 🚫 delete more unused code
- faster latent logging https://discuss.huggingface.co/t/decoding-latents-to-rgb-without-upscaling/23204/9 - faster latent logging https://discuss.huggingface.co/t/decoding-latents-to-rgb-without-upscaling/23204/9
- Interface improvements - Interface improvements
- ✅ init-image at command line - ✅ init-image at command line
@ -555,14 +558,17 @@ would be uncorrelated to the rest of the surrounding image. It created terrible
- Image Generation Features - Image Generation Features
- ✅ add k-diffusion sampling methods - ✅ add k-diffusion sampling methods
- ✅ tiling - ✅ tiling
- generation videos/gifs - generation videos/gifs
- Compositional Visual Generation - Compositional Visual Generation
- https://github.com/energy-based-model/Compositional-Visual-Generation-with-Composable-Diffusion-Models-PyTorch - https://github.com/energy-based-model/Compositional-Visual-Generation-with-Composable-Diffusion-Models-PyTorch
- https://colab.research.google.com/github/energy-based-model/Compositional-Visual-Generation-with-Composable-Diffusion-Models-PyTorch/blob/main/notebooks/demo.ipynb#scrollTo=wt_j3uXZGFAS - https://colab.research.google.com/github/energy-based-model/Compositional-Visual-Generation-with-Composable-Diffusion-Models-PyTorch/blob/main/notebooks/demo.ipynb#scrollTo=wt_j3uXZGFAS
- ✅ negative prompting - ✅ negative prompting
- some syntax to allow it in a text string - some syntax to allow it in a text string
- 🚫 images as actual prompts instead of just init images. - [paint with words](https://www.reddit.com/r/StableDiffusion/comments/10lzgze/i_figured_out_a_way_to_apply_different_prompts_to/)
- https://github.com/cloneofsimo/paint-with-words-sd
- images as actual prompts instead of just init images.
- not directly possible due to model architecture. - not directly possible due to model architecture.
- can it just be integrated into sampler?
- requires model fine-tuning since SD1.4 expects 77x768 text encoding input - requires model fine-tuning since SD1.4 expects 77x768 text encoding input
- https://twitter.com/Buntworthy/status/1566744186153484288 - https://twitter.com/Buntworthy/status/1566744186153484288
- https://github.com/justinpinkney/stable-diffusion - https://github.com/justinpinkney/stable-diffusion
@ -645,6 +651,7 @@ would be uncorrelated to the rest of the surrounding image. It created terrible
- [Textual Inversion](https://arxiv.org/abs/2208.01618) - [Textual Inversion](https://arxiv.org/abs/2208.01618)
- [Fast Textual Inversion](https://github.com/peterwilli/sd-leap-booster) - [Fast Textual Inversion](https://github.com/peterwilli/sd-leap-booster)
- [Low-rank Adaptation for Fast Text-to-Image Diffusion Fine-tuning (LORA)](https://github.com/cloneofsimo/lora) - [Low-rank Adaptation for Fast Text-to-Image Diffusion Fine-tuning (LORA)](https://github.com/cloneofsimo/lora)
- https://huggingface.co/spaces/lora-library/Low-rank-Adaptation
- Performance Improvements - Performance Improvements
- [ColoassalAI](https://github.com/hpcaitech/ColossalAI/tree/main/examples/images/diffusion) - almost got it working but it's not easy enough to install to merit inclusion in imaginairy. We should check back in on this. - [ColoassalAI](https://github.com/hpcaitech/ColossalAI/tree/main/examples/images/diffusion) - almost got it working but it's not easy enough to install to merit inclusion in imaginairy. We should check back in on this.
- Xformers - Xformers

View File

@ -7,7 +7,7 @@ setup(
name="imaginAIry", name="imaginAIry",
author="Bryce Drennan", author="Bryce Drennan",
# author_email="b r y p y d o t io", # author_email="b r y p y d o t io",
version="8.3.1", version="9.0.0",
description="AI imagined images. Pythonic generation of stable diffusion images.", description="AI imagined images. Pythonic generation of stable diffusion images.",
long_description=readme, long_description=readme,
long_description_content_type="text/markdown", long_description_content_type="text/markdown",