mirror of https://github.com/brycedrennan/imaginAIry synced 2024-10-31 03:20:40 +00:00

Go to file

Bryce fa3673ef56 fix: use context manager syntax compatible with python 3.8 parentheses supported in python 3.10		2022-09-14 21:24:05 -07:00
assets	feature: face enhancement and upscaling!!	2022-09-13 00:27:53 -07:00
imaginairy	fix: use context manager syntax compatible with python 3.8	2022-09-14 21:24:05 -07:00
tests	refactor/test: logging suppression + hashed image test	2022-09-14 19:40:50 -07:00
.gitignore	refactor/test: logging suppression + hashed image test	2022-09-14 19:40:50 -07:00
LICENSE	docs: update readme. add docs to package	2022-09-11 21:36:14 -07:00
Makefile	fix: make k-diffusion samplers deterministic	2022-09-14 09:37:45 -07:00
README.md	fix: make k-diffusion samplers deterministic	2022-09-14 09:37:45 -07:00
requirements-dev.in	ci: add linter, import sorting	2022-09-11 13:57:36 -07:00
requirements-dev.txt	feature: k-diffusion samplers	2022-09-14 00:40:25 -07:00
setup.py	fix: filter logic was wrong	2022-09-14 20:01:12 -07:00
STABLE_DIFFUSION_LICENSE	refactor: simplify structure	2022-09-11 00:59:03 -07:00
tox.ini	fix: filter logic was wrong	2022-09-14 20:01:12 -07:00

README.md

ImaginAIry 🤖🧠

AI imagined images. Pythonic generation of stable diffusion images.

"just works" on Linux and OSX(M1).

Examples

>> pip install imaginairy
>> imagine "a scenic landscape" "a photo of a dog" "photo of a fruit bowl" "portrait photo of a freckled woman"

Console Output

🤖🧠 received 4 prompt(s) and will repeat them 1 times to create 4 images.
Loading model onto mps backend...
Generating 🖼  : "a scenic landscape" 512x512px seed:557988237 prompt-strength:7.5 steps:40 sampler-type:PLMS
    PLMS Sampler: 100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 40/40 [00:29<00:00,  1.36it/s]
    🖼  saved to: ./outputs/000001_557988237_PLMS40_PS7.5_a_scenic_landscape.jpg
Generating 🖼  : "a photo of a dog" 512x512px seed:277230171 prompt-strength:7.5 steps:40 sampler-type:PLMS
    PLMS Sampler: 100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 40/40 [00:28<00:00,  1.41it/s]
    🖼  saved to: ./outputs/000002_277230171_PLMS40_PS7.5_a_photo_of_a_dog.jpg
Generating 🖼  : "photo of a fruit bowl" 512x512px seed:639753980 prompt-strength:7.5 steps:40 sampler-type:PLMS
    PLMS Sampler: 100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 40/40 [00:28<00:00,  1.40it/s]
    🖼  saved to: ./outputs/000003_639753980_PLMS40_PS7.5_photo_of_a_fruit_bowl.jpg
Generating 🖼  : "portrait photo of a freckled woman" 512x512px seed:500686645 prompt-strength:7.5 steps:40 sampler-type:PLMS
    PLMS Sampler: 100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 40/40 [00:29<00:00,  1.37it/s]
    🖼  saved to: ./outputs/000004_500686645_PLMS40_PS7.5_portrait_photo_of_a_freckled_woman.jpg

Tiled Images

>> imagine  "gold coins" "a lush forest" "piles of old books" leaves --tile

Image-to-Image

>> imagine "portrait of a smiling lady. oil painting" --init-image girl_with_a_pearl_earring.jpg

Face Enhancement by CodeFormer

>> imagine "a couple smiling" --steps 40 --seed 1 --fix-faces

Upscaling by RealESRGAN

>> imagine "colorful smoke" --steps 40 --upscale

Features

It makes images from text descriptions! 🎉
Generate images either in code or from command line.
It just works. Proper requirements are installed. model weights are automatically downloaded. No huggingface account needed. (if you have the right hardware... and aren't on windows)
No more distorted faces!
Noisy logs are gone (which was surprisingly hard to accomplish)
WeightedPrompts let you smash together separate prompts (cat-dog)
Tile Mode creates tileable images
Prompt metadata saved into image file metadata

How To

from imaginairy import imagine, imagine_image_files, ImaginePrompt, WeightedPrompt

prompts = [
    ImaginePrompt("a scenic landscape", seed=1),
    ImaginePrompt("a bowl of fruit"),
    ImaginePrompt([
        WeightedPrompt("cat", weight=1),
        WeightedPrompt("dog", weight=1),
    ])
]
for result in imagine(prompts):
    # do something
    result.save("my_image.jpg")

# or

imagine_image_files(prompts, outdir="./my-art")

Requirements

~10 gb space for models to download
A decent computer with either a CUDA supported graphics card or M1 processor.

Improvements from CompVis

img2img actually does # of steps you specify
performance optimizations

Models Used

Not Supported

a web interface. this is a python library

Todo

performance optimizations
✅ deploy to pypi
find similar images https://knn5.laion.ai/?back=https%3A%2F%2Fknn5.laion.ai%2F&index=laion5B&useMclip=false
Development Environment
- add tests
- set up ci (test/lint/format)
- add docs
- remove yaml config
- delete more unused code
Interface improvements
- ✅ init-image at command line
- prompt expansion
Image Generation Features
- ✅ add k-diffusion sampling methods
- why is k-diffusion so slow compared to plms? 2 it/s vs 8 it/s
- upscaling
  - ✅ realesrgan
  - ldm
  - https://github.com/lowfuel/progrock-stable
- ✅ face enhancers
  - ✅ gfpgan - https://github.com/TencentARC/GFPGAN
  - ✅ codeformer - https://github.com/sczhou/CodeFormer
- image describe feature - https://replicate.com/methexis-inc/img2prompt
- outpainting
- inpainting
- CPU support
- img2img for plms?
- images as actual prompts instead of just init images
- cross-attention control:
  - https://github.com/bloc97/CrossAttentionControl/blob/main/CrossAttention_Release_NoImages.ipynb
- guided generation https://colab.research.google.com/drive/1dlgggNa5Mz8sEAGU0wFCHhGLFooW_pf1#scrollTo=UDeXQKbPTdZI
- ✅ tiling
- output show-work videos
- image variations https://github.com/lstein/stable-diffusion/blob/main/VARIATIONS.md
- textual inversion
- fix saturation at high CFG https://www.reddit.com/r/StableDiffusion/comments/xalo78/fixing_excessive_contrastsaturation_resulting/
- https://www.reddit.com/r/StableDiffusion/comments/xbrrgt/a_rundown_of_twenty_new_methodsoptions_added_to/