Dreambot

What is it?

Dreambot is a distributed architecture for running chat bots.

flowchart LR
    subgraph NATS [ NATS JetStream Cluster]
        C[nats-1]
        I[nats-2]
        J[nats-3]

        C <--> I
        C <--> J
        I <--> J
    end
    subgraph Dreambot
        B([IRC Frontend])
        E([GPT Backend])
        D([Discord Frontend])
        F([A1111 Backend])
        M([A1111 Backend])
        O([ComfyUI Backend])
        K([Command Backend])
        NATS
    end
    A((Users)) <-. IRC Servers .-> B <--> NATS <--> E <-.-> G{{OpenAI API}}
    A <-. Discord Servers .-> D <--> NATS <--> F <-.-> H{{A1111 API}} <--> L{{GPU}}
    NATS <--> M <-.-> N{{A1111 API}} <--> L
    NATS <--> O <-.-> P{{ComfyUI API}} <--> L
    NATS <--> K

What are Frontends and Backends?

Dreambot is designed to run as multiple processes, each of which is either a frontend or a backend.
Each of these processes is intended to subscribe to one or more message queues in NATS, from which it will receive messages.
Frontends listen for trigger keywords from users and then publish them to NATS, where they are picked up by the backends.
Backends then perform the requested work and publish the results back to NATS, where they are picked up by the frontends and sent back to the user.

What are the current Frontends and Backends?

Frontends:

IRC
Discord

Backends:

OpenAI's GPT Chat Completions
A1111's fork of Stable Diffusion
ComfyUI for Stable Diffusion image generation
Commands - simple bot commands, which currently include:
- chance:
  - Usage: !chance rain tomorrow
  - Action: Returns a random percentage chance of rain tomorrow

Why is this architecture so complicated for a simple chat bot?

The initial prototype was designed to run an IRC bot frontend on a server that is trusted to talk to the Internet, and a Stable Diffusion backend process on a machine that is not trusted to talk to the Internet, with a websocket connection between them.

While this prototype worked well, it was not easy to extend to support Discord and additional types of backend. With the current design, each process knows nothing about the other processes, and can be run on any machine that can connect to the NATS cluster.

Additionally, it can be useful to spread out backends across machines that have relevant GPU hardware, and it's beneficial to have each backend in its own process because AI/ML code tends to have extremely specific requirements for Python and dependency library versions, which may not all be compatible in a single module.

How do I deploy this?

I deploy all of this using Docker Compose, and here are the approximate steps I use. This all assumes:

You're running Ubuntu Server 22.04
You have an NVIDIA GPU that you want to use for A1111
You have ample storage in /srv/docker/ to attach to the containers (in particular, A1111 will need >10GB for its model cache)
You have a web server running that can serve files from /srv/public/

Stages

Create the required directories

mkdir -p /srv/docker/nats/{config,nats-1,nats-2,nats-3}
mkdir -p /srv/docker/A1111/data
mkdir -p /srv/docker/dreambot/config

Create the required files

NATS config file

/srv/docker/nats/config/nats-server.conf

host: 0.0.0.0
port: 4222
http: 0.0.0.0:8222
max_payload: 8388608 # Default is 1MB, but we send images over NATS, so bump it to 8MB

jetstream {
    store_dir: /data
    # 1 GB
    max_memory_store: 1073741824
    # 10 GB
    max_file_store: 10737418240
}

cluster {
    name: dreambot
    host: 0.0.0.0
    port: 4245
    routes: [
        nats-route://nats-1:4245,
        nats-route://nats-2:4245,
        nats-route://nats-3:4245
    ]
}

Install A1111

See their docs for this

Dreambot config files

/srv/docker/dreambot/config/config-frontend-irc.json

Notes:

uri_base should be where your webserver has this container's /data volume mounted

{
  "triggers": {
          "!gpt": "backend.gpt",
          "!dream": "backend.a1111",
          "!comfy": "backend.comfyui"
  },
  "nats_uri": [ "nats://nats-1:4222", "nats://nats-2:4222", "nats://nats-3:4222" ],
  "output_dir": "/data",
  "uri_base": "https://dreams.mydomain.com",
  "irc": [
        {
                "nickname": "dreambot",
                "ident": "dreambot",
                "realname": "I've dreamed things you people wouldn't believe",
                "host": "irc.server.com",
                "port": 6667,
                "ssl": false,
                "channels": [
                        "#dreambot"
                ]
        },
        {
                "nickname": "dreambot",
                "ident": "dreambot",
                "realname": "I've dreamed things you people wouldn't believe",
                "host": "irc.something.org",
                "port": 6667,
                "ssl": false,
                "channels": [
                        "#dreambot",
                        "#chat"
                ]
        }

  ]
}

/srv/docker/dreambot/config/config-frontend-discord.json

There is a bunch of Discord developer website stuff you need to do to get the token for this file, and you will need to give it permissions I haven't documented yet
{
  "triggers": {
          "!gpt": "backend.gpt",
          "!dream": "backend.a1111",
          "!comfy": "backend.comfyui"
  },
  "nats_uri": [ "nats://nats-1:4222", "nats://nats-2:4222", "nats://nats-3:4222" ],
  "output_dir": "/data",
  "discord": {
    "token": "abc123"
  }
}

/srv/docker/dreambot/config/config-backend-gpt.json

Sign up for a developer account at https://openai.com and you can get your API key and organization ID from there.
{
  "gpt": {
      "api_key": "ab-abc123",
      "organization": "org-ABC123",
      "model": "gpt-3.5-turbo"
  },
  "nats_uri": [ "nats://nats-1:4222", "nats://nats-2:4222", "nats://nats-3:4222" ],
}

/srv/docker/dreambot/config/config-backend-a1111.json

The A1111 backend supports arguments for choosing between several models. Install the models you want in A1111 and (optionally, but recommendedly) configure it to keep several models in VRAM at the same time. Play with the settings to get what you want and then configure the settings that will be sent with each request, in the json config:

{
  "a1111": {
      "host": "a1111",
      "port": "9090",
      "default_model": "sdxl",
      "models": {
        "sdxl": {
                "payload": {
                  "hr_upscaler": "SwinIR_4x",
                  "sampler_name": "Euler a",
                  "seed": -1,
                  "restore_faces": "True",
                  "cfg_scale": 1.0,
                  "override_settings": {
                      "sd_model_checkpoint": "sd_xl_turbo_1.0_fp16"
                  },
                  "steps": 1
                }
        },
        "real": {
           "payload": {
                  "sampler_name": "DPM++ SDE",
                  "scheduler": "Karras",
                  "seed": -1,
                  "restore_faces": "True",
                  "steps": 5,
                  "cfg_scale": 1.6,
                  "override_settings": {
                      "sd_model_checkpoint": "realisticVisionV60B1_v51HyperVAE"
                  }
           }
        }
  },
  "nats_uri": [ "nats://nats-1:4222", "nats://nats-2:4222", "nats://nats-3:4222" ],
}

/srv/docker/dreambot/config/config-backend-comfyui.json

ComfyUI backend uses workflow definitions for image generation. You can export workflows from the ComfyUI web interface and use them in the configuration. The backend will automatically update the text prompt in the workflow.

{
  "comfyui": {
      "host": "comfyui",
      "port": "8188",
      "default_workflow": "txt2img",
      "workflows": {
        "txt2img": {
                "workflow": {
                  "3": {
                    "class_type": "KSampler",
                    "inputs": {
                      "seed": -1,
                      "steps": 20,
                      "cfg": 8.0,
                      "sampler_name": "euler",
                      "scheduler": "normal",
                      "denoise": 1.0,
                      "model": ["4", 0],
                      "positive": ["6", 0],
                      "negative": ["7", 0],
                      "latent_image": ["5", 0]
                    }
                  },
                  "4": {
                    "class_type": "CheckpointLoaderSimple",
                    "inputs": {
                      "ckpt_name": "sd_xl_base_1.0.safetensors"
                    }
                  },
                  "5": {
                    "class_type": "EmptyLatentImage",
                    "inputs": {
                      "width": 512,
                      "height": 512,
                      "batch_size": 1
                    }
                  },
                  "6": {
                    "class_type": "CLIPTextEncode",
                    "inputs": {
                      "text": "beautiful scenery",
                      "clip": ["4", 1]
                    }
                  },
                  "7": {
                    "class_type": "CLIPTextEncode",
                    "inputs": {
                      "text": "text, watermark",
                      "clip": ["4", 1]
                    }
                  },
                  "8": {
                    "class_type": "VAEDecode",
                    "inputs": {
                      "samples": ["3", 0],
                      "vae": ["4", 2]
                    }
                  },
                  "9": {
                    "class_type": "SaveImage",
                    "inputs": {
                      "filename_prefix": "ComfyUI",
                      "images": ["8", 0]
                    }
                  }
                }
        }
      }
  },
  "nats_uri": [ "nats://nats-1:4222", "nats://nats-2:4222", "nats://nats-3:4222" ]
}

/srv/docker/dreambot/config/config-backend-commands.json

{
  "nats_uri": [ "nats://nats-1:4222", "nats://nats-2:4222", "nats://nats-3:4222" ],
  "nats_queue_name": "!commands"
}

Ansible playbook to install/configure CUDA (required for A1111 backend)

To run a container that accesses GPUs for CUDA, you need to install nvidia drivers, the nvidia-container-toolkit package, and then configure the docker daemon to use it as a runtime.

- name: Install cuda keyring
ansible.builtin.apt:
    deb: https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64/cuda-keyring_1.0-1_all.deb
    state: present

- name: Install nvidia/cuda packages
ansible.builtin.apt:
    name: "{{ item }}"
    update_cache: yes
with_items:
    - nvidia-headless-530
    - nvidia-utils-530
    - cuda-toolkit

# Based on: https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/install-guide.html#docker
- name: Install nvidia-container-toolkit apt key
ansible.builtin.shell:
    cmd: curl -fsSL https://nvidia.github.io/libnvidia-container/gpgkey | gpg --dearmor -o /usr/share/keyrings/nvidia-container-toolkit-keyring.gpg
    creates: /usr/share/keyrings/nvidia-container-toolkit-keyring.gpg

- name: Add the nvidia-container-toolkit apt repo
ansible.builtin.shell:
    cmd: curl -s -L https://nvidia.github.io/libnvidia-container/ubuntu22.04/libnvidia-container.list | sed 's#deb https://#deb [signed-by=/usr/share/keyrings/nvidia-container-toolkit-keyring.gpg] https://#g' | tee /etc/apt/sources.list.d/nvidia-container-toolkit.list
    creates: /etc/apt/sources.list.d/nvidia-container-toolkit.list

- name: Install the nvidia-container-toolkit package
ansible.builtin.apt:
    name: nvidia-container-toolkit
    update_cache: yes

- name: Configure nvidia-container-toolkit runtime
ansible.builtin.shell:
    cmd: nvidia-ctk runtime configure --runtime=docker

Then restart the docker daemon, and you should be able to run containers that access the GPUs.

Docker Compose for deploying the Dreambot infrastructure

networks:
    dreambot:
        external: false
        driver: bridge

services:
    # Deploy a NATS cluster
    nats-1:
        hostname: nats-1
        image: nats
        restart: unless-stopped
        networks:
            - dreambot
        expose:
            - "8222"
        volumes:
            - /srv/docker/nats/nats-1:/data
            - /srv/docker/nats/config:/config
        entrypoint: /nats-server
        command: --name nats-1 -c /config/nats-server.conf

    nats-2:
        hostname: nats-2
        image: nats
        restart: unless-stopped
        networks:
            - dreambot
        expose:
            - "8222"
        volumes:
            - /srv/docker/nats/nats-2:/data
            - /srv/docker/nats/config:/config
        entrypoint: /nats-server
        command: --name nats-2 -c /config/nats-server.conf

    nats-3:
        hostname: nats-3
        image: nats
        restart: unless-stopped
        networks:
            - dreambot
        expose:
            - "8222"
        volumes:
            - /srv/docker/nats/nats-3:/data
            - /srv/docker/nats/config:/config
        entrypoint: /nats-server
        command: --name nats-3 -c /config/nats-server.conf

    # Deploy A1111 with access to our GPU
    a1111:
        hostname: a1111
        image: ghcr.io/neggles/sd-webui-docker:latest
        restart: unless-stopped
        deploy:
        resources:
            reservations:
            devices:
                - driver: nvidia
                count: 1
                capabilities: [gpu]
        networks:
            - dreambot
        environment:
            CLI_ARGS: "--skip-version-check --allow-code --enable-insecure-extension-access --api --xformers --opt-channelslast"
            SD_WEBUI_VARIANT: "default"
            # make TQDM behave a little better
            PYTHONUNBUFFERED: "1"
            TERM: "vt100"
        expose:
            - "9090"
        volumes:
            - /srv/docker/a1111/data:/data
            - /srv/public/outputs:/outputs # This is where the outputs of A1111 will be stored if you talk to it directly rather than through Dreambot (e.g. their web interface)

    # Deploy the Dreambot Frontends
    dreambot-frontend-irc:
        hostname: dreambot-frontend-irc
        image: ghcr.io/cmsj/dreambot:latest
        restart: unless-stopped
        networks:
            - dreambot
        volumes:
            - /srv/docker/dreambot/config:/config
            - /srv/public/dreams:/data
        command: dreambot_frontend_irc -c /config/config-frontend-irc.json

    dreambot-frontend-discord:
        hostname: dreambot-frontend-discord
        image: ghcr.io/cmsj/dreambot:latest
        restart: unless-stopped
        networks:
            - dreambot
        volumes:
            - /srv/docker/dreambot/config:/config
            - /srv/public/dreams:/data
        command: dreambot_frontend_discord -c /config/config-frontend-discord.json

    # Deploy the Dreambot Backends
    dreambot-backend-gpt:
        hostname: dreambot_backend_gpt
        image: ghcr.io/cmsj/dreambot:latest
        restart: unless-stopped
        networks:
            - dreambot
        volumes:
            - /srv/docker/dreambot/config:/config
            - /srv/public/dreams:/data
        command: dreambot_backend_gpt -c /config/config-backend-gpt.json

    dreambot-backend-a1111:
        hostname: dreambot_backend_a1111
        image: ghcr.io/cmsj/dreambot:latest
        restart: unless-stopped
        networks:
            - dreambot
        volumes:
            - /srv/docker/dreambot/config:/config
            - /srv/public/dreams:/data
        command: dreambot_backend_a1111 -c /config/config-backend-a1111.json

That's a lot of setup, holy cow

Yes, it is.

Name		Name	Last commit message	Last commit date
Latest commit History 393 Commits
.github/workflows		.github/workflows
src/dreambot		src/dreambot
tests		tests
.coveragerc		.coveragerc
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE.txt		LICENSE.txt
README.md		README.md
dreambot.code-workspace		dreambot.code-workspace
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dreambot

What is it?

What are Frontends and Backends?

What are the current Frontends and Backends?

Why is this architecture so complicated for a simple chat bot?

How do I deploy this?

Stages

NATS config file

Install A1111

Dreambot config files

That's a lot of setup, holy cow

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Dreambot

What is it?

What are Frontends and Backends?

What are the current Frontends and Backends?

Why is this architecture so complicated for a simple chat bot?

How do I deploy this?

Stages

NATS config file

Install A1111

Dreambot config files

That's a lot of setup, holy cow

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages