Intranet Image Generator

I wanted to show my family what I do for a living and what better way to make Computer Vision interesting than diffusion models?

I could have just shown them DALL-E 2, Midjourney, or the million mobile apps built on SD already out there, however if I built it myself then I can run it for free and retain end-to-end control over all aspects, e.g. which model I use, possibility to add parental controls to the prompts etc.

So, I built:

a simple React Native mobile app as frontend, that takes a prompt as input and displays the generated images
a Python backend, with a Flask-based API and a diffusion model running inference on an RTX 3090 GPU, with plans to containerize using Docker

Work in progress!

How it works:

Set up:

Environment variables on the backend (e.g. in a .env file)

HF_KEY: Your Hugging Face API key
IMG_DIR_WIN and IMG_DIR_DOCKER: Location to store the generated images
PROMPT_PREFIX and PROMPT_SUFFIX: Optional, if you want to prefix or suffix the prompt with anything (e.g. cartoonish, kid-friendly)
NEGATIVE_PROMPT: Optional, but should be used for parental controls (e.g. add "scary" to prevent convergence on scary images, the same with NSFW concepts, etc.)
MODEL_ID: Optional, Hugging Face model ID, using SD 2.1 if not defined

set a fixed LAN IP address on the machine running the backend and expose port 5000 to your intranet
set up the IP address of the backend on the mobile app under the kebab menu (look for ⋮ in the upper right corner)
As of now, to get the mobile app running, you need to set up a React Native development environment, compile the app from source and load the .apk onto an Android device using developer mode.
Here is a handy guide: https://reactnative.dev/docs/environment-setup?guide=native

Known issues and Disclaimers:

This is a hobby prototype that takes quite a bit of tech skills to get to work and is not production ready. You shouldn't use it if you don't understand the technology involved.
Read the license terms, especially Section 5 – Disclaimer of Warranties and Limitation of Liability.
I couldn't test if Docker works at all, as my NVIDIA drivers do not want to play with Docker in my Windows Linux Subsystem
The mobile app still has the default Android icon and is named "mobile_client"
Minimal security (not making any attempts to sanitize inputs or authenticate clients), the backend is only intended to be used behind a NAT router for demo purposes, not ready to be exposed to the Internet.
I recommend setting up an extensive negative prompt as parental controls, in addition to using the Stability safety filter, and not letting kids play with diffusion models without adult supervision, as most of these models will produce age-inappropriate content with minimal effort and curiosity.

License:

Copyright 2023, Jozsef Szalma
Creative Commons Attribution-NonCommercial 4.0 International Public License
https://creativecommons.org/licenses/by-nc/4.0/legalcode

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
mobile_client		mobile_client
LICENSE		LICENSE
README.md		README.md
app.py		app.py
dockerfile		dockerfile
notebook.ipynb		notebook.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mobile_client

mobile_client

LICENSE

LICENSE

README.md

README.md

app.py

app.py

dockerfile

dockerfile

notebook.ipynb

notebook.ipynb

requirements.txt

requirements.txt

Repository files navigation

Intranet Image Generator

How it works:

Set up:

Known issues and Disclaimers:

License:

About

Releases

Packages

Languages

License

jozsefszalma/intranet_image_generator

Folders and files

Latest commit

History

Repository files navigation

Intranet Image Generator

How it works:

Set up:

Known issues and Disclaimers:

License:

About

Topics

Resources

License

Stars

Watchers

Forks

Languages