Skip to content

Commit e9e65e9

Browse files
committed
docs: update README with QNN and fal.ai features
- Add fal.ai as supported backend with FLUX models - Document local diffusion options (ONNX, MediaPipe, QNN) - Add QNN setup instructions for Snapdragon devices - Update feature list with ADetailer, Hires.Fix, Forge modules - Document gallery improvements and file-based storage
1 parent 373edf1 commit e9e65e9

1 file changed

Lines changed: 56 additions & 10 deletions

File tree

README.md

Lines changed: 56 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -29,7 +29,11 @@ Stable Diffusion AI (SDAI) is an easy-to-use app that:
2929
- Can use server environment powered by [Hugging Face Inference API](https://huggingface.co/docs/api-inference/quicktour).
3030
- Can use server environment powered by [OpenAI](https://platform.openai.com/docs/api-reference/images) (DALL-E-2, DALL-E-3).
3131
- Can use server environment powered by [Stability AI](https://platform.stability.ai/).
32+
- Can use server environment powered by [Fal.AI](https://fal.ai/).
3233
- Can use local environment powered by LocalDiffusion (Beta)
34+
- Microsoft ONNX Runtime (txt2img)
35+
- Google AI MediaPipe (txt2img)
36+
- Qualcomm QNN with NPU acceleration (txt2img, img2img)
3337
- Supports original Txt2Img, Img2Img modes
3438
- **Positive** and **negative** prompt support
3539
- Support dynamic **size** in range from 64 to 2048 px (for width and height)
@@ -52,9 +56,14 @@ Stable Diffusion AI (SDAI) is an easy-to-use app that:
5256
- Textual inversion picker (for A1111)
5357
- Hypernetworks picker (for A1111)
5458
- SD Model picker (for A1111)
59+
- Forge Modules support (for A1111/Forge)
60+
- ADetailer (After Detailer) support for enhanced face/hand/body fixing (for A1111/Forge)
61+
- Hires.Fix support for high-resolution upscaling (for A1111/Forge)
5562
- In-app Gallery, stored locally, contains all AI generated images
56-
- Displays generated images grid
57-
- Image detail view: Zoom, Pinch, Generation Info.
63+
- Displays generated images grid with pagination
64+
- Image detail view: Advanced zoom/pan controls, Pinch-to-zoom, Generation Info
65+
- Navigation between images with swipe gestures
66+
- InPaint editor with zoom/pan support for precise mask drawing
5867
- Export all gallery to **.zip** file
5968
- Export single photo to **.zip** file
6069
- Settings
@@ -63,6 +72,7 @@ Stable Diffusion AI (SDAI) is an easy-to-use app that:
6372
- Server availability monitoring (http-ping method)
6473
- Enable/Disable auto-saving of generated images
6574
- Enable/Disable saving generated images to `Download/SDAI` android MediaStore folder
75+
- Optimized file-based media storage (faster loading and reduced memory usage)
6676
- Clear gallery / app cache
6777

6878
## Setup instruction
@@ -71,7 +81,7 @@ Stable Diffusion AI (SDAI) is an easy-to-use app that:
7181

7282
This requires you to have the AUTOMATIC1111 WebUI that is running in server mode.
7383

74-
You can have it running either on your own hardware with modern GPU from Nvidia or AMD, or running it using Google Colab.
84+
You can have it running either on your own hardware with modern GPU from Nvidia or AMD, or running it using Google Colab.
7585

7686
1. Follow the setup instructions on [Stable-Diffusion-WebUI](https://github.com/AUTOMATIC1111/stable-diffusion-webui) repository.
7787
2. Add the arguments `--api --listen` to the command line arguments of WebUI launch script.
@@ -90,7 +100,7 @@ Please refer to the [SwarmUI documentation](https://github.com/mcmonkeyprojects/
90100

91101
### Option 3: Use AI Horde
92102

93-
[AI Horde](https://stablehorde.net/) is a crowdsourced distributed cluster of Image generation workers and text generation workers.
103+
[AI Horde](https://stablehorde.net/) is a crowdsourced distributed cluster of Image generation workers and text generation workers.
94104

95105
AI Horde requires to use API KEY, this mobile app allows to use either default API KEY (which is "0000000000"), or type your own. You can sign up and get your own AI Horde API KEY [here](https://stablehorde.net/register).
96106

@@ -102,7 +112,7 @@ Hugging Face Inference requires to use API KEY, which can be created in [Hugging
102112

103113
### Option 5: OpenAI
104114

105-
OpenAI provides a service for text to image generation using [DALLE-2](https://openai.com/dall-e-2) or [DALLE-3](https://openai.com/dall-e-3) models. This service is paid.
115+
OpenAI provides a service for text to image generation using [DALLE-2](https://openai.com/dall-e-2) or [DALLE-3](https://openai.com/dall-e-3) models. This service is paid.
106116

107117
OpenAI requires to use API KEY, which can be created in [OpenAI API Key settings](https://platform.openai.com/api-keys).
108118

@@ -112,19 +122,55 @@ OpenAI requires to use API KEY, which can be created in [OpenAI API Key settings
112122

113123
StabilityAI requires to use API KEY, which can be created in [API Keys page](https://platform.stability.ai/account/keys).
114124

115-
### Option 7: Local Diffusion Microsoft ONNX Runtime (Beta)
125+
### Option 7: Fal.AI
126+
127+
[Fal.AI](https://fal.ai/) is a modern AI generation service with support for latest FLUX models family.
128+
129+
**Built-in models:**
130+
- FLUX.1-dev (high quality, slower)
131+
- FLUX.1-schnell (fast generation)
132+
- FLUX-LoRA (custom LoRA support)
133+
- FLUX-2 (latest generation)
134+
- FLUX-Kontext (context-aware generation)
135+
136+
**Features:**
137+
- Import OpenAPI.json from fal.ai to add new models
138+
- Dynamic form generation based on model parameters
139+
- Support for custom endpoints
140+
141+
Fal.AI requires to use API KEY, which can be created in [Fal.AI dashboard](https://fal.ai/dashboard/keys).
142+
143+
### Option 8: Local Diffusion Microsoft ONNX Runtime (Beta)
116144

117145
Only **txt2img** mode is supported.
118146

119-
Allows to use phone resources to generate images.
147+
Allows to use phone resources to generate images using Microsoft ONNX Runtime.
120148

121-
### Option 8: Local Diffusion Google AI MediaPipe (Beta)
149+
### Option 9: Local Diffusion Google AI MediaPipe (Beta)
122150

123151
Available only in **playstore** and **full** flavors.
124152

125153
Only **txt2img** mode is supported.
126154

127-
Allows to use phone resources to generate images.
155+
Allows to use phone resources to generate images using Google MediaPipe framework.
156+
157+
### Option 10: Local Diffusion Qualcomm QNN (Beta)
158+
159+
Available only in **playstore** and **full** flavors.
160+
161+
Supports both **txt2img** and **img2img** modes.
162+
163+
Uses Qualcomm QNN SDK with NPU acceleration (HTP) and MNN backend for fast on-device Stable Diffusion generation.
164+
165+
**Requirements:**
166+
- Snapdragon 8 Gen 1 or newer chipset
167+
- Supports both NPU-accelerated models (8Gen1, 8Gen2/3/4) and CPU/MNN models
168+
169+
**Features:**
170+
- Hardware NPU acceleration for faster generation
171+
- Multiple pre-built models available for download
172+
- Support for custom models (scan local folders)
173+
- Lower power consumption compared to CPU-only backends
128174

129175
## Supported languages
130176

@@ -150,7 +196,7 @@ The difference between SDAI app flavors are described at the project wiki page [
150196

151197
## Donate
152198

153-
This software is open source, provided with no warranty, and you are welcome to use it for free.
199+
This software is open source, provided with no warranty, and you are welcome to use it for free.
154200

155201
In case you find this software valuable, and you'd like to say thanks and show a little support, here is the button:
156202

0 commit comments

Comments
 (0)