What you’ll build
A remote ComfyUI instance accessible from your browser, running on a dedicated GPU with all models and custom nodes pre-installed. You can connect from anywhere and generate images without taxing your local hardware.GPU sizing
| Model | Min VRAM | Recommended GPU |
|---|---|---|
| SDXL | 12 GB | RTX 4090 |
| FLUX.1 Schnell | 16 GB | RTX 4090 |
| FLUX.1 Dev / Pro | 24 GB | RTX 4090, L40S |
| FLUX.2 Pro (4MP) | 32 GB+ | L40S, A100 |
| Multiple models loaded | 48 GB+ | L40S, A100 |
Setup (CLI)
1. Deploy a GPU instance
2. Install ComfyUI
3. Download models
4. Install popular custom nodes
5. Start ComfyUI
6. Connect
http://<INSTANCE_IP>:8188 in your browser. ComfyUI is now running on your cloud GPU.
Persistent setup with storage
Downloading models every time you deploy a new instance is slow. Use a storage volume to persist your models:Using MCP
“Deploy an RTX 4090 instance called comfyui. Clone the ComfyUI repo, install requirements, download the FLUX.1 Schnell checkpoint, and start the server on port 8188. Give me the URL when it’s ready.”The agent runs through the full setup and returns the URL to open in your browser.
Tips
- Save workflows as JSON — export from ComfyUI and store in your repo for reproducible generation
- Use FP8 quantized models for FLUX.2 — 40% less VRAM, similar quality
- Kill the instance when done —
runcrate instances delete comfyuistops billing immediately - Use the API mode — ComfyUI has a REST API at
/promptfor programmatic batch generation