I've hit a workflow roadblock and I'm hoping someone who's already solved this can point me in the right direction.
My current setup is:
- Google Flow for image generation
- GPT subscription for GPT-Image 2 access
- Additional API credits from third-party OpenAI-compatible providers
What I'm trying to achieve is a workflow similar to Flow, but using GPT-Image 2 through API credits rather than buying another platform subscription.
The challenge is that while Flow gives great control, I still spend a lot of time dealing with facial consistency issues across generations. GPT-Image 2 seems noticeably stronger in that area, so I'd like to build my image workflow around it.
I've already tested several clients/interfaces:
- Chatbox
- LobeChat
- OpenRouter Chat
- TypingMind
- Cherry Studio
- Jan
Most of them work well for chat, but I haven't found one that provides a strong image-generation workflow with:
- custom API endpoint support
- GPT-Image 2 access
- image-first UI
- prompt iteration/versioning
- multi-image generation and comparison
I'm not necessarily looking for the best platform. I'm trying to understand whether a client that supports this workflow already exists, or if most people using GPT-Image 2 via API are building their own interface.
For those generating images through API providers rather than platform subscriptions, what does your setup look like?
[link] [comments]