For example, I will write a prompt in my own words, then it'll walk me through a few questions to make it better, such as image color (colored, vibrant, neon, black/white, etc), what style image do I want (photorealistic, anime, drawing, digital, 3d, etc), lighting (soft, natural, direct, dim, etc), and whatever other details that might be helpful such as photography style, clothing style, scenery, etc depending on previous answers. I envision each of these steps having an example image for each option to give you an idea of what that means (like what does soft vs natural vs direct lighting look like.)
If this doesn't exist, I do believe this is a very useful product worth building.
https://www.logitech.com/en-us/software/logi-ai-prompt-build...
https://blog.logitech.com/2024/04/17/looking-to-improve-your...
Generate a picture of a house
house.jpg
Let's change it to colonial style
House2.jpg
I don't like the color let's change it to red.
House3.jpg
Redo it using acrylic oil painting.
Etc.
Since we're focusing on images right now, you can't just "build one prompt" to rule them all. Effective prompts are highly dependent on the checkpoint, sampler, Loras, embeddings, etc.
Incidentally, I believe that Foocus tries to use a smaller LLM To try to expand a prompt.
For example, a lot of the stable diffusion 1.5 checkpoints can generate better looking imagery if you add in essentially keywords such as "trending, masterpiece, award-winning, etc.", but they aren't nearly as important in certain SDXL checkpoints.
To me it sounds like you want an interactive wizard / tutor which could be useful, but my point is you'd have to customize or fine-tune it at a very deep level per model (in the case of images generation). It would definitely be a non-trivial amount of work.
That's why ChatGPT DALL-E 3 is so nice, it's end-to-end in terms of allowing simple expression of what you want to see and an iterative workflow. My younger siblings are in elementary school and are able to get great results from it.
Midjourney also lets you upload an unrelated picture as a stylistic reference, so you can find a picture that has the aesthetic you want (ink, watercolor, matte, etc.)
Building something that can actually help those users seems like it would be challenging.