I've been frustrated in attempts to get dunk tanks in images. The LLM generally ignores the prompt and puts the person facing the wrong way, or the seat floating over the middle of the tank, or the tank is too small.
Gloopsuit said: I've been frustrated in attempts to get dunk tanks in images. The LLM generally ignores the prompt and puts the person facing the wrong way, or the seat floating over the middle of the tank, or the tank is too small.
You need to think laterally about what you're trying to achieve and what context does the model understand. It is highly unlikely that it understands what a gungetank is. It also doesn't understand how a person would sit in it. You need to to be as specific as possible within the confines of the prompt. If you are using GPT4o or Sora, you can sketch and image in pencil and upload it with specific instructions. It is smart enough to understand your sketch and notes.
I'm getting consistent dunk-tank image results in Whisk, specifying an elevated over the shoulder perspective. It's the video prompts that are infuriating me, the entire seat assembly constantly falls with the contestants attached to it, regardless of how I've described it. It seems like the only thing Kling can generate successfully is a credit card deduction.
Edit - It did just manage to do this, but it cost thousands of credits to get there.
Gloopsuit said: I've been frustrated in attempts to get dunk tanks in images. The LLM generally ignores the prompt and puts the person facing the wrong way, or the seat floating over the middle of the tank, or the tank is too small.
You need to think laterally about what you're trying to achieve and what context does the model understand. It is highly unlikely that it understands what a gungetank is. It also doesn't understand how a person would sit in it. You need to to be as specific as possible within the confines of the prompt. If you are using GPT4o or Sora, you can sketch and image in pencil and upload it with specific instructions. It is smart enough to understand your sketch and notes.
Oh, I have been. The prompt has gone into detail such as 'a plastic seat attached to the inside rim of the vat' and 'she is sitting above the gunge with legs over the edge of the seat.' Didn't know you could use a sketch as a basis, will try that.
Generated these with Veo3 via Gemini. The woman takes/slips off one of her shoes and steps on a Cool Whip pie with her bare foot, and shows her creamed sole to the camera.
Working on cleaning up the prompts so that limb movement is consistent.