AI apps are popping up everywhere. We’ve all played around with ChatGPT to write an email for us, used a Text to Image app to generate a silly image, or created a new headshot with an AI app. But users are seeing varied success with many of these apps. One of the reasons? The prompts they’re entering.
Let’s take a look at Canva’s Text to Image app. The app has us enter a text prompt and choose a selected style. We’ll go ahead and use a prompt like “a warrior panda riding into battle”.
After some AI Magic, Canva gives us 4 results.
Now these results aren’t bad. We can clearly see this is a panda that is a warrior, but there doesn’t seem to be much of a battle around. Or any riding for that matter.
One thing most users do is focus on the subject of the image. This helps create a high-fidelity subject, but it’s missing lots of the details that make it realistic. When entering a prompt, describe the details of the image. Think of things like “blue eyes” or “wearing a red scarf”, additionally describing the form, style, and composition. Many apps include a Style selector to make sure users do this. We can use more selectors to make sure our users are including more detail. And lastly, encourage users to use more adjectives and adverbs. This will help add more complexity to our prompt without making it overly wordy.
This is where something like our Prompt Improver app comes into play!
Encouraging us to add more descriptive language based on the prompt, the Prompt Improver app suggested some changes, and as we add them, removes them from the list of suggestions. This is done using a Parts of Speech (pos) npm package that classifies the words in the sentence as a user types in their prompt. You can use these parts of speech to ensure users are entering critical things like adjectives and adverbs, and suggest prompt improvements.
Now let’s take our improved prompt, “a warrior panda riding a horse into a chaotic battle scene”, and see what Text to Image returns for us.
Now we have some more details showing up in our image, like the people in the background and the panda on a panda-horse.
We know that as the developer training your models, you know them best. You’ll know what kind of prompts work best, and what data users should be entering to achieve the best results. Make sure to share your knowledge with users! Add some tool tips or info text to your app UI to inform users about key things to include (or avoid!) in their prompt. Sometimes adding cues to what key descriptors users are missing in their prompt can go a long way with garnering user love for your app.
Happy AI Creating!