Welcome to an exciting exploration of the innovative new GPT Builder tool (by OpenAI) and my ambitious first project with it — the Visual Consistent Character Creator, an attempt to unlock the holy grail of the text to image sphere. This pioneering tool represents a major leap forward in AI-assisted creativity, combining GPT Builder’s capabilities with the user’s unique imagination.
What exactly does it do? The GPT Builder assists you in creating your own personal AI assistant tailored to your needs – in a nut shell.
In this case, my aim was not just to create a character generator, but to come up with a tool that allows me to construct aesthetically pleasing and above all consistent looking AI-generated characters. As a cherry on top I had the GPT Builder formulate the prompt in a Midjourney readable syntax.
The Innovative Process: Enabling Detail and Consistency
My Visual Consistent Character Creator enables this through three key capabilities:
- Comprehensive trait selection — this allows for diverse and highly customised characters.
- Sequential, step-by-step trait selection — this ensures (or at least strives to achieve) coherence and precision in line with GPT Builder’s innovative approach.
- Quality check through AI-enhanced portraits using DALL-E — as a first step to ensure some level of consistency has been achieved before moving to Midjourney (or other text to image generators).
By combining these strengths, my tool can cater to a wide spectrum of creative needs while maintaining visual consistency and artistic flair. Let’s have a look at the process.
The Innovative Process: Detail and Consistency
1. Comprehensive Trait Selection At the outset, I focused on defining a wide array of character traits, mainly physical attributes (and some secret ingredients I am not revealing). I created a template for this, a matrix of sorts. This was done keeping in mind the need to match the high standards of character portrayal as seen in Midjourney. Every trait was carefully chosen to ensure that my GPT Builder could cater to diverse creative needs while maintaining a high level of detail and visual consistency.
2. Sequential Interactivity for Enhanced Precision A standout feature of the Visual Consistent Character Creator is its methodical, step-by-step trait definition process. Reflecting the innovative approach of the GPT Builder, this process ensures that each character trait is not only distinct but also contributes to a coherent overall portrait. This phase was somewhat tricky as the GPT Builder, although always complying, not always “remembered” my instructions and occasionnally would show signs of hallucinations.
3. AI-Enhanced Portrait Prompt Meeting Midjourney Syntax In the final step, after the Visual Consistent Character Creator has summarised the character’s traits and the user has confirmed them I instructed the tool to generate two types of portraits — a detailed close-up and a full-body image with DALL-E which conveniently sits in this workflow as it is part of OpenAI’s ecoystem, so the user never has to leave the browser window. At the very end the Visual Consistent Character Creator creates a prompt the user can copy&paste into Midjourney.
After some testing however, I have to say that the consistency of the characters is quite impressive, but only when generating with DALL-E. When exporting the prompts to Midjourney, the consistency is less evident. A major advantage of integrated DALL-E in ChatGPT is the possibility to discuss the result with GPT and ask for modifications on the generated image. This is huge!
The Potential Impacts: Opportunities and Challenges
By significantly enhancing the character design process, a visual character designer assistant like I just build with GPT Builder could revolutionise creative sectors like gaming, animation and graphic novels. The ability to quickly generate consistently looking, detailed and high-quality characters could greatly accelerate production and encourage more experimentation.
However, this also risks reducing human input in creative roles. As AI becomes increasingly capable of mimicking human artistry, important questions around originality and authenticity arise. While AI art tools offer exciting new opportunities, maintaining a balanced perspective regarding their applications will be vital so that we can benefit from their potential while responsibly managing their risks.
Overall, as an innovative new frontier in AI-assisted creativity, this tool promises to take character design into an exciting new era. By harnessing its capabilities thoughtfully, we can unlock immense creative potential.