Artificial Unintelligence

17 April 2024
BrianD
A blue sky with a few clouds in it
A man is sitting at a desk using a laptop computer.

Artificial Unintelligence

BrianD
17 April 2024
A blue sky with a few clouds in it

Why does Dall-E struggle to comprehend basic instructions? My conversation with ChatGPT 🤖

Why does Dall-E not understand basic instructions. My lengthy conversation with ChatGPT 🤖

Me: I asked for a futuristic graphic for my tech blog and it must include the word BLOG. It came back with the word BLLLOG. Why?

ChatGPT: "It sounds like there might have been a misunderstanding or error in the input you provided to DALL-E. While DALL-E is capable of generating images based on text prompts, it may not always interpret instructions as intended. In this case, it seems like DALL-E may have misinterpreted the word "BLOG" as "BLLLOG."


To improve the accuracy of DALL-E's responses, you could try providing more specific or detailed instructions. For example, instead of just asking for a "futuristic graphic for my tech blog," you could specify the elements you want in the image, such as futuristic technology, a blog interface, and the word "BLOG" clearly spelled out. This should help reduce the likelihood of misinterpretation.

Bllog is written in white on a black background

Me: "OK, Let me try that"


ChatGPT: "Sure, feel free to provide more specific instructions, and I can help you craft a prompt to submit to DALL-E. Let me know what elements you'd like to include in your futuristic graphic for your tech blog, and we can work on refining the prompt together."


Me: "Actually, yes, This is what I want: I would like a futuristic graphic depicting futuristic technology, a blog interface, and the word "BLOG" clearly spelled out."


ChatGPT: Great! Based on your instructions, here's a refined prompt you can use for DALL-E:


"Generate a futuristic graphic featuring advanced technology integrated with a sleek blog interface. The graphic should prominently display the word 'BLOG' in a clear and easily readable font. Incorporate futuristic elements such as holographic displays, advanced user interfaces, and any other innovative technology. The overall aesthetic should convey a sense of modernity and innovation suitable for a tech-focused blog."


This should provide DALL-E with clearer instructions to generate the kind of image you're looking for. Let me know if you need further assistance!


Me: "Now after that command I get: BLLOOG"

A blloog logo with a globe in the background

ChatGPT: It seems there might still be some confusion in DALL-E's interpretation of the prompt. Let's try refining the instructions further to ensure clarity:


"Create a futuristic graphic that showcases advanced technology alongside a modern blog interface. Please include the word 'BLOG' in a clearly readable font, ensuring accurate spelling. Imagine innovative elements like holographic displays, sleek user interfaces, and other futuristic technologies integrated seamlessly into the design. The overall aesthetic should exude modernity and innovation, fitting for a tech-oriented blog."


With these adjustments, DALL-E should have a clearer understanding of your requirements. Let's see if it produces the desired result.


Me: "Hold tight, I'll check"


ChatGPT: "Take your time! Let me know if you need any further assistance or if you'd like to make any more adjustments to the prompt."


Me: "I wish you could do it, you seem cleverer than DALL-E"


ChatGPT: Thank you for the compliment! While I'm here to help refine instructions and offer suggestions, DALL-E is specifically designed to generate images based on textual prompts. If you encounter any difficulties or need further assistance with generating the desired graphic, feel free to let me know, and I'll do my best to assist you!


Me: "OK, after your last refined instruction DALL-E has spelt it like this "BLOGLAAG"

The word bloglagg is on a black background

ChatGPT: "It seems like there might still be some confusion in DALL-E's understanding of the prompt. Let's simplify and clarify the instructions even further:


"Please create a futuristic graphic representing advanced technology alongside a modern blog interface. Include the word 'BLOG' in a clear and readable font. Incorporate futuristic elements like holographic displays and sleek user interfaces. The overall design should reflect innovation and modernity, suitable for a tech-oriented blog."


Let's see if this adjustment helps DALL-E generate the correct image. If not, we may need to try a different approach or provide even more specific instructions."


Me: "Now it's come back with "bellogoog" and some other crazy text? I've given up.

A purple sign that says g tielog bellogoog

ChatGPT: "I understand your frustration, and I'm sorry to hear that DALL-E is still not producing the desired results. It seems like there may be limitations or challenges in accurately interpreting the prompt. If you've given up on using DALL-E for this task, you might consider seeking assistance from a graphic designer who can create the futuristic graphic you're envisioning. Alternatively, you could try experimenting with different wording or approaches in DALL-E, but I completely understand if you prefer to explore other options. If there's anything else I can assist you with, please feel free to let me know.


Me: "So you're now sending me back to what I did twenty years ago!


ChatGPT: "I apologize if it feels like I'm suggesting a step backward. Technology can sometimes have its limitations, and it's frustrating when it doesn't meet our expectations. While DALL-E represents an exciting advancement in AI, it's not perfect and may struggle with certain tasks.


That said, there have been significant advancements in graphic design tools and platforms over the years, so you might find more success with modern tools compared to what was available twenty years ago. Additionally, if you're open to it, there are online communities and marketplaces where you can connect with skilled designers who can bring your vision to life efficiently.


If you'd like, I can also help brainstorm alternative solutions or provide guidance on how to approach this task differently. Just let me know how you'd like to proceed!


Me: "No, all good. I can do the design and then add the word blog myself"


ChatGPT: "That sounds like a solid plan! Designing the graphic yourself and then adding the word "BLOG" manually is a practical solution. If you need any tips or advice on graphic design tools or techniques, feel free to reach out. Otherwise, I'm here to help with any other questions or tasks you may have in the future."


Me: "Just out of interest, are you cleverer than Dall-E?"


ChatGPT: "In some ways, I may have certain advantages over DALL-E. For example, I can understand and respond to a wide range of queries, provide explanations, offer advice, and engage in conversation on various topics. However, DALL-E excels specifically in generating images based on textual prompts, a task that I'm not designed to perform. So, while I might have different capabilities, it's not accurate to say that I'm "cleverer" overall. Each AI model has its strengths and limitations, and it's essential to leverage the right tool for the task at hand."


Me: "I guess Dall-E's weakness is that it cannot spell?"


ChatGPT: "It's not accurate to say that DALL-E cannot spell, as it is trained on a vast dataset and generally produces text that adheres to standard spelling conventions. However, like any AI model, DALL-E may occasionally make errors or produce unexpected results due to the complexity of language and the inherent limitations of machine learning algorithms.


In the examples you've provided, it seems that DALL-E may have struggled with accurately interpreting and rendering the word "BLOG" according to your instructions. This could be due to various factors, such as the complexity of the prompt, the nuances of language, or limitations in the model's training data.


While DALL-E's performance can be impressive, it's essential to remember that it's not perfect and may require some trial and error to achieve the desired results. Additionally, providing clear and specific instructions can help mitigate potential misunderstandings or errors."


Me: "Can you spell BLOG?"


ChatGPT: "Yes, "BLOG" is spelled as B-L-O-G."


Me: "You see, you had no problem with it..."


ChatGPT: "Indeed, spelling "BLOG" correctly is straightforward for me. However, DALL-E's understanding and interpretation of textual prompts can sometimes be influenced by various factors, including the complexity of the instructions and the nuances of language. While DALL-E is generally adept at generating images based on text, it may encounter difficulties or produce unexpected results in certain cases.


Me: "OK, Let me try something else then. Hang on..."


ChatGPT: "Sure, take your time. Whenever you're ready, feel free to provide the new prompt or let me know how I can assist you further."


Me: "OK, this was my command: Please create a graphic with a white background and Include the word 'BLOG' in a clear and readable font."


ChatGPT: "That sounds like a clear and straightforward prompt. It should be easy for DALL-E to understand and execute. Let's see what it generates this time!"


Me: "Nope, BLLG was it's response"

The word bllg is written in black on a white background.

ChatGPT: "It seems there's still some misunderstanding despite the clear instruction. DALL-E might be struggling with this particular request. Sometimes, simplifying the prompt further or providing additional context can help. Would you like to try refining the prompt, or is there something else I can assist you with?


Me: "No, it's time for my coffee break, or perhaps a double brandy..."


ChatGPT: "A well-deserved break sounds like a great idea! Whether it's coffee or a double brandy, I hope it helps you relax and recharge. If you have any more questions or need assistance later on, don't hesitate to reach out. Enjoy your break!


Me: "Spell the word BLOG again please"


ChatGPT: "Certainly! "BLOG" is spelled as B-L-O-G."


Me: "Well done"


Me: Well, ChatGPT, it seems we've hit a dead end with no real conclusion in sight.


ChatGPT: Ah, sounds like a bit of a pickle! But hey, why not add a dash of frustration and a couple of double brandies to spice things up?


Me: Haha, you've got a point there! Who needs closure when you've got a good drink and a sense of humour, right?


ChatGPT: Absolutely! Here's to embracing AI's unpredictability and enjoying the ride, brandy and all! Cheers!



More Posts

by Brian Hallam 17 September 2025
Is Your Website Ready for AI Search? Here’s Why It Matters
A shield with VP written inside it.
by BrianD 28 April 2024
It's like a secret tunnel that takes your internet traffic and shrouds it in a cloak of invisibility. 🔒🕵️‍♂️
A close up of a circuit board with a light coming out of it.
by "David the Tech Nerd" 17 April 2024
Why Surge Protectors Are Your Devices' Best Friends! ⚡️🔥
Show More

Share this Post

Shopify eCommerce Stores

Shopify

eCommerce Stores

Reach customers worldwide and boost sales with Shopify's versatile platform. Whether you're selling online, in-store, or across borders, our marketing tools, social integrations, and sales channels will help you connect with buyers and showcase your products effectively.


Join our global commerce community and take advantage of our limited-time offer: pay just $1 per month for the first three months. Don't miss out on this incredible opportunity!

Start A Free Trial
A person is using a tablet to pay for a pair of shoes.
A person is using a tablet to pay for a pair of shoes.

Reach customers worldwide and boost sales with Shopify's versatile platform. Whether you're selling online, in-store, or across borders, our marketing tools, social integrations, and sales channels will help you connect with buyers and showcase your products effectively.


Join our global commerce community and take advantage of our limited-time offer: pay just $1 per month for the first three months. Don't miss out on this incredible opportunity!

Start A Free Trial