A dog without a tail.
Photo credit: theFreesheet/Google ImageFX

Generative AI might be able to create stunning visual spectacles, but it still struggles with basic linguistic tests, such as understanding what “not” means.

A new study from the University of Liège reveals that while platforms like Midjourney and DALL-E can produce “aesthetic feats,” they frequently commit “beginner’s mistakes” when tasked with simple logical instructions.

The interdisciplinary research found that these systems struggle fundamentally with negation and temporal sequences. When asked to generate “a dog without a tail,” the AI typically produces a dog with a tail or hides the rear of the animal behind a frame, failing to comprehend the linguistic concept of absence.

“Our approach is based on a series of rigorous tests,” said Maria Giulia Dondero, semiotician at the University of Liège. “We submitted very specific requests to these two AI systems and analysed the images produced according to criteria from the humanities, such as the arrangement of shapes, colours, gazes, the specific dynamism of the still image, the rhythm of its deployment, etc.”

Beauty over brains

The study highlights a significant gap between visual polish and semantic understanding. The AI models frequently translated simple actions, such as “fighting,” into dance scenes and failed to distinguish between temporal states, like “starting to eat” versus “having finished eating.”

The researchers validated these findings through replication, generating up to 50 images per prompt to demonstrate that the errors were statistical patterns rather than one-off glitches.

They found distinct “personalities” in the failures: Midjourney favours “aestheticised” renderings that often ignore strict instructions in favour of visually appealing textures, while DALL-E offers better compositional control but varies widely in object orientation.

“AI tools are not simply automatic tools,” said Enzo D’Armenio, the article’s lead author and a junior professor at the University of Lorraine. “They translate our words according to their own logic, influenced by their databases and algorithms.”

Leave a Reply

Your email address will not be published. Required fields are marked *

You May Also Like

Journalism schools lack consistent AI strategy as scattershot policies confuse

Artificial intelligence is becoming deeply embedded in journalistic workflows, yet new research…

AI uses rapid facial ageing to predict cancer survival chances

When battling cancer, the speed at which your face physically ages could…

Lower-income nations lead the world in digital health literacy

It is a common assumption that national wealth automatically translates into stronger…

AI chatbots lose up to 30 per cent accuracy when trained to be friendly

Training chatbots to sound warmer and more empathetic makes them significantly less…

AI ‘photo booth’ reads the faces of lab mice to detect their hidden pain

Assessing pain in laboratory mice is notoriously difficult, often relying on subjective…

Your AI chatbot addiction is a deliberate corporate design, exploiting loneliness

Millions of people are developing severe, life-altering addictions to artificial intelligence chatbots…