Smaller AIs vs larger AIs.
Photo credit: theFreesheet/Google ImageFX

The industry’s reliance on building ever-larger models is becoming unsustainable due to “enormous computing resources” and energy costs, creating an urgent need for leaner artificial intelligence systems.

Researchers from Shanghai Jiao Tong University have outlined a comprehensive roadmap for “efficient multimodal large language models” that challenges the dominance of centralised cloud infrastructure. The review, published in Visual Intelligence, argues that the future of intelligence depends on reducing computational barriers rather than sheer scale.

“Efficiency determines who can build, deploy, and benefit from multimodal AI,” said Prof. Lizhuang Ma, the team leader of the study.

Critical flaw

The study identifies a critical flaw in current multimodal systems: visual inputs generate long token sequences that dramatically increase complexity. A single image can produce thousands of tokens, making standard models too heavy for practical deployment.

To solve this, researchers propose “vision token compression” to remove redundant data before it reaches the language model. They also advocate for compact language backbones with just one billion to three billion parameters, coupled with lightweight vision encoders.

Beyond simple compression, the review emphasises emerging architectures such as “mixture-of-experts”. These systems selectively activate specific model components to increase capacity without proportionally increasing computation costs.

This shift toward efficiency is expected to “democratise access” to advanced AI capabilities by allowing powerful models to run on mobile devices and edge platforms. The authors suggest this transition will enable real-time applications in healthcare and remote sensing while addressing growing concerns about energy consumption and data privacy.

Leave a Reply

Your email address will not be published. Required fields are marked *

You May Also Like

New theory suggests AI may never be conscious without ‘biological’ chips

The debate over whether Artificial Intelligence can ever truly be conscious has…

You can swear by it: Turning the air blue makes you stronger, psychologists find

Unleashing a string of expletives might be the secret to hitting a…

Super Mario Bros. prescribed as ‘potent antidote’ for adults suffering burnout

Replaying familiar video games like Super Mario Bros. and Yoshi may help…

AI fuels boom in scientific papers but floods journals with ‘mediocre’ research

Artificial intelligence is helping scientists write papers faster than ever before, but…

‘Feral’ AI chatbots are spreading shame and destroying reputations

Artificial intelligence is evolving into a “feral” gossip machine capable of ruining…

New AI personality test reveals chatbots can be programmed with ‘psychosis’

Researchers have developed the first scientifically validated framework to measure the “personality”…