Business Daily Media

The Times

.

PolyU-led research reveals that sensory and motor inputs help large language models represent complex concepts

HONG KONG SAR - Media OutReach Newswire - 9 June 2025 - Can one truly understand what "flower" means without smelling a rose, touching a daisy or walking through a field of wildflowers? This question is at the core of a rich debate in philosophy and cognitive science.

While embodied cognition theorists argue that physical, sensory experience is essential to concept formation, studies of the rapidly evolving large language models (LLMs) suggest that language alone can build deep, meaningful representations of the world.

A research team led by Prof. Li Ping, Sin Wai Kin Foundation Professor in Humanities and Technology, Dean of the PolyU Faculty of Humanities and Associate Director of the PolyU-Hangzhou Technology and Innovation Research Institute, explored the similarities between large language models and human representations, shedding new light on the extent to which language alone can shape the formation and learning of complex conceptual knowledge.
A research team led by Prof. Li Ping, Sin Wai Kin Foundation Professor in Humanities and Technology, Dean of the PolyU Faculty of Humanities and Associate Director of the PolyU-Hangzhou Technology and Innovation Research Institute, explored the similarities between large language models and human representations, shedding new light on the extent to which language alone can shape the formation and learning of complex conceptual knowledge.

By exploring the similarities between LLMs and human representations, researchers at The Hong Kong Polytechnic University (PolyU) and their collaborators have shed new light on the extent to which language alone can shape the formation and learning of complex conceptual knowledge. Their findings also revealed how the use of sensory input for grounding or embodiment – connecting abstract with concrete concepts during learning – affects the ability of LLMs to understand complex concepts and form human-like representations. The study, in collaboration with scholars from Ohio State University, Princeton University and City University of New York, was recently published in Nature Human Behaviour.

Led by Prof. LI Ping, Sin Wai Kin Foundation Professor in Humanities and Technology, Dean of the PolyU Faculty of Humanities and Associate Director of the PolyU-Hangzhou Technology and Innovation Research Institute, the research team selected conceptual word ratings produced by state-of-the-art LLMs, namely ChatGPT (GPT-3.5, GPT-4) and Google LLMs (PaLM and Gemini). They compared them with human-generated word ratings of around 4,500 words across non-sensorimotor (e.g., valence, concreteness, imageability), sensory (e.g., visual, olfactory, auditory) and motor domains (e.g., foot/leg, mouth/throat) from the highly reliable and validated Glasgow Norms and Lancaster Norms datasets.

The research team first compared pairs of data from individual humans and individual LLM runs to discover the similarity between word ratings across each dimension in the three domains, using results from human-human pairs as the benchmark. This approach could, for instance, highlight to what extent humans and LLMs agree that certain concepts are more concrete than others. However, such analyses might overlook how multiple dimensions jointly contribute to the overall representation of a word. For example, the word pair "pasta" and "roses" might receive equally high olfactory ratings, but "pasta" is in fact more similar to "noodles" than to "roses" when considering appearance and taste. The team therefore conducted representational similarity analysis of each word as a vector along multiple attributes of non-sensorimotor, sensory and motor dimensions for a more complete comparison between humans and LLMs.

The representational similarity analyses revealed that word representations produced by the LLMs were most similar to human representations in the non-sensorimotor domain, less similar for words in sensory domain and most dissimilar for words in motor domain. This highlights LLM limitations in fully capturing humans' conceptual understanding. Non-sensorimotor concepts are understood well but LLMs fall short when representing concepts involving sensory information like visual appearance and taste, and body movement. Motor concepts, which are less described in language and rely heavily on embodied experiences, are even more challenging to LLMs than sensory concepts like colour, which can be learned from textual data.

In light of the findings, the researchers examined whether grounding would improve the LLMs' performance. They compared the performance of more grounded LLMs trained on both language and visual input (GPT-4, Gemini) with that of LLMs trained on language alone (GPT-3.5, PaLM). They discovered that the more grounded models incorporating visual input exhibited a much higher similarity with human representations.

Prof. Li Ping said, "The availability of both LLMs trained on language alone and those trained on language and visual input, such as images and videos, provides a unique setting for research on how sensory input affects human conceptualisation. Our study exemplifies the potential benefits of multimodal learning, a human ability to simultaneously integrate information from multiple dimensions in the learning and formation of concepts and knowledge in general. Incorporating multimodal information processing in LLMs can potentially lead to a more human-like representation and more efficient human-like performance in LLMs in the future."

Interestingly, this finding is also consistent with those of previous human studies indicating the representational transfer. Humans acquire object-shape knowledge through both visual and tactile experiences, with seeing and touching objects activating the same regions in human brains. The researchers pointed out that – as in humans – multimodal LLMs may use multiple types of input to merge or transfer representations embedded in a continuous, high-dimensional space. Prof. Li added, "The smooth, continuous structure of embedding space in LLMs may underlie our observation that knowledge derived from one modality could transfer to other related modalities. This could explain why congenitally blind and normally sighted people can have similar representations in some areas. Current limits in LLMs are clear in this respect".

Ultimately, the researchers envision a future in which LLMs are equipped with grounded sensory input, for example, through humanoid robotics, allowing them to actively interpret the physical world and act accordingly. Prof. Li said, "These advances may enable LLMs to fully capture embodied representations that mirror the complexity and richness of human cognition, and a rose in LLM's representation will then be indistinguishable from that of humans."

Hashtag: #PolyU #HumanCognition #LargeLanguageModels #LLMs #GenerativeAI

The issuer is solely responsible for the content of this announcement.

News from Asia

As GCC Temperatures Soar, Carpe Diem Beach Club Phuket Beckons Middle East Luxury Travellers

BANGKOK, THAILAND - Media OutReach Newswire - 22 May 2026 - Carpe Diem Beach Club, the premier Mediterranean-inspired destination on Bang Tao Beach, has officially unveiled its signature beachfron...

Aftersales Ecosystem Emerges as a Key Driver of VinFast’s Global Growth

As emerging economies accelerate EV adoption, VinFast is pairing strong product portfolio with an expanding aftersales ecosystem, positioning the VF 8 as a compelling choice for Middle Eastern cust...

Thailand Showcases Creative Industry Strength at Cannes 2026 "Thai Night Cannes 2026" Highlights Vision of "Reimagining Thailand" Positioning Thailand from Global Filming Destination to Future Creative Content Partner

CANNES, FRANCE - Media OutReach Newswire - 22 May 2026 - The Department of International Trade Promotion (DITP), Ministry of Commerce, Thailand, successfully hosted "Thai Night Cannes 2026" during...

"Porcelain on the Silk Road: In Pursuit of Craft" Cultural Exploration Event Held in Tongchuan, Shaanxi

TONGCHUAN, CHINA - Media OutReach Newswire - 22 May 2026 - On May 18, International Museum Day, a cultural exploration event themed "Porcelain on the Silk Road: In Pursuit of Craft" was held in To...

Forest City Classic Course Rises 14 Places to No. 36 in Asia; Retains Malaysia No. 1 for Seventh Year

JOHOR, MALAYSIA - Media OutReach Newswire - 22 May 2026 - Forest City Golf Resort announced that the Forest City Classic Course recorded its largest year-on-year rise in rankings to date in the 2...

Nine industry leaders including Hyundai Motor Group sign landmark hydrogen MOU to drive Hong Kong's green economy (with photos)

HONG KONG SAR - Media OutReach Newswire - 22 May 2026 - At the International Hydrogen Development Symposium 2026 today (May 18), nine pioneering companies from Korea, the Chinese Mainland, France...

Yeebo Ramps Up AI Computing Expansion with Subsidiary Suanova’s TaaS Rollout at Cyberport

Setting a New Benchmark for Domestic AI Computing Services HONG KONG SAR - Media OutReach Newswire - 22 May 2026 - Yeebo (International Holdings) Limited ("Yeebo"; Stock Code: 00259.HK, together ...

HKCSS "S+ Summit 2026" Navigating the Future – Tech for Good & Co-creation

HONG KONG SAR - Media OutReach Newswire - 22 May 2026 - Amid rapid social, economic, and environmental changes, Hong Kong faces significant challenges such as demographic shifts and economic trans...

Innomotics drives electrification of industrial heat processes with industrial heat pump solutions

Significant reductions in energy consumption, CO₂ emissions, and operating costs for energy-intensive industries Growing demand highlights strong market potential for sustainable ...

TCMA Marks National Milestone, Driving Thailand’s Cement Industry toward Net Zero 2050

BANGKOK, THAILAND - Media OutReach Newswire - 22 May 2026 - Thai Cement Manufacturers Association (TCMA) marked a significant national milestone in advancing the decarbonization of Thailand's cem...

For Midsize Companies, Global Payroll Systems Matter More to Business-Security Than You Think

When a midsize company expands across borders, its payroll operation becomes exponentially more complex. These organisations typically face a new ...

GEO and the AI search shift reshaping Australian and New Zealand business visibility

For years, one of the biggest digital marketing questions for businesses was ‘how do we get onto page one of Google?’ That question still matters, ...

Why self-service is reshaping fleet management for modern businesses

Fleet management today is constrained by fragmented systems and heavy administrative demands. A lot of the work still relies on booking vehicles and...

Fraud Prevention and security crucial as identity crime hits record highs in Australia

In a radically transformed risk landscape where the scale and speed of financial fraud have reached unprecedented levels, Australian businesses ar...

Sectorial ATO Tax Debt Disclosures Rise, Overall Business Credit Demand Flattens and High-Risk SME 'Credit Shopping' hits 8-month peak

Q1 2026 Equifax Business Market Pulse shows low-risk borrowers consolidate demand enquiries while sub-prime entities accelerate shopping activity ...

SME support in Federal Budget falls short of easing business pressures

“The Federal Budget delivered several measures aimed at supporting small businesses, including making the instant asset write-off permanent, exten...