Business Daily Media

Men's Weekly

.

PolyU develops novel multi-modal agent to facilitate long video understanding by AI, accelerating development of generative AI-assisted video analysis

HONG KONG SAR - Media OutReach Newswire - 10 June 2025 - While Artificial Intelligence (AI) technology is evolving rapidly, AI models still struggle with understanding long videos. A research team from The Hong Kong Polytechnic University (PolyU) has developed a novel video-language agent, VideoMind, that enables AI models to perform long video reasoning and question-answering tasks by emulating humans' way of thinking.

The VideoMind framework incorporates an innovative Chain-of-Low-Rank Adaptation (LoRA) strategy to reduce the demand for computational resources and power, advancing the application of generative AI in video analysis. The findings have been submitted to the world-leading AI conferences.

A research team led by Prof. Changwen Chen, Interim Dean of the PolyU Faculty of Computer and Mathematical Sciences and Chair Professor of Visual Computing, has developed a novel video-language agent VideoMind that allows AI models to perform long video reasoning and question-answering tasks by emulating humans’ way of thinking. The VideoMind framework incorporates an innovative Chain-of-LoRA strategy to reduce the demand for computational resources and power, advancing the application of generative AI in video analysis.
A research team led by Prof. Changwen Chen, Interim Dean of the PolyU Faculty of Computer and Mathematical Sciences and Chair Professor of Visual Computing, has developed a novel video-language agent VideoMind that allows AI models to perform long video reasoning and question-answering tasks by emulating humans’ way of thinking. The VideoMind framework incorporates an innovative Chain-of-LoRA strategy to reduce the demand for computational resources and power, advancing the application of generative AI in video analysis.

Videos, especially those longer than 15 minutes, carry information that unfolds over time, such as the sequence of events, causality, coherence and scene transitions. To understand the video content, AI models therefore need not only to identify the objects present, but also take into account how they change throughout the video. As visuals in videos occupy a large number of tokens, video understanding requires vast amounts of computing capacity and memory, making it difficult for AI models to process long videos.

Prof. Changwen CHEN, Interim Dean of the PolyU Faculty of Computer and Mathematical Sciences and Chair Professor of Visual Computing, and his team have achieved a breakthrough in research on long video reasoning by AI. In designing VideoMind, they made reference to a human-like process of video understanding, and introduced a role-based workflow. The four roles included in the framework are: the Planner, to coordinate all other roles for each query; the Grounder, to localise and retrieve relevant moments; the Verifier, to validate the information accuracy of the retrieved moments and select the most reliable one; and the Answerer, to generate the query-aware answer. This progressive approach to video understanding helps address the challenge of temporal-grounded reasoning that most AI models face.

Another core innovation of the VideoMind framework lies in its adoption of a Chain-of-LoRA strategy. LoRA is a finetuning technique emerged in recent years. It adapts AI models for specific uses without performing full-parameter retraining. The innovative chain-of-LoRA strategy pioneered by the team involves applying four lightweight LoRA adapters in a unified model, each of which is designed for calling a specific role. With this strategy, the model can dynamically activate role-specific LoRA adapters during inference via self-calling to seamlessly switch among these roles, eliminating the need and cost of deploying multiple models while enhancing the efficiency and flexibility of the single model.

VideoMind is open source on GitHub and Huggingface. Details of the experiments conducted to evaluate its effectiveness in temporal-grounded video understanding across 14 diverse benchmarks are also available. Comparing VideoMind with some state-of-the-art AI models, including GPT-4o and Gemini 1.5 Pro, the researchers found that the grounding accuracy of VideoMind outperformed all competitors in challenging tasks involving videos with an average duration of 27 minutes. Notably, the team included two versions of VideoMind in the experiments: one with a smaller, 2 billion (2B) parameter model, and another with a bigger, 7 billion (7B) parameter model. The results showed that, even at the 2B size, VideoMind still yielded performance comparable with many of the other 7B size models.

Prof. Chen said, "Humans switch among different thinking modes when understanding videos: breaking down tasks, identifying relevant moments, revisiting these to confirm details and synthesising their observations into coherent answers. The process is very efficient with the human brain using only about 25 watts of power, which is about a million times lower than that of a supercomputer with equivalent computing power. Inspired by this, we designed the role-based workflow that allows AI to understand videos like human, while leveraging the chain-of-LoRA strategy to minimise the need for computing power and memory in this process."

AI is at the core of global technological development. The advancement of AI models is however constrained by insufficient computing power and excessive power consumption. Built upon a unified, open-source model Qwen2-VL and augmented with additional optimisation tools, the VideoMind framework has lowered the technological cost and the threshold for deployment, offering a feasible solution to the bottleneck of reducing power consumption in AI models.

Prof. Chen added, "VideoMind not only overcomes the performance limitations of AI models in video processing, but also serves as a modular, scalable and interpretable multimodal reasoning framework. We envision that it will expand the application of generative AI to various areas, such as intelligent surveillance, sports and entertainment video analysis, video search engines and more."


Hashtag: #PolyU #AI #LLMs #VideoAnalysis #IntelligentSurveillance #VideoSearch

The issuer is solely responsible for the content of this announcement.

News from Asia

Optimistic Hong Kong Ecommerce Merchants Report Growth, But Hidden Payment Friction Is Eroding Up to 10% of Revenue, Aspire Report Finds

Despite 64% reporting revenue growth, 91% of merchants face payment friction HONG KONG SAR - Media OutReach Newswire - 16 December 2025 - Aspire, the all-in-one finance platform for modern busine...

ISCA Unveils Bold Plan to Future-Proof Singapore’s Small and Medium-Sized Accounting Practices

SINGAPORE - Media OutReach Newswire - 16 December 2025 - Small and Medium-Sized Accounting Practices (SMPs) are the backbone of Singapore's business community, supporting thousands of Small and Me...

Leeds Capital and MIO Trust Are Proud to Announce Their Collaboration on an AI‑Driven Multi‑Asset Trust Focused on Digital Assets and Precious Metals

SYDNEY, AUSTRALIA - Media OutReach Newswire - 16 December 2025 - Leeds Capital and MIO Trust are proud to announce their collaboration. Together they are set to launch an AI‑driven multi‑asset tru...

Lily Allen, Little Simz and Bianca Bustamante Light Up The Red-Carpet In Desert Diamonds, At The Fashion Awards 2025

LONDON, UK - Media OutReach Newswire - 16 December 2025 - Desert diamonds graced the red-carpet at this year's Fashion Awards, held at the Royal Albert Hall on 1st December. Attendees including si...

The 27th Mountain Emei Ice, Snow & Hot Spring Season Invites Global Visitors to "Enjoy Winter Fun"

EMEISHAN, CHINA - Media OutReach Newswire - 16 December 2025 - On the evening of December 14, as the 218-meter-high Twin Towers lit up with a spectacular giant-screen light show, the launch ceremo...

Cheers to New Beginnings: Carlsberg Hong Kong Launches No & Low-Alcohol and Beyond Beer Series for Conscious Celebrations

Ringing in the New Year, the extended collection promotes moderation and conscious drinking throughout the festive season and beyond HONG KONG SAR - Media OutReach Newswire - 16 December 2025 - Ca...

1wish Season with Santa Jones – Christmas Advents by 1win and Jon Jones

WILLEMSTAD, CURAÇAO - Media OutReach Newswire - 3 December 2025 - Four weeks before Christmas, 1win has launched an Advent Calendar in partnership with its global brand ambassador Jon Jones, the l...

Prediction is the New Protection: Gartner® Acknowledged CyCraft as a Sample Vendor We Believe for Emerging AI Cyber Solutions

TAIPEI, TAIWAN - Media OutReach Newswire - 17 December, 2025 - CyCraft Technology has been identified as a Gartner® Sample Vendor in both Preemptive Exposure Management (PEM) and Unified Exposure ...

TSquared Lab launches TSquared Health, an AI-driven longevity ecosystem, with the acquisition of Noviu Health

Dr. Hisham Badaruddin Appointed Chief Medical Officer as TSquared Health Integrates Medical, Biomarker, and AI Longevity Capabilities SINGAPORE - Media OutReach Newswire - 17 December 2025 - TSqua...

Xtreme Communications partners with Truecaller to bring trust and efficiency to business communication in Australia

SYDNEY, AUSTRALIA - NewsVoir - 17 December 2025 - Truecaller, the leading global communications platform, today announced a strategic partnership with Xtreme Group, to introduce the Truecaller Cust...

From Check-in to Touchdown: How AI and smarter systems are transforming the travel industry

Richard Valente, VP of Customer Experience Strategy at TP in Australia, explores how IT-BPM outsourcing is revolutionising the travel sector throu...

Online Christmas shoppers fund climate and biodiversity projects via HealthPost's Click Sphere for Good initiative

Online shoppers with HealthPost’s Flora & Fauna have made 11,000 contributions towards climate and biodiversity projects when ordering parcel ...

US landmark settlement protects SMEs, highlighting flaws in the RBA's proposed blanket card surcharging ban for Australia

Aussie SMEs warn RBA not to ignore global trends, with the current sledgehammer approach threatening business viability and increasing inflation ...

Thryv Australia named Employer of Choice for third consecutive year at Australian Business Awards

Thryv® (NASDAQ: THRY), Australia’s provider of the leading small business marketing and sales software platform, has been awarded the Employer of ...

RogersDigital.com Announces the Launch of TheBulletin.au, a Destination for Business, Policy and Financial Insight

RogersDigital.com has announced the launch of TheBulletin.au, a new national digital publication designed to deliver sharp, data-driven reporting ...

Controlling business spend is helping finance leaders to forecast with confidence

Forecasting has always been central to financial planning; however, traditional methods based on historical trends are no longer enough. Economic ...

hacklink hack forum hacklink film izle hacklink หวยออนไลน์betsmovematbethttps://vozolturkiyedistributoru.com/Pusulabet Girişสล็อตเว็บตรงgamdom girişpadişahbetMostbetpradabetmatbetcarros usadospin upMostbetdizipalholiganbetnn888trendbetsetrabetjojobetmarsbahis girişpusulabet girişbetnanotürk ifşaBets10pusulabetholiganbetpusulabetMavibet色情marsbahisnakitbahisholiganbetjojobet girişjojobet girişjojobet girişjojobet girişYakabet1xbet girişjojobetgrandpashabetFİXBETbetofficeenjoybetpradabetkingroyalholiganbetgiftcardmall/mygiftultrabetkavbetbets10palacebetmamibetmeritkingcasibommeritkingdamabetslot spacemancasibomteknoloji haberlericasibom girişJojobetmeritkingmeritkingPorno İzlecasibom girişsweet bonanzakingroyalgalabetcasibomcasibom girişjokerbetjokerbetyakabetCasibombetpuanmeritkingmatbet girişdinamobetmasterbettingvdcasinoSekabet girişmarsbahisbetkolikultrabetprimebahismeritkingprimebahismeritkingbets10yakabetyakabetyakabetjojobetprizmabetkulisbetSahabetmr pachoaertyercasibomcolor pickermatbetvbetkavbetkralbet girişmavibetmavibetmavibetbetnano girişcratosslot girişคลิปหลุดไทยCasibomCasibomholiganbetdeneme bonusu veren siteleronwinonwinizmir escortbetnanoantalya escortbetnano girişbahsegeltimebetbetnanocasibom güncel girişcasibom girişbahiscasinoultrabetbets10matbetcasibomRoyal Reelsroyal reelsstarzbet girişKayseri Escortjojobet girişjojobetbetasusNişantaşı EscortelexbetelexbetbettiltStreameastcasibomKalebetBetplayfixbetaviator gameÜsküdar Evden Eve Nakliyatsonbahistimebettimebettimebetbahisoistanbul escort telegramcasibombetparkpantheraproject.netprimebahisholiganbetholiganbet girişmarsbahiscasibomstreameast한국야동vaycasinoสล็อตholiganbet girişholiganbetpornopadişahbetBetigmabetparkBetigmaBetlora girişgiftcardmall/mygiftgaziantep escorteb7png pokiesbest online casino australiabest online pokies australiareal money pokies online australiabcgame96 casinocrown155 hk casinohb88kh casinoBetplaygalabetmarsbahisgalabetholiganbet girişjojobetcasibombets10bets10betasusholiganbetolimposcasinobetbabaholiganbetholiganbetolabahis girişcasibomdeneme bonusu veren siteler rehneriblooketasyabahis girişpinbahis girişdumanbet girişjojobetStreameastmostbetdaftar situs judi slot gacor hb88 indonesiaJojobet 1112mostbetmostbetmostbetteosbetorisbetbahis siteleri 2025matadorbetcasinowon girişkavbetjojobetgiftcardmall/mygift check balance visapusulabetgalabet girişซื้อหวยออนไลน์grandpashabetcasibomasdsadasdasdasdasfdasfasfsadfasdfsdfasdasdasdasdkingroyal girişjojobetbahiscasinobetasuspin up uzbekistanSlot Heart Casinomamibet logincasinomedklarna.sebetworld96 online casino cambodiaholiganbetwww.giftcardmall.com/mygiftwww.giftcardmall.com/mygiftcasibomtm menards loginbetasuspalacebetsekabet girişe wallet casino australiameybetplay aristocrat pokies onlinecasibompusulabetmaltcasino girişcanlı maç izlebetpasSahabet girişcasibomcasibomcratosroyalbetci girişultrabetcasibomdeneme bonusu veren sitelerPinup AZjokerbetjojobetvdcasinomostbetcasibom girişCasibomsitus slot gacormatbet