Multimodal Large Language Models and Applications-东北师范大学数学与统计学院

当前位置：首页 > 学术活动 > 正文

Multimodal Large Language Models and Applications

时间：2025年10月20日 10:00 点击数：

报告人：蔡小昊

报告地点：腾讯会议ID: 838-732-389

报告时间：2025年10月22日星期三15:00-16:00

邀请人：刘俊

报告摘要：

Machine/deep learning technologies like large language models have revolutionised many fields including computer vision, image processing, and natural language processing. Their success generally relies on big data, functioning in a black-box manner. In this presentation, I will introduce some of our recent work leveraging multimodal large language models on detection and classification, and some applications in autonomous driving (e.g., point cloud analysis) and natural language processing (e.g., fake news detection and personality detection).

主讲人简介：

Xiaohao Cai received his PhD degree from The Chinese University of Hong Kong in 2012. He afterwards was a Postdoctoral Researcher at the Department of Mathematics of the Technische Universitat Kaiserslautern in Germany. He has broad multi-disciplinary research interests in applied mathematics, statistics, and computer science, with main focus and applications in image/signal/data processing, optimisation, machine learning and computer vision. He is Fellow of Advance HE in the UK. His recent work in AI for donkey identification has been featured by world-leading media like BBC and ITV.