个人简介
曹毅超(Yichao Cao),博士,特聘副教授,硕士生导师。研究方向包括多模态大模型、具身智能、计算机视觉等。近五年在ICCV、NeurIPS、ICML、AAAI、ACMMM、ICCAD、DAC等高水平国际会议与期刊上录用/发表论文20余篇,授权国家发明专利10余项,长期担任CVPR、ICCV、NeurIPS、ICML、ICLR、ACMMM、IJCAI、ECCV、IEEE TMM、IEEE TNNLS、IEEE TCSVT等会议和期刊的审稿人,担任AAAI、IJCAI等顶会程序委员会委员。Google Scholar个人主页:https://scholar.google.com/citations?user=--8h8o0AAAAJ&hl=zh-CN。
【研究方向】
※ 多模态大模型(VLM大模型的持续演化和能力优化);
※ 具身智能(具身VTLA大模型的感知、记忆、泛化等);
※ 包括但不限于其他有趣的方向;
【企业经历】
企业任职期间,主持林业/草原行业首个多模态大模型“林海思绪(ForestMind)”的研发(大模型备案号:Jiangsu-LinHaiSiXu-202508130039),主持的“端-边-云协同林火识别引擎及一体化解决方案”获得国际先进成果评价,所主持研发的林火识别解决方案已成功应用于我国20多个省、60多个地市,大幅推动了林草行业智能化发展,建立了一批行业示范性项目,并参与起草了林草、应急行业的多项行业标准。
【近五年代表性论文】
[19] CompTrack: Information Bottleneck‑Guided Low-Rank Dynamic Token Compression for Point Cloud Tracking. AAAI Conference on Artificial Intelligence (AAAI Oral, CCF A), 2026
[18] FocusTrack: One-Stage Focus-and-Suppress Framework for 3D Point Cloud Object Tracking. ACM Multimedia (ACM MM, CCF A), 2025
[17] Refining the Granularity of Smoke Representation: SAM-Powered Density-Aware Progressive Smoke Segmentation Framework. Pattern Recognition (PR, 中科院1区TOP), 2025
[16] CounterPC: Counterfactual Feature Realignment for Unsupervised Domain Adaptation on Point Clouds. International Conference on Computer Vision (ICCV Highlight, CCF A), 2025
[15] TinyMIG: Transferring Generalization from Vision Foundation Models to Single-Domain Medical Imaging. International Conference on Machine Learning (ICML, CCF A), 2025
[14] Perturbating, Tuning, and Collaborating: Harnessing Vision Foundation Models for Single Domain Generalization on Medical Imaging. AAAI Conference on Artificial Intelligence (AAAI, CCF A), 2025
[13] A Novel Image-Graph Heterogeneous Fusion Framework for Static IR Drop Prediction. Design Automation Conference (DAC, CCF A), 2025
[12] Variational Feature Imitation Conditioned on Visual Descriptions for Few-Shot Fine-Grained Recognition. IEEE Transactions on Circuits and Systems for Video Technology (IEEE TCSVT, 中科院1区TOP), 2025
[11] SmokeAgent: Multimodal Agent for Fine-Grained Smoke Event Analysis in Large-Scale Wild Environments. Pattern Recognition (PR, 中科院1区TOP), 2025
[10] Debiased Prototype Evolving for Point Cloud Domain Adaptation via 3D Foundation Models. International Conference on Acoustics, Speech, and Signal Processing (ICASSP, CCF B), 2025
[9] A Geometry-Material Aware Point Cloud Transformer for Large-scale Unstructured Thermal Analysis in 2.5D ICs. IEEE/ACM International Conference on Computer-Aided Design (ICCAD, CCF B), 2025
[8] Detecting Any Human-Object Interaction Relationship: Universal HOI Detector with Spatial Prompt Learning on Foundation Models. Advances in Neural Information Processing Systems (NeurIPS, CCF A), 2024
[7] Universal Frequency Domain Perturbation for Single-Source Domain Generalization. ACM Multimedia (ACM MM, CCF A), 2024
[6] Re-mine, Learn and Reason: Exploring the Cross-modal Semantic Correlations for Language-guided HOI detection. International Conference on Computer Vision (ICCV, CCF A), 2023
[5] Coarse2Fine: Local Consistency Aware Re-prediction for Weakly Supervised Object Localization. AAAI Conference on Artificial Intelligence (AAAI, CCF A), 2023
[4] Attributes Grouping and Mining Hashing for Fine-Grained Image Retrieval. ACM Multimedia (ACM MM, CCF A), 2023
[3] Searching for Better Spatio-temporal Alignment in Few-Shot Action Recognition. Neural Information Processing Systems (NeurIPS, CCF A), 2022
[2] EFFNet: Enhanced Feature Foreground Network for Video Smoke Source Prediction and Detection. IEEE Transactions on Circuits and Systems for Video Technology (IEEE TCSVT, 中科院1区TOP), 2022
[1] Combining the Convolution and Transformer for Classification of Smoke-Like Scenes in Remote Sensing Images. IEEE Transactions on Geoscience and Remote Sensing (IEEE TGRS, 中科院1区TOP), 2022
【联系方式】
caoyichao@csu.edu.cn
