Submission Deadline: 30 June 2025 View: 267 Submit to Special Issue
Dr. Bo Yang
Email: boyang@uestc.edu.cn
Affiliation: School of Automation Engineering, University of Electronic Science and Technology of China, China
Research Interests: Computer Vision, Surgical Robotics, Surgical (endoscopic) Vison, and Medical Image Processing
Dr. Chao Liu
Email: liu@lirmm.fr
Affiliation: CNRS (French National Center for Scientific Research), France
Research Interests: Visual Augmentation and Reconstruction, 3D Reconstruction of Deformable Surface, Haptics in Human-Machine Interaction, Multimodal Sensor-Based Analysis of Manipulation Skills, Surgical Robot, Medical Image Processing
Over the past decade or so, artificial intelligence technologies represented by deep learning have made remarkable progress, especially in the domains of signal processing and computer vision. In these domains, deep learning-based methods are being iterated and commercialized at an unprecedented rate, dramatically changing the way humans live, learn, and work.
Since the resurgence of convolutional neural networks in 2010, computer vision has been one of the most dynamic areas for deep learning technology. Recently, image and video synthesis and generation, 3D vision, visual language model, and multimodal learning have gradually been the research hotspots in the field. The transformative wave of Large Language Models (LLMs) in the field of Natural Language Processing (NLP) has inspired further exploration of their potential in computer vision. On the other hand, AI technologies are increasingly transitioning from the virtual realm to the physical, integrating with automated devices or machinery to create embodied, intelligent entities capable of physical interaction, known as Embodied Artificial Intelligence (EAI). In this context, AI needs to sink down to deal with more underlying signal or hardware information. In the field of signal processing, AI intersects with traditional control, automation, and robotics technologies, leading to a fusion of these disciplines.
In short, AI has made great strides from initially recognizing the world (traditional vision tasks such as classification and recognition) to simulating the world (generative models) and changing the world (embodied intelligence). This special issue hopes to document and advance this trend by focusing on the latest advances in AI technology in the areas of signal processing and computer vision. We seek original research articles, reviews, and survey papers that explore the latest developments, challenges, and solutions in these rapidly evolving areas. The potential topics encompassed may include, but are not limited to, the following topics:
· Multimodal artificial intelligence
· 2D&3D generative modes
· Image & video segmentation
· 3D reconstruction
· Large models and their applications in signal processing and computer vision
· Visual question and answer (VQA), visual reasoning
· Meta-learning, transfer learning, few-shot learning.
· Embodied Artificial Intelligence
· Reinforcement Learning
· Medical image processing
· Medical robot
· Vision Foundation Models
· Efficient and robust AI