UID:
almahu_9949930860102882
Format:
XXXVII, 474 p. 188 illus., 165 illus. in color.
,
online resource.
Edition:
1st ed. 2025.
ISBN:
9783031781131
Series Statement:
Lecture Notes in Computer Science, 15330
Content:
The multi-volume set of LNCS books with volume numbers 15301-15333 constitutes the refereed proceedings of the 27th International Conference on Pattern Recognition, ICPR 2024, held in Kolkata, India, during December 1-5, 2024. The 963 papers presented in these proceedings were carefully reviewed and selected from a total of 2106 submissions. They deal with topics such as Pattern Recognition; Artificial Intelligence; Machine Learning; Computer Vision; Robot Vision; Machine Vision; Image Processing; Speech Processing; Signal Processing; Video Processing; Biometrics; Human-Computer Interaction (HCI); Document Analysis; Document Recognition; Biomedical Imaging; Bioinformatics.
Note:
DCI-Net: Remote Sensing Image-based Object Detector -- CROSS-MODAL SHIP GROUNDING: TOWARDS LARGE MODEL FOR ENHANCED FEW-SHOT LEARNING -- STNet: Small Target Detection Network for IR Imagery -- FF-Yolo: A Feature-fusion Yolo model for Small Scale FODs detection in Airport Runways -- Weakly Aligned Multi-Spectral Pedestrian Detection via Cross-Modality Differential Enhancement and Multi-Scale Spatial Alignment -- CrackUDA: Incremental Unsupervised Domain Adaptation for Improved Crack Segmentation in Civil Structures -- DS MYOLO: A Reliable Object Detector Based on SSMs for Driving Scenarios -- Robust Single-Cam Surround View Object Detection and Localization Using Memory Maps -- Exploring the Reliability of Foundation Model-Based Frontier Selection in Zero-Shot Object Goal Navigation -- Reliable Semantic Understanding for Real World Zero-shot Object Goal Navigation -- AllWeather-Net: Unified Image Enhancement for Autonomous Driving Under Adverse Weather and Low-Light Conditions -- Uni4DAL: A Unified Baseline for Multi-dataset 4D Auto-Labeling -- Dual-Attention Fusion Network with Edge and Content Guidance for Remote Sensing Images Segmentation -- Distortion Correction Sub-Network for Semantic Segmentation based on Deep Hough Transform -- MemoFlow: Modifying Explicit Motion of Inconsistency in Optical Flow -- Enhanced Brain Tumor Segmentation Using Preprocessing Techniques and 3D U-Net -- Joint Top-Down and Bottom-Up Frameworks for 3D Visual Grounding -- Anticipating Future Object Compositions without Forgetting -- SPK: Semantic and Positional Knowledge for Zero-shot Referring Expression Comprehension -- Can Language Improve Visual Features For Distinguishing Unseen Plant Diseases? -- Show Me the World in My Language: Establishing the First Baseline for Scene-Text to Scene-Text Translation -- iGrasp: An Interactive 2D-3D Framework for 6-DoF Grasp Detection -- Goal-Driven Transformer for Robot Behavior Learning from Play Data -- Adaptive Dynamic VSLAM: Refining Semantic-Geometric Fusion and Static Background Inpainting -- Hierarchical Visual Place Recognition with Semantic-guided Attention -- Dense Reconstruction and Localization in Scenes with Glass Surfaces Based on ORB-SLAM2 -- Content-Aware Feature Upsampling for Voxel-based 3D Semantic Segmentation -- Enhancing 3D Referential Grounding by Learning Coarse Spatial Relationships -- PointGADM: Geometry Acquainted Deep Model for 3D Point Cloud Analysis -- CroMA: Cross-Modal Attention for Visual Question Answering in Robotic Surgery.
In:
Springer Nature eBook
Additional Edition:
Printed edition: ISBN 9783031781124
Additional Edition:
Printed edition: ISBN 9783031781148
Language:
English
DOI:
10.1007/978-3-031-78113-1
URL:
https://doi.org/10.1007/978-3-031-78113-1
Bookmarklink