Institute for Intelligent Systems
StixelNExT++ introduces a lightweight, end-to-end neural network for 3D monocular scene segmentation and representation, achieving real-time inference at 10 ms per frame and a 63.8% F1-Score for holistic obstacle detection. The system generates a compressed 3D Stixel World from a single RGB image, suitable for collective perception by leveraging a depth-aware loss and non-linear depth discretization.
Wildlife re-identification aims to match individuals of the same species across different observations. Current state-of-the-art (SOTA) models rely on class labels to train supervised models for individual classification. This dependence on annotated data has driven the curation of numerous large-scale wildlife datasets. This study investigates self-supervised learning Self-Supervised Learning (SSL) for wildlife re-identification. We automatically extract two distinct views of an individual using temporal image pairs from camera trap data without supervision. The image pairs train a self-supervised model from a potentially endless stream of video data. We evaluate the learnt representations against supervised features on open-world scenarios and transfer learning in various wildlife downstream tasks. The analysis of the experimental results shows that self-supervised models are more robust even with limited data. Moreover, self-supervised features outperform supervision across all downstream tasks. The code is available here this https URL.
There are no more papers matching your filters at the moment.