Accurate skier tracking is essential for performance analysis, injury
prevention, and optimizing training strategies in alpine sports. Traditional
tracking methods often struggle with occlusions, dynamic movements, and varying
environmental conditions, limiting their effectiveness. In this work, we used
STARK (Spatio-Temporal Transformer Network for Visual Tracking), a
transformer-based model, to track skiers. We adapted STARK to address
domain-specific challenges such as camera movements, camera changes,
occlusions, etc. by optimizing the model's architecture and hyperparameters to
better suit the dataset.