Key Laboratory for Information Science of Electromagnetic Waves (Ministry of Education)
Researchers from Fudan University developed a unified framework for Aerial Vision-Language Navigation, enabling UAVs to follow natural language instructions using only monocular RGB observations. This framework achieves state-of-the-art performance among RGB-only methods and demonstrates competitive capabilities compared to systems relying on additional sensors.
There are no more papers matching your filters at the moment.