pushing the boundaries of drone intelligence and putting it in your hands
May 12 • 8 tweets • 4 min read
< CitiNavAgent: Zero-Shot Drone Navigation in Cities >
"Fly to the white statue after passing the red phone booth" is now a fully acceptable command thanks to CitiNavAgent
The authors design a VLN (visual-language-navigation) model that uses a hierarchical semantic planner to break long-horizon instructions into subgoals of varying abstraction, and leverages a global memory graph of past trajectories to simplify navigation in familiar areas