Toward Foundation Models for Mobility Enriched Geospatially Embedded Objects
Abstract
Recent advances in large foundation models (FMs) have enabled
learning general-purpose representations in natural language, vi-
sion, and audio. Yet geospatial artificial intelligence (GeoAI) still
lacks widely adopted foundation models that generalize across
tasks. We argue that a key bottleneck is the absence of unified,
general-purpose, and transferable representations for geospatially
embedded objects (GEOs). Such objects include points, polylines,
and polygons in geographic space, enriched with semantic context
and critical for geospatial reasoning. Much current GeoAI research
compares GEOs to tokens in language models, where patterns of
human movement and spatiotemporal interactions yield contextual
meaning similar to patterns of words in text. However, modeling
GEOs introduces challenges fundamentally different from language,
including spatial continuity, variable scale and resolution, temporal
dynamics, and data sparsity. Moreover, privacy constraints and
global variation in mobility further complicates modeling and gen-
eralization. This paper formalizes these challenges, identifies key
representational gaps, and outlines research directions for build-
ing foundation models that learn behavior-informed, transferable
representations of GEOs from large-scale human mobility data.
learning general-purpose representations in natural language, vi-
sion, and audio. Yet geospatial artificial intelligence (GeoAI) still
lacks widely adopted foundation models that generalize across
tasks. We argue that a key bottleneck is the absence of unified,
general-purpose, and transferable representations for geospatially
embedded objects (GEOs). Such objects include points, polylines,
and polygons in geographic space, enriched with semantic context
and critical for geospatial reasoning. Much current GeoAI research
compares GEOs to tokens in language models, where patterns of
human movement and spatiotemporal interactions yield contextual
meaning similar to patterns of words in text. However, modeling
GEOs introduces challenges fundamentally different from language,
including spatial continuity, variable scale and resolution, temporal
dynamics, and data sparsity. Moreover, privacy constraints and
global variation in mobility further complicates modeling and gen-
eralization. This paper formalizes these challenges, identifies key
representational gaps, and outlines research directions for build-
ing foundation models that learn behavior-informed, transferable
representations of GEOs from large-scale human mobility data.