The Auto Arborist Dataset is a multiview urban tree classification dataset that, at ~2.6 million trees and >320 genera, is two orders of magnitude larger than those in prior work.
To build the dataset, we pulled from public tree censuses from 23 North American cities and merged these records with Google Street View and overhead RGB imagery. As the first urban forest dataset to cover multiple cities, one can analyze in detail how forest models can generalize with respect to geographic distribution shifts, crucial to building systems that scale.
We have released all 2.6M tree records publicly, along with aerial and ground-level imagery for 1M trees. See the Auto-Arborist blog post for more information.