This repo provides scripts to download, process, and analyze data for over 1.8 billion taxi and for-hire vehicle (Uber, Lyft, etc.) trips originating in New York City since 2009. The data is stored in a PostgreSQL database, and uses PostGIS for spatial calculations, in particular mapping latitude/longitude coordinates to census tracts. Most of the raw data comes from the NYC Taxi & Limousine Commission. The 2014 Uber data comes via FiveThirtyEight, who obtained it via a FOIL request. In August 2016, the TLC began providing for-hire vehicle trip records in addition to taxi trips.