
Correct Data Selection
Ensures Efficient Decision Making
Transportation Datasets

MTA Turnstiles
-
Provides the number of entries and exits every four hours of every subway stations
-
Be used to draw the heatmap of entries, exits and difference of the city
-
Helps analyze the attributes of every station in the morning and evening peak

Citibike
-
Includes the starting and ending position and time information of each trip
-
Helps understand the relationship and passenger transformation between stations and Citibike
-
Focuses only on the areas centered on Manhattan and northern Brooklyn

Taxi
-
Includes the starting and ending position and time information of each trip
-
Helps understand the relationship between subway stations and other transportations
-
Identifies last mile routes from subway stations to destinations with additional demand
Socio-economic Datasets

NYC PLUTO
-
Includes building properties including age, number of floors, and tax returned
-
Contains the data of each building to facilitate the use of various models
-
Provides evaluation indicators for economy, population and connectivity

The Public Use Microdata Areas (PUMAs)
-
Contains demographic, economic, housing and society data classified by region
-
Helps analyze the characteristics of residents of every puma zones
-
Assists in analyzing the characteristics of the area surrounding the subway stations

American Community Survey / US Census
-
Includes economic index and population index of certain area
-
Provides general ideas for the surrounding environment of the subway system
-
Identify income levels and population density for hypothesis drafting and modeling
Geographic Datasets
.png)
Subway Stations Location
-
Contains the station name, lines, latitude and longitude of each station
-
Combined with turnstile, transportation and macro datasets etc. for visualization
.png)
Neighborhoods
-
Provides a geographic reference for visualization as the base map
-
Divided by region and can be combined with datasets in geometry formats
Supportive Datasets

Company Location data
-
Provides company names and locations
-
Used to plot company location to indicate commuting opportunities

Real Estate (public housing)
-
Provides the locations of public housing which represent low income unit
-
Presents the distribution of low-income and other income units