New York City Transit Accessibility Analysis
Introduction
In New York City, the subway system is crucial for millions, supporting daily commutes and influencing urban mobility. This project uses Geographic Information Systems (GIS) for a detailed analysis of NYC subway coverage, examining how well the system meets urban demographics and accessibility needs. The study highlights transit equity gaps and sustainable development opportunities, incorporating theories of accessibility and land-use interaction that link transit accessibility disparities to socioeconomic and demographic factors (Levinson & Krizek, 2018; Martens, 2016; El-Geneidy et al., 2016).
Previous research has documented the crucial role of GIS in identifying and addressing transit deserts—areas with significant transit demand but insufficient service. Studies have shown that these areas often coincide with neighborhoods housing marginalized populations, exacerbating social and economic inequities (Levinson et al., 2015). Furthermore, GIS-based studies contribute to a nuanced understanding of transit accessibility by mapping service distribution and integrating this data with demographic analytics, thus providing a robust framework for policy development aimed at enhancing transit equity (Geurs & Van Wee, 2004; Fan et al., 2016).
Problem Hypothesis
This project hypothesizes that the distribution of subway services in NYC is uneven and correlates with demographic and socioeconomic disparities. By employing GIS to map and analyze these patterns, the study seeks to offer actionable insights that can inform urban planning and contribute to the development of more equitable and efficient public transportation systems.
Goals and Objectives
The primary goal of this project is to provide a comprehensive analysis of transit accessibility in New York City, using spatial and demographic data to illuminate areas of concern and opportunities for enhancement. The objectives are designed to inform policymakers and urban planners with actionable insights:
- Integrate Demographic Data: Analyze the populations served by transit to determine the distribution of transit demand.
- Map Transit Accessibility: Conduct detailed mapping of subway station service areas at different walking intervals to identify geographic disparities in access to public transit.
- Evaluate Spatial Distribution of Transit Access: Utilize spatial statistics to assess the distribution of travel times to the nearest subway stations, identifying anomalous areas where transit access deviates from citywide norms.
- Assess Commute Efficiency During Peak Hours: Assess commute efficiency during peak hours by examining workplace transit accessibility within the transit system, focusing on the reachability of key employment centers during morning rush hours.
- Identify Areas for Potential Service Improvement: Based on the analysis of transit accessibility and demographic needs, identify blocks and neighborhoods where subway access is notably deficient and provide insights for areas that could benefit from transit development initiatives.
Methods
Workflow
Demographic Analysis
Analyze the demographic characteristics of transit-dependent populations and assess the distribution of transit demand across New York City.
Business Analysis - Enrich Layer
Enrich Layer
Incorporate relevant demographic data into the GIS dataset, enriching the analysis of transit-dependent populations.
- Data: 2020 Census New York City Blocks
- Key variables:
- Total Population: Examine overall population metrics.
- Income Below Poverty Level: Assess economic vulnerabilities affecting transit reliance.
- Owner and Renter Households without Vehicles: Analyze vehicle access among homeowners and renters to gauge public transit dependency.
ModelBuilder
Combine demographic data layers to generate a transit demand distribution map and ensure consistency and repeatability in the analysis.
- Duplicate Layers: Copy the original result layer into four separate layers for individual processing and symbology.
- Population Estimation:
- Calculate total population for each block group, assuming uniform distribution.
- Address variable block sizes (264 feet by 900 feet), and estimate population per 237,600 square feet area.
- Raster Conversion and Sizing
- Convert the each feature layers to a raster format, setting the block size.
- Set the output cell size for the raster based on the square root of the block area, which is approximately 487.44 feet.
- Focal Statistics: Estimate average density within a defined radius around each raster cell, targeting an area encompassing three city blocks.
- Combine Rasters: Perform raster calculations to merge all four layers into a single comprehensive density map, which represent the transit demand.
- Automate with Model Builder: Construct and automate the entire workflow within Model Builder to ensure consistency and repeatability across analyses.
Transit Demand Model
Street Network Dataset
- Network Dataset Configuration:
- Transit Stations and Lines Data: Process GTFS data to create spatial features.
- NYC Streets Data: Include one-way or two-way roads to enhance traffic and accessibility modeling.
- Evaluator: Walking time (min) along street segments.
Service Area Solver
Identify the geographic areas served by the city's transit system by tailoring the network for pedestrian movement analysis, using walking speeds to simulate realistic travel times within the city.
- Solver Configuration:
- Set three travel time cutoffs, 5, 10 and 15 mins to identify areas accessible within specified walking time from subway stations
- Use an 'overlap' configuration to highlight intersections of accessible areas, producing detailed maps of coverage overlaps.
- Apply a 'dissolve' configuration to merge overlapping areas, creating unified maps and highlighting regions where the cutoff time exceeds 15 minutes.
Closest Facility Solver
Assess the proximity of the nearest subway stations at the block level by measuring the walking time required from the centroid of each block to the nearest subway station. Then, categorize blocks according to their access levels, using specific travel time thresholds—5, 10, 15 minutes, and over 15 minutes—to evaluate transit accessibility.
- Dual-Scope Evaluation:
- Citywide Analysis: Conduct a broad analysis across all of New York City to identify overarching disparities in transit service coverage.
- Borough-Specific Analysis: Focus on localized transit accessibility within individual boroughs.
- Inferential Statistics:
- Spatial Autocorrelation (Moran's I and Incremental): Analyze the degree of geographical dependency among travel times for each block, and determine whether the pattern is clustered, dispersed, or random.
- Hotspots Analysis (Getis-Ord GI*) and Cluster and Outlier Analysis (Anselin Local Moran's I): Identify areas of high and low value clusters (hot and cold spots) and detect local patterns that significantly differ from overall trends.
Transit Network Dataset
Network Analyst public transit data model
- GTFS to Public Transit Data Mode: Converts multiple GTFS public transit datasets to a set of feature classes and tables that represent the transit stops, lines, and schedules.
- Connect Public Transit Data Model to Streets: Connects transit stops to street features for use in a transit-enabled network dataset.
Transit Network Dataset Configuration
Transit Network Dataset
- Feature Class Integration:
- Stop: Designated access points for passengers, defining entry and exit points within the transit network.
- LineVariantElements: Details variations in transit routes, capturing differences in train schedules and destinations.
- Streets: NYC Streets.
- StopsOnStreets: Connects transit stops to their precise street locations.
- StopConnectors: Pathways that facilitate pedestrian movement from streets to transit stops.
- Evaluator: Public Transit Time (min) = Travel time along a public transit line segment + Walking time along street segments (Walking speed: 308 ft/min)
Calculate Accessibility Matrix
Assess workplace transit accessibility by evaluating job connectivity, measuring the total number of jobs reachable within a 40-minute travel time by transit and walking during the morning rush hour.
- Data Configuration and Analysis Settings:
- Origins and Destinations: All blocks in NYC.
- Network Data Source: Utilizes Transit Network Dataset to access transit routes and schedules.
- Travel Model: Implements the Public Transit Travel Model to simulate commuter conditions during peak hours.
- Time Frame: Focus on the morning rush time window, 7:00 AM to 9:00 AM, on a typical weekday, with one-minute increments for high temporal resolution.
- Cutoff Time: Set at 40 minutes, based on average commute times to the office for New Yorkers.
- Destination Weighting: Based on the number of jobs in each block, referring to New York LEHD Origin-Destination Employment Statistics (LODES) data.
- Analysis Insights:
- Identify areas with optimal job accessibility by combining demographic data and transit accessibility coverage, as well as pinpointing those areas that require improved transit services.
Main Findings
Geographic Analysis
Population Density
The areas in Manhattan, Brooklyn, and parts of Queens and the Bronx exhibit higher population density.
Conversely, lower density is notable in Staten Island and the peripheral regions of other boroughs.
Households with Income Below Poverty Level Percent
In parts of Brooklyn, the Bronx, and upper Manhattan, there are noticeable concentrations of lower-income households.
Owner Households with No Vehicles
Significant areas with owner households without vehicles are found in Manhattan.
Renter Households with No Vehicles
Renter households without vehicles are particularly concentrated in parts of Manhattan, Brooklyn, and Queens.
Transit Demand
The map represents areas with a high concentration of populations that are likely to rely more heavily on public transit.
In parts of Brooklyn, the Bronx, and Queens, indicating significant numbers of residents who depend on public transport. Staten Island and certain parts of Queens show lower demand.
Service Area Coverage by Stations
Within a 5-Minute Walking Radius from Each Station
Subway lines in most boroughs, except Staten Island, are covered by 5-minute walking radius areas, showing a high coverage of stations across most parts of NYC.
There are also many areas with overlapping coverage, meaning that people can reach one to two stations within a 5-minute walk. Notably, in Manhattan and the center of Brooklyn have a high degree of overlap.
Within a 10-Minute Walking Radius from Each Station
When the walking radius around each subway station is expanded to 10 minutes, a significantly larger number of blocks along the subway line are covered.
In Staten Island, the subway line's coverage greatly increases, almost encompassing the entire area, except for a notable gap between two stations in the upper Staten Island, meaning that the distance between these two stations exceeds a 20-minute walking time.
Within a 15-Minute Walking Radius from Each Station
When the walking radius around each subway station is expanded to 15 minutes, coverage extends to the entire subway line, encompassing the majority of the New York City area, including the previously identified gap in Staten Island.
Subway Accessibility Mapped by Station Service Area Coverage
Manhattan and central Brooklyn have dense coverage, showing a a robust network of transit stations, while Staten Island remains largely disconnected.
The outer regions of Queens, the Bronx and Brooklyn, as well as Staten Island display sparser coverage, showing the decrease in accessibility as moving away from the city center.
Demographics and Service Area Coverage - Statistics
Total Population
The majority of the population, approximately 3.39 million people, can access subway stations within a 5-minute walk. There is a decreasing trend in the number of people as the walking cutoff time increases. This data highlights that a significant portion of the city's population lives within a 10-minute walk of a station, but, there is still a sizable population beyond a 15-minute walk.
Income Below Poverty Level Percentage
The percentage of individuals living below the poverty level increases dramatically with longer walking times to subway stations, from 17.3% within 5 minutes to an alarming 87.9% for those more than 15 minutes away.
Renter Households with No Vehicles
The number of renter households without vehicles decreases sharply as the walking distance to subway stations increases.
Owner Households with No Vehicles
Owner households without vehicles also show a declining trend with increasing walking distance.
Transit Accessibility - Block-to-Station Travel Times
This map, powered by the Closest Facility Solver, categorizes New York blocks based on travel time to the nearest subway station. The colors indicate accessibility: 0-5 minutes, 5-10 and 10-15 minutes, and over 15 minutes away.
Areas with the best access (0-5 minutes) are primarily around central New York and Brooklyn.
Areas with 5-10 and 10-15 minutes travel time, appear further from the city center.
"Service Area Coverage" vs. "Walking Time to Station Categories"
The two maps illustrate similar findings but emphasize different aspects of subway station accessibility in New York City.
The left map, generated using the Service Area Solver, visually represents the areas that are accessible within predefined walking time intervals from subway stations. It showcases extensive coverage, highlighting the broad zones that can be reached within each set time frame.
On the right, the map created with the Closest Facility Solver focuses on pinpointing the shortest path to the nearest subway station for each block, emphasizing point-to-point connectivity rather than area coverages.
Spatial Autocorrelation (Citywide)
- The Global Moran's I index indicates a very high level high level of clustering within the dataset, suggesting that blocks with similar travel times are geographically close to each other.
- The extremely high z-score and a p-value of 0.000000 confirm that this clustering is statistically significant and highly unlikely to be due to random chance.
Incremental Spatial Autocorrelation (Citywide)
- The consistently high Moran's I values across increasing distances suggest that the travel time to the nearest subway station remains similar even as the range of consideration expands, maintaining strong spatial autocorrelation.
- The lack of peaks suggests that the spatial pattern of travel times is consistent across various scales, without distinct zones of change.
Hot Spots Analysis
Citywide Distribution (Left Map): It masks smaller-scale variations within boroughs, presenting uniform zones of travel time.
Large swathes of Manhattan and central Brooklyn appear consistently cold spots, indicating short travel times across these areas without distinguishing between their different neighborhoods.
Similarly, regions in outer Queens and Staten Island uniformly display hot spots, pointing to uniformly long travel times without further granularity.
Borough-Specific Distribution (Right Map): The borough-specific approach reveals nuanced differences within these same areas. Within each borough, the map differentiates between neighborhoods where travel times might slightly vary.
Manhattan exhibits generally short travel times to subway stations, especially in the central areas. However, residential zones in the Upper East and West Sides and northern neighborhoods have longer travel times.
In Brooklyn, the central and northwest regions benefit from shorter travel times, while the borough's outer neighborhoods face longer commutes.
In the Bronx, the South Bronx enjoys robust subway access, but the northeastern areas experience significantly longer travel times to subway stations.
Western Queens benefits from shorter travel times, while eastern and southern parts of Queens and nearly all of Staten Island, except the North Shore, have limited subway access.
Cluster and Outlier Analysis (LISA)
Citywide Analysis (Left Map): In this macro-level view, it shows that the clustering of travel times to subway stations is uniformly distributed across New York City.
All areas in Manhattan, along with the main parts of Brooklyn and the Bronx, are marked as low-low clusters, with no significant spatial outliers.
In Staten Island and Queens, the maps clearly show two distinct parts: a high-high cluster and a low-low cluster, separated by a clear, non-significant zone. There are no obvious spatial outliers in these boroughs.
Borough-Specific Analysis (Right Map): The borough-specific map focusing on intra-borough variations, allowing for the identification of smaller areas that are outliers or clusters within their respective boroughs.
The northern parts of Manhattan exhibit more variability, with some blocks identified as spatial outliers. These blocks have longer travel times compared to the surrounding areas.
In the Bronx, the map highlights both areas with significantly longer travel times and those with shorter travel times. Many non-significant areas separate these, effectively breaking the borough into multiple distinct clusters.
Workplace Transit Accessibility
Transit access to jobs in NYC
The number of jobs reachable within a 40-minute travel time by transit and walking during morning rush hour (7 - 9 am), at least once (left side), and at least 90% of start times (right side)
Left Map: It measures the maximum potential accessibility, capturing the best-case scenario rather than everyday reliability.
Right Map: It focuses on the reliability of accessing jobs by public transit, representing a more consistent accessibility.
In both maps, Manhattan and central Brooklyn consistently exhibit excellent transit connectivity and higher job accessibility. In contrast, the outer boroughs like Staten Island, along with peripheral areas of Brooklyn, the Bronx, and Queens, show lower job accessibility.
Level of need for more transit access to jobs
Ratio of population to number of jobs accessible at least once is used to identify areas of of the city which have both a high population and also poor access to jobs.
Manhattan and central Brooklyn show a healthier balance between population density and accessible job opportunities
In Staten Island, peripheral parts of Queens, and certain parts of Brooklyn and the Bronx, stand out as regions with significant needs for improved transit access.
Service Improvement Opportunities
Overlaying key indicators, transit demand (Red), population density relative to job access (Blue), and walking time to subway stations (Green).
Due to various considerations, areas where these layers overlap, such as areas with high transit demand and poor job accessibility, or areas with poor accessibility combined with high transit demand or poor job accessibility, can be identified for improvements.
Based on the existing transit lines and stations, interventions could include the expansion of transit routes, the establishment of new stations, or the introduction of rapid transit services to connect these identified areas.
Manhattan
Manhattan has most high-demand areas covered with good job accessibility and within a reasonable walking time (less than 15 minutes).
Brooklyn
High transit demand areas with poor job accessibility are mainly in the southern parts of the borough.
Queens
There are three significant areas with high demand and poor transit access in western Queens, located outside of the 15-minute walking time zone.
Bronx
High demand and poor job accessibility do not significantly overlap in this borough. However, high demand or poor job accessibility areas appear in the outer boroughs.
Staten Island
There are large areas with poor job accessibility in the northern part, and along the South Shore line, these areas require more than 15 minutes to reach the nearest station.
Discussion
Limitations & Future Work
Data Considerations for Transit Demand
The determination of population reliance on public transit as a measure of transit demand could be refined by including a broader range of factors. Current assessments may not fully capture the complexity of transit dependency. Future work should integrate additional variables such as socio-economic status, availability of alternative transportation modes, and temporal variability in transit needs to provide a more comprehensive understanding of transit demand.
Incomplete Street Network Data
The transit accessibility analysis relies on a street network dataset to calculate travel times from each block to the nearest station. However, missing data for some blocks highlights imperfections in the street network, due to the solver's inability to find valid routes. This lack of complete path data can skew accessibility measurements and does not accurately represent areas without direct paths to transit. Future work should focus on enhancing the quality and completeness of street network data to ensure more complete and accurate travel time estimations.
Overlooking Factors in Transit Network Dataset
The current transit network dataset used to establish transit travel time costs does not account for several significant factors that affect total transit time. For instance, it omits the waiting times for transit and the time spent transferring between lines within stations. Future enhancements should include these time components to provide a more accurate representation of accessibility and better reflect the real-world experience of transit users.
Exclusion of Commuters Between NYC and Surrounding Areas
Our Workplace Transit Accessibility study excludes commuters who live in New York City but work outside it, as well as those who live outside New York City but work within it. Consequently, the findings may not fully represent the transit accessibility issues faced by all workers connected to New York City.
Advancements Beyond Previous Research
Previous studies on New York City's transit accessibility have typically focused on broad assessments of transit coverage or isolated analyses of demand within specific neighborhoods. However, this project extends beyond these studies by systematically mapping transit demand gradients, accessibility in terms of travel time to subway stations, and job accessibility across all boroughs. Unlike earlier research that might use static data analysis tools like Excel or basic programming in Python, this GIS analysis integrates spatial data with demographic and transit service information, providing a more comprehensive overview of the transit landscape.
GIS technology allows for a dynamic visualization of data layers, which is crucial for understanding the geographic spread and interaction of variables such as population density, transit facility locations, and employment hubs. This spatial integration is something tools like Excel and non-spatial Python scripts cannot achieve, as they lack the capacity to effectively render and analyze geographic and spatial relationships.
Conclusions
Transit Demand Distribution
The analysis reveals a gradient of transit demand, decreasing from high in city centers to lower in outer areas. Central Manhattan and Brooklyn consistently show the highest demand, with sporadic pockets of high demand also emerging in more remote areas.
Transit Accessibility
The accessibility measured by travel time to the nearest subway stations indicates that while the New York City transit system broadly covers areas from urban cores to the outskirts, significant service gaps still exist. And, although the distribution of access times is relatively uniform across the city, narrowing the focus to each borough reveals outliers where certain blocks experience notably longer access times compared to their neighboring areas.
Job Accessibility
Job accessibility is highly concentrated in Manhattan and central Brooklyn and decreases outward. While most regions maintain a balance between population density and accessible job opportunities, the outer areas of the city show substantial gaps in transit access. These areas are marked for needed enhancements in transit services to improve job connectivity.
Insights and Recommendations to Urban Planning and Policy
Our analysis highlights areas with high demand, poor job accessibility, and overall poor access, emphasizing the need for targeted interventions like extending subway lines, adding new stations, or improving other public transportation options to serve the city's diverse populations more effectively. By visually representing disparities in job accessibility and the locations of transit deserts, particularly in outer boroughs, this analysis provide urban planners and policymakers with actionable insights that can directly shape transit policies and infrastructure development.