Chapter 3 Study Area and Data

3.1 Study Area

In this dissertation, the study area covers the Greater London Authority (GLA), which represents a particularly relevant case study for cycling environment assessment due to its ongoing transformation towards a more sustainable travel city, including a growing emphasis on cycling. The Greater London Authority has implemented substantial cycling infrastructure investments through initiatives such as the Cycle Superhighways, Quietways, and Mini-Holland programmes (Transport for London (2018)). Despite these investments, cycling modal share in London remains relatively low at approximately 4.5 % of all trips in 2023 (up from about 2 % in earlier years) (Transport for London (2025)), compared to leading cycling cities such as Copenhagen (29 %) and Amsterdam (38 %) (Aldred, Croft, and Goodman (2019)).

The London cycling landscape presents unique challenges that highlight the importance of comprehensive environmental assessment. Research by Goodman and Aldred (2018) demonstrates significant inequalities in cycling participation across London’s diverse socio-economic geography, with higher cycling rates concentrated in inner London boroughs with better infrastructure provision. Aldred and Crosweller (2015) identified that perceptions of safety, air quality, and route attractiveness significantly influence cycling behavior across London’s varied urban contexts, emphasising the need for multidimensional assessment approaches.

Recent evaluations of London’s cycling interventions have shown mixed results. The Mini-Holland programme, which invested £30 million in cycling infrastructure across three outer London boroughs, showed positive impacts on cycling participation and safety outcomes (Aldred, Croft, and Goodman (2019)). However, the programme also highlighted the importance of community engagement and the complex relationship between infrastructure provision and behavioral change. These findings underscore the necessity for sophisticated evaluation tools that can capture both objective infrastructure quality and subjective environmental factors influencing cycling decisions in diverse urban contexts.

London’s geographic and demographic diversity, combined with its substantial ongoing investment in cycling infrastructure, makes it an ideal testing ground for comprehensive cycling environment assessment methodologies. The city’s extensive open data availability, detailed geographic information systems, and varied cycling infrastructure types provide an optimal context for developing and validating integrated assessment approaches that can inform evidence-based cycling policy and investment decisions.

3.2 Data

This study draws on a series of open and authoritative datasets to construct a multi-dimensional database for assessing cycling conditions in London. The datasets cover four broad domains: general geographic data, structural rideability, environmental perception, and network centrality. Together, they capture both physical constraints and qualitative attributes of the urban environment. All datasets are openly accessible, ensuring transparency and reproducibility. Prior to analysis, spatial harmonisation was carried out by projecting all data to the British National Grid (EPSG:27700) and clipping them to the Greater London boundary.

3.2.1 General Data

General-purpose datasets form the foundation of the research. The road network was extracted from OpenStreetMap (OSM) via Overpass Turbo queries. Filters were applied to retain only roads relevant to cycling, excluding private or non-navigable paths. This cleaned network dataset provides the geometric basis for all subsequent stages of analysis, including the calculation of network centrality and the assignment of slope and environmental attributes. To facilitate aggregation, administrative boundaries were sourced from the ONS Statistical GIS Boundary Files, comprising both Borough-level and Greater London shapefiles.

3.2.2 Data for Structural Rideability

To capture the influence of terrain on cycling, slope data were integrated. These data were obtained from the Defra Data Services Platform at a resolution of 5 × 5 metres. Each OSM-derived road segment was intersected with the slope raster to compute mean gradients. This allows for the quantification of physical impedance to cycling: higher slopes are expected to increase exertion and reduce accessibility, making this dataset critical to the structural dimension of rideability.

3.2.3 Data for Environmental Perception

Cycling experience is also shaped by environmental and perceptual factors. Three distinct datasets were incorporated:

  • Green View Index (GVI): The GVI dataset originates from the Treepedia project (MIT, 2015), which employs Google Street View images (captured around 2015) to estimate the proportion of visible greenery along urban streets. While the dataset does not reflect more recent greening interventions or modifications to the built environment, it remains widely used as a proxy for visual landscape structure. Urban greenery at the street scale typically evolves slowly, especially in mature built-up areas such as London, and therefore the 2015 imagery still provides meaningful insights into the distribution of visual greenery. As such, despite its temporal limitations, GVI is considered a valid and informative indicator of the visual environment for this research.

  • Air Quality (NO\(_2\)): Data were obtained from the London Air Quality Network, which provides long-term modelled and forecasted concentrations of key pollutants. Nitrogen dioxide (NO\(_2\)) was selected as a representative measure due to its close association with traffic-related emissions and industrial activity. This dataset, available at a 20-metre grid resolution and projected over a 25-year horizon, ensures both spatial detail and temporal consistency.

  • Natural Features: Additional environmental attributes were derived from OSM, filtered using specific tags { “leisure”: [“park”], “natural”: [“water”,“wood”,“scrub”], “landuse”: [“grass”,“meadow”] }. This allowed the identification of publicly accessible green and blue spaces, including parks, woodlands, grasslands, and water bodies. Such features are strongly associated with enhanced environmental quality and contribute positively to cycling comfort.

3.2.4 Data for Network Centrality

Finally, the spatial configuration of the road network was modelled to capture structural importance. Using OSMnx, a graph representation of the OSM road network was constructed. Classic measures of network analysis such as betweenness centrality and closeness centrality were then computed. These measures help identify which roads function as critical connectors or bottlenecks, offering valuable insights into where infrastructure upgrades could generate the greatest improvements in overall cycling accessibility.

In summary, this study combines structural, environmental, and network-based datasets to produce a holistic representation of London’s cycling environment. The OSM-derived road network serves as the unifying backbone, onto which slope, greenery, air quality (NO\(_2\)), and centrality indicators are layered. This multi-source framework allows for a nuanced assessment that moves beyond simple distance-based measures, accounting for both the physical feasibility and the perceptual quality of cycling in London.

3.2.5 VData for External Validation

To support external validation of the CECI scores, this study additionally uses data from TfL’s Active Travel Counts Programme (2024 W1, spring). The dataset provides 15-minute manual counts of cycling flows across Central, Inner, Outer London and Cycleways, with site coordinates reported in both WGS84 and British National Grid. Counts were aggregated to site-level average daily volumes for weekdays (07:00–19:00) and subsequently linked to the nearest road segment in the CECI network.

The following table represents the data sources used in this study:

Table 3.1: Datasets and sources
Dataset Description Aggregation Source Year
OSM Road Network Extracted and filtered cycling-permissible roads Line segments OpenStreetMap / Overpass Turbo 2025
Borough / London Shp Administrative boundaries at borough and Greater London scale Polygon ONS Statistical GIS Boundaries 2021
Slope Raster Terrain slope at 5×5 m resolution Raster Defra Data Services Platform 2022
Green View Index (GVI) Percentage of visible greenery from street-level imagery Point / Segment MIT Treepedia Project 2015
NO₂ Concentration Modelled nitrogen dioxide levels 20m Grid Raster London Air Quality Network 2016
Natural Features Parks, woodlands, meadows, water bodies from OSM tags Polygon OpenStreetMap 2025
Road Centrality Data Betweenness and closeness derived from OSM road network Line segments Derived via OSMnx 2025
TfL Active Travel Counts (W1, spring) Manual counts of cycling flows, aggregated to weekday daily volumes Point / Site Transport for London (TfL) 2024

References

Aldred, Rachel, Joseph Croft, and Anna Goodman. 2019. “Impacts of an Active Travel Intervention with a Cycling Focus in a Suburban Context: One-Year Findings from an Evaluation of London’s in-Progress Mini-Hollands Programme.” Transportation Research Part A: Policy and Practice, Walking and Cycling for better Transport, Health and the Environment, 123 (May): 147–69. https://doi.org/10.1016/j.tra.2018.05.018.
Aldred, Rachel, and Sian Crosweller. 2015. “Investigating the Rates and Impacts of Near Misses and Related Incidents Among UK Cyclists.” Journal of Transport & Health 2 (3): 379–93. https://doi.org/10.1016/j.jth.2015.05.006.
Goodman, Anna, and Rachel Aldred. 2018. “Inequalities in Utility and Leisure Cycling in England, and Variation by Local Cycling Prevalence.” Transportation Research Part F: Traffic Psychology and Behaviour 56 (July): 381–91. https://doi.org/10.1016/j.trf.2018.05.001.
Transport for London. 2018. “Cycling Action Plan: Making London the World’s Best Big City for Cycling.” Report. Transport for London. https://www.london.gov.uk/sites/default/files/tr_19_cycling-action-plan.pdf.
———. 2025. “Travel in London 2024 – Active Travel Trends.” Report. Transport for London. https://content.tfl.gov.uk/travel-in-london-2024-active-travel-trends-acc.pdf.