Over the past decade, there have been a number of researches regarding road cycling safety such as the prediction of bike-involved collision severity, identification of influencing factors and assessment of cycling safety levels. Many data-driven models have been employed in thes
...
Over the past decade, there have been a number of researches regarding road cycling safety such as the prediction of bike-involved collision severity, identification of influencing factors and assessment of cycling safety levels. Many data-driven models have been employed in these studies, among which the Logistic regression is most commonly used. However, as a parametric model, Logistic regression assumes linear relations of variables making it impossible to capture the complexity of the real world. Therefore, this research will use a non-parametric model called CART (Classification and Regression Tree) that has no assumption of variable distribution and is highly explainable and apply it to objective data including bike-involved collision data and road feature data to rate cycling safety. The model is then applied to an urban area in London, UK. The roads in this area are present in different colors indicating the safety levels.