McCloskey, J. et al
Virtual urban testbed representing a Global South urban setting based on Nairobi, Kenya and Kathmandu, Nepal contexts
Cite this dataset as:
McCloskey, J.; Menteşe, E.Y.; Cremen, G.; Gentile, R.; Galasso, C.; Filippi, M.E.; Jenkins, L.; Creed, M.; Watson, C.S.; Sinclair, H.; Pelling, M. (2025). Virtual urban testbed representing a Global South urban setting based on Nairobi, Kenya and Kathmandu, Nepal contexts. NERC EDS Environmental Information Data Centre. https://doi.org/10.5285/8b5834a5-ae8a-4f24-836c-16fab961aeb3
Download/Access
PLEASE NOTE:
By accessing or using this dataset, you agree to the terms of the relevant licence agreement(s). You will ensure that this dataset is cited in any publication that describes research in which the data have been used.
This dataset is available under the terms of the Open Government Licence
https://doi.org/10.5285/8b5834a5-ae8a-4f24-836c-16fab961aeb3
This dataset contains a digital urban scenario, named Tomorrowville, that is developed as a testbed for multi-hazard risk assessments and to evaluate the performance of urbanisation scenarios. Tomorrowville was created to represent a global-south urban setting by means of its socio-economic and physical aspects. It covers an area of 500ha located south of Kathmandu (Nepal). The dataset consists of 5 different data types:
- Buildings: Data representing the building footprints for today and 50 years from now including specific attributes to be used within multi-hazard risk assessments.
- Land uses: Data representing the land use information for today and 50 years from now.
- Vulnerability: Tabular files that contain vulnerability functions for buildings under earthquake and flood hazards.
- Household: Data that contains social attributes of the Tomorrowville, such as the level of education, age, gender and working status of the individuals and their states in the households.
- Hazards: Data representing the hazards (earthquake (eq), floods (fl) and debris flows (df) that may impact the case study areas of Tomorrowville.
Observational data of the built environment and socio-economical properties of Kathmandu and Nairobi were used in addition to synthetic social data to create the initial scenario. This is a synthetic social dataset, meaning it was derived from existing population projections and distributions for the testbed but does not reflect the reality on the ground. It is synthetically created using specific algorithms in a GIS environment to represent a Global South social context.
For the building data, Open Street Map (OSM) database is used as a basis. The data is scraped from OSM and modified to represent an urban context for Tomorrowville. The attributes are also modified to be able to use in a multi-hazard risk computation. A taxonomy string is generated for each building that represents an acronym for its building code level, number of storeys, occupation type and structural system. The hazards that were existing in the selected spatial extent were earthquake, flood, and debris flow. Hazard data represents an intensity measure for the relevant hazard type (ground acceleration for earthquake, flow velocity for the flood and debris flow hazards). The following hazard input data are included:
- For the flood simulations, the discharge and rainfall time series are generated based on moderate to peak daily data based on recorded data from the Department of Hydrology and Meteorology, Nepal.
- Earthquake hazard sources are generated and simulated by Jenkins et al. (2023).
- For the debris-flow and flood simulations tri-stereo Pleiades satellite imagery is used to produce a 2m resolution Digital Elevation Model.
The work to create this dataset is supported by NERC as part of the GCRF Urban Disaster Risk Hub (NE/S009000/1)
- Buildings: Data representing the building footprints for today and 50 years from now including specific attributes to be used within multi-hazard risk assessments.
- Land uses: Data representing the land use information for today and 50 years from now.
- Vulnerability: Tabular files that contain vulnerability functions for buildings under earthquake and flood hazards.
- Household: Data that contains social attributes of the Tomorrowville, such as the level of education, age, gender and working status of the individuals and their states in the households.
- Hazards: Data representing the hazards (earthquake (eq), floods (fl) and debris flows (df) that may impact the case study areas of Tomorrowville.
Observational data of the built environment and socio-economical properties of Kathmandu and Nairobi were used in addition to synthetic social data to create the initial scenario. This is a synthetic social dataset, meaning it was derived from existing population projections and distributions for the testbed but does not reflect the reality on the ground. It is synthetically created using specific algorithms in a GIS environment to represent a Global South social context.
For the building data, Open Street Map (OSM) database is used as a basis. The data is scraped from OSM and modified to represent an urban context for Tomorrowville. The attributes are also modified to be able to use in a multi-hazard risk computation. A taxonomy string is generated for each building that represents an acronym for its building code level, number of storeys, occupation type and structural system. The hazards that were existing in the selected spatial extent were earthquake, flood, and debris flow. Hazard data represents an intensity measure for the relevant hazard type (ground acceleration for earthquake, flow velocity for the flood and debris flow hazards). The following hazard input data are included:
- For the flood simulations, the discharge and rainfall time series are generated based on moderate to peak daily data based on recorded data from the Department of Hydrology and Meteorology, Nepal.
- Earthquake hazard sources are generated and simulated by Jenkins et al. (2023).
- For the debris-flow and flood simulations tri-stereo Pleiades satellite imagery is used to produce a 2m resolution Digital Elevation Model.
The work to create this dataset is supported by NERC as part of the GCRF Urban Disaster Risk Hub (NE/S009000/1)
Publication date: 2025-04-22
View numbers valid from 22 April 2025 Download numbers valid from 22 April 2025 (information prior to this was not collected)
Formats
Shapefile, Comma-separated values (CSV), TIFF, json
Spatial information
Study area
Spatial representation types
Vector
Raster
Tabular (text)
Raster
Tabular (text)
Spatial reference system
WGS 84
Provenance & quality
As an interdisciplinary process, knowledge and perspective of different disciplines helped us shape the characteristics holistically and enabled us to construct a comprehensive understanding of the testbed. By bringing qualitative and quantitative approaches together, it became possible to integrate both social and physical characteristics with the facilitation of urban planning and GIS disciplines. The development of Tomorrowville started in June 2021. Between June and December 2021, there have been around 30 inter-disciplinary online meetings within our research group regarding the development of Tomorrowville. The team consisted of 2 structural engineers, 4 geoscientists, 4 social scientists and a GIS expert with an urban planning background. Local researchers from Kathmandu and Nairobi had structural engineering and urban planning expertise and they provided insight into our approach which enabled us to be more realistic in terms of the spatial context we represent within Tomorrowville.
The building footprints are derived from some sample data gathered from Open Street Map database and modified to fit the aims of the dataset. The exposure data is synthetic. All flood simulations are generated using Caesar-Lisflood, a landscape evolution model (LEM) that combines the hydrological and surface flow model, Lisflood-FP (Bates et al. 2010), with the CAESAR landscape evolution model (Coulthard et al. 2013). For landslide analysis digital elevation model from tri-stereo Pleiades satellite imagery is benefited. For debris-flow analysis, rainfall-driven LaharFlow dynamic hazard model is used. The vulnerability data are produced based on consultation with local partners and global vulnerability databases such as Joint Research Center’s “Global flood depth-damage functions: Methodology and the database with guidelines”.
The synthetic social data was produced through bespoke algorithms that considered population characteristics and projections relevant for the area. These algorithms are developed to generate a population that would live in future urban scenario within the designed built environment. The algorithms were coded in MATLAB software environment.
The building footprints are derived from some sample data gathered from Open Street Map database and modified to fit the aims of the dataset. The exposure data is synthetic. All flood simulations are generated using Caesar-Lisflood, a landscape evolution model (LEM) that combines the hydrological and surface flow model, Lisflood-FP (Bates et al. 2010), with the CAESAR landscape evolution model (Coulthard et al. 2013). For landslide analysis digital elevation model from tri-stereo Pleiades satellite imagery is benefited. For debris-flow analysis, rainfall-driven LaharFlow dynamic hazard model is used. The vulnerability data are produced based on consultation with local partners and global vulnerability databases such as Joint Research Center’s “Global flood depth-damage functions: Methodology and the database with guidelines”.
The synthetic social data was produced through bespoke algorithms that considered population characteristics and projections relevant for the area. These algorithms are developed to generate a population that would live in future urban scenario within the designed built environment. The algorithms were coded in MATLAB software environment.
Licensing and constraints
This dataset is available under the terms of the Open Government Licence
Cite this dataset as:
McCloskey, J.; Menteşe, E.Y.; Cremen, G.; Gentile, R.; Galasso, C.; Filippi, M.E.; Jenkins, L.; Creed, M.; Watson, C.S.; Sinclair, H.; Pelling, M. (2025). Virtual urban testbed representing a Global South urban setting based on Nairobi, Kenya and Kathmandu, Nepal contexts. NERC EDS Environmental Information Data Centre. https://doi.org/10.5285/8b5834a5-ae8a-4f24-836c-16fab961aeb3
Supplemental information
Gentile, R., Cremen, G., Galasso, C., Jenkins, L. T., Manandhar, V., Menteşe, E. Y., Guragain, R., & McCloskey, J. (2022). Scoring, selecting, and developing physical impact models for multi-hazard risk assessment. In International Journal of Disaster Risk Reduction (Vol. 82, p. 103365). Elsevier BV. https://doi.org/10.1016/j.ijdrr.2022.103365
Jenkins, L. T., Creed, M. J., Tarbali, K., Muthusamy, M., Trogrlić, R. Š., Phillips, J. C., Watson, C. S., Sinclair, H. D., Galasso, C., & McCloskey, J. (2023). Physics-based simulations of multiple natural hazards for risk-sensitive planning and decision making in expanding urban regions. In International Journal of Disaster Risk Reduction (Vol. 84, p. 103338). Elsevier BV. https://doi.org/10.1016/j.ijdrr.2022.103338
Menteşe, E. Y., Cremen, G., Gentile, R., Galasso, C., Filippi, M. E., & McCloskey, J. (2023). Future exposure modelling for risk-informed decision making in urban planning. In International Journal of Disaster Risk Reduction (Vol. 90, p. 103651). Elsevier BV. https://doi.org/10.1016/j.ijdrr.2023.103651
Correspondence/contact details
Menteşe, E.Y.
Anofa Engineering, Planning and Informatics Ltd
Ayazağa, Kemerburgaz Cd. No:7
Istanbul
34396
TÜRKIYE
emin.mentese@anofa.co
Istanbul
34396
TÜRKIYE
Authors
McCloskey, J.
University of Edinburgh
Menteşe, E.Y.
Anofa Engineering, Planning and Informatics Ltd.
Other contacts
Rights holder
University of Edinburgh
Custodian
NERC EDS Environmental Information Data Centre
info@eidc.ac.uk
Publisher
NERC EDS Environmental Information Data Centre
info@eidc.ac.uk
Additional metadata
Keywords
Funding
Natural Environment Research Council Award: NE/S009000/1
Last updated
23 April 2025 12:38