{"accessLimitation":{"value":"noLimitations","availability":"Available","description":"Access to the resource is not subject to any access limitations and is available to all users without restriction.","uri":"http://vocab.nerc.ac.uk/collection/N07/current/UNRS/"},"authors":[{"honorificPrefix":"Dr","familyName":"Hibdige","givenName":"Sam","organisationName":"Cranfield University","organisationIdentifier":"https://ror.org/05cncd958","role":"author","email":"sam.hibdige@cranfield.ac.uk","fullName":"Hibdige, S."},{"honorificPrefix":"Dr","familyName":"Larionov","givenName":"Alexey","organisationName":"Cranfield University","role":"author","email":"Alexey.Larionov@cranfield.ac.uk","fullName":"Larionov, A."},{"honorificPrefix":"Professor","familyName":"Harris","givenName":"Jim","organisationName":"Cranfield University","role":"author","email":"j.a.harris@cranfield.ac.uk","nameIdentifier":"https://orcid.org/0000-0001-9266-4979","fullName":"Harris, J."},{"honorificPrefix":"Dr","familyName":"Pawlett","givenName":"Mark","organisationName":"Cranfield University","role":"author","email":"m.pawlett@cranfield.ac.uk","nameIdentifier":"https://orcid.org/0000-0001-8060-0345","fullName":"Pawlett, M."}],"availability":"Available","boundingBoxes":[{"westBoundLongitude":-8.648,"eastBoundLongitude":1.768,"southBoundLatitude":49.864,"northBoundLatitude":60.861,"bounds":"{\"type\": \"Feature\",      \"properties\": {},      \"geometry\": {        \"type\": \"Polygon\",        \"coordinates\": [[[-8.648, 49.864], [-8.648, 60.861], [1.768, 60.861], [1.768, 49.864], [-8.648, 49.864]]]      }}","coordinates":"[[[-8.648, 49.864], [-8.648, 60.861], [1.768, 60.861], [1.768, 49.864], [-8.648, 49.864]]]"}],"citation":{"authors":["Hibdige, S.","Larionov, A.","Harris, J.","Pawlett, M."],"bibtex":"https://catalogue.ceh.ac.uk/documents/a21b8ed1-124b-4b2a-adb4-c3fdcc9f95a2/citation?format=bib","day":16,"doi":"10.5285/a21b8ed1-124b-4b2a-adb4-c3fdcc9f95a2","month":3,"publisher":"NERC EDS Environmental Information Data Centre","resourceTypeGeneral":"dataset","ris":"https://catalogue.ceh.ac.uk/documents/a21b8ed1-124b-4b2a-adb4-c3fdcc9f95a2/citation?format=ris","title":"Soil metagenomics data from woodland restoration sites in Central Scotland, 2021 and English Midlands, 2022","url":"https://doi.org/10.5285/a21b8ed1-124b-4b2a-adb4-c3fdcc9f95a2","year":2026},"custodians":[{"organisationName":"NERC EDS Environmental Information Data Centre","organisationIdentifier":"https://ror.org/04xw4m193","role":"custodian","email":"info@eidc.ac.uk"}],"datasetReferenceDate":{"creationDate":"2025-12-01","publicationDate":"2026-03-16"},"description":"This dataset comprises Alpha diversity and functional metrics estimations of soil bacterial and fungal communities and soil chemical properties (pH and electrical conductivity) in samples collected from 60 broadleaved woodland restoration sites from Central Scotland (2021) and English Midlands (2022). A further six wildcard sites made up of ancient woodlands (2 in Scotland and 2 in England) and rewilding sites (2 in England) were visited, and soil samples collected using a 10cm soil corer. Information on site characteristics were collected, including age of the restoration site, former land-use and features of the surrounding landscape. DNA was extracted from the soil and sequenced for metagenomic analysis.","distributionFormats":[{"name":"Comma-separated values (CSV)","type":"text/csv","version":"unknown"}],"distributorContacts":[{"organisationName":"NERC EDS Environmental Information Data Centre","organisationIdentifier":"https://ror.org/04xw4m193","role":"distributor","email":"info@eidc.ac.uk"}],"funding":[{"funderName":"Natural Environment Research Council","funderIdentifier":"https://ror.org/02b5d8509","awardNumber":"NE/V006444/1","orcid":false,"ror":true}],"hasOnlineServiceAgreement":true,"id":"a21b8ed1-124b-4b2a-adb4-c3fdcc9f95a2","incomingCitationCount":0,"infoLinks":[{"url":"https://data-package.ceh.ac.uk/sd/a21b8ed1-124b-4b2a-adb4-c3fdcc9f95a2.zip","name":"Supporting information","description":"Supporting information available to assist in re-use of this dataset","function":"information","type":"OTHER"}],"inspireThemes":[{"theme":"Soil","uri":"http://inspire.ec.europa.eu/theme/so"}],"keywordsOther":[{"value":"soil chemistry","uri":"http://www.eionet.europa.eu/gemet/concept/7849"},{"value":"metagenomics"}],"keywordsTheme":[{"value":"Soil","uri":"http://onto.nerc.ac.uk/CEHMD/topic/17"},{"value":"Biodiversity","uri":"http://onto.nerc.ac.uk/CEHMD/topic/2"}],"licences":[{"value":"This resource is available under the terms of the Open Government Licence","code":"license","uri":"https://eidc.ac.uk/licences/OGL/plain"}],"lineage":"Fieldwork instrumentation used.  Basic field instruments were used to mark plot positions (GPS), set out the plots (20m tape measures), and collect soil samples (soil corer).\nMethods of collection and how data was values were recorded. In each site, five circular plots (radius 10m) were established with their centres randomly decided in advance using ArcGIS. Sampling of our 30 Scottish woodland restorations sites occurred in 2021, for our 30 English woodland restoration sites and our six wildcards, sampling occurred in 2022. Some were subsequently resampled in 2023.  Data processing and lab-based techniques continued into 2024. Soil: Five 10cm soil cores were collected randomly from each plot and stored at 4°C. Soil was sieved to 2mm, samples were split between storage at -20°C for DNA extraction and 4°C. Site characteristics:  Site location was recorded using GPS.  Information on site history (including site age and former land-use) was recorded based on historic maps and information from landowners.  Surrounding land use cover was recorded based on the UCKCEH Landcover Map.\nNature and units of recorded variables and processing steps.  \npH: Soil was air dried and the five samples per site were pooled together. Ultrapure water was added and the samples agitated before values were measured with a pH probe. The pH reading was considered stable when the value did not vary by more than 0.02 units over five seconds. Electrical Conductivity: Soil was air dried and the five samples per site were pooled together. Ultrapure water was added and the samples agitated, centrifuged and filtered. Electrical conductivity was then measured using a probe and recorded in uS. Alpha diversity metrics: DNA was extracted from the soil using laboratory kits and sequenced by a third-party company (Novogene) using a paired-end Illumina platform, (NovaSeq 6000) using well established amplicon regions. The data was processed using QIIME2 to generate feature tables per sample. Taxonomy was assigned to the amplicons using Greengenes 2 (bacteria) and UNITE dynamic (Fungal). Shannon, Faith PD and chao1 were calculated from the QIIME2 feature tables and the assigned taxonomy. Functional metrics (bacteria): PicrusT2 was used to estimate Enzyme Commission numbers (EC), KEGG orthologs (KO) and MEtaCyc pathway predictions were calculated using default parameters using EPA-NG to place sequences into the required reference phylogeny. QIIME2 was then used to calculate Shannon entropy and Chao1 from the pathway abundance feature tables from PicrusT2. Functional metrics (fungi): Funguild was used to estimate guild associations per sample. QIIME2 was then used to calculate Shannon entropy and Chao1 from the guild tables. Former land-use is categorical with 4 levels: Industrial (n=30), Agricultural (n=30), Ancient Woodland (n=4) and Rewilding (n=2).\nQuality control/assessment applied to the data Data was collected in the field using pre-prepared data sheets. Data sheets were checked both visually before digital data entry, and any suspected errors were checked with raw field data sheets.  Extracted DNA was quantified on Nanodrop spectrophotometer to measure ng/μl, 260/280, and 260/230 ratios before being sent for sequencing. The sequencing company (Novogene) provided their own quality control before and after sequencing and any that failed were re-extracted and re-sequenced. The received sequenced reads were examined using FAST-QC and MULTI-QC and trimmed accordingly with cut adapt. The reads were then quality filtered and processed using DADA2. Rarefactions plots were generated to examine feature counts to ensure feature tables were of expected quality.\nLimitations on the data's reliability.  Amplicon sequencing is largely only accurate to genus level and taxonomy assignment is limited by database curation. Functional analysis based on amplicon sequencing is less accurate that shotgun sequencing as functionality is inferred.","metadataDate":"2026-06-29T18:14:47","notGEMINI":false,"onlineResources":[{"url":"https://data-package.ceh.ac.uk/data/a21b8ed1-124b-4b2a-adb4-c3fdcc9f95a2","name":"Download the data","description":"Download a copy of this data","function":"download","type":"OTHER"},{"url":"https://data-package.ceh.ac.uk/sd/a21b8ed1-124b-4b2a-adb4-c3fdcc9f95a2.zip","name":"Supporting information","description":"Supporting information available to assist in re-use of this dataset","function":"information","type":"OTHER"}],"pointsOfContact":[{"displayName":"Samuel Hibdige","organisationName":"Cranfield University","role":"pointOfContact","email":"sam.hibdige@cranfield.ac.uk","fullName":"Samuel Hibdige","pointOfContact":"Cranfield University"}],"publicationDate":"2026-03-16T00:00:00.000Z","publishers":[{"organisationName":"NERC EDS Environmental Information Data Centre","organisationIdentifier":"https://ror.org/04xw4m193","role":"publisher","email":"info@eidc.ac.uk"}],"relationships":[{"relation":"http://purl.org/dc/terms/relation","target":"8c997943-1f90-4897-87b3-491eaef534ec"}],"resourceIdentifiers":[{"code":"https://catalogue.ceh.ac.uk/id/a21b8ed1-124b-4b2a-adb4-c3fdcc9f95a2"},{"code":"10.5285/a21b8ed1-124b-4b2a-adb4-c3fdcc9f95a2","codeSpace":"doi"}],"resourceType":{"value":"dataset"},"responsibleParties":[{"displayName":"Samuel Hibdige","organisationName":"Cranfield University","role":"pointOfContact","email":"sam.hibdige@cranfield.ac.uk","fullName":"Samuel Hibdige","pointOfContact":"Cranfield University"},{"honorificPrefix":"Dr","familyName":"Hibdige","givenName":"Sam","organisationName":"Cranfield University","organisationIdentifier":"https://ror.org/05cncd958","role":"author","email":"sam.hibdige@cranfield.ac.uk","fullName":"Hibdige, S."},{"honorificPrefix":"Dr","familyName":"Larionov","givenName":"Alexey","organisationName":"Cranfield University","role":"author","email":"Alexey.Larionov@cranfield.ac.uk","fullName":"Larionov, A."},{"honorificPrefix":"Professor","familyName":"Harris","givenName":"Jim","organisationName":"Cranfield University","role":"author","email":"j.a.harris@cranfield.ac.uk","nameIdentifier":"https://orcid.org/0000-0001-9266-4979","fullName":"Harris, J."},{"honorificPrefix":"Dr","familyName":"Pawlett","givenName":"Mark","organisationName":"Cranfield University","role":"author","email":"m.pawlett@cranfield.ac.uk","nameIdentifier":"https://orcid.org/0000-0001-8060-0345","fullName":"Pawlett, M."},{"organisationName":"Cranfield University","role":"rightsHolder"},{"organisationName":"NERC EDS Environmental Information Data Centre","organisationIdentifier":"https://ror.org/04xw4m193","role":"publisher","email":"info@eidc.ac.uk"},{"organisationName":"NERC EDS Environmental Information Data Centre","organisationIdentifier":"https://ror.org/04xw4m193","role":"custodian","email":"info@eidc.ac.uk"}],"rightsHolders":[{"organisationName":"Cranfield University","role":"rightsHolder"}],"spatialReferenceSystems":[{"code":"http://www.opengis.net/def/crs/EPSG/0/27700","title":"OSGB 1936 / British National Grid"}],"spatialRepresentationTypes":["tin"],"spatialResolutions":[{"distance":"10","uom":"urn:ogc:def:uom:EPSG::9001"}],"temporalExtents":[{"begin":"2021-01-01","end":"2022-12-31"}],"title":"Soil metagenomics data from woodland restoration sites in Central Scotland, 2021 and English Midlands, 2022","topicCategories":[{"value":"environment","uri":"http://inspire.ec.europa.eu/metadata-codelist/TopicCategory/environment"}],"topics":["http://onto.nerc.ac.uk/CEHMD/topic/17","http://onto.nerc.ac.uk/CEHMD/topic/2"],"type":"dataset","uri":"https://catalogue.ceh.ac.uk/id/a21b8ed1-124b-4b2a-adb4-c3fdcc9f95a2","useConstraints":[{"value":"This resource is available under the terms of the Open Government Licence","code":"license","uri":"https://eidc.ac.uk/licences/OGL/plain"}]}