© OpenStreetMap contributors
- Add External Layer
- Upload Shapefile
- Layer Tutorial
- Close
Use the checkbox () to show or hide a layer.
Use the radio buttons () to select a layer to use with the identify function.
- Layers
- Opacity
Available layers
(ctrl+c)
Search for marine data across UK organisations
- API
- How-To
- About
- Contact MEDIN
- Share
Metadata: A new machine learning approach to seabed biotope classification
Abstract:
Files for use with the R script accompanying the paper Cooper (2020). Note that this script also uses files from `https://doi.org/10.14466/CefasDataHub.34`_ (details provided in script). Cooper, K.M. (2020). A new machine learning approach to seabed biotope classification. Science Advances. .. _`https://doi.org/10.14466/cefasdatahub.34`: https://doi.org/10.14466/CefasDataHub.34
Data holder:
Centre for Environment, Fisheries and Aquaculture Science, Lowestoft Laboratory (CEFAS)
| Other details | ||
| Internal code | Internally assigned metadata identifier | 9656 |
| Title | The title is used to provide a brief and precise description of the dataset such as 'Date', 'Originating organisation/programme', 'Location' and 'Type of survey'. All acronyms and abbreviations should be reproduced in full. | A new machine learning approach to seabed biotope classification |
| File Identifier | The File Identifier is a code, preferably a GUID, that is globally unique and remains with the same metadata record even if the record is edited or transferred between portals or tools. | CEFAS4e148f93-2982-4858-a418-8e7f9ded625d |
| Resource Identifier | This is the code assigned by the data owner. | CEFAS19921 |
| Resource type | The resource type will likely be a dataset but could also be a series (collection of datasets with a common specification) or a service. | dataset |
| Start date | This describes the date the resource starts. This may only be the year if month and day are not known | 1969-03-30 |
| End date | This describes the date the resource ends. This may only be the year if month and day are not known | 2018-01-11 |
| Frequency of updates | This describes the frequency with which the resource is modified or updated i.e. a monitoring programme that samples once per year has a frequency that is described as 'annually'. | notPlanned |
| Abstract | The abstract provides a clear and brief statement of the content of the resource. | Files for use with the R script accompanying the paper Cooper (2020). Note that this script also uses files from `https://doi.org/10.14466/CefasDataHub.34`_ (details provided in script). Cooper, K.M. (2020). A new machine learning approach to seabed biotope classification. Science Advances. .. _`https://doi.org/10.14466/cefasdatahub.34`: https://doi.org/10.14466/CefasDataHub.34 |
| Lineage | Lineage includes the background information, history of the sources of data, data quality statements and methods. | Files include: BiotopePredictionScript.R (R script), EUROPE.shp (European Coastline), EuropeLiteScoWal.shp (European Coastline with UK boundaries), DEFRADEMKC8.shp (Seabed bathymetry), C5922DATASETFAM13022017.csv (Training dataset), PARTC16112018.csv (Test dataset), PARTCAGG16112018.csv (Aggregation data). Description of C5922DATASETFAM13022017.csv: This file is based on the RSMP dataset (see https://www.cefas.co.uk/cefas-data-hub/dois/rsmp-baseline-dataset/), but with macrofaunal data output at the level of family or above. A variety of gear types have been used for sample collection including grabs (0.1m2 Hamon, 0.2m2 Hamon, 0.1m2 Day, 0.1m2 Van Veen and 0.1m2 Smith McIntrye) and cores. Of these various devices, 93% of samples were acquired using either a 0.1m2 Hamon grab or a 0.1m2 Day grab. Sieve sizes used in sample processing include 1mm and 0.5mm, reflecting the conventional preference for 1mm offshore and 0.5mm inshore. Of the samples collected using either a 0.1m2 Hamon grab or a 0.1m2 Day grab, 88% were processed using a 1mm sieve. Taxon names were standardised according to the WoRMS (World Register of Marine Species) list using the Taxon Match Tool (http://www.marinespecies.org/aphia.php?p=match). Of the initial 13,449 taxon names, only 774 remained after correction and aggregation to family level. The final dataset comprises of a single sheet comma-separated values (.csv) file. Colonials accounted for less than 20% of the total number of taxa and, where present, were given a value of 1 in the dataset. This component of the fauna was missing from 325 out of the 777 surveys, reflecting either a true absence, or simply that colonial taxa were ignored by the analyst. Sediment particle size data were provided as percentage weight by sieve mesh size, with the dataset including 99 different sieve sizes. Sediment samples have been processed using sieve, and a combination of sieve and laser diffraction techniques. Key metadata fields include: Sample coordinates (Latitude & Longitude), Survey Name, Gear, Date, Grab Sample Volume (litres) and Water Depth (m). A number of additional explanatory variables are also provided (salinity, temperature, chlorophyll a, Suspended particulate matter, Water depth, Wave Orbital Velocity, Average Current, Bed Stress). In total, the dataset dimensions are 33,198 rows (samples) x 900 columns (variables/factors), yielding a matrix of 29,878,200 individual data values. |
| Related keywords | ||
| Keyword | General subject area(s) associated with the resource, uses multiple controlled vocabularies | Benthos |
| General subject area(s) associated with the resource, uses multiple controlled vocabularies | Biodiversity | |
| General subject area(s) associated with the resource, uses multiple controlled vocabularies | Ecology | |
| General subject area(s) associated with the resource, uses multiple controlled vocabularies | Invertebrate | |
| General subject area(s) associated with the resource, uses multiple controlled vocabularies | Conservation | |
| General subject area(s) associated with the resource, uses multiple controlled vocabularies | Management | |
| General subject area(s) associated with the resource, uses multiple controlled vocabularies | Monitoring | |
| General subject area(s) associated with the resource, uses multiple controlled vocabularies | Sea bed | |
| General subject area(s) associated with the resource, uses multiple controlled vocabularies | Habitat | |
| General subject area(s) associated with the resource, uses multiple controlled vocabularies | Habitat characterisation | |
| General subject area(s) associated with the resource, uses multiple controlled vocabularies | Habitat extent | |
| General subject area(s) associated with the resource, uses multiple controlled vocabularies | Habitats and biotopes | |
| General subject area(s) associated with the resource, uses multiple controlled vocabularies | Unknown | |
| General subject area(s) associated with the resource, uses multiple controlled vocabularies | Marine Environmental Data and Information Network | |
| General subject area(s) associated with the resource, uses multiple controlled vocabularies | data.gov.uk | |
| Geographical coverage | ||
| North | The northern-most limit of the data resource in decimal degrees | 52.4595 |
| East | The eastern-most limit of the data resource in decimal degrees | 1.74086 |
| South | The southern-most limit of the data resource in decimal degrees | 52.4581 |
| West | The western-most limit of the data resource in decimal degrees | 1.73881 |
| Responsible organisations | ||
| Role | The point of contact is person or organisation with responsibility for the creation and maintenance of the metadata for the resource. | pointOfContact |
| Organisation name | Centre for Environment, Fisheries and Aquaculture Science, Lowestoft Laboratory (CEFAS) | |
| Delivery point | Cefas Lowestoft Laboratory, Pakefield Road | |
| Postal code | NR33 0HT | |
| City | Lowestoft | |
| Administrative area | Suffolk | |
| Country | UK | |
| data.manager@cefas.co.uk | ||
| Role | The originator is the person or organisation who created, collected or produced the resource. | originator |
| Organisation name | Centre for Environment, Fisheries and Aquaculture Science, Lowestoft Laboratory (CEFAS) | |
| Delivery point | Cefas Lowestoft Laboratory, Pakefield Road | |
| Postal code | NR33 0HT | |
| City | Lowestoft | |
| Administrative area | Suffolk | |
| Country | UK | |
| data.manager@cefas.co.uk | ||
| Role | The custodian is the person or organisation that accepts responsibility for the resource and ensures appropriate care and maintenance. If a dataset has been lodged with a Data Archive Centre for maintenance then this organisation is be entered here. | custodian |
| Organisation name | Centre for Environment, Fisheries and Aquaculture Science, Lowestoft Laboratory (CEFAS) | |
| Delivery point | Cefas Lowestoft Laboratory, Pakefield Road | |
| Postal code | NR33 0HT | |
| City | Lowestoft | |
| Administrative area | Suffolk | |
| Country | UK | |
| data.manager@cefas.co.uk | ||
| Role | The distributor is the person or organisation that distributes the resource. | distributor |
| Organisation name | Centre for Environment, Fisheries and Aquaculture Science, Lowestoft Laboratory (CEFAS) | |
| Delivery point | Cefas Lowestoft Laboratory, Pakefield Road | |
| Postal code | NR33 0HT | |
| City | Lowestoft | |
| Administrative area | Suffolk | |
| Country | UK | |
| data.manager@cefas.co.uk | ||
| Role | The owner is the person or organisation that owns the resource. | owner |
| Organisation name | Department for Environment, Food and Rural Affairs (DEFRA) | |
| defra.helpline@defra.gov.uk | ||
| Resource locators | ||
| Locator URL | Web address (URL) that links to the resource | https://data.cefas.co.uk/view/19921 |
| Locator name | Name of the web resource | Cefas Data Portal |
| Dataset constraints | ||
| 20.1 Limitations on Public Access - Access constraints | This states `otherRestrictions` from ISO vocabulary RestrictionCode and is an INSPIRE/GEMINI requirement. | otherRestrictions |
| 20.2 Limitations on Public Access - Other constraints | noLimitations | |
| 21.1 Conditions for Access and Use - Use constraints | This states `otherRestrictions` from ISO vocabulary RestrictionCode and is an INSPIRE/GEMINI requirement. | otherRestrictions |
| 21.2 Conditions for Access and Use - Other constraints | http://standards.iso.org/iso/19139/resources/gmxCodelists.xml#MD_RestrictionCode | |
| Version info | ||
| Date of publication | The publication date of the resource or if previously unpublished the date that the resource was made publicly available via the MEDIN network. | 2019-07-05 |
| Date of last revision | The most recent date that the resource was revised. | 2024-07-12 |
| Date of creation | The date that the resource was created. | 2019-07-05 |
| Harvest date | The date which this record has been (re)harvested from the provider. | 2026-04-12 |
| Metadata date | The date when the content of this metadata record was last updated. | 2024-07-12 |
| Metadata standard name | The name of the metadata standard used to create this metadata | MEDIN |
| Metadata standard version | The version of the MEDIN Discovery Metadata Standard used to create the metadata record | 3.1.1 |