Skip to content

ioos/bio_data_guide

Repository files navigation

Standardizing Marine Biological Data Working Group (SMBD)

Purpose

The purpose of the SMBD is to facilitate a community of practice for aligning marine biological data to Darwin Core for sharing to OBIS. We do this by empowering our community members - which consist of federal, state, local, tribal, and private data managers, scientists, computer programmers, and everything in between - with the tools and knowledge to mobilize marine biological data.

How do we do it?

We host monthly meetings, a Slack space, and this GitHub repository to provide various mechanisms for community members to participate.

The primary focus of the working group is to help you get past any blockers you might be experiencing during the mobilization process. Below is a list of example blockers we've seen already:

  • What does the Darwin Core data model look like?
  • What about metadata?
  • How do I automatically collect scientific names for my species observations?
  • How can I best represent my data in Darwin Core?
  • I need help munging my data using R (or Python)!
  • How do I deal with dates when I only know the year?

Those and many more questions can be answered through this working group!

Who can join?

Anyone!

  • Do you have Taxonomic Occurrence data and want to share it?
  • Have you ever wanted to chat about biological data standards, programming, or biodiversity?

👋 If so: This is the place for you.

📆 How to participate?

We have open monthly meetings every 2nd Wednesday of the month at 16:00 ET to discuss marine biological data issues. Please feel free to join us!

Checkout our current contributors:

Made with contrib.rocks.


About this repository

There are multiple resources in this GitHub repository, including:

  • 📓 Living documentation for anyone working with, learning about, or conributing to IOOS's best practices for biological data.
  • 🗄️ Datasets being actively worked on by community members.
  • ♻️ Code and documentation used on other datasets that can be re-used.
  • 🧰 Tools to help you navigate the organizational, technical, and social challenges of publishing data.

❓ Have Questions? ❓

  • See the "issues" tab above to ask questions or discuss with the IOOS biodata community.
  • Also try searching for related issues which are open or have been closed (ie answered).

Got Data to Share?

  • 💬 open an issue in the issues tab above and tell us about it.
  • 💾 small datasets can be uploaded into ./datasets/ so we can directly help you align with best practices.
  • 🔗 dataset repositories or other hosted data can be included in the links in the Datasets section below.

Also, check out CONTRIBUTING.md

Our training & workshops

Datasets

The ./datasets/ directory in this repository contains small datasets which meet one of the following criteria:

  • 👷 the community is currently aligning this data
  • 📓 the dataset is retained as an instructive example
  • 🙊 the lazy maintainers of this repo haven't cleaned it out yet

Ideally each dataset should contain a README.md file with details about the data and the ingestion process for this dataset. See more on this in the contribute example applications guidance. A few datasets are highlighted below as especially instructive examples:


The Standardizing Marine Bio Data Guide

See the guide here.

We are documenting, in the form of a 📓 Guide, relevant resources and standards which apply to various marine biological data sets. This is a work in progress, a growing guide that is being put together by scientists and data managers responsible for transforming their data to meet international standards. The Guide is exported into multiple formats, including a pdf and an epub document. Chapters are written in R Markdown files; contributions are welcome!

Technical details of how to work with the book can be found in /building-the-data-guide.md.