A new study has found that as much as 80 percent of the raw scientific data collected by researchers in the early 1990s is gone forever, mostly because no one knows where to find it.
Disappearing Data
Human Error
The Solution
Research Data Management
Managing the way data is collected, processed, analyzed, preserved, and published for greater reuse by the community and the original researcher.
What is Data?
"the recorded factual material commonly accepted in the scientific community as necessary to validate research findings." -Federal Office of Management & Budget Circular A-110
Federal Regulations
High-Level View of RDM
Data Type
Group Roles
Data Storage
Data Archiving
format of data to be generated
who is primarily responsible for carrying out RDM? Set group norms
where will you store your data and how will you backup your data?
how will you preserve and make your data available to others?
Documentation
Documentation with the Open Science Framework
Wiki: document your lab procedures, standards, etc.
Collaborators: add collaborators of all levels, on different parts of your project
Components: sub-projects to organize your research
Version Control: upload files of the same name & OSF will track your versions!
Add-Ons: use OSF to bring together tools you use | GitHub
Registrations: when you have an unchanging version of your project, register it & get a DOI!
Disturbing (add random values to encoded value, retaining integrity of statistical accuracy)
Long Term Storage
Choose what you want to preserve/get to in the long term, but No matter WHAT, make sure you keep:
documentation (lab/field notebooks, etc.)
tools & analysis
Put your data into an archival format!
this should be open + accessible
Software agnostic
Archival Storage in Repositories
When you publish, you should make the underlying data available in a repository that issues DOIs! You then link that DOI in your "Supplementary Materials" section!
This means that anyone who wants to use your data must go to this repository, download it, and cite their use if they publish using it!
Collaborators of all levels, on different parts of your project!
Components: sub-projects to organize your research!
Add-Ons: use OSF to bring together tools you use!
3. Know What Data Management Funders Want
Applying Best Practices
Data Management Plans
a document that describes how you will collect, organise, manage, store, secure, backup, preserve, and share your data.
From NSF’s Data Management Plan Guidelines:
the types of data, samples, physical collections, software, curriculum materials, and other materials to be produced in the course of the project;
the standards to be used for data and metadata format and content (where existing standards are absent or deemed inadequate, this should be documented along with any proposed solutions or remedies);
policies for access and sharing including provisions for appropriate protection of privacy, confidentiality, security, intellectual property, or other rights or requirements;
policies and provisions for re-use, re-distribution, and the production of derivatives;
plans for archiving data, samples, and other research products, and for preservation of access to them.