Supplementary Material | ||
---|---|---|
Articles published 2020 or later | ||
file type | n1 | % |
missing | 202 | 51.4 |
docx | 149 | 37.9 |
xlsx | 24 | 6.1 |
13 | 3.3 | |
pptx | 4 | 1.0 |
png | 1 | 0.3 |
1 One article can have multiple files. |
a community effort to bring open data practices to the WASH sector
Global Health Engineering, ETH Zurich
September 12, 2024
Data: R package washopenresearch to be published at https://github.com/openwashdata/washopenresearch
Take-away: Not a single file is in machine-readable, non-proprietary file type format that would qualify for following FAIR principles for data sharing (Wilkinson et al. 2016).
Good practice: CSV file (comma-separated values), including a data dictionary for all variables/columns in the data
Supplementary Material | ||
---|---|---|
Articles published 2020 or later | ||
file type | n1 | % |
missing | 202 | 51.4 |
docx | 149 | 37.9 |
xlsx | 24 | 6.1 |
13 | 3.3 | |
pptx | 4 | 1.0 |
png | 1 | 0.3 |
1 One article can have multiple files. |
Data: R package washopenresearch to be published at https://github.com/openwashdata/washopenresearch
Screenshot from Soeters, Mukheibir, and Willetts (2021)
Screenshot from Soeters, Mukheibir, and Willetts (2021)
An active global community that applies FAIR principles (Wilkinson et al. 2016) to data generated in the greater water, sanitation, and hygiene sector.
Empower WASH professionals to engage with tools and workflows for open data and code.
fsmglobal documenation website by Greene et al. (2023) built with pkgdown R package
Read full proposal for Phase 2 at: openwashdata.org/pages/gallery/proposal-02/
Read full proposal for Phase 2 at: openwashdata.org/pages/gallery/proposal-02/
Read full proposal for Phase 2 at: openwashdata.org/pages/gallery/proposal-02/
Read full proposal for Phase 2 at: openwashdata.org/pages/gallery/proposal-02/
Read full proposal for Phase 2 at: openwashdata.org/pages/gallery/proposal-02/
Read full proposal for Phase 2 at: openwashdata.org/pages/gallery/proposal-02/
Read full proposal for Phase 2 at: openwashdata.org/pages/gallery/proposal-02/
Read full proposal for Phase 2 at: openwashdata.org/pages/gallery/proposal-02/
Read full proposal for Phase 2 at: openwashdata.org/pages/gallery/proposal-02/
Read full proposal for Phase 2 at: openwashdata.org/pages/gallery/proposal-02/
Who has an ORCID iD?
Who has published a scientific article in a journal?
I have:
I don’t have:
Job Description: Open Science Specialist
For 1 minute, think about these two questions and take some notes for later:
How should I be rewarded scientifically?
Which career paths are there for data stewards?
term | explanation | file format |
---|---|---|
unprocessed raw data | data that is not processed and remains in its original form and file type | often XLSX, also CSV and others |
term | explanation | file format |
---|---|---|
unprocessed raw data | data that is not processed and remains in its original form and file type | often XLSX, also CSV and others |
processed analysis-ready data | data that is processed to prepare for an analysis and is exported in its new form as a new file | CSV, R data package |
term | explanation | file format |
---|---|---|
unprocessed raw data | data that is not processed and remains in its original form and file type | often XLSX, also CSV and others |
processed analysis-ready data | data that is processed to prepare for an analysis and is exported in its new form as a new file | CSV, R data package |
final data underlying a publication | data that is the result of an analysis (e.g descriptive statistics or data visualization) and shown in a publication, but then also exported in its new form as a new file | CSV |
Self-nomination for Swiss Reproducibility Award 2024: https://ghe-open.ch/blog/posts/2024-02-27-swissrn-award/
Activity 1.3: Identify how ethical approval for data collection differs for types of organizations (university, NGO) and types of data (quantitative, qualitative).
Activity 1.4: Identify current data management practices and develop a draft data management strategy for organization.
Activity 1.5: Publish at least 10 datasets of two different types that are available to the organization, following openwashdata data publishing workflow.
A the end of the workshop, participants will be able to:
Describe how data published using the washr package follows FAIR principles compared to data shared in an appendix of a PDF or DOCX document.
Follow step by step instruction to create an R data package using the washr package.
Understand the difference between human-readible and machine-readible documentation.
https://buttondown.email/openwashdata
This project was supported by the Open Research Data Program of the ETH Board.
The slides were created via revealjs and Quarto: https://quarto.org/docs/presentations/revealjs/
You can view source code of slides on GitHub
Or you can download slides in PDF format
This material is licensed under Creative Commons Attribution Share Alike 4.0 International.