openwashdata 101

A starter kit to become a part of the openwashdata community

In this blog post, you will first find ways to access, download, and share our published data. Are you interested to publish your data with us? We talk about the starting steps to launch your open data practices, get your unique ORCID iD, sign up for GitHub, and join the chat on Matrix.
Author
Affiliation

Mian Zhong

Global Health Engineering, ETH Zurich

Published

July 3, 2023

Welcome! We are so happy that you join the openwashdata community!

While our common goal is more accessible data in the WASH sector, we do have different backgrounds and skill-sets. So feel free to skip some parts that are familiar to you. On the other hand, if you find some terms confusing, like “R programming” or “GitHub”, stay tuned by subcribing to our newsletter or get in touch. We will provide more detailed posts about open data practices and opportunities for training in the future.

1. Access and use data

You could get involved with the openwashdata community by using our published data:

  • If you are a researcher, you may analyze and cite our data in your scientific work.
  • If you are a lecturer, consider to integrate our data into your training and course materials.
  • If you do business in the WASH sector, you may refer to some of the updated data for your business cases.

The development team is constantly working on organizing, cleaning, and releasing new datasets. The following sections introduce two ways to use the published data.

1.1 Let’s do R, why not 🥳

If you have worked with R programming1, we are actively developing new R data packages. You could access the data by installing the package in R and start from there. For example, we have released the dataset package wasteskipsblantyre that contains locations of the waste skips in Blantyre, Malawi from the year 2021.

```{r}
# install devtools package that facilitates installation from GitHub
install.packages("devtools")

# Install data package from GitHub
devtools::install_github("openwashdata/wasteskipsblantyre")

# Import data package
library(wasteskipsblantyre)

# Check documentation
?wasteskipsblantyre
```

Now you have successfully loaded the data and can start to explore this dataset with your creativity!

1.2 Nah, just give me the files 😈

For the published datasets, we also provide the csv files (.csv) and the spreadsheets (.xlsx) that you can directly download from the associated website. For example, with the wasteskipsblantyre dataset, you may find the files as shown in the following figure2.

screenshot of downloading spreadsheet data

We are opinionated to discourage this direction because the analysis work with the data might be less reproducible. Nevertheless, for the moment, we still think it’s a good way to make data as accessible as possible.

1.3 Cite openwashdata

All datasets by default are released under CC-BY license. When using openwashdata, consider to cite the data package to acknowledge data collectors and maintainers. Each dataset has its citation information on the documentation website. For example, you may find how to cite package wasteskipsblantyre here.

2. Share your data with openwashdata

You got some amazing data and love to share? Great! Become a contributor by getting started with some tools and submit your data ideas.

You can also always get in touch with us via Email.

2.1 Join platforms for open data practices

We probably all have some social media to share life updates. For an open data community, likewise, we need tools to help us share, track, and communicate progress. Hence, we have picked widely used tools as part of the starter-kit to build openwashdata.

  • ORCID iD: a personal identifier of scholarly contribution, like a business card.
  • GitHub: our digital headquarter of openwashdata datasets.
  • Matrix: an instant message chatroom for the openwashdata community, like Slack or Messenger.

ORCID iD

  1. Go to Register an ORCID iD

    1. If your institute has an affiliation, you could choose “Access through your institution” to register and sign in.

    2. Otherwise, you may opt in th “Google” or “Facebook” sign-in options.

    3. If the above-mentioned options are not for you, click “Register Now” and follow the instructions.

  2. Save your ORCID iD

GitHub

  1. Go to Sign up for Github and follow the instructions to get an account.
  2. Open openwashdata GitHub organization and click “Follow” on the upper right corner to connect with us on GitHub.

Matrix

  1. If you do not have a Matrix account yet, follow these instructions to set up!
  2. Join our chatroom with a direct link to: openwashdata-lobby
  3. Say “hi” in openwashdata-lobby!

2.2 Submit a data idea

Now that you have a GitHub account, you can submit your data idea and collaborate with us by:

  • Log into GitHub
  • Go to our data idea tracker
  • Follow the instructions to submit your idea

We hope this post not only helps you with a smooth start to collaborate with us in the openwashdata community, but also benefit you with long-term open science practice. Join the Matrix chatroom to get support and advice from now on.

Footnotes

  1. “A free software environment for statistical computing and graphics.”↩︎

  2. The figure is a screenshot from wasteskipsblantyre package website and you may also want to check out this awesome example data analysis article.↩︎