Skip to content

Uploading and Downloading Data

This guide explains how to contribute datasets to the CEMA Warehouse. The warehouse accepts different file formats and ensures all uploaded data maintains high quality and accessibility standards.

Uploads Interface

Access the upload tab form the navigation bar. User must be authenticated

Any logged-in user can upload datasets. Please fill in all sections carefully since good provenance makes datasets useful to others.

How to upload

  • Click Upload in the navigation bar.
  • Fill in the dataset details
Menu item What it means
Display name a clear, descriptive name (e.g. "STH Prevalence by Subcounty — 2024")
Description what the dataset contains and what it is useful for
Keywords comma-separated words to help others find it (e.g. "STH, helminth, subcounty, 2024")
Category select the most appropriate category from the dropdown. A subcategory dropdown will appear.
Tag this is filled automatically from your subcategory selection
  • Add the intended File you want to upload:
Steps
1. Click the dotted upload box, or drag and drop your file onto it
2. Accepted tabular formats: CSV, Excel (.xlsx/.xls), Parquet — these are stored in the database and support live preview and API access
3. Accepted document formats: PDF, Word (.docx), PowerPoint, ZIP, PNG, JPG — these are stored as files and support download only
  • Fill in the Provenance (very important!):
Menu item What it means
Data source the institution that originally collected the data (e.g. "Ministry of Health Kenya"). This is pre-filled based on your category but you can change it.
Source URL a link to the original data source, if available
Processing description describe any cleaning or transformation you applied
Code repository URL a GitHub or GitLab link to your processing scripts, if applicable
Steward name and email your name and email, so users can contact you with questions
  • Click Upload dataset.

You will be taken to the dataset's detail page once the upload is complete.


Downloading Data

From a dataset detail page

On the right side of any dataset page, you will find three download buttons:

Format Best for
CSV Excel, Google Sheets, any data analysis tool
Excel (.xlsx) Opening directly in Microsoft Excel
Parquet Python or R users who want a compact, fast-loading format

Click the button for the format you want. The file will download to your computer's default downloads folder.

Downloading a specific version

If you need an older version of a dataset (for reproducibility or comparison), ask an administrator.


Need help? Contact me or consult the documentation sections for detailed guidance on specific tasks.