FAQ
Frequently Asked Questions¶
Q: What is CEMA Warehouse? The CEMA Data Warehouse is a secure, internal website that stores health-related datasets produced by or used by researchers at the Centre for Epidemiological Modelling and Analysis (CEMA) at the University of Nairobi.
Think of it as a shared drive, but smarter. Instead of hunting through emails or shared folders for a spreadsheet, you:
- Search by name or keyword
- Browse datasets by category (e.g. Neglected Tropical Diseases, Demography, Health Systems)
- Preview the data before downloading
- Download in the format you need (CSV, Excel, or Parquet)
- Access data programmatically from Python or R via a web link
- See exactly where the data came from and how it was processed (provenance)
Q: Is the data free to use? Yes, most datasets in the warehouse are freely accessible for research, analysis, and educational purposes. However, for usesr outside CEMA, data can be requested by contacting support.
Q: I registered but cannot log in. What do I do? Your account is pending admin approval. Email anne.kariithi@cema.africa to follow up.
Q: I can see most datasets but not some specific ones. Why? Some datasets are restricted to HBTAP users. If you need access to those datasets, contact an administrator.
Q: Can I edit a dataset I uploaded? You cannot edit the data itself after uploading. To update the data, an administrator needs to upload a new version. You can update the provenance information by contacting an admin. If you are an admin, you can do both yourself.
Q: What is the maximum file size I can upload? The warehouse accepts files up to 1 GB. For larger files, contact the admin team.
Q: My upload failed. What should I do?
Check the error message shown on screen — it will tell you exactly what went wrong (e.g. wrong file format, missing shapefile components, empty file). For shapefile uploads, make sure your ZIP contains all required files (.shp, .shx, .dbf). If the problem persists, contact the admin.
Q: I need data from a specific year but the current version is newer. Can I get the old data? Ask an administrator. All previous versions are preserved and admins can download them. If you are an admin, go to the dataset's Versions tab.
Q: How do I know the data is reliable? Each dataset has a Provenance tab showing where the data came from, how it was processed, who is responsible for it, and a link to any processing code. If you have questions about a specific dataset, contact the Data Steward listed on that dataset's page.
Q: Can I use the API without coding skills? The API is designed for Python and R users. If you just need to download data, use the Download buttons on any dataset page — no coding required.
Q: My API token stopped working. What happened? Your token may have been regenerated (by you or, if you shared it, by someone else). Go to your Profile page to see your current token.
Q: How do I use the API? Our REST API provides programmatic access to datasets. Basic usage:
Python Example:
import pandas as pd
df = pd.read_csv(
"https://warehouse.cema.africa/api/v2/dataset/your-dataset-slug/YOUR_TOKEN?fmt=csv"
)
print(df.head())
R Example:
library(data.table)
dt <- fread(
"https://warehouse.cema.africa/api/v2/dataset/your-dataset-slug/YOUR_TOKEN?fmt=csv"
)
head(dt)
Q: Can I integrate warehouse data into my application? Absolutely! The warehouse is designed for integration:
- REST API: Direct data access for web and mobile applications
- Data Formats: The APIs return CSV data formats.
- No Downloads Required: Reduce data management overhead
Q: How do you handle data privacy?
- No Personal Data: Individual-level data with personal identifiers is not accepted, this can be added as private data from admin side.
- Aggregated Data Only: Focus on population-level and summary statistics data. Upload clean and comprehensible data to other people other than you.
- Source Compliance: Ensure all data complies with original privacy requirements and you should acknowledge the source in the description section.
- Secure Storage: All data stored securely with appropriate access controls either private or public access. It is also guarded by user authentication.
Can't find the answer to your question? Contact.