The WHO Global Health Observatory offers datasets on a wide range of health-related indicators, including disease prevalence, mortality rates, healthcare access and more.
The WHO GHO provides a REST API and bulk CSV downloads. Engineers use `requests` against `ghoapi.azureedge.net` with indicator codes and country filters. The `who-gho` Python client simplifies authentication-free API access for health indicator time-series.
WHO GHO data builds authoritative health AI knowledge bases. Index all 1,000+ indicators for a RAG system that answers any global health question with official WHO data. AI health analytics models use GHO time-series to predict country health system performance and identify intervention priorities.
# pip install requests pandas
import requests, pandas as pd
# Life expectancy at birth for all countries (latest)
resp = requests.get("https://ghoapi.azureedge.net/api/WHOSIS_000001",
params={"$filter": "Dim1 eq 'BTSX' and TimeDim eq 2022"})
df = pd.DataFrame(resp.json()["value"])[["SpatialDim", "NumericValue"]]
df.columns = ["country", "life_expectancy"]
print(df.nlargest(10, "life_expectancy"))Official dataset source
More datasets used by Python data engineers.
The World Bank World Development Indicators provides 1,600+ time-series indicators covering poverty, health, education, infrastructure, and environment for 217 countries from 1960 onwards. Used in data engineering for global development dashboards, longitudinal analysis pipelines, and economic research systems in Python.
Access datasets on child well-being, education enrolment, nutrition, immunisation, child mortality, and child protection indicators worldwide from UNICEF. Used in data engineering for humanitarian analytics pipelines, SDG progress tracking, and building global child health indicator dashboards in Python.
New York City's open data portal provides 3,000+ datasets covering taxi trips, 311 complaints, crime statistics, building permits, health inspections, and transit data. Used in urban data engineering pipelines for city analytics, transportation modelling, and building geospatial dashboards in Python.