data and software

datasets

World of International Non-Governmental Organizations (WINGO) Database, hosted on GitHub

The World of International Non-Governmental Organizations (WINGO) database is a comprehensive catalogue of formal international non-governmental organizations (INGOs). Covering more than 2,600 organizations, the dataset is based primarily on the Yearbook of International Organizations, published by the Union of International Associations (UIA). It also contains original data on INGOs’ founding and dissolution dates, headquarters locations, and issue areas.

Accountability in Global Governance (AGG) Database, hosted on GitHub

The Accountability in Global Governance (AGG) database measures the strength of formal accountability mechanisms in major international institutions. The dataset covers 52 institutions between 1960 and 2018 and includes five categories of accountability mechanisms: transparency, evaluation, redress, investigation, and participation. It additionally provides information on institutions’ governance tasks, decision-making procedures, financial resources, and media coverage.

Performance of International Institutions Project (PIIP), hosted on GitHub

The Performance of International Institutions Project (PIIP) is a wide-ranging dataset on the performance of international institutions. It includes performance assessments of 54 major institutions between 2008 and 2018 produced by the governments of Australia, Denmark, the Netherlands, Sweden, and the United Kingdom, as well as by the Multilateral Organisation Performance Assessment Network (MOPAN). The dataset also offers extensive information on institutions’ policy autonomy, governance tasks, and operational partnerships.

Project Performance Database (PPD) 2.1, hosted by AidData

The Project Performance Database (PPD) is the world’s largest dataset of international development projects with quantitative measures of overall project performance. Version 2.1, developed jointly with Dan Honig and Bradley Parks, contains evaluations of more than 20,000 foreign aid projects in 183 countries between 1956 and 2016 by 12 bilateral and multilateral aid agencies.

Other data

Most of the datasets used in my other research are available through the Harvard Dataverse.

software

MIDASverse: a suite of Python and R packages for missing-data analysis using machine learning methods. Developed with Thomas Robinson.

citest — Python (PyPI): tests conditional independence in incomplete regression analysis with machine learning.

citestR — R (CRAN): R equivalent of citest.

MIDASpy — Python (PyPI): accurate and efficient multiple imputation with deep learning.

rMIDAS — R (CRAN): R equivalent of MIDASpy.