For a considerable portion of human history, Africa has been overlooked as a key region for foreign investments due to a few key reasons. The first is that there is a longer time horizon to see one's return on investment, and second, the low access to high-quality data and accurate information. Yet as clearly seen in our graphs below, Africa; specifically Sub-Saharan Africa, has seen a steady increase in its GDP compared to the rest of the world. As the political instability throughout the Sub-Saharan region has begun to subside, many countries in this region have been able to develop and industrialize their economy. Many countries have been able to move away from predominantly agriculture and mining-based economies to manufacturing and service-based economies. Therefore, Sub-Saharan Africa has become more open to more complex and sustainable corporate foreign investments over the last decade and has the potential to shape the global economic landscape as we move forwards and look toward the rest of the 21st century.
Another critical reason why Sub-Saharan Africa is an attractive investment option is due to the overall steady decline in year-over-year GDP growth for many developed and developing countries. A principal cause for this is a demographic shift in such countries. Many of these countries suffer from an aging population which has a detrimental impact on the labor market's growth. Other demographic indicators such as fertility rates, education, and income inequality have caused these countries' economies to stagnate or, in some instances, even decline. Sub-Saharan Africa, on the other hand, has seen a significant improvement in these demographic indicators. Many of these countries have an extremely young population, which will help drive the growth of their labor market. Furthermore, they have seen an improvement in healthcare, education, and many other demographic metrics, allowing them to sustain an expanding economy.
With the rising economic potential in Sub-Saharan Africa, it is imperative that we close the gap in high-quality, accurate socioeconomic data. As stated previously, one of the main detriments to investing in this region is the lack of accurate local information. Many institutions, such as USAID, the UN, and World Bank, track country-level information very closely, but accessing local information within a specific country is extremely limited and cumbersome. Microdata is essential to clearly understand the economic potential of an investment in a country, and without such information can have a detrimental impact on the sustainability of foreign corporate investment. Therefore, we aim to illustrate through our example below how one can potentially aggregate and model historical trends and project future trends using the limited historical data available. Such models can guide policymakers, both national and international, in making key investment decisions of a critical nature that can benefit the region and its countries.
Within Sub-Saharan Africa, Kenya would be a prime location for future long-term foreign corporate investment due to the country's recent changes. The country has developed the right combination of a politically stable environment and government focus on improving key demographics to help improve and diversify its economy.
Kenya has made staggering strides in developing its political environment, especially after passing its new constitution in 2010. Their new constitution enacted a bicameral legislative house, devolved county government, and a constitutionally tenured judiciary and electoral body. This was a crucial step towards moving away from historical corruptive government practices and pushing towards a more decentralized democratic nation.
Furthermore, Kenya has had a significant increase in its younger population over the past decade. One of the country's main objectives is to continue improving its ability to provide better healthcare and education to this younger demographic. This will not only help give the country an extensive and more specialized labor force but also help improve the poverty level seen throughout the country, especially in rural regions.
Even though the region has seen some hardships due to the Ukraine War and the global rise in inflation, it has still been able to limit the negative impact of these externalities compared to many other countries worldwide, as seen in the Chart below. Therefore, Kenya is a significant opportunity to focus on while exploring possible investment opportunities.
Aggregating sub-national historical data in Kenya and other countries in sub-Saharan Africa is challenging due to the numerous governmental and institutional changes that have occurred in the region. Many global non-profit institutions, such as the UN and the World Bank, collect data on vital socioeconomic indicators on a national level. Still, only a few collect this information on a sub-national level. Yet, as a foreign corporate investor, understanding the local socioeconomic landscape is key to making sound investments in sub-Saharan Africa.
A few organizations have closely watched and surveyed Kenya’s progress on a sub-national level, such as DHS and KNBS. Although, there are a few critical issues regarding the data collection process historically and the hurdles faced when attempting to understand the historical trends of the country’s socioeconomic indicators on a sub-national level. Before 2009, actual reported surveyed data was collected on a province level, only starting in 2009 (for KNBS Census), and 2009/2014(DHS) were the county-level data reported. This is a crucial issue since in 2010 passed, a new constitution enacted devolved county governments. The other problem is the inconsistency of the type of data collected. The surveyed data differs from year to year. Finally, many other institutions that do provide historical sub-national information tend to estimate using DHS and the UN Projection model. The key issue with using such models is that many of the data points are health-focused due to the immense amount of global funding provided to Kenya and other similar countries to improve health care in the region. Many of these indicators are not directly interpreted as an indicator of future economic growth, but as you will see below, even such modeled information can indicate the future trend for key economic factors.
We developed custom models to understand the historical county-level trends of some adjusted critical economic indicators.
We've performed linear regression using the KNBS's reported GCP from (2013 - 2020) as a % of Kenya's Total GVA to understand the possible historical trend of a county's GCP.
We've used the reported four-year (2017-2020) average sector-level GCP along with the back-casted regressed GCP share of Kenya's total GDP to understand the historical sector-level trends by county.
Furthermore, we've used the IHME's Modeled Average Years of Education based on DHS surveys by Gender to develop a county-specific autoregressive model to understand historical and future education trends by county.
Finally, we used the UN's subnational custom R simulation to project population, TFR, and Demographic Breakdown to the year 2050. We've used IHME's estimated TFR and Mortality Rate by county (using the 5yr average from DHS) as inputs for the custom simulation. A key aspect to remember is that we could not use the migration parameter because such information is not available on a local-level; therefore, the real modeled value may be slightly higher than the reported value from KNBS.
Given the minimal sources of accurate data currently available for Kenya (e.g. Garissa’s census crisis, high fluctuation of gross county product (GCP) (defined as a proportion of Kenya's GVA attributed to each county) in numerous counties within Kenya, etc…), our team has decided to focus on bridging the gap between sources and accessibility by building a tool that allows users to visualize and further explore key indicators that can provide some color as to what is occurring in Kenya locally.
With this aggregation of data and back-casting of current data, users can understand the future economic growth trends at the county level in Kenya and how key local-social indicators impact economic growth in an emerging country. This tool will provide policymakers and investors a window into the future economic trends of each county and guide their decision-making process while implementing new policies and making potential investments within Kenya.
A key aspect to keep in mind is our ability to measure the accuracy of our data to the actual values reported. As mentioned above, until 2010, Kenya was divided into provinces, and only after the ratification of their new constitution in 2010 has Kenya started monitoring micro-level data by county. Furthermore, they only started reporting some of these socioeconomic county-level indicators in 2013 yearly and have expanded sector-level indicators in 2017. Therefore, we need more references regarding how accurate our models are compared to the actual reported real value results for each specific county historically. To accommodate for these restrictions, we compared our results to the data provided by DHS (since they reported their information at 5-year intervals using geospatial locations), and the average MSE ranged from 0.0038 to 0.34914 depending on the county for ASFR, Years of Education, and Household Size for the years 1998-2014. Using these metrics, we could see that our models provide an accurate representation of the trends based on historically geospatial locations within the newly created county boundaries.
Our tool's key applications are identifying critical trends of a county that can indicate a county's potential for sustainable economic diversification and growth. Some key indications of such growth would be seeing the change in the youth population, an upward trend in education, specifically for women, a steeper downward trend in TFR, even gender breakdown, and a slowly reducing agriculture sector.
Key applications of our tool include identifying key trends in socioeconomic indicators that could potentially identify local areas that would help expand their already diversifying economy. Some key indicators would be seeing the growth in the youth population, trends in education and TFR, gender breakdown, and sector level diversification in the county's economy.
This tab allows the user to explore both reported and modeled data by geolocation of the county on a map. In the top right corner, we have provided the option of adjusting the year of the map you want to see and the type of map you want to see. There are three types of maps: GCP(Modeled), the Gini Index (Reported), and GCP YoY growth (Modeled), and each county is highlighted according to the map selected. By clicking on a specific county within the map, the left-hand panel will present the individual metrics for the particular county compared to the rest of the counties.
This tab provides both reported information from the most recent census data (2019) and our custom models that use data from a variety of sources.
This section provides the reported information demographic and agriculture breakdown by KNBS from the 2019 census for each county.
This section presents the results from the UN’s subnational custom r simulation, mentioned above.
We've provided the simulations projections for population, TFR, and Gender/Age Breakdown to the year 2050 and their corresponding confidence intervals.
From our dashboard, we were able to identify that this county has seen the largest increase in overall GCP (Gross County Product) from 2000 to 2020, even though it is landlocked, as seen in our Interactive Map. Our dashboard was able to highlight the increase in Nyandarua's impact on the increase in Kenya's agricultural GDP, rising from 1% to 4%. This is the epitome of the information that would benefit policymakers and investors to explore further locally and identify the best use of invested funds. Through our preliminary research, we could determine that farmers in the county cite a significant improvement in farming tactics, such as using fertilizer and manure, along with decreasing costs of raw materials provided substantial gains over the recent years. Agriculture service offices have been providing farmers with knowledge on how to improve their farming practices using western research and overall support. However, there are still some constraints on logistics, like poor roads that limit the distribution of goods leading to wasted crops, and more needed investments in upgrading technology that can improve the efficiency of agriculture logistics. What must also be recognized is that the significant improvement in Nyandarua's agriculture based economy can be replicated elsewhere. Thus such information from our dashboard guiding force to help farmers in other counties with limited resources continue to strengthen Kenya's agricultural economy.
Another example is how our dashboard identified the rise in major cities' overall economies by using demographic indicators. For example, our dashboard shows that Kisumu, and other similar major counties, depict how Kenya's fertility rate is falling below the replacement rate and stabilizing between 1.5 and 1.7. Also, education, specifically for women, is generally increasing, signaling a transition to greater industrialization. These demographic trends will lead to a peak in population around 2060.
After further exploration of the results for these indicators from our dashboard, we were able to determine that the increase in educational years, and decrease in fertility is due to Kenya's strong stance on bolstering education, rolling out a free primary education program in 2003, revamping its curriculum in 2017, and increasing the supply of key materials, such as books, to the students in these counties. Providing such resources has allowed cities and counties to maintain their economy but not significantly expand their economy. This could be because resources such as teachers can be scarce, unlike supplies, where large teacher-student ratios are common. Further, focusing on educational personnel and providing incentives for individuals to become teachers through student loan forgiveness programs or scholarships could ameliorate academic personnel constraints and help build a more specialized, younger labor force, improving the county's economy overall.
We would like to thank our Partner Microsoft AI for Good Lab, specifically Lucas Meyer for providing us with this wonderful opportunity. We would also like to thank our Professor, Weiwei Pan, and our Teaching Fellow, Sarah Rathnam, for providing insightful guidance throughout our project.
Aditi Memani
Master of Science (S.M.), Computational Science and Engineering
aditimemani@g.harvard.edu
github: aditimemani-harvard
Geoff Liu
geoffreyliu@fas.harvard.edu
Angel Hsu
Master of Science (S.M.), Data Science
angelyinhuahsu@g.harvard.edu
github: angelhsu
Alex Kashi
Master of Science (S.M.), Computational Science and Engineering
alexkashi@g.harvard.edu
github: alexkashi
Aggregated Data: Population ( Census 2019), Age Demographic ( Census 2019), Gender Demographic (Cenus 2019), GCP (2013-2020), GCP by Industry (2017-2020). Main Cash Crops (Census 2019), Farming Households (Census 2019), Average Household Size (Census 2019),
Source: Wiesmann, U., Kiteme, B., Mwangi, Z. (2019-2021). Socio-Economic Atlas of Kenya: Depicting the National Population Census by County and Sub-Location and Economic Survey. Second, revised edition. KNBS, Nairobi. CETRAD, Nanyuki. CDE, Bern. https://www.knbs.or.ke/
Aggregated Data: Median Number of Education Years, TFR, and Life Expectancy
Source: Global Burden of Disease Collaborative Network. Global Burden of Disease Study 2019 (GBD 2019) Life Expectancy and Healthy Life Expectancy 1990-2019. Seattle, United States of America: Institute for Health Metrics and Evaluation (IHME), 2020.
Aggregated Data: Global, Country, and Country Sector-level GDP and GVA
Source: World Bank, World Development Indicators. (2021).
Aggregated Data: Population, Age Demographic, Gender Demographic
Source: WorldPop (www.worldpop.org - School of Geography and Environmental Science, University of Southampton; Department of Geography and Geosciences, University of Louisville; Departement de Geographie, Universite de Namur) and Center for International Earth Science Information Network (CIESIN), Columbia University (2018). Global High Resolution Population Denominators Project - Funded by The Bill and Melinda Gates Foundation (OPP1134076).
Data Inputs: IHME's county-level TFR & Life Expactancy, World Pop County-level population, UN's Country-level TFR, life expectancy, and migration
Summary: We used BayesTFR and BayesLife to project make and female expected life expectancy at birth and total fertility rate from Ševčíková et. al. (2018) We then used BayesPop with those the projected life expectancy at birth and total fertility rate to forecast future population.
Package Information: Ševčíková H, Raftery AE (2016). “bayesPop: Probabilistic Population Projections.” Journal of Statistical Software, 75(5), 1–29. doi:10.18637/jss.v075.i05.
Aggregated Data: pre-2010 Historical geospatial location data for TFR, province-level location, education, household size, Median Number of Education Years, ASFR
Spatial Data Source: ICF. The DHS Program Spatial Data Repository. Funded by USAID. spatialdata.dhsprogram.com. [October, 2022].
Survey Data Source: Kenya National Bureau of Statistics, Ministry of Health/Kenya, National AIDS Control Council/Kenya, Kenya Medical Research Institute, and National Council for Population and Development/Kenya. (1989-2020). Kenya Demographic and Health Survey (1989-2014). Rockville, MD, USA