The validity of an area-based method to estimate the size of hard-to-reach populations using satellite images: The example of fishing populations of Lake Victoria

Emerging Themes in Epidemiology, Volume 15, No. 1, Year 2018

Ukuhunyushwa

Background: Information on the size of populations is crucial for planning of service and resource allocation to communities in need of health interventions. In resource limited settings, reliable census data are often not available. Using publicly available Google Earth Pro and available local household survey data from fishing communities (FC) on Lake Victoria in Uganda, we compared two simple methods (using average population density) and one simple linear regression model to estimate populations of small rural FC in Uganda. We split the dataset into two sections; one to obtain parameters and one to test the validity of the models. Results: Out of 66 FC, we were able to estimate populations for 47. There were 16 FC in the test set. The estimates for total population from all three methods were similar, with errors less than 2.2%. Estimates of individual FC populations were more widely discrepant. Conclusions: In our rural Ugandan setting, it was possible to use a simple area based model to get reasonable estimates of total population. However, there were often large errors in estimates for individual villages. We use data from the fishing communities of Lake Victoria in Uganda. The villages were selected as already having been surveyed in previous research by the research teams from the MRC/UVRI Uganda Research Unit and therefore accurate population data and global positioning system (GPS) location for each village were available. All estimates obtained from the methods described below were compared to these ground survey data. A fishing community (FC) was defined as a residential area in which the majority of the residents rely on Lake Victoria for income generation. Household surveys were conducted in 2012–13 and counted number of households and number of people in each household [12, 13]. All of the villages are fishing communities, with 39 on the mainland and 27 on the islands of Lake Victoria. These communities are characterised by single storey buildings, with the majority used for residential purposes. These communities are hard-to-reach, poorly served by skilled health care providers and have poor access to clean water and sanitation. Health issues include HIV, helminth infection, malaria, and high maternal and newborn morbidity. The populations of these communities are typically very mobile, consisting of transient populations who move between villages and within the wider region and country. Each community was viewed in Google Earth Pro software (GEP) and communities with no central cluster of residential structures were excluded. We also excluded fishing communities for which GPS coordinates did not show up as a village on the available satellite imagery, or where satellite images were unavailable. For each fishing community with satellite imagery available, we used GEP software to assess the area as follows. A member of our team [CG] estimated the perimeter of each community based on where structures were observable, and assessed density as either low or high, based on the space visible between structures on the satellite image (see Fig. 1). Although the perimeter was drawn so as to enclose the majority of structures which naturally formed the community, it was occasionally the case that some structures were excluded. The area enclosed within the perimeter was calculated automatically by the GEP software. We estimate that this process took less than 1 min per FC. Examples of boundaries fitted to the typical satellite images of FC We compared three methods of estimating populations: two using the average density and one using a regression model. The two average density methods calculated the average in different ways: the first used the average of the individual FC densities; the second used the overall population density calculated by summing the population of all FCs and dividing by the total area. We refer to these two methods as AD1 and AD2. The simple linear regression model we used consisted of a constant term and the FC area as the single predictor. The average density methods can be considered as regression models without a constant term; this allows the first two methods to be described as: where Yi is the predicted population for village i, and β is the average population density (however calculated). The regression method can be described as where α* and β* are the regression coefficients representing the intercept and slope respectively. All population estimates are presented rounded to the nearest whole number; when calculating total populations by summing individual populations the original estimates were used. To allow us to test and compare these approaches we randomly split the data into two sets: an index set of 31 FCs which we used to calculate the parameters (average density and regression coefficients) and a test set of 16 FCs which we used to compare the predictions made by these parameters with the values from the earlier surveys. We also calculated the unstratified parameters in the entire dataset of 47 FC’s, as these are the best available estimates from the data we have. We report each of these parameters with a 95% confidence interval (CI), with the exception of the M2 parameter for which a CI cannot be calculated as it is the simple ratio of total population to total area. To calculate the average density of FC for M1 we first calculated the density in each of the 31 index FCs and then used the mean of these figures as the parameter βM1. We then applied this value to each of the 16 test FCs to predict their population, and summed these estimates to give a total population for the test FCs. For M2 we calculated the total population of the 31 index communities and divided by their total area, and again used this parameter βM2 to calculate the populations of the remaining FC. We ran a simple linear regression, using area as a predictor of population, on the 31 index FCs. We took the parameters from this regression (α*, the intercept and β*, the coefficient for area) applied them to the 16 test FCs. We summed these individual estimates to get an estimate for the total population of the 16 test FCs. Note that because the constant is calculated at the village level, it was not possible to apply these parameters to an entire region; they must be applied at the village level. We repeated the above twice: once stratifying on location (island/mainland) and once stratified on assessed density category (low/high). In each case, we used the same original set of index and test FC, to enable comparison between the methods. We then separately calculated parameters in each stratum, and applied them to the test FC according to stratification level. This is equivalent to allowing an interaction between area and the stratification factor in Eqs. 1 and 2; alternatively it can simply be expressed as separate equations with equivalent parameters for each level of the stratification factor. That is, parameters βisland, βmainland, βlow-density, and βhigh-density, and similarly for β*, α and α*. Stata v15.0 was used for population estimation and GEP was used to obtain satellite images and estimate areas.

I-AI Digest

Study Justification:

The study aimed to address the challenge of estimating the size of hard-to-reach populations in resource-limited settings where reliable census data is often unavailable. By using satellite images and local household survey data from fishing communities on Lake Victoria in Uganda, the study aimed to develop a simple and cost-effective method to estimate population sizes. This information is crucial for planning service and resource allocation to communities in need of health interventions.

Highlights:

– The study compared three methods to estimate populations: two using average population density and one using a regression model.
– The estimates for total population from all three methods were similar, with errors less than 2.2%.
– However, there were often large errors in estimates for individual villages.
– The study demonstrated that a simple area-based model can provide reasonable estimates of total population in rural Ugandan fishing communities.

Recommendations for Lay Reader and Policy Maker:

1. Consider implementing the area-based method to estimate population sizes in hard-to-reach communities where reliable census data is unavailable.
2. Recognize that while the method provides reasonable estimates of total population, there may be larger errors in estimating population sizes for individual villages.
3. Explore additional methods or data sources to improve the accuracy of population estimates for individual villages.

Key Role Players:

1. Research teams from the MRC/UVRI Uganda Research Unit: They have expertise in conducting surveys and collecting accurate population data.
2. Local community leaders and representatives: They can provide valuable insights and local knowledge to assist in estimating population sizes.
3. Satellite imagery providers: They play a crucial role in providing access to satellite images for estimating community areas.

Cost Items for Planning Recommendations:

1. Satellite imagery access and processing: Budget for obtaining satellite images and using software tools like Google Earth Pro for estimating community areas.
2. Survey and data collection: Allocate resources for conducting household surveys to gather accurate population data.
3. Research team and staff: Consider the cost of employing researchers and staff members to analyze data and develop population estimation models.
4. Training and capacity building: Provide training and capacity building programs for local community leaders and representatives to enhance their involvement in population estimation efforts.

Amandla Obufakazi

The strength of evidence for this abstract is 7 out of 10.
The evidence in the abstract is moderately strong, but there are some limitations. The study used publicly available Google Earth Pro and local household survey data to estimate populations of fishing communities in Uganda. The results showed that the estimates for total population were similar among the three methods, with errors less than 2.2%. However, there were often large errors in estimates for individual villages. The study also mentioned that accurate population data and GPS locations were available for the selected villages. While this provides some level of validity, it is unclear how representative these villages are of the entire population. To improve the strength of the evidence, future studies could consider using a larger and more diverse sample of fishing communities and validate the estimates against more comprehensive census data. Additionally, conducting sensitivity analyses to assess the robustness of the estimates would be beneficial.

Abstract

We use data from the fishing communities of Lake Victoria in Uganda. The villages were selected as already having been surveyed in previous research by the research teams from the MRC/UVRI Uganda Research Unit and therefore accurate population data and global positioning system (GPS) location for each village were available. All estimates obtained from the methods described below were compared to these ground survey data. A fishing community (FC) was defined as a residential area in which the majority of the residents rely on Lake Victoria for income generation. Household surveys were conducted in 2012–13 and counted number of households and number of people in each household [12, 13]. All of the villages are fishing communities, with 39 on the mainland and 27 on the islands of Lake Victoria. These communities are characterised by single storey buildings, with the majority used for residential purposes. These communities are hard-to-reach, poorly served by skilled health care providers and have poor access to clean water and sanitation. Health issues include HIV, helminth infection, malaria, and high maternal and newborn morbidity. The populations of these communities are typically very mobile, consisting of transient populations who move between villages and within the wider region and country. Each community was viewed in Google Earth Pro software (GEP) and communities with no central cluster of residential structures were excluded. We also excluded fishing communities for which GPS coordinates did not show up as a village on the available satellite imagery, or where satellite images were unavailable. For each fishing community with satellite imagery available, we used GEP software to assess the area as follows. A member of our team [CG] estimated the perimeter of each community based on where structures were observable, and assessed density as either low or high, based on the space visible between structures on the satellite image (see Fig. 1). Although the perimeter was drawn so as to enclose the majority of structures which naturally formed the community, it was occasionally the case that some structures were excluded. The area enclosed within the perimeter was calculated automatically by the GEP software. We estimate that this process took less than 1 min per FC. Examples of boundaries fitted to the typical satellite images of FC We compared three methods of estimating populations: two using the average density and one using a regression model. The two average density methods calculated the average in different ways: the first used the average of the individual FC densities; the second used the overall population density calculated by summing the population of all FCs and dividing by the total area. We refer to these two methods as AD1 and AD2. The simple linear regression model we used consisted of a constant term and the FC area as the single predictor. The average density methods can be considered as regression models without a constant term; this allows the first two methods to be described as: where Yi is the predicted population for village i, and β is the average population density (however calculated). The regression method can be described as where α* and β* are the regression coefficients representing the intercept and slope respectively. All population estimates are presented rounded to the nearest whole number; when calculating total populations by summing individual populations the original estimates were used. To allow us to test and compare these approaches we randomly split the data into two sets: an index set of 31 FCs which we used to calculate the parameters (average density and regression coefficients) and a test set of 16 FCs which we used to compare the predictions made by these parameters with the values from the earlier surveys. We also calculated the unstratified parameters in the entire dataset of 47 FC’s, as these are the best available estimates from the data we have. We report each of these parameters with a 95% confidence interval (CI), with the exception of the M2 parameter for which a CI cannot be calculated as it is the simple ratio of total population to total area. To calculate the average density of FC for M1 we first calculated the density in each of the 31 index FCs and then used the mean of these figures as the parameter βM1. We then applied this value to each of the 16 test FCs to predict their population, and summed these estimates to give a total population for the test FCs. For M2 we calculated the total population of the 31 index communities and divided by their total area, and again used this parameter βM2 to calculate the populations of the remaining FC. We ran a simple linear regression, using area as a predictor of population, on the 31 index FCs. We took the parameters from this regression (α*, the intercept and β*, the coefficient for area) applied them to the 16 test FCs. We summed these individual estimates to get an estimate for the total population of the 16 test FCs. Note that because the constant is calculated at the village level, it was not possible to apply these parameters to an entire region; they must be applied at the village level. We repeated the above twice: once stratifying on location (island/mainland) and once stratified on assessed density category (low/high). In each case, we used the same original set of index and test FC, to enable comparison between the methods. We then separately calculated parameters in each stratum, and applied them to the test FC according to stratification level. This is equivalent to allowing an interaction between area and the stratification factor in Eqs. 1 and 2; alternatively it can simply be expressed as separate equations with equivalent parameters for each level of the stratification factor. That is, parameters βisland, βmainland, βlow-density, and βhigh-density, and similarly for β*, α and α*. Stata v15.0 was used for population estimation and GEP was used to obtain satellite images and estimate areas.

Izinto zokwakha

N/A

Emisha

Based on the provided description, here are some potential innovations that can be used to improve access to maternal health in fishing communities of Lake Victoria in Uganda:

1. Mobile Clinics: Implementing mobile clinics that can travel to fishing communities, providing essential maternal health services such as prenatal care, postnatal care, and family planning. These clinics can be equipped with necessary medical equipment and staffed with healthcare professionals.

2. Telemedicine: Utilizing telemedicine technology to connect healthcare providers with pregnant women in fishing communities. This would allow for remote consultations, monitoring, and guidance, reducing the need for women to travel long distances for healthcare services.

3. Community Health Workers: Training and deploying community health workers within fishing communities to provide basic maternal health education, screenings, and referrals. These workers can act as a bridge between the community and formal healthcare systems.

4. Water and Sanitation Improvements: Implementing initiatives to improve access to clean water and sanitation facilities within fishing communities. This can help reduce the risk of infections and improve overall maternal health outcomes.

5. Transportation Solutions: Developing transportation solutions specifically tailored to the needs of pregnant women in fishing communities. This can include providing affordable and reliable transportation options to healthcare facilities for prenatal visits, delivery, and postnatal care.

6. Health Education Programs: Implementing comprehensive health education programs within fishing communities to raise awareness about maternal health, family planning, and the importance of seeking timely healthcare services.

7. Partnerships with NGOs and Local Organizations: Collaborating with non-governmental organizations (NGOs) and local organizations to provide resources, funding, and expertise to improve access to maternal health services in fishing communities.

8. Data Collection and Monitoring: Establishing a robust data collection and monitoring system to track maternal health indicators and identify areas for improvement. This can help inform targeted interventions and measure the impact of implemented innovations.

It is important to note that the feasibility and effectiveness of these innovations would need to be assessed through further research and pilot programs in the specific context of fishing communities in Lake Victoria, Uganda.

AI Innovations Description

The recommendation to improve access to maternal health based on the provided information is to use satellite images and area-based methods to estimate the size of hard-to-reach populations in fishing communities of Lake Victoria in Uganda. This method can help in planning for service and resource allocation to these communities in need of health interventions. By using publicly available Google Earth Pro and local household survey data, it is possible to estimate the populations of small rural fishing communities. However, it is important to note that there may be large errors in estimating populations for individual villages. This approach can be used to identify and target areas with poor access to maternal health services and allocate resources accordingly.

AI Innovations Methodology

Based on the provided description, here are some potential recommendations for improving access to maternal health in fishing communities of Lake Victoria in Uganda:

1. Mobile Clinics: Implement mobile clinics that can travel to different fishing communities, providing essential maternal health services such as prenatal care, antenatal check-ups, and delivery assistance. These clinics can be equipped with skilled healthcare providers, necessary medical equipment, and supplies.

2. Community Health Workers: Train and deploy community health workers within the fishing communities. These workers can provide basic maternal health education, conduct regular check-ups, and refer pregnant women to appropriate healthcare facilities when needed.

3. Telemedicine: Establish telemedicine services that allow pregnant women in fishing communities to consult with healthcare professionals remotely. This can help overcome the challenge of limited access to skilled healthcare providers in these remote areas.

4. Health Education Programs: Develop and implement health education programs specifically focused on maternal health. These programs can raise awareness about the importance of prenatal care, safe delivery practices, and postnatal care, empowering women to make informed decisions regarding their health.

To simulate the impact of these recommendations on improving access to maternal health, a methodology could be developed as follows:

1. Baseline Data Collection: Collect baseline data on the current state of maternal health access in the fishing communities. This can include information on the number of pregnant women, availability of healthcare facilities, distance to the nearest facility, and utilization of maternal health services.

2. Define Metrics: Determine specific metrics to measure the impact of the recommendations, such as the number of pregnant women receiving prenatal care, the number of safe deliveries, or the reduction in maternal mortality rates.

3. Intervention Implementation: Implement the recommended interventions, such as mobile clinics, community health worker programs, telemedicine services, and health education programs.

4. Data Collection: Continuously collect data on the implementation of the interventions, including the number of pregnant women reached, services provided, and any challenges or barriers encountered.

5. Impact Evaluation: Analyze the collected data to assess the impact of the interventions on improving access to maternal health. Compare the baseline data with the post-intervention data to identify any changes or improvements in the defined metrics.

6. Adjust and Refine: Based on the evaluation results, make any necessary adjustments or refinements to the interventions to further improve access to maternal health.

7. Continuous Monitoring: Continuously monitor the implementation and impact of the interventions to ensure sustained improvements in access to maternal health in the fishing communities.

By following this methodology, it would be possible to simulate and evaluate the impact of the recommended innovations on improving access to maternal health in the fishing communities of Lake Victoria in Uganda.

Statistics:

Citations: 2

Authors: 26

Identifiers:

DOI: 10.1186/s12982-018-0079-5

Research Areas:

Community Interventions, Environmental, Health System and Policy, Infectious Diseases, Maternal and Child Health, Quality of Care, Social Determinants

Study Countries:

Uganda

Study Design:

Cross Sectional Study

Study Approach:

Quantitative

Yabelana ngalokhu: