|
|
||||||||
| ABSTRACT |
|
|
|---|
28 x 16 km) study area. The multivariate logistic regression models showed strong positive associations of infection at baseline (baseline prevalence = 83.2%) with settlement density (odds ratio [OR] = 1.24, 95% confidence interval [CI] = 1.101.38) and vegetation density (OR = 1.66, 95% CI = 1.252.22) and a strong negative association with the clay content of the soil (OR = 0.67, 95% CI = 0.620.73). Similar but weaker correlations were found after re-infection. Socioeconomic status and behavior did not seem to confound these associations. Spatial analysis of the model residuals suggested that because the models accounted for most of the spatial pattern, the model standard errors were not affected by spatial clustering. Our study shows that the pattern of hookworm infection is strongly influenced by several environmental factors. The GIS-aided prediction of areas in need of treatment is therefore a promising tool to guide control efforts when epidemiologic information is insufficient. | INTRODUCTION |
|
|
|---|
Geographic information systems (GISs) have gained importance for research into and prediction of the occurrence of parasitic diseases during the last two decades.2 The rationale behind this is that parasitic diseases where the parasite life-cycle either entails a free-living stage or an animal intermediate host are more sensitive to environmental influences than diseases with direct person to person transmission. A GIS provides a convenient way to integrate large spatially referenced datasets concerning various aspects of geographic entities into one spatial database that can be used for research into disease etiology and for disease prediction. This integration of datasets is particularly important since it avoids the need for lengthy and costly data collection while providing a means for spatial prediction that can direct control efforts to the worst affected areas.3
A GIS has been used to study and predict the occurrence of malaria, trypanosomiasis, schistosomiasis, and tick-borne and numerous other parasitic diseases.410 However, there are only few published reports where a GIS is used to investigate the association of environmental conditions with intestinal helminth infections.11
It is generally believed, that hookworms thrive in areas with sandy soils.1113 However, evidence in support of this hypothesis either originates from experiments that were concerned with larval survival and did not examine transmission,14,15 or from ecologic studies16 that were unable to account for confounders,1720 such as climatic, socioeconomic, and other factors.
The objective of this study was to explore associations of hookworm infection with environmental factors to further investigate the environmental epidemiology of this helminthiasis and to identify possible disease predictors. More specifically, we tried to establish whether hookworm infection is actually influenced by the sand content of the soil. The small spatial scale of the study allowed for collection of socioeconomic and behavioral data at an individual level and avoided bias caused by large-scale climatic differences. This enabled us to effectively control for confounders.
| MATERIALS AND METHODS |
|
|
|---|
|
|
|
Specimens were kept cool prior to preparation of the slides. Duplicate 50-mg Kato-Katz cellophane thick smears were prepared from every fecal sample and were examined for hookworm eggs within one hour after preparation.24 Diarrheal specimens and slides that were too dark were not examined; instead the respective pupils were asked to provide another specimen. Infection intensities are expressed as eggs per gram (epg) of feces. For individuals, these were calculated as the arithmetic mean of the number of eggs per thick smear multiplied by 20. The geometric mean of the individual values was used to characterize infection intensity on a population level. Children who still passed hookworm eggs after the two rounds of treatment were not included in the analyses of reinfection.
Ethical clearance was obtained from the Ethics Committee of the Faculty of Medicine of the University of Natal/Durban and the study was also reviewed and approved by the Central Medical Ethics Committee in Denmark. Before the onset of the study, information meetings were held with the staff and parents of the schools in the study. At these meetings, informed consent was obtained from the parents. Informed consent from the children was obtained directly before the first specimen collection.
Socioeconomic, behavioral, and environmental data. Interviews with the children were conducted during June and July 1999, immediately after the last parasitologic survey. The interviewers were not aware of the actual or former infection status of the children. During the interviews, children were asked about behavior and knowledge relevant to worm infections and about markers of socioeconomic status.
Environmental data were obtained from various sources: Information on population density (persons/km2) was obtained from the 1996 population census and settlement density information (as a map of homesteads in the area) was obtained from the South African Medical Research Council GIS malaria database for the area.10,25 Because information about the sand content of the soil or its porosity was difficult to obtain, we used the soil clay content as a reverse proxy. This was estimated using a land type map of the area and the accompanying key that provide detailed information on soil composition.26 Elevation, slope, and exposure to the sun were calculated using elevation contours from electronic versions of 1:50.000 topographic maps of the area and the respective surface analysis components of Idrisi32 GIS software.27,28
Vegetation density was characterized using data for the normalized difference vegetation index (NDVI) with a spatial resolution of 1 km. These data are distributed at no cost to the user by the Land Processes Distributed Active Archive Center (LP DAAC), located at the U.S. Geological Survey EROS Data Center (Sioux Falls, SD) (http://LPDAAC.usgs.gov).29 It is obvious that data recorded closer to and during the study period might have better reflected the actual vegetation conditions. However, the alternatives would have been to either use 8-km resolution data, which seem too coarse for an area of approximately 28 x 16 km, or to buy higher resolution data for the respective period. Since we were concerned that our results should be of direct relevance to future studies, we decided to use the 1-km data because they are affordable, relatively easy to use, and available with worldwide coverage.
The basis for relating these data to individuals were the homestead positions of the children that we had previously determined using a global positioning system (GPS) receiver (GPS 12; Garmin, Olathe, KS). The environmental data were averaged for an area within a 1-km radius around the home of each child. This radius was chosen since a buffer of this size should represent the environmental conditions around the home relatively well.
Data analysis. Data were double-entered and corrected for data entry errors. Statistical analysis was carried out in Stata 7 for Windows (Stata Corp., College Station, TX) and Microsoft (Redmond, WA) Excel® was used to create the related figures. Prevalence maps were calculated and drawn using ArcView 3.2.30,31
Exploratory analysis showed that infection was clustered on several levels: children from the same home, same school and the same part of the study area (east or west of the Pongola River) were likely to be more similar to each other regarding hookworm infection when compared with children from elsewhere. In addition, we observed limited positive spatial autocorrelation, meaning that children who were living close to each other were also more similar with regard to hookworm infection.32
Clustering and spatial correlation of responses invalidates the statistical assumption of independence of observations, resulting in an under-estimation of standard errors and thus exaggerated statistical significance.3335 Several techniques have been proposed to overcome the problems of hierarchical clustering and spatial autocorrelation, but there is no single accepted way to do this for non-Gaussian data.3638
To overcome the problem of clustering within homesteads (several participants shared their homes with one, two, or three other participants) dependent and independent variables were averaged for each home, and the homestead was made the unit of analysis. This reduced the effective sample size from 825 pupils to 716 participant homes for the baseline data and from 753 to 666 for the re-infection data. Binary disease status of the homestead for logistic regression purposes was coded as 0 if only uninfected participants lived there, and as 1 if at least one of the participating children was infected. Clustering on other levels was addressed during analysis.
Model identification. Data for hookworm infection at baseline and after re-infection were analyzed separately. Explanatory variables for both outcomes were selected systematically in three consecutive steps: first socioeconomic factors that were associated with hookworm infection and were therefore regarded as possible confounders were identified. Similarly, behavioral factors and in a third step environmental factors that were associated with hookworm infection were selected.
Stepwise forward logistic regression was used to select the best model in each case. P values for inclusion and exclusion of variables during the stepwise procedure were set to 0.10 and 0.11 respectively. Age, sex, and residence in the eastern or western part of the study area were retained in all models during the model identification process. Robust (i.e., "Huber-White" or "Sandwich") estimates of the standard errors were calculated to allow for clustering of responses within the same school.39
Resulting models. Single variable logistic regressions were carried out to assess unadjusted correlations of the identified predictor variables with the disease outcomes. Because we wanted to determine whether the results of the multivariate models were sensitive to the technique used to adjust for clustering of responses, we used two alternative methods to model within-cluster correlation in addition to the above described ordinary logistic model: 1) Generalized estimating equations (GEEs) logistic models with exchangeable correlation structure and with school as the unit of clustering;40 and 2) negative binomial regression modeling with the untransformed egg count data (epg) as the outcome that, like the logistic models, also used robust standard errors to adjust for clustering within schools. Effect estimates of the negative binomial models were labeled egg count ratios by analogy to incidence rate ratios.
Spatial statistics. Spatial autocorrelation was assessed using two different approaches. Semi-variograms were drawn using GS+ software for the calculations and Microsoft Excel® 97 to produce the graphs. Because the semi-variogram approach does not provide estimates of statistical significance Morans I and the related P values were calculated using the Stata 7 module spatgsa.41,42
| RESULTS |
|
|
|---|
Figure 2
demonstrates spatial clustering of infection in the study area. Apart from the generally focal distribution, it shows that hookworm infection was much more prevalent in the eastern part of the study area than in the western part. Prevalence in the two areas was 94.3% versus 76.6% at baseline and 59.6% versus 20.8% after re-infection.
|
|
The results of the different multivariate models agree relatively well with each other, but there are some differences when comparing them with the unadjusted results. Most obvious are these differences for the predictors settlement density (at baseline) and population density (after re-infection), where analysis showed that the unadjusted associations were confounded by vegetation density and clay content (at baseline) and by residence in the eastern or western part of the study area (after re-infection).
When comparing the ordinary and the GEE logistic model with the negative binomial model, it should be kept in mind that the former two report odds ratios, whereas the effect estimates of the negative binomial model are closely related to incidence rate ratios. In a high prevalence situation, they should therefore be expected to differ numerically.43
The semi-variogram (Figure 3
) shows how similar or dissimilar observations are at different separation distances and thus characterizes spatial autocorrelation in the data. Every point in the graph represents the semi-variance of all possible pairs of observations that are separated by the same distance. To give an example: The first diamond from the left in the baseline variogram characterizes the variability of hookworm infection status within all pairs of children that lived between > 0 and 250 meters from each other, the next diamond stands for those who lived between 250 and 500 meters apart, etc. Values less than one indicate that the variability at this distance is lower than the overall variability and thus signify positive spatial autocorrelation whereas values above unity indicate the opposite.
|
When comparing semi-variograms of the raw data with those of the respective deviance residuals, it is obvious that the latter were much less autocorrelated. This indicates that the independent variables included in the model explained most of the spatial correlation in infection status.
Table 4
is in agreement with this. It shows that spatial autocorrelation as indicated by Morans I (which ranges from +1 = strong positive autocorrelation to 1 = strong negative autocorrelation) was significant in the raw data, but greatly reduced in the model residuals. Nevertheless, the baseline data still display some significant residual autocorrelation for children living within 1,000 meters from each other, indicating unexplained residual correlation in infection rates.
|
| DISCUSSION |
|
|
|---|
The clay content was included as an inverse proxy for soil porosity because data on particle size or sand content were not available. Soil texture is believed to be important because porous soils are well aerated and allow for vertical migration of the larvae, enabling them to actively avoid desiccation and solar radiation by moving deeper into the soil, and to avoid rising water levels by moving upwards. In contrast, clay soil seems to inhibit larval migration.13,15,4447 In accordance with this reasoning, a positive association of sandy soils with hookworm infection has been demonstrated in many studies.1719,4850
The association of NDVI with hookworm infection that we found is also plausible. Soil moisture is an essential prerequisite for the development and infectivity of hookworm larvae and it is also an important determinant of plant cover. Plant cover provides shade and thus prevents desiccation of the top soil and shelters surface dwelling hookworm larvae from ultraviolet radiation. Associations of soil moisture and exposure to the sun with larval survival and infectivity have been demonstrated in numerous studies.15,44,45,47,5153 However, the only article we know of that demonstrates an association of NDVI with the occurrence of geohelminth infection does not consider hookworm.54 Our study is therefore unique in demonstrating an association between hookworm and NDVI specifically.
The relationship of population density with hookworm infection seems also sensible because if sanitary facilities are lacking, fecal contamination of the soil increases with population, thus leading to elevated levels of transmission in disease-endemic areas.55
The lack of residual autocorrelation demonstrated in Figure 3
and Table 4
shows that our models accounted for the autocorrelation in the raw infection data. This is very important since it implies that the model standard errors are not underestimated due to the presence of spatial autocorrelation and therefore the calculated significance estimates are valid.56
As for the suspected socioeconomic and behavioral confounding, it may be worth noting that effect estimates and standard errors from models that did not include these factors but only contained the environmental covariates (not shown) were very similar to those of the full models shown in Table 3
. This indicates that in the study area with its relatively homogenous population structure, socioeconomic and behavioral confounding seems to have been negligible. However, we believe that this conclusion cannot be extended to larger-scale studies with more diverse populations and locally clustered conditions.
It may be seen as a limitation of this study that information on socioeconomic status was obtained from the children and not from their parents. However, because most of our questions were concerned with relatively practical things (absence or presence of certain items in the household, number of cattle, literacy of mother and father, etc.) and thus quite straightforward, we believe that the information should nevertheless be valid. The only more complicated questions were concerned with the parental employment status. This information was collected by asking the child two questions about each parent: whether the mother or father was working, and also what the parent was doing to earn money. The information from the second question was used to validate the answer from the first question. In most cases, however, the two answers were in agreement.
Our study shows that soil composition, vegetation, and population density can strongly influence prevalence and intensity of hookworm infection and that they should therefore be considered when trying to develop predictive models and intervention strategies if there is insufficient epidemiologic evidence.
The NDVI and population density are especially promising for prediction because relatively detailed information with wide coverage for both factors is available in the public domain.57,58 Soil composition is more problematic because detailed information is lacking for most developing countries where hookworm is most prevalent. Furthermore, comparisons of this measure between different countries pose problems because differences in survey methods, soil classification systems, etc. complicate a unified approach. However, with the advent of new remote sensing tools, this situation might soon change.59,60
The inclusion of socioeconomic factors that we believe are important determinants of large-scale variation is an even bigger challenge: detailed information is lacking, and the available information would be hard to generalize over country and cultural boundaries.
Our study did not assess the influence of climatic factors because we assumed that their variation in the study area would be small and because climatic data with the necessary high spatial resolution were not available. However, there is overwhelming evidence that rainfall and temperature are important determinants of the larger-scale geographic pattern of hookworm infection.53,6166
Although elevation or altitude, which is closely related to the climatic factors, did not seem to play an important role in our small study area, it has been shown to be associated with the occurrence of hookworm infection in other studies on larger scales.50,67,68 Like the NDVI and climate data, elevation data are available in the public domain at adequate resolution.69,70
In accordance with our expectations, this study shows that the pattern of hookworm infection is strongly influenced by several environmental factors. The GIS-aided prediction of areas in need of treatment is therefore a promising tool to guide control efforts when epidemiologic information is insufficient.
Received May 27, 2004. Accepted for publication November 4, 2004.
Acknowledgments: We thank the pupils, staff, and parents of the participating schools for their cooperation and their patience, and the KwaZulu-Natal Department of Health for providing treatment, laboratory space, and general assistance.
Financial support: This study was supported by the Danish Bilharziasis Laboratory. Elmar Saathoff was supported by a PhD scholarship from Evangelisches Studienwerk Villigst/Germany.
Authors addresses: Elmar Saathoff, Harvard School of Public Health, 665 Huntington Avenue, Boston MA 02115, E-mail: elmarsaathoff{at}compuserve.de. Annette Olsen, Danish Bilharziasis Laboratory, Jaegersborg Allé 1 D, DK-2920, Charlottenlund, Denmark, E-mail: ao{at}bilharziasis.dk. Brian Sharp and Immo Kleinschmidt, Malaria Lead Programme of the Medical Research Council, PO Box 17120, Congella 4013, Durban, South Africa, E-mails: sharpb{at}mrc.ac.za and immo.kleinschmidt{at}mrc.ac.za. Jane D. Kvalsvig, Child, Youth and Family Development, Human Sciences Research Council, Private Bag X07, Dalbridge 4014, South Africa, E-mail: JKvalsvig{at}hsrc.ac.za. Chris C. Appleton, School of Life and Environmental Sciences, University of KwaZulu-Natal, Durban 4041, South Africa, E-mail: appleton{at}biology.und.ac.za.
| REFERENCES |
|
|
|---|
This article has been cited by other articles:
![]() |
B. Dondji, R. D. Bungiro, L. M. Harrison, J. J. Vermeire, C. Bifulco, D. McMahon-Pratt, and M. Cappello Role for Nitric Oxide in Hookworm-Associated Immune Suppression Infect. Immun., June 1, 2008; 76(6): 2560 - 2567. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |