Link to home

Case Study #2: Primary and Secondary Gradients

Parker, S. K., Nutter, F. W., and Gleason, M. L. 1997. Directional spread of Septoria leaf spot in tomato rows . Plant Disease 81: 272-276.

Septoria leaf spot, caused by Septoria lycopersici on greenhouse tomato. Photo courtesy M. Bates, Kansas State University, Manhattan, Kansas.

Septoria leaf spot, caused by Septoria lycopersici, infects solanaceous species, including many weeds. Conidia are spores borne in pycnidia and these spores may infect tomato plants at any developmental growth stage, but controlling new infections is most critical around the time of anthesis. Infection typically occurs first within the bottom leaves of the tomato plant and then progresses upward. The leaf spotting leads to leaflet necrosis and leaf abscission. This loss of leaves renders tomato fruit susceptible to sunscalding, and puts the plant in an unfavorably stressed condition. Both of these factors contribute to the economic importance of this fungal pathogen.

Because S. lycopersici is both parasitic and saprophytic, managing the crop residue is a major method for reducing inoculum from season to season. Weed management is also critical, since S. lycopersici is known to infect weed hosts.

Dissemination of S. lycopersici spores occurs via multiple mechanisms, including, wind-blown water, water-splash, insect transfer, and mechanical transfer (e.g. equipment, humans, etc.), where wind-blown water and/or water-splash are the most important.

As part of a larger study of the epidemiology of Septoria leaf spot, Parker et al. (1997) examined disease gradients of S. lycopersici over two years. In their experiments, a single plant, positioned in the center of 25 plants within a single row was inoculated with S. lycopersici at five (1992) or six (1993) weeks after transplanting. Plants were spaced at 60 cm intervals. Disease severity was assessed as defoliation and this was defined as the number of abscised leaves plus the number of completely necrotic (still attached) leaves. Gradients were measured over time and this enabled a comparison of primary and secondary dispersal/disease gradients.

Parker et al. (1997) observed that gradients became relatively constant three to five weeks after inoculation. The observed gradients reflected secondary foci. Parker et al. (1997) concluded that the logistic model best described the secondary disease gradients for this pathogen.

The following exercise examines the model assumptions for primary and secondary gradients. For many situations where a secondary gradient is measured, the exponential and power law distributions do not reflect the inherent dispersal process. This is because gradients tend to flatten over time as the source of inoculum expands. An extension of the exponential model for secondary gradients is the logistic model. Also, for power law models, an extension for secondary gradients is the use of a power-logistic model. To follow the results of Parker et al., who found that the logistic model best fit most of the disease gradient curves, an example is presented based on data from Figures 1 and 2 of their paper. In this example, disease gradients have been constructed for two sampling dates, where one represents a gradient assessed just after inoculation of the focus, while the second is a gradient for an assessment that occurred approximately three weeks later (assuming 7-day assessment intervals). Table 2 presents the raw data.

Table 2. Percent defoliation (adapted from Parker et al. (1997)) for a single dispersal gradient for tomato plants, where assessment date 1 was just after inoculation of a focus, while assessment date 4 was approximately three weeks later. Percent defoliation included abscised and necrotic leaves.

Distance
from Focus (cm)
Date 1 Date 4

60

75

87

120

40

78

180

30

68

240

20

62

300

15

50

360

12

27

420

10

32

480

12

12

540

8

13

600

5

5

660

4

4

To examine these two gradients in R, the following code may be used.

#Construct vectors for the distance and defoliation data
distance <- c(60,120,180,240,300,360,420,480,540,600,660)
date1 <- c(75,40,30,20,15,12,10,12,8,5,4)
date4 <- c(87,78,68,62,50,27,32,12,13,5,4)

#Compare the two gradient curves on the same plot
plot(
  distance,
  date1,
  type='b',
  lty=1,
  pch=19,
  col='black',
  ylim=c(0,100),
  xlab='Distance (cm)',
  ylab='Percent defoliation'
)

lines(
  distance,
  date4,
  type='b',
  lty=1,
  pch=20,
  col='red'
)

legend(
  'topright',
  c('Date1','Date4'),
  lty=c(1,1),
  pch=c(19,20),
  col=c('black','red'),
  title='Legend',
  inset=0.02
)

From which the following plot is obtained:

Output

Click on the image for larger version.

It appears that for date four, the percent defoliation is beginning to flatten out, exhibiting a more sigmoidal (S-shaped) curve. This would be expected when secondary dispersal occurs, as newly infected plants also become a source of inoculum for dispersal to longer distances from the original source. Regression analysis may be used to compare which model best fits data from each date.

Suggested exercises

Examine date 1 and date 4 using the power and power-logistic distributions.

Following the methods of Parker

log.date1=log(date1)
log.date1
log.date4=log(date4)
log.date4
logit.date1=log(date1/(100-date1))
logit.date1
logit.date4=log(date4/(100-date4))
logit.date4

 

Now using the transformed data, linear regression methods in R (Sparks et al. 2008) may be used to compare the two models:

#Set up data frames for each set of regressions
regression1A=as.data.frame(cbind(distance,log.date1))
regression1B=as.data.frame(cbind(distance,logit.date1))
regression4A=as.data.frame(cbind(distance,log.date4))
regression4B=as.data.frame(cbind(distance,logit.date4))

#Perform linear regressions for the four modeling scenarios
dispersal.1A=lm(log.date1~distance,data=regression1A)
dispersal.1B=lm(logit.date1~distance,data=regression1B)
dispersal.4A=lm(log.date4~distance,data=regression4A)
dispersal.4B=lm(logit.date4~distance,data=regression4B)

 

What information might we want from linear regression? In addition to the methods described in Sparks et al. (200x) for assessing a linear regression fit, some of the summary statistics that provide insight into how well a model fits data include the coefficient of determination (R2), residual mean square error (RMSE), as well as plots of residuals. These may be obtained using the summary function for a linear model.

#For further information on the summary function in R,
# type ?summary.lm
summary(dispersal.1A) Call: lm(formula = log.date1 ~ distance, data = regression1A) Residuals: Min 1Q Median 3Q Max -0.22791 -0.15540 -0.03715 0.08628 0.35004 Coefficients: Estimate Std. Error t value Pr(>|t|) (Intercept) 4.2253264 0.1369926 30.84 1.94e-10 *** distance -0.0042979 0.0003366 -12.77 4.53e-07 *** --- Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1 Residual standard error: 0.2118 on 9 degrees of freedom Multiple R-Squared: 0.9477, Adjusted R-squared: 0.9419 F-statistic: 163 on 1 and 9 DF, p-value: 4.535e-07
summary(dispersal.1B)

Call:
lm(formula = logit.date1 ~ distance, data = regression1B)

Residuals:
    Min      1Q  Median      3Q     Max 
-0.4417 -0.2997 -0.1489  0.2100  1.0097 

Coefficients:
              Estimate Std. Error t value Pr(>|t|)    
(Intercept)  0.4343264  0.2946757   1.474    0.175    
distance    -0.0057575  0.0007241  -7.951 2.32e-05 ***
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1 

Residual standard error: 0.4557 on 9 degrees of freedom
Multiple R-Squared: 0.8754,     Adjusted R-squared: 0.8615 
F-statistic: 63.22 on 1 and 9 DF,  p-value: 2.325e-05
summary(dispersal.4A)

Call:
lm(formula = log.date4 ~ distance, data = regression4A)

Residuals:
       Min         1Q     Median         3Q        Max 
-0.3912944 -0.2316225  0.0001519  0.2429178  0.5220735 

Coefficients:
              Estimate Std. Error t value Pr(>|t|)    
(Intercept)  5.1761259  0.2077517   24.91 1.30e-09 ***
distance    -0.0053154  0.0005105  -10.41 2.56e-06 ***
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1 

Residual standard error: 0.3213 on 9 degrees of freedom
Multiple R-Squared: 0.9233,     Adjusted R-squared: 0.9148 
F-statistic: 108.4 on 1 and 9 DF,  p-value: 2.555e-06
summary(dispersal.4B)

Call:
lm(formula = logit.date4 ~ distance, data = regression4B)

Residuals:
     Min       1Q   Median       3Q      Max 
-0.32605 -0.17259  0.02783  0.15084  0.42314 

Coefficients:
              Estimate Std. Error t value Pr(>|t|)    
(Intercept)  2.3814647  0.1658552   14.36 1.65e-07 ***
distance    -0.0084723  0.0004076  -20.79 6.46e-09 ***
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1 

Residual standard error: 0.2565 on 9 degrees of freedom
Multiple R-Squared: 0.9796,     Adjusted R-squared: 0.9773 
F-statistic: 432.1 on 1 and 9 DF,  p-value: 6.457e-09

 

It appears that for date one, the exponential model provides a better fit for the gradient, while for date four, the logistic model provides a better fit. To visually compare the fits of these models with the original raw data, the fitted values for each regression can be backtransformed into the original units (% defoliation) using the following R code:

#The lines below take the fitted value from each
# regression analysis
# and backtransforms the data into % defoliation
back.fitted.1A=exp(dispersal.1A$fitted.values) back.fitted.1B=(exp(dispersal.1B$fitted.values)/
(exp(dispersal.1B$fitted.values)+1))*100 back.fitted.4A=exp(dispersal.4A$fitted.values) back.fitted.4B=(exp(dispersal.4B$fitted.values)/
(exp(dispersal.4B$fitted.values)+1))*100 #The subsequent plots may be created: plot( distance, date1, type='p', pch=19, xlab='Distance', ylab='Percent defoliation', ylim=c(0,100)) lines(distance,back.fitted.1A,type='l',lty=1,col='black') lines(distance,back.fitted.1B,type='l',lty=1,col='red') legend('topright', c('Original data','Exponential','Logistic'), lty=c(NA,1,1), pch=c(19,NA,NA), col=c('black','black','red'), title='Legend', inset=0.05 )

which leads to (date 1):

Output

Click on the image for larger version.

#Date 4 Plots:
plot(
  distance,
  date4,
  type='p',
  pch=19,
  xlab='Distance',
  ylab='Percent defoliation',
  ylim=c(0,100))
  lines(distance,back.fitted.4A,type='l',lty=1,col='black')
  lines(distance,back.fitted.4B,type='l',lty=1,col='red')
  legend('topright',
  c('Original data','Exponential','Logistic'),
  lty=c(NA,1,1),
  pch=c(19,NA,NA),
  col=c('black','black','red'),
  title='Legend',
  inset=0.05
)

(date 4):

Output