...

–nitrogen in the Regional hybrid geospatial modeling of soil nitrate

by user

on
Category: Documents
8

views

Report

Comments

Transcript

–nitrogen in the Regional hybrid geospatial modeling of soil nitrate
+ MODEL
ARTICLE IN PRESS
Geoderma xx (2006) xxx – xxx
www.elsevier.com/locate/geoderma
Regional hybrid geospatial modeling of soil nitrate–nitrogen in the
Santa Fe River Watershed
S. Lamsal, S. Grunwald ⁎, G.L. Bruland, C.M. Bliss, N.B. Comerford
Soil and Water Science Department, IFAS, University of Florida, 2169 McCarty Hall; PO Box 110290, Gainesville FL, 32611, USA
Received 1 July 2005; received in revised form 29 November 2005; accepted 27 December 2005
Abstract
Typically, regional assessment of the spatial variability and distribution of environmental properties are constrained by sparse
field observations that are costly and labor intensive. We adopted a hybrid geospatial modeling approach that combined sparsely
measured soil NO3–N observations collected in three seasons (Sept. 2003, Jan. and May 2004) with dense auxiliary environmental
datasets to predict NO3–N within the Santa Fe River Watershed (3585 km2) in north-east Florida. Elevated nitrate–nitrogen
concentrations have been found in this watershed in spring, surface and ground water. We collected soil samples at four depths (0–
30, 30–60, 60–120, 120–180cm) based on a random-stratified sampling design. Classification and regression trees were used to
develop trend models for soil NO3–N predictions based on environmental correlation and predict values at the watershed scale.
Residuals were spatially autocorrelated only for the Jan. 2004 sampling and regression kriging was used to combine the kriged
residuals with tree-based trend estimates for this event. At each step of the upscaling process, error assessment provided important
information about the uncertainty of predictions, which was lowest for the Jan. sampling event. Sites that showed consistently high
NO3–N values throughout the cropping season (Jan–May 2004) with values ≤5 μg g− 1 covered 95.7% (3363.9 km2) of the
watershed. Values in the 5–10 μg g− 1 range covered 4.3% (150.7km2), while values exceeding 10μg g− 1 covered only 0.59%
(20.7 km2) of the watershed. Elevated soil NO3–N on karst, unconfined areas with sand-rich soils, or in close proximity to streams
and water bodies pose the greatest risk for accelerated nitrate leaching contributing to elevated nitrogen found in spring, surface and
ground water in the watershed. This approach is transferable to other land resource problems that require the upscaling of sparse
site-specific data to large watersheds.
© 2006 Elsevier B.V. All rights reserved.
Keywords: Nitrate–nitrogen; Geospatial; Classification and regression trees; Regression kriging
1. Introduction
Typically, regional assessment of the spatial variability and distribution of soil properties are constrained
by sparse field observations that are costly and labor
intensive. Therefore, hybrid modeling techniques have
⁎ Corresponding author. Tel.: +1 352 392 1951x204; fax: +1 352 392
3902.
E-mail address: [email protected] (S. Grunwald).
been proposed to combine sparsely sampled field data
with dense auxiliary environmental datasets to predict
soil properties at regional scale (Goovaerts, 1999;
McBratney et al., 2000; van Meirvenne and van
Cleemput, 2005). Commonly, regression modeling is
used to characterize relationships between environmental variables whereas geospatial analysis is focused on
the interpretation of soil-landscape variability. Least
squares linear regression and multiple regressions are
among the most commonly used analytical techniques
0016-7061/$ - see front matter © 2006 Elsevier B.V. All rights reserved.
doi:10.1016/j.geoderma.2005.12.009
GEODER-02541; No of Pages 15
ARTICLE IN PRESS
2
S. Lamsal et al. / Geoderma xx (2006) xxx–xxx
of ecologists and soil scientists (Trexler and Travis,
1993). Linear models are associated with assumptions
about the statistical distribution of a response variable,
the form of variance structure and the independence of
observations which are often difficult to meet with
environmental data (James et al., 2004). Furthermore,
environmental datasets are often complex and unbalanced with non-linear relationships that vary in space
and time. Therefore, linear regression models often fail
to effectively describe landscape pattern to address
environmental issues. Numerous studies employed
multivariate non-linear methods to interpret environmental behavior and model the spatial distribution of
environmental properties. Odeh et al. (1995), McBratney et al. (2000) and Hengl et al. (2004) used a suite of
different environmental variables such as soil, land use,
topographic and other landscape properties to predict
target soil variables.
These studies were rooted in the conceptual framework called SCORPAN described by McBratney et al.
(2003). Grunwald and Lamsal (2005) provided an
overview over mapping techniques to collect environmental data that can be included in a SCORPAN model.
McBratney et al. (2000) compared 24 different
univariate and multivariate statistical, geostatistical and
hybrid models at field, sub-catchment and regional
scales and found that hybrid techniques outperformed all
other methods at all scales. Hybrid models have the
ability to mix deterministic and stochastic components,
link data collected at different spatial scales, and
integrate sparse point datasets (e.g. soil samples) with
exhaustively measured environmental datasets (e.g.
digital elevation model—DEM, satellite imagery).
Bishop and McBratney (2001) argued that models that
incorporate secondary data (e.g. regression kriging—
RK) are superior in terms of their prediction performance
to generic geostatistical models such as ordinary kriging.
The performance of regression kriging was found to be
superior when compared to other prediction methods by
Odeh et al. (1995), Knotters et al. (1995), Odeh and
McBratney (2000), Hengl et al. (2004) and Triantafilis et
al. (2001) to predict a variety of different soil properties.
To address elevated nitrogen values found in the
Santa Fe River Watershed (SFRW) we present a case
study that uses land uses, soils, and landscape
characteristics in a multi-tier geospatial hybrid approach to model seasonal and regional patterns of soil
NO3–N. Increasing concentrations of NO3–N were
found in the ground, spring and surface water of the
Suwannee Basin in north-east Florida (Hornsby et al.,
2001). The SFRW is one of the tributaries of the
Suwannee River and has greatly contributed to the
NO3–N loads delivered by the Suwannee River to the
Gulf of Mexico (Hornsby et al., 2002). The watershed
covers 3585 km2 (13.8 %) of the Suwannee Basin but
contributed 20% of the total NO3–N loads that
drained from the Suwannee Basin into the Gulf of
Mexico accounting for about 2900 tons NO3–N
(Suwannee River Water Management District, 2003).
Our objectives were to upscale site-specific NO3–N
measurements to the watershed scale using a mixed
modeling approach based on environmental correlation
and geostatistical modeling.
2. Materials and methods
2.1. Study area
The SFRW, a tributary of the Suwannee Basin that
drains into the Gulf of Mexico, spans over an area of
approximately 3585 km2 across eight counties in northeast Florida (Fig. 1). The soils of the SFRW are
predominantly sandy in texture with loamy to clayey
deposits, organics and sites with sand hill karst terrain
with many solution basins. Ultisols with 36.7% aerial
coverage, Spodosols (25.8%), and Entisols (14.7%) are
the dominant soil orders in the watershed. Less
prominent are Histosols (2.0%), Inceptisols (1.1%),
and Alfisols (1.0%). Land use consists of pine plantation
(32.2%), wetlands (16.2%), upland forest (14.7%),
improved pasture (14.0%), urban (8.8%), forest regeneration (6.0%), crops (5.0%), rangeland (3.7%) and a
variety of high intensity land uses such as tree groves,
dairies, and feeding operations (SRWMD and SJWMD,
1995). Agricultural land uses in the watershed are
diverse, ranging from corn, peanuts, tobacco, vegetables, watermelons, strawberries, blueberries, and
pecans. The elevation ranges from around 3 m to over
91 m above mean sea level. Generally, the land is level
(0–2% slopes) to gently sloping and undulating (0–5%
slopes), with the major exception to this pattern being
the moderately and strongly sloping land (5–12%
slopes) along the Cody Scarp. Two main physiographic
regions in the watershed are the Gulf Coastal Lowlands
and the Northern Highlands, which are separated from
one another by the Cody Scarp (Schmidt, 1997).
Underlying geologic units include Eocene limestone
(which occurs near the ground surface in the highrecharge, strongly karst-influenced Gulf Coastal Lowlands), capped by Miocene sediments which tend to be
rather clayey and phosphatic (occurring at or near the
surface along the Cody Scarp), in turn capped by
Pliocene and Pleistocene–Holocene sediments which
tend to be sandy at the surface but having loamy subsoils
ARTICLE IN PRESS
S. Lamsal et al. / Geoderma xx (2006) xxx–xxx
3
Fig. 1. The Santa Fe River Watershed in north-east Florida. The enlarged watershed (right) shows the boundary of the eight counties with sampling
locations (May 2004 event).
or substrate at varying depths (Randazzo and Jones,
1997; Brown et al., 1990). Based on the National
Oceanic and Atmospheric Administration (NOAA)
records at seven monitoring stations across the watershed, the total precipitation was 680.7 mm in Sept. 2003,
274.3 mm in Jan. 2004 and 231.4 mm in May 2004
representing “wet/end of cropping season” (Sept.), “dry
winter season” (Jan.) and “dry spring season” (May),
respectively. The mean annual precipitation based on
30 years of records from 1971 to 2000 across the
watershed was 1334 mm and the mean annual temperature 20.4 °C (National Climatic Data Center, NOAA).
selected privately owned land. Therefore, the number
of samples collected varied slightly by season and depth
(Table 1). The samples were extracted with 2 M KCl
(Keeney and Nelson, 1982) and the extraction solution
was analyzed for NO3–N content and expressed in μg
g− 1 of dry soil. Detection limits were 0.05 ppm NO3–N.
Samples below the detection limit were assigned the
mean value derived from averaging the lowest observed
value in the respective sampling event and Null.
Profile average NO3–N values (za) were derived
according to Eq. (1):
4
X
zðxi Þ⁎di
2.2. Field data collection
za ¼
We selected 151 sampling locations using a stratified-random sampling design targeted to areas with land
use–soil combinations proportional to their aerial extent
and other land uses (e.g. tree groves) that were expected
to show high NO3–N in soils according to expertknowledge from experienced local extension specialists
(personal communication Dr. M.W. Clark). Soil samples
were collected during Sept. 2003, Jan. 2004 and May
2004 from four depth increments (0–30, 30–60, 60–120
and 120–180 cm) in composites proportional to the
depth of sampling to ensure a constant sampling support
for each layer increment. Composite soil samples from
each site lumped the local variability of soil properties
providing a local signature of average site conditions.
Not all sampling depths could be sampled due to field
conditions (e.g. high water table during rainy season) or
due to lack in receiving sampling permissions on
i¼1
4
X
ð1Þ
di
i¼1
where, z(xi) is the measured NO3–N in μg g− 1 soil at
sampling sites (xi) for the ith layer (i = 1, 2, 3 and 4) and
di is the thickness in cm of the ith layer.
Table 1
Number of samples for three sampling events
Sampling
time
Soil layers
Layer 1
(0–30cm)
Layer 2
(30–60cm)
Sept.,
2003
Jan.,
2004
May,
2004
101
101
89
59
123
122
116
103
128
128
125
122
Layer 3
(60–120cm)
Layer 4
(120–180cm)
ARTICLE IN PRESS
4
S. Lamsal et al. / Geoderma xx (2006) xxx–xxx
2.3. Spatial datasets
A comprehensive suite of soil-landscape and environmental predictor variables were assembled using
Table 2
Summary of predictor variables used for CART modeling
Soil-landscape and environmental attributes
Soil data: a
• Taxonomic data: Soil order, suborder, great group; subgroup,
component name
• Soil properties: Available water capacity low (AWCL) (cm/cm),
available water capacity high (AWCH) (cm/cm), surface texture,
water table depth (cm), drainage class, hydric/non-hydric, b
clay class, c depth to clay (cm), d bulk density (g cm− 3), e
organic matter (%),e spodic horizonb
Climate data: f
• Precipitation amount sampling month (mm)
• Precipitation amount previous 3 sampling months (mm)
• Precipitation amount previous 12 sampling months (mm)
Land cover/land use/landscape data:
• Landsat ETM+ 7 spectral bands—local pixel value g
• Landsat ETM+ 7 spectral bands—mean within 3 × 3 and
10 × 10 window g
• Landsat ETM+ 7 spectral bands—std. dev. within 3 × 3 and
10 × 10 window g
• Normalized Difference Vegetation Index (NDVI) g h
• Land use class (2003) i
• Majority land use within 3 × 3 windowi
• Land use change class j [agricultural/non-agricultural]
(period: 1990–2003)
• Population density k
• Physiographic division l
• Ecoregion m
Topographic data:
• Elevation (m)—local pixel value n
• Elevation (m)—3 × 3 windown
• Slope (percentage)—local pixel value n, o
• Slope (percentage)—3 × 3 window n, o
• Upslope drainage area value (m2) n, o
• Upslope drainage area class n, o, p
• Compound topographic index (CTI)—local pixel value n, o, q
• CTI within 3 × 3 window n, o, q
• Distance to stream value (m) r
• Distance to stream classr
Geologic data: s
• Environmental geology
• Floridian aquifer drastic index
• Intermediate aquifer drastic index
• Surficial aquifer drastic index
Geographic data: t
• x coordinates (m)
• y coordinates (m)
• xy coordinates
• Squared xy coordinates
local and focal geographic information system (GIS)
methods (Table 2). We derived soil attributes from the
Soil Survey Geographic Database (SSURGO) using
ArcGIS version 9.0 (Environmental System Research
Institute, Redlands, CA) including taxonomic (e.g. soil
order, suborder) and soil characteristics such as
available water capacity, bulk density and organic
matter content. We gathered precipitation data for the
sampling month (e.g. Sept. 2003), the past three months
of sampling (e.g. June–August for the September 2003
sampling event) and average annual precipitation for the
watershed from NOAA. Spectral reflectance data
(digital numbers—DN) were derived from Landsat
Enhanced Thematic Mapper (ETM+) satellite images. In
order to account for the dominant signature and
variability in the proximity of sampling locations, the
mean and standard deviation of the band reflectance
values were calculated using moving window neighborhoods of 3 × 3 pixels representing a neighborhood of
8100 m2 and 10 × 10 pixels corresponding to a neighborhood of size 90,000 m2, respectively. Land use data
were derived from a supervised classification of Landsat
ETM+ 2003 imagery with an overall classification
a
Soil Survey Geographic Database (SSURGO), Natural Resources
Conservation Service–U.S. Dept. of Agriculture.
b
Presence (1) or absence (0) of soil property.
c
Three clay classes corresponding to the clay content in the soil
profile: 1 = sand, sandy loam, loamy sand, fine sand; 2 = sandy clay
loam; and 3 = clay.
d
Depth to clay—four categories corresponding to depth at which
clay layer was encountered. 1 = clay layer within 50cm of the soil
surface; 2 = clay layer within 100cm of the soil surface; 3 = clay layer
within 200cm of the soil surface; and 4 = no clay layer in the soil
profile.
e
Depth averaged value within soil profile.
f
National Climatic Data Center.
g
Landsat Enhanced Thematic Mapper (ETM+) satellite image
(Febr. 13th, 2003).
h
Normalized Difference Vegetation Index (NDVI) (Jensen, 1996).
i
Seven land use classes (Sabesan, 2004).
j
Nine land use change classes based on agriculture and nonagriculture land uses from 1990, 2000 to 2003 (Sabesan, 2004).
k
U.S. Census Bureau (http://www.census.gov/main/www/cen2000.
html).
l
St. Johns River Water Management District.
m
World Wildlife Fund.
n
National Elevation Dataset (NED) U.S. Geological Survey.
o
Derived topographic attributed from NED.
p
Five classes based on the size of upslope drainage area: A ≤ 1800,
B 1800–3600, C 3600–5400, D 5400–9000, E >9000m2.
q
Compound topographic index (Wilson and Gallant, 2000).
r
Euclidean distance from sampling site to closest stream.
s
Florida Dept. of Environment Protection (1998).
t
Geographic coordinates of sampling sites in meters (Albers Equal
Area Conic map projection).
ARTICLE IN PRESS
S. Lamsal et al. / Geoderma xx (2006) xxx–xxx
accuracy of 82% (Sabesan, 2004). Demographic data
(e.g. population density) that served as a proxy
representing anthropogenic induced impact in the
SFRW were derived from U.S. Census data (http://
www.census.gov/main/www/cen2000.html). We used
the National Elevation Dataset (NED) with a pixel
resolution of 30 m to delineate multiple secondary
topographic attributes including slope, upslope drainage
area, and compound topographic index (CTI) using the
ArcHydro extension of ArcGIS. We used geologic
datasets developed by the Florida Department of
Environment Protection such as the Drastic Index that
was derived as a weighting average using the following
variables: (i) Depth to water, (ii) Net recharge, (iii)
Aquifer media, (iv) Soil media, (v) Topography, (vi)
Impact of the vadose zone, and (vii) Hydraulic
conductivity (Florida Dept. of Environmental Protection, 1998). Other landscape metrics included the
distance of each pixel in the watershed to the closest
streams and the geographic coordinates (x and y
coordinates).
2.4. Upscaling methods
Regional predictions of soil NO3–N across the
SFRW were conducted separately for each sampling
event using field observations and a set of environmental spatial variables summarized in Table 2. The
upscaling procedure followed the following basic
assumptions that the prediction of a variable Ẑ (xi) at
an unvisited location (xi) is made by summing three
components that describe the spatial variation of an
environmental variable (Matheron, 1965) (Eq. (2)).
Z ̂ðxi Þ ¼ mðxi Þ þ eðxi Þ þ e V
ð2Þ
where, Ẑ(xi) is the predicted value at location xi, m(xi) is
the trend/drift for the region (or the structural component); ε is a stochastic, locally varying but spatially
dependent residual from m(xi) known as the variation of
the regionalized variable, and ε′ is a residual error term
(spatially independent noise term).
To describe m(xi) we used Classification and
Regression Trees (CART) relating za to environmental
datasets summarized in Table 2. Classification and
Regression Trees are statistical data mining procedures
that were introduced by Breiman et al. (1984). Treebased modeling is suited for the analysis of complex
environmental datasets that require flexible and robust
analytical methods to deal with nonlinear relationships,
high order interactions and missing values (DeAth and
Fabricius, 2000). The CART methodology is based on
5
binary recursive partitioning; binary because parent
nodes are always split into exactly two child nodes and
recursive because the process can be repeated by treating
each child node as a parent node. Classification and
Regression Trees use either categorical or continuous
data types which predict the data class (classification
tree) or the data values (regression tree). Optimized
splitting rules are identified at each level of the tree. The
goal of regression tree models is to partition the data into
relatively homogenous (low deviation) terminal nodes,
and the mean of the values in each node is the predicted
value for that node. The errors from the classification
tree model are expressed with two measures of relative
cost (Breiman et al., 1984): (i) V-fold validation relative
cost which sets aside independent cases that are used to
evaluate tree predictions, and (ii) resubstitution relative
cost, which is computed using the same data used to
train/develop the tree.
We used the CART software package (Salford
Systems, San Diego, CA) to test a variety of tree
models. Trees were pruned to optimize the tree models
for each soil sampling event separately. Our goal was
to identify parsimonious trees that included environmental variables with large power to predict soil
NO3–N. We adopted the concept of minimal costcomplexity measure that is derived using the resubstitution misclassification rate and a penalty imposed
per each additional terminal node to optimize trees
(Breiman et al., 1984). We calculated the resubstitution relative cost and used V-fold validation with a
90% to 10% split between model and validation
datasets. For each of the three sampling events, a tree
was built and the environmental predictor variables
and splitting rules identified which were subsequently
used to predict soil NO3–N across the SFRW
representing the deterministic trend component of
Eq. (2). At each sampling location the residuals were
derived by subtracting the tree-derived trend model
from the soil NO3–N observations according to the
procedure outlined in Fig. 2.
The Moran's Index and semivariogram models were
used to test the residuals for spatial autocorrelation
(Moran, 1948). The Moran's index (I) is calculated
according to Eq. (3) (compare Goodchild, 1986):
XX
XX
I¼
wij ½ðzi −z̄ Þðzj −z̄ Þ=s
wij
ð3Þ
i
j
i
j
where zi and zj refer to attribute values measured at
different locations i and j, z̄ denotes the mean of the
attribute value, s is the sample variance, and the wij
term represents the spatial proximity (or similarity) of
i's and j's locations, with wii = 0 for all i and wjj = 0
ARTICLE IN PRESS
6
S. Lamsal et al. / Geoderma xx (2006) xxx–xxx
Fig. 2. Flow diagram summarizing our adopted methodology for regression-kriging.
for all j. The Moran index values that are positive
spatial autocorrelation, whereas Moran index values
smaller than 1 indicate independent, dissimilar attribute
values.
We found that the residuals of only one sampling
event (January 2004) were spatially autocorrelated. For
these spatially autocorrelated residuals a semivariogram
was generated and ordinary kriging used to generate soil
NO3–N residual predictions. According to Eq. (2) the
residuals were added to the trend component to create
the final output map showing the predicted spatial
distribution of soil NO3–N. This procedure resembles
regression kriging as outlined by Odeh et al. (1995)
(called “model C—RK”) and Hengl et al. (2004). The
advantage of adopting a hybrid modeling approach is
that kriging of regression residuals may improve the
prediction performance compared to the performance
when kriging or regression is done solely. While the
regression model predicts the trend component across
the area for which the explanatory variables are
available, residuals are restricted to the region for
which stationarity of the residuals can be assumed.
Regression kriging is a hybrid technique that combines a
multi-linear regression model (or Regression Tree or
Generalized Linear Model) with kriging of the regression residuals (Odeh et al., 1995; Goovaerts, 1997). This
method is based on the assumption that the deterministic
component of the target (soil) variable is accounted for
by the regression model, while the model residuals
represent the spatially dependent component.
Spatial modeling and upscaling were conducted
using the ISATIS software (Geovariances Americas
Inc., Houston, TX) and visualizations in ArcGIS 9.0
(ESRI, Redlands, CA). The prediction quality consists of
two components: map precision that measures the
residual variability in prediction, and map accuracy
that measures the closeness of the predictions to true
conditions (Mueller et al., 2001). We used the following
error measures (Mueller et al., 2001; Schloeder et al.,
2001; Webster and Oliver, 2001; Erxleben et al., 2002):
(i) Mean Prediction Error (MPE) to assess the systematic
error of the model; Mean Absolute Error (MAE) that is
indicative of the most accurate global or large-scale
estimates; Mean Square Error (MSE) that is more
sensitive to outliers than the MPE; Precision which is the
inverse of the variance; and the G value which measures
how effective a prediction might be relative to that which
could have been derived by using the sample mean.
3. Results and discussion
3.1. The NO3–N dataset—exploratory data analysis
Statistical properties for the depth averaged soil
NO3–N content at the different sampling events are
summarized in Table 3. Variations in soil NO3–N in
different seasons were due to land use management
practices (e.g. fertilizer applications), crop growth,
climatic conditions, nitrogen cycling and leaching. The
mean NO3–N was highest in Jan. 2004 with 4.1 μg g− 1
ARTICLE IN PRESS
S. Lamsal et al. / Geoderma xx (2006) xxx–xxx
Table 3
Statistical properties of profile averaged soil NO3–N (μg g− 1) for
different seasons
Statistics
Sept. 2003
Jan. 2004
May 2004
Number of observations
Mean
Standard error of the mean
95 % confidence of mean
— Lower
— Upper
Median
Min.
Max.
Standard deviation
Skewness
Kurtosis
101
0.70
0.13
0.45
123
4.09
1.45
1.22
128
1.17
0.24
0.69
0.97
0.23
0.05
6.54
1.31
2.77
7.59
6.96
0.11
0.06
103.71
16.09
4.91
25.36
1.64
0.29
0.05
19.92
2.74
4.37
22.30
followed by May 2004 with 1.2 μg g− 1 and Sept. 2003
with 0.7μg g− 1. The low NO3–N content during Sept.
2003 could be attributed to the influence of precipitation. High rainfall during the rainy season, with 600 mm
from June–Aug. 2003, favored leaching losses of NO3–
N from soils. The highest NO3–N values with a
maximum of 103.7 μg g− 1 encountered in Jan. 2004
were attributed to fertilization of crops and pastures, low
plant uptake as well as low microbial immobilization
and nitrification during the winter period. Similar results
were found in two other studies that documented the
impact of fertilization on soil NO3–N (Pérez et al.,
2003) and NO3–N leaching due to rainfall–runoff
events (Guo et al., 2001).
7
All events showed positively skewed NO3–N
distributions with many measurements below the
detection limit (BDL). It is a common phenomenon
that environmental datasets show many more low
observed values and fewer high values. For example,
Johnson et al. (2003) observed positively skewed
distribution of NO3–N content in 19 ground water
monitoring wells with values ranging from BDL of 0.10
up to 350.0 mg L− 1 with a median concentration of
2.0 mg L− 1. The soil NO3–N content by land use was
diverse showing high mean values in crops with 2.0μg
g− 1 (Sept. 2003), 26.9μg g− 1 (Jan. 2004) and 3.5 μg g− 1
(May 2004). Improved pasture showed mean NO3–N of
1.2 (Sept. 2003), 11.4 (Jan. 2004) and 2.9 (May
2004) μg g− 1 . Soil nitrate–nitrogen values in this
study were similar to other studies conducted on
comparable soils in Florida (Woodard et al., 2002).
Throughout all sampling events, pine plantations
showed low NO3–N measurements with means of
0.2 μg g− 1. These low values are typical for a Florida
forest ecosystem with tight cycling of nitrogen and low
fertilizer input that may only apply 50kg N ha− 1 once in
the first 5 years (forest regeneration phase) and as much
as 200kg N ha− 1 once during the following 15 years (N.
B. Comerford personal. communication, 2005). Low
soil NO3–N values were also found in wetlands, with
means of 0.3μg g− 1 (Sept. 2003), 0.1 μg g− 1 (Jan. 2004)
and 0.5μg g− 1 (May 2004), respectively. The low NO3–
N concentrations in wetlands are assumed to be due to
denitrification as well as mixing of water from the river
Fig. 3. Classification tree model of soil nitrate–nitrogen (NO3–N) for the Sept. 2003 event. The non-terminal nodes contain the node number and
number of predicted sites. The terminal nodes (TN) are shown in bold and contain terminal node number, the number of data (N) and the predicted
class.
ARTICLE IN PRESS
8
S. Lamsal et al. / Geoderma xx (2006) xxx–xxx
Table 4
Variable splitting criteria for the classification tree to predict NO3–N
classes (Sept. 2003)
Node
Splitting
number variable
1
2
3
Splitting criteria
Routing to left
child node
Land use
Agriculture,
rangeland
Physiographic High flatwoods,
divisions
Lake city karst,
lower, mcalpin
plain
Soil order
Alfisols,
Inceptisols,
Ultisols
Routing to right child
node
Pine plantation, upland
forest, urban, wetland
Bell sand hills and
willford flats, Haile
limestone plain, lake
city ridge, newberry
sand hills, perched
lakes and prairies, pose
creek scarp, san felasco
hammock, trail ridge
Entisols, Histosols,
Spodosols
riverbed and terrace deposits was accounted for by
denitrification and the rest by mixing of river and
floodplain water.
The chronological assessment using age dating
techniques made by Katz et al. (2001) have shown
that the NO3–N concentrations of springs of the
Suwannee River Basin have responded to increased
nitrogen loads from various sources in the basin in the
order of decades. Long term trends of NO3–N in the
basin showed that the increasing NO3–N concentrations in spring waters followed the steady increase in
fertilizer use (Katz et al., 1999). Katz (2004) found that
inorganic fertilizers (agricultural land use) were the
dominant source of nitrogen in spring waters in the
Suwannee River Basin based on δ15N isotope tracers.
Our study confirmed that land use is closely related to
soil NO3–N.
3.2. Regional predictions of soil NO3–N
and groundwater by water level fluctuations. In a
comparative study, McMahon and Böhlke (1996)
showed that the median NO3–N concentration in
groundwater from adjacent floodplain deposits
(468 μmol L− 1 ) and riverbed sediments (461μmol
L− 1) were lower than the median concentration in the
terrace deposits (1857 μmol L− 1). Authors estimated
that 15–30% of the difference between floodplain/
We calculated Moran's indices and semivariograms
using observed soil NO3–N values for each sampling
event, but no spatial autocorrelation was found. Because
all three seasonal soil datasets showed spatially
independent observations, we proceeded with relating
environmental datasets to soil NO3–N measurements to
generate trend models using CART.
Fig. 4. Regression tree model of NO3–N for Jan. (a) and May (b) 2004 sampling events.
ARTICLE IN PRESS
S. Lamsal et al. / Geoderma xx (2006) xxx–xxx
9
Table 5
Variable splitting criteria for regression tree model for Jan. 2004
Table 7
Relative cost of classification tree model for Sept. 2003 sampling event
Node
Class
No. of
observations
Error method
Misclassified
counts
Cost
measure
A
30
B
20
C
26
D
25
Resubstitution
Validation
Resubstitution
Validation
Resubstitution
Validation
Resubstitution
Validation
17
20
3
11
4
11
6
14
0.567
0.667
0.150
0.550
0.154
0.423
0.240
0.560
1
2
3
Splitting variable
Band 4 reflectance
Land use
Available water
capacity low
(AWCL)
Splitting criteria
Routing to the
left node
Routing to
the right node
≤113.5
Pine plantation,
upland forest, urban,
wetland
≤0.025
>113.5
Agriculture,
rangeland
>0.025
3.2.1. Classification tree model—Sept. 2003 sampling
event
The regression tree model of Sept. 2003 produced
only 2 nodes, suggesting that (i) either the NO3–N
distribution could not be predicted based on the selected
predictor variables, (ii) the size of the dataset was too
small to build a predictive tree, or (iii) the range of the
target soil variable values was too small as soil NO3–N
was relatively homogenous throughout the watershed.
To optimize tree-based predictions, a classification tree
Table 6
Variable splitting criteria for regression tree model for the May 2004
event
Node Splitting variable
Splitting criteria
Routing to left
child node
Routing to
right child
node
Pine plantation,
rangeland, urban,
wetland
≤14.0
Agriculture
1
Land use
2
Mean compound
topographic index
(CTI) 3 × 3 window
Component name
Arredondo, Bigbee,
Bivans, Blanton,
Blichton, Bonneau,
Chiefland, Dorovan,
Fluvaquents, Kendrick,
Kershaw, Lake, Lakeland,
Leon, Lochloosa, Lynn
Haven, Mascotte,
Millhopper, Monteocha,
Myakka, Neilhurst,
Newnan, Ocilla, Oleno,
Penney, Plummer,
Pomona, Ridgewood,
Sapelo, Sparr, Tavares,
Wauchula, Zolfo
Band 7 avg.
≤113.5
reflectance in
10 × 10 window
3
4
>14.0
Albany,
Chipley,
Pelham
was built to predict the four quartile classes of NO3–N:
class A (≤ 0.01μg g− 1), B (0.01–0.23 μg g− 1), C (0.23–
0.56 μg g− 1) and D (≥ 0.56 μg g− 1). The optimum tree
(Fig. 3) consisted of four nodes using land use,
physiographic divisions and soil orders as predictor
variables with different splitting criteria (Table 4).
3.2.2. Regression tree model—Jan. and May 2004
sampling events
The optimal regression tree models for the Jan. and
May soil sampling events are shown in Fig. 4(a) and (b)
and their splitting criteria are summarized in Tables 5
and 6, respectively. The Jan. model had 4 predicted
values of NO3–N, while the May model had 5 values.
The seasonal regression tree models used different sets
of environmental variables to predict NO3–N for
sampling events reflecting different seasonal patterns.
In Jan., spectral reflectance (band 4), land use and
available water capacity (low) were used as predictor
variables (Fig. 4). According to Lillesand and Kiefer
(2000) band 4 (0.76–0.90 μm near-infrared) absorbs
reflectance from healthy green vegetation and is
commonly used to estimate biomass. This band
emphasizes soil–crop and land–water contrasts (Jensen,
1996). In May 2004, predictor variables included land
use, mean compound topographic index within a local
Table 8
Errors associated with different sized regression tree models for the
Jan. 2004 sampling campaign
No. of nodes Validation relative error Resubstitution relative error
7
6
5
4a
2
1
>113.5
0.89 ± 0.26
0.89 ± 0.26
0.89 ± 0.26
0.86 ± 0.25
1.13 ± 0.16
1.01 ± 0.00
0.46
0.46
0.46
0.47
0.71
1.00
The mean and variance associated with one node tree were 1.47 and
24.56, respectively.
a
The optimal tree model.
ARTICLE IN PRESS
10
S. Lamsal et al. / Geoderma xx (2006) xxx–xxx
Table 9
Errors associated with different sized regression tree models for the
May 2004 sampling campaign
No. of nodes
Validation
relative error
Resubstitution
relative error
7
6
5a
4
2
1
0.59 ± 0.21
0.55 ± 0.19
0.51 ± 0.18
0.52 ± 0.18
0.85 ± 0.07
1.01 ± 0.00
0.08
0.10
0.14
0.20
0.34
1.00
The mean and variance associated with one node tree were 1.67 and
7.43, respectively.
a
The optimal tree model.
neighborhood of 3 × 3 pixels, the main component soils
and mean reflectance band 7 within a 10 × 10 pixel
neighborhood (Fig. 4b). Band 7 has a spectral resolution
from 2.08 to 2.35 μm representing the mid-infrared
range and is important to discriminate geologic rock
formations (Jensen, 1996). Though band 7 showed a
statistical relationship to soil NO3–N, it has less
meaning to describe nitrogen cycling or leaching
behavior. Land use was consistently used as a predictor
variable in all three tree models (Sept. 2003, Jan. and
May 2004), illustrating the importance of land use
management (e.g. fertilizer input), density and structure
of vegetation that are likely impacting nitrate leaching.
3.2.3. CART model error
The costs associated with the classification tree
model are summarized in Table 7. The validation cost
measure is considered more rigorous and conservative
than the resubstitution cost measure. Class C showed the
lowest costs with 0.15 based on resubstitution and 0.42
based on validation indicating the lowest misclassification rate. From an environmental perspective, it is more
important to predict high soil NO3–N values (classes C
and D) more accurately when compared to lower values
(classes A and B) that are closer to “natural” background
concentrations.
The errors associated with multiple regression tree
models for the Jan. 2004 event are summarized in Table
8. We selected the optimum regression tree with 4 nodes
(i.e., 3 splitting variables) based on validation results
with a minimum variance of 0.86. The simplest tree
consisted of a single node, with mean value of 1.47 and
variance of 24.56. The errors associated with regression
tree models for the May 2004 sampling event are
summarized in Table 9. The one node tree model had a
mean and variance of 1.17 and 7.43, respectively, and
the optimum tree with 5 nodes had a validation relative
error of 0.51 and a resubstitution relative error of 0.14.
V-fold validation is considered more rigorous than
resubstitution to evaluate modeling success. The NO3–
N predictions made for the May event are more accurate
compared to Jan. predictions according to the validation
relative errors (Tables 8 and 9).
3.2.4. Upscaling of the tree model predictions
The predicted NO3–N classes for the Sept. 2003
event (Fig. 5) were upscaled to unsampled locations
(pixels) across the watershed based on predictor
variables in the respective tree model. This model
shows the deterministic trend of NO3–N across the
watershed. Class D of the Sept. sampling event was
solely based on land use (agriculture and rangeland).
Predictions of class C were derived from land use and
physiographic division whereas classes A and B were
Fig. 5. Classification tree predictions of NO3–N for Sept. 2003 upscaled to the watershed scale. The unmapped areas (white) are water bodies.
ARTICLE IN PRESS
S. Lamsal et al. / Geoderma xx (2006) xxx–xxx
distinguished based on soil orders. Class B extended
along a NW–SE axis corresponding to the Cody Scarp.
Overall, in the western part of the watershed higher
NO3–N values dominated (classes C and D) whereas
east of the Cody Scarp classes A and C were prominent.
Since soil NO3–N observations were relatively low in
the Fall season (Sept. 2003), generated patterns of soil
NO3–N were relatively homogenous throughout the
mixed-use watershed.
Predicted NO3–N values for the Jan. and May 2004
events (Fig. 6) were also upscaled to the unsampled
locations across the watershed. Both the Jan. and May
trend models showed mixed distributions of high and
11
low NO3–N predictions across the watershed. Land use
was an important predictor variable in both seasons.
Speckled patterns of soil NO3–N were predicted based
on sets of environmental variables including land use,
spectral data, soil and topographic attributes. Although
the predicted values differed for the Jan. and May
events, the spatial distributions of high and low values
were similar. In general, the predicted higher soil NO3–
N values in the western part of the watershed pose a high
risk to be leached into the aquifer due to predominantly
sand-rich soils that are unconfined coinciding with karst
terrain. On the Lowland side of the watershed (west of
the Cody Scarp) the Floridian aquifer is unconfined
Fig. 6. (a) Regression tree predictions of NO3–N for Jan. and (b) May 2004 upscaled to the watershed scale.
ARTICLE IN PRESS
12
S. Lamsal et al. / Geoderma xx (2006) xxx–xxx
(Fernald and Purdum, 1998) posing a high risk for soil
NO3–N to be leached into the aquifer. Because the
Lowland side is also dominated by karst terrain and
sand-rich soils potential risks of leaching is higher in the
western region than the eastern Highland part of the
watershed where the Floridian aquifer system is
confined. The Cody Scarp represents the area containing
the most mature karst features including artesian
springs, steep-sided sinkholes, lakes that periodically
drain downward, disappearing and resurgences of
streams (Fernald and Purdum, 1998). High soil NO3–
N found along the Cody Scarp possibly poses the
greatest risk of accelerated nitrate leaching through the
vadose zone into the aquifer.
The residuals derived from the regression tree model
predictions at sampled sites account for the error of the
tree model and their error statistics are summarized in
Table 10. Error statistics showed smaller MPE, MAE,
MSE, and Precision for the May NO3–N trend model
compared to the Jan. 2004 model. The G values for both
models were high. The Moran's I analysis of the NO3–
N residuals for the Jan. 2004 event indicated that there
was significant positive autocorrelation with a positive I
value of 0.24 at the first distance interval (0–4500m).
For all other distance intervals, Moran's I values showed
neither significant positive nor negative autocorrelation.
Moran's I analysis for the May 2004 event revealed that
there was no significant autocorrelation of residuals at
any distance intervals indicating that the CART-based
trend map provided the best estimate.
For the Jan. dataset, a semivariogram model using the
residuals was derived by subtracting the trend values
from the observed values. A spherical semivariogram
model was fitted with the following parameters: range
5000 m, partial sill 9.2 and nugget of 1.8 (Fig. 7).
According to Camberdella et al. (1994), there is a strong
spatial dependence of the observations if the ratio of
nugget to total semivariance is <25%, which was the
case for the Jan. soil NO3–N residual dataset but not for
the residual May dataset. The interpolated residual
Table 10
Error statistics for the regression tree and regression kriging
predictions in different seasons
Error
measures
Jan. 2004
Trend model
(CART)
RK model
Trend model
(CART)
May 2004
MPE
MAE
MSE
Precision
G value
−0.034
1.129
11.321
11.414
53.900
0.230
0.738
3.043
3.090
82.570
−0.057
0.537
1.053
1.061
85.833
Fig. 7. Experimental (thin line) and fitted (bold line) semivariogram of
the NO3–N residuals for Jan sampling event.
model was added to the trend model to produce the final
NO3–N distribution map for the Jan. sampling campaign (Fig. 8). The addition of kriged residual data
corrected for the over- and under-predictions made by
the trend model. Cross-validation of soil NO3–N
predictions derived via regression kriging showed a
MPE of 0.23, indicating that predictions slightly overestimated observed values. Overall, the RK model
performed slightly better than the trend model for the
Jan. sampling event which was confirmed by a lower
MAE, MSE and Precision (Table 10). The G value was
higher for the RK model (82.57) when compared to the
trend model (53.90), suggesting that the former
provided better regional predictions of soil NO3–N
across the watershed (Table 10). The G value for the RK
model was slightly lower than that for the May
predictions which could be attributed to the higher
variance in the Jan. NO3–N data compared to the May
data. With a given set of predictor variables, it becomes
increasingly difficult to predict the target variables when
the target variables become more variable.
Spatial patterns of soil NO3–N differed among
seasons. In Sept., class A covered an area of 11.3%,
class B 6.2%, class C 40.4% and class D 42.1%. For the
Jan. sampling event, 90.7% of the watershed was
predicted to have ≤ 5μg g− 1 soil NO3–N, while 4.2%
was predicted to be within 5–10μg g− 1 soil NO3–N,
and 5.1% had predicted values ≥ 10 μg g− 1. During the
May 2004 sampling event, 81.9% showed soil NO3–N
predictions ≤5 μg g− 1 while 14.0% were in the range
between 5 and 10μg g− 1 and 4.1% of sites ≥ 10 μg g− 1.
Identifying sites that showed consistently high soil
NO3–N predictions in different cropping seasons (Jan.
and May) may help to provide a stronger signature of
“high risk areas” that potentially contribute to excessive
nitrate leaching. Sites that showed consistently higher
values throughout the cropping season, as represented
ARTICLE IN PRESS
S. Lamsal et al. / Geoderma xx (2006) xxx–xxx
13
Fig. 8. Predictions derived from regression kriging of NO3–N distribution across the watershed for the Jan. 2004 event (search neighborhood:
1500m).
by the Jan and May sampling events, in the predicted
5–10μg g− 1 range covered 4.3% (150.7 km2) of the
watershed, while sites exceeding predicted values of
10 μg g− 1 occurred on only 0.59% (20.7 km2) of the
watershed. Over 95.7% of the watershed expanding
over an area of 3363.9 km2 predicted soil NO3–N
were ≤5 μg g− 1 (Jan. and May, 2004).
In the western part of the watershed that is
characterized by the presence of sinkholes and karst
terrain, 93.5% of the area had soil NO3–N predictions
≤ 5μg g− 1, 2.8% within predicted 5–10μg g− 1 and
3.7% exceeding 10 μg g− 1 in Jan. 2004. Katz et al.
(2004) pointed out that karst areas in Florida are
susceptible to accelerated leaching of NO3–N, allowing
for ground water pollution. High soil NO3–N values in
close proximity to streams and water bodies pose a
greater risk to increase NO3–N in surface waters. We
found that only a small area (2.5%) showed soil NO3–N
exceeding 10 μg g− 1 within a 500 m buffer of streams
and water bodies in Jan 2004 (area of the buffer = 110.8km2). Within the same buffer, 0.2% showed
predicted soil NO3–N in the 5–10 μg g− 1 range while
97.3% showed predicted values ≤ 5 μg g− 1. Similarly,
the May prediction model showed 2.5%, 8.3% and
89.1% of the buffer zone accounting for > 10, 5–10 and
< 5μg g− 1, respectively.
4. Summary and conclusions
We adopted a hybrid geospatial modeling approach
that combined sparsely measured soil NO3–N observations collected in three seasons (Sept. 2003, Jan.
and May 2004) with dense auxiliary environmental
datasets to predict NO3–N within the SFRW in northeast Florida. For each season, different sets of
environmental variables had predictive power. Based
on the relationships between environmental predictor
variables and soil NO3–N, we selected season-specific
upscaling strategies. The weakest relationships were
found for the Sept. 2003 sampling event for which a
classification tree provided the best estimate. Stronger
relationships between environmental variables, especially land use and soil NO3–N, were found for the
Jan. and May 2004 events using regression trees. For
the Jan. event we found that soil NO3–N residuals
were spatially autocorrelated and therefore proceeded
with kriging of the residuals. Overall, NO3–N
prediction quality was highest in Jan. and May
2004. At each step of the upscaling process, error
assessment provided important information about the
uncertainty of predictions. We found that no universal
methodology (e.g. CART, RK) was applicable to
predict seasonal and regional patterns of soil NO3–N
within the SFRW. This might be due to the fact that
NO3–N is a highly dynamic soil property that leaches
readily and is highly dependent on climate, soil,
topography and season-specific land use management.
Therefore, a mixed approach that combines exhaustive
spatial environmental data (e.g. spectral data derived
from remote sensing) and sparse, season-specific field
observations of NO3–N provides a compromise
between labor, costs, and accuracy for regional
space–time predictions of soil NO3–N across a large
watershed. We were able to identify sites that showed
persistent high values in soil NO3–N in different
seasons that may be targeted first to implement best
ARTICLE IN PRESS
14
S. Lamsal et al. / Geoderma xx (2006) xxx–xxx
management practices. Other sites that show high soil
NO3–N and occur in proximity to streams, water
bodies or karst features pose a potential high risk for
accelerated nitrate leaching.
Acknowledgements
This project was supported by the USDA grant #200200501 funded through the “Nutrient Science for Improved
Watershed Management” program. This research was
supported by the Florida Agricultural Experiment Station
and approved for publication as Journal Series No. R10792. We would like to thank the support of Dr. M.W.
Clark and Dr. D.A. Graetz, and Dr. R.B. Brown.
References
ArcGIS software (Vers. 9.0). Environmental Systems Research
Institute (ESRI), Redlands, CA.
Bishop, T.F.A., McBratney, A.B., 2001. A comparison of prediction
methods for the creation of field-extent soil property maps.
Geoderma 103, 149–160.
Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J., 1984.
Classification and Regression Trees. Wadsworth International
Group, CA.
Brown, R.B., Stone, E.L., Carlisle, V.W., 1990. Soils. In: Myers, R.L.,
Ewel, J.J. (Eds.), Ecosystems of Florida. University of Central
Florida Press, Orlando.
Camberdella, C.A., Moorman, T.B., Novak, J.M., Parking, T.B.,
Karlen, D.L., Turco, R.F., Konopka, E.E., 1994. Field-scale
variability of soil properties in central Iowa soils. Soil. Sci. Soc.
Am. J. 58, 1501–1511.
DeAth, G., Fabricius, K.E., 2000. Classification and regression trees: a
powerful yet simple technique for ecological data analysis.
Ecology 81 (11), 3178–3192.
Erxleben, J., Elder, K., Davis, R., 2002. Comparison of spatial interpolation methods for estimating snow distribution
in the Colorado Rocky Mountains. Hydrol. Process. 16,
3627–3649.
Fernald, E.A., Purdum, E.D. (Eds.), 1998. Water Resources Atlas of
Florida. Institute of Science and Public Affairs, Florida Sate
University, FL.
Florida Department of Environmental Protection, 1998. State wide
geologic data for Florida. Available at: http://www.fgdl.org.
Goodchild, M.F., 1986. Spatial autocorrelation. Concepts and
Techniques in Modern Geography, vol. 47. Geo Books, Norwich.
Goovaerts, P., 1997. Geostatistics for Natural Resources Evaluation.
Oxford University Press, NY.
Goovaerts, P., 1999. Using elevation to aid the geostatistical mapping
of rainfall erosivity. Catena 34 (3–4), 227–242.
Guo, L.P., Zhang, F.S., Wang, X.R., Mao, D.R., Chen, X.P., 2001.
Effect of long-term fertilization on soil nitrate distribution.
J. Environ. Sci. 13 (1), 58–63.
Grunwald, S., Lamsal, S., 2005. Emerging geographic information
technologies and soil information systems. In: Grunwald, S. (Ed.),
Environmental Soil-Landscape Modeling—Geographic Information Technologies and Pedometrics. CRC Press, New York,
pp. 127–154.
Hengl, T., Heuvelink, G.B.M., Stein, A., 2004. A generic framework
for spatial prediction of soil variables based on regression-kriging.
Geoderma 120, 75–93.
Hornsby, D., Mattson, R., Mirti, T., 2001. Surface Water Quality and
Biological Annual Report. Suwannee River Water Management
District, Florida.
Hornsby, D., Mirti, T., Zwanka, W., 2002. Nitrate Profile of the Lower
Santa Fe River. Department of Water Resources, Suwannee River
Water Management District, Live Oak Florida.
ISATIS software (Vers. 4.0.1). Geovariances, Houston, TX.
James, M.R., Turner, M.G., Smithwick, E.A.H., Dent, C.L., Stanley,
E.H., 2004. Spatial extrapolation: the science of predicting
ecological patterns and processes. Bioscience 54 (4), 310–320.
Jensen, J.R., 1996. Introductory Digital Image Processing, a Remote
Sensing Perspective. Prentice-Hall, Upper Saddle River, New
Jersey.
Johnson, D.M., Osiensky, J.L., Miller, S.M., 2003. Geostatistical
ground water monitoring of a point source NO3–N plume
entering a restored riparian zone. J. Environ. Sci. Health 38 (5),
719–735.
Katz, B.G., 2004. Sources of nitrate contamination and age of water in
large karstic springs of Florida. Environ. Geol. 46, 689–706.
Katz, B.G., Hornsby, H.D., Bohlke, J.F., Mokray, M.F., 1999. Sources
and chronology of nitrate contamination in spring waters,
Suwannee River Basin, Florida. USGS Water-Resources Investigations Report 99-4252. Tallahassee, Florida.
Katz, B.G., Bohlke, J.K., Hornsby, H.D., 2001. Timescales for nitrate
contamination of spring waters, northern Florida, USA. Chem.
Geol. 179, 167–186.
Katz, B.G., Chelette, A.R., Pratt, T.R., 2004. Use of chemical and
isotopic tracers to assess nitrate contamination and ground water
age, Woodville Karst Plain, USA. J. Hydrol. 289, 36–61.
Keeney, D.R., Nelson, D.W., 1982. Nitrogen—inorganic forms. In:
Page, A.L., Miller, R.H., Keeney, D.R. (Eds.). Methods of Soil
Analysis: Part 2. American Society of Agronomy Inc., Soil Sci.
Soc. of Am. Inc., Madison, WI.
Knotters, M., Brus, D.J., Voshaar, J.H.O., 1995. A comparison of
kriging, co-kriging and kriging combined with regression for
spatial interpolation of horizon depth with censored observations.
Geoderma 67, 227–246.
Lillesand, T., Kiefer, R., 2000. Remote Sensing and Image
Interpretation. John Wiley and Sons, New York, NY.
Matheron, G., 1965. Les Variable Régionalisees et leur Estimation.
Masson, Paris, France.
McBratney, A.B., Odeh, I.O.A., Bishop, T.F.A., Dunbar, M.S., Shatar,
T.M., 2000. An overview of pedometric techniques for use in soil
survey. Geoderma 97, 293–327.
McBratney, A.B., Mendonça Santos, M.L., Minasny, B., 2003. On
digital soil mapping. Geoderma 117, 3–52.
McMahon, P.B., Böhlke, J.K., 1996. Denitrification and mixing in a
stream-aquifer system: effects on nitrate loading to surface water.
J. Hydrol. 186, 105–128.
Moran, P.A.P., 1948. The interpretation of statistical maps. J. R. Stat.
Soc., Ser. B Methodol. 10, 243–251.
Mueller, T.G., Pierce, F.J., Schabenberger, O., Warncke, D.D., 2001.
Map quality for site-specific fertility management. Soil Sci. Soc.
Am. J. 65, 1547–1558.
Odeh, I.O.A., McBratney, A.B., 2000. Using AVHRR imageries for
spatial prediction of clay content in the lower Namoi Valley of
eastern Australia. Geoderma 97, 237–254.
Odeh, I.O.A., McBratney, A.B., Chittleborough, D.J., 1995. Further
results on prediction of soil properties from terrain attributes:
ARTICLE IN PRESS
S. Lamsal et al. / Geoderma xx (2006) xxx–xxx
heterotopic cokriging and regression-kriging. Geoderma 67,
215–226.
Pérez, J.M.S., Antigüedad, I., Arate, I., Linares, C.G., Morell, I., 2003.
The influence of nitrate leaching through unsaturated soil on
groundwater pollution in an agricultural area of the Basque
country: a case study. J. Sci. Total Environ. 317, 173–187.
Randazzo, A.F., Jones, D.S., 1997. The Geology of Florida. University
Press of Florida, Gainesville, FL.
Sabesan, A., 2004. Geo-spatial assessment of the impact of land cover
dynamics and distribution of land resources on soil and water
quality in the Santa Fe River Watershed. M.S. thesis, University of
Florida, Gainesville, FL.
Schloeder, C.A., Zimmerman, N.E., Jacobs, M.J., 2001. Comparison
of methods for interpolating soil properties using limited data. Soil
Sci. Soc. Am. J. 65, 470–479.
Schmidt, W., 1997. Geomorphology and physiography of Florida. In:
Randazzo, A.F., Hones, D.S. (Eds.), The Geology of Florida.
University Press of Florida, Gainesville, FL.
SRWMD and SJRWMD (Suwannee River Water Management District
and St. Johns River Water Management District), 1995. Remote
Sensing Derived Land Use According to the Florida Land Use and
Cover Classification System (FLUCCS). Available at: http://www.
fgdl.org).
15
Suwannee River Water Management District, 2003. Surface Water
Quality and Biological Annual Report. WR-02/03-03.
Trexler, J.C., Travis, J., 1993. Nontraditional regression analyses.
Ecology 76 (6), 1629–1637.
Triantafilis, J., Odeh, I.O.A., McBratney, A.B., 2001. Five geostatistical models to predict soil salinity from electromagnetic induction
data across irrigated cotton. Soil Sci. Soc. Am. J. 65, 869–879.
Van Meirvenne, M., van Cleemput, I., 2005. Pedometrical techniques
for soil texture mapping at a regional scale. In: Grunwald, S. (Ed.),
Environmental Soil-Landscape Modeling—Geographic Information Technologies and Pedometrics. CRC Press, New York,
pp. 323–341.
Webster, R., Oliver, M.A., 2001. Geostatistics for Environmental
Scientists. John Wiley and Sons, Ltd., Chichester, England.
Wilson, J.P., Gallant, J.C., 2000. Terrain analysis: principles and
applications. John Wiley & Sons, Inc., NY.
Woodard, R.K., French, C.E., Sweat, A.L., Graetz, D.A., Sollenberger,
L.E., Macoon, B., Portier, M.P., Wade, L.B., Rymph, J.S., Prine, G.
M., van Horn, H.H., 2002. Nitrate removal and nitrate leaching
from forage systems receiving dairy effluent. J. Environ. Qual. 31,
1980–1992.
Fly UP