• No results found

A random field model for estimating the content of soil layers

N/A
N/A
Protected

Academic year: 2021

Share "A random field model for estimating the content of soil layers"

Copied!
10
0
0

Bezig met laden.... (Bekijk nu de volledige tekst)

Hele tekst

(1)

A random field model for estimating the content of soil layers

Citation for published version (APA):

Boender, C. G. E., Rinnooy Kan, A. H. G., & Vellekoop, A. H. (1988). A random field model for estimating the content of soil layers. (Designing decision support systems notes; Vol. 8804). Eindhoven University of Technology.

Document status and date: Published: 01/01/1988 Document Version:

Publisher’s PDF, also known as Version of Record (includes final page, issue and volume numbers) Please check the document version of this publication:

• A submitted manuscript is the version of the article upon submission and before peer-review. There can be important differences between the submitted version and the official published version of record. People interested in the research are advised to contact the author for the final version of the publication, or visit the DOI to the publisher's website.

• The final author version and the galley proof are versions of the publication after peer review.

• The final published version features the final layout of the paper including the volume, issue and page numbers.

Link to publication

General rights

Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights. • Users may download and print one copy of any publication from the public portal for the purpose of private study or research. • You may not further distribute the material or use it for any profit-making activity or commercial gain

• You may freely distribute the URL identifying the publication in the public portal.

If the publication is distributed under the terms of Article 25fa of the Dutch Copyright Act, indicated by the “Taverne” license above, please follow below link for the End User Agreement:

www.tue.nl/taverne Take down policy

If you believe that this document breaches copyright please contact us at: openaccess@tue.nl

providing details and we will investigate your claim.

(2)

Editors: prof.dr. K.M. van Hee prof.dr. H.G. Sol

A RANDOM FIELD MODEL FOR ESTIMATING THE CONTENT OF SOIL LAYERS

by

C.G.E. Boender A.H.G. Rinnooy Kan A.H. Vellekoop

EINDHOVEN UNIVERSITY OF TECHNOLOGY l i u Buisson

Vepartment of Mathematics and Computing Science P.O. Box 513

5600 MB EINDHOVEN September 1988

(3)

C.G.E. Boender

Econometric Institute, Erasmus University

P.O. Box 1738, 3000 DR Rotterdam, the Netherlands

A.H.G. Rinnooy Kan

Econometric Institute, Erasmus University

P.O. Box 1738,3000 DR Rotterdam, the Netherlands

A.H. Vellekoop

M.I.S.-Department, Erasmus University Rotterdam P.O. Box 1738,3000 DR Rotterdam, the Netherlands

(4)

A RANDOM FIELD MODEL FOR ESTIMATING THE CONTENT OF SOIL LAYERS

C.G.E. Ecender A.H.G. Rinnooy Kan A.H. Vellekoop

Abstract

o

We define a smooth stationary Gaussian random field on a set ScR (0)=1), for which, given a number of observations, the posterior expected integral over

S

and its variance are calculated analytically. These formulae ~re useful, for ex.ample, to dredging companies who want to use a small number of extremely expensive measurements to arrive at an estimate (and its accuracy) of the total content of soil layers.

1. INTRODUCTION

A typical dredging project is the excavation of an area to a certain depth so as to create a harbour or canal. Frequently, the company who tenders the

1m-lest offer ':'li11 be allowed to do the job.

Usually several soil layers have to be removed, containing sand, mud, clay or rock. The cost to remove such a layer is proportional to its content. Drilling information that can be used to obtain an estimate of the content is almost always scarce, and extremely expensive if not impossible to obtain.

Thu3, a dredging company is likely to specify a price which is either too high or too low. If the price is specified too high the company will probably not be granted the job since there is a severe competition, due to an enormous world-wide surplus of dredging material. If the price is set too low and the project is assigned to the company, it will incur a potentially substantial loss which is porportional to the error of the estimation.

Facing the above problem, a Dutch dredging company posed the following challenge:

A. Construct a mathematical model which, given a number of measurements of the thickness of a soil layer in different locations, answers the following questions:

(5)

1. What is the thickness of the layer in other locations? 2. What is the total quantity of soil in the area of interest? 3. What 1s the accuracy of the above estimates?

B. How can available expert-opinion be incorporated?

Question A3 motivated the use of stochastic methods. A natural way to proceed

(cf. Schagen (19791) is to specify a prior random field for the layer. The random field is a probability distribution defined on a class of functions, in which the layer is assumed to be imbedded. Then the posterior random field is computed, conditional on the set of measurements in the observation points.

Clearly, any question than can be asked about the layer can just as readily be asked about its prior and posterior counterpart. The prior properties have to be in accordance with known properties of the layer to assure that it provides a sound mathematical model; given the observations, the posterior properties of the field serve to answer the questions AI, A2 and A3. Thus, for our problem the posterior expected value of the field in any location, the posterior expected value of the integral of the field over the area of interest, as well as their corresponding pos terior variances are of special interest.

In the literature various stationary Gaussian random fields are used as mathematical models. However, all these fields suffer from the serious drawback that the posterior expected value of the integral, as well as its-variance, can only be computed using numerical approximation methods. To overcome this difficulty we will define a smooth stationary Gaussian random field which we have not encountered in the literature sofar, whose sample paths are continuously differentiable infinitely often with probability 1, and for which the posterior expectation of the integral and its posterior variance can be calculated analytically.

Currently we are building a Decision Support System, which is based on the mathematical framework described above. Due to the analytical formulae for the quantities of interest, the system is very user-friendly with respect to the response-time. Expert-knowlegde about layers is incorporated in the system through so-called fictive measurements which are of vital importance for the estimation of the unknown parameters of the field.

(6)

2. STATIONARY GAUSSIAN RANDOM FIELDS.

A random field can be viewed as a probability distribution on a space of functions, called realizations or sample paths. For n points xl"",xn in the

domain S, the function values f(xl~, •••• f(Xn) will have a joint n-dimensional distribution. The collection of all these joint distributions for all possible values of nand xl' •••• Xn constitutes the family of finite-dimensional distributions of the field. A random field is called Gaussian if each member of the family is multivariate Gaussian. These field are characterized by the mean- and covariance function:

(1) lJ(X) = E(f(x» (XES)

( 2) r(x,y)

=

E(ff(x) - U(x»).[f(y) - lJ(Y)]) (x,y£S).

A Gaussian field is called stationary or homogeneous if its covariance function only depends on t=x-y, and if the mean function is equal to a constant.

Conversely, we can define a stationary Gaussian field by specifying a constant

u,

and a nonnegative definite function ret). Then, due to a famous result of Kolmogorov [1933], there exists a stationary Gaussian random field possessing

lJ and ret) as mean- and covariance functions. Furthermore, an important result

of Bochner [1933] states that a function is nonnegative definite i.f.f. it is' the characteristic function of a distribution function. Thus, the following characteristic functions may serve as covariance functions

2 D d d (3) r(x-y) .. a II exp{-I x ~y

I }

d=l a 2 D

C(d_~d )2 }

(4 ) r(x-y) = a II exp

{-t

d=l a 2 D 1 (5) r(x-y) '= a II d d d=l 1

+

(X

-a

)2

a

(7)

The random fields with covariance functions (3) and (4) which correspond respectively to the characteristic functions of the product of D independent Cauchy- and Gaussian distributions are well known. The stationary field with covariance function (5) corresponds to the characteristic function of the product of D independent bilateral exponential distributions (cf. Feller

[1971]).

from the theorems in the book of Cramer and Leadbetter [1967J it follows that the realizations of the field with covariance function (3) are continuous with probability 1. The realizations of the fields with covariance function (4) and

(5) can be proven to be continuously differentiable infinitely often with probability 1, so that these field are good mathematical models for the soil layers of interest, possibly with the exception of thin rocklayers in a small region.

3. INTERPOLATION AND INTEGRATION.

Suppose that n observations fl=f(xl), ••• ,fn=f(~) are taken. Then it is well known (c.f. e.g. Adler [1981)) that for each stationary Gaussian field the conditional field or so-called posterior field is also Gaussian with mean-function (6) n

L L

n (f.-~).rij -1 .r(x-x.), i=l j=l' 1. J and variance-function ( 7) n

L L

n r(x-xi)·r-1 ij .r(x-x.), i=1 j=1 J -1

where r .. is the (i,j)-th element of the inverse of the matrix of covariances

1.J

between the function values in the observation points xl""'xn •

The posterior mean function (6) is the best approximation of the field in the least squares sense, and passes through all the observed points, so that it can be used as a proper interpolation function.

For each stationary Gaussian field the expected value of the integral I over a

D d d

(8)

D

(8) E(l) D

u.

IT (bd_ad).

d-l

For the field with covariance function (5) the corresponding variance can be seen to be equal ~o:

( 9) a 2 (I) ,. a 2 ';;;;"':"'~d::---'-·arctan( 2(bd_a d) bd_ad d )-1n[1+( bd_a d 2 d)

J}.

a a a

Given a set of observations (f1=f(xl), ••• ,fnDf(~)} for the field with covariance function (5) the posterior results corresponding to (8) and (9) can be seen to be equal to (10) E(I

I

f1, ••• ,fn) = E(I) bd_x~ d d 2 n n D d a -x

I I

-1 (T)-arctan ( d

I)}

+

a (fj-lJ)rij IT a {arctan i=l jal d=l a a d d d d 4

-a n

L

n

I

D IT (a ) d 2 (arctan( b -xi d )-arctan(---d---)]·r i ·• a -xi -1

i=l j=l d-l a a J

d d d d

b -x. a -x. [arctan( d J)-arctan( d

J)J.

a a

T~e results (8) up to (11) can be proven by using the result that the expectation and variance of the integral of the field are equal to the integral of the mean- and covariance ~unction respectively. Details will appear in Vellekoop [1988].

4. EXPERT KNOWLEDGE AND A PRACTICAL EXAMPLE.

In this section we briefly describe the results we obtained for a layer of mud on a square region of 500 x 500 metres, for which only 10 measurements of the thic.kness of the layer were available. To these 10 real measurements the experts of the dredging company added 15 extra points where the thickness of the layer was guessed. This information is referred to as so-c.alled fictive measurements.

(9)

We only describe the results which have been obtained with the field with covariance function (5). First, the parameters a1.a Z and 1.1,02 have to be

determined. Suppose al ad a Z are given. Then, given the n=10 real measurements

?

the maximum likelihood estimates of u and 0- are:

n n -1 n n -1 (12) uMLE = E

r

f i ' rij

I

I: I: rij i=1 j=1 i-I j-l 2 2 n n -1 (13) a MLE

= -

n a I:

r

(fi-\.I)·rij .(fr\.l)· i-I j=1

Thus, given our choice of a 1 and a 2, and the resulting maximum likelihood estimates of \.I and 02 we can compute the difference Ai of a fictive

measurement and the posterior expected value in this point (1=1, ••• ,15). Our procedure is to choose values for a l and a 2 for which the sum of squared

errors

r~51 A~

is minimal. The rationale of this procedure is as follows. The

~= ~

values of a l and a 2 determine how the correlation between the function values of two points decreases as a function of the distance between the points (c.f. (5». Hence, these values determine the rate In which the prior assumptions are abandoned in favour of the 10 real observations. The procedure will be incorporated in a decision support system that will assist the dredging company in arriving at a proper tender price.

MUDLAYER

Prior expected content 1.25 x

Posterior expected content 1.21 x

Prior standard deviation 0.24 x

Posterior standard deviation 0.05 x

al 120.0

a2 120.0

maximum likelihood estimate \.I 5.0

maximum likelihood estimate a 2.0

106m3 106m3 106m3 106m3 ( 8) (10) ( 9) (11 ) (12) (3)

(10)

References

Adler, J.A. (1981), The geometry of random fields, Wiley, New York. Bochner, S. (1935), Monotone Funktionen Stieltjessche Integrale und

Harmonische Analyse, Mathematische Annalen 108.

Cramer, H. & M.R. Leadbetter (1967), Stationary and related stochastic processes, Wiley, New York.

Feller, W. (1971), An introduction to probability theory and its applications II, Wiley, New York.

Kolmogorov, A.N. (1933), Grundbegriffe der Wahrscheinlichkeitsrechnung, Erg. Mat. 2.

Schagen, I.P. (1979), Interpolation in two dimensions - a new technique. J.

Inst. Maths. Applies. 23.

Vellekoop, A.H. (1988), The design of decision support systems: a case study, to appear.

Referenties

GERELATEERDE DOCUMENTEN

In the following subsections of the Introduction we define the model of interest and formulate our main results on the fluctuations of the partition function of the random energy

We adapt a regeneration-time argument originally developed by Comets and Zeitouni [8] for static random environments to prove that, under a space-time mixing property for the

a general locally finite connected graph, the mixing measure is unique if there exists a mixing measure Q which is supported on transition probabilities of irreducible Markov

In Section 3 we assume a stronger space-time mixing property, namely, exponential mixing, and derive a series expansion for the global speed of the random walk in powers of the size

• Next, we investigate the background effective field theory (EFT) of inflation with the dimension-five (dim-5) and dimension-six (dim-6) mixing operators. This EFT approach, which

This condition is given in terms of the original system coefficients and is furthermore necessary and sufficient for the existence of a positive semi-definite

Here we present findings of a study conducted to deter- mine the prevalence of bTB and risk factors associated with bTB infection in pastoral and agro-pastoral communities at

In een recent rapport van het Engelse Institution of Engineering and Technology (IET, zie www.theiet.org) wordt een overzicht gegeven van de redenen waarom 16-