On estimation of the intensity function of a point process
Citation for published version (APA):Lieshout, van, M. N. M. (2010). On estimation of the intensity function of a point process. In Fifth International Workshop in Applied Probability (IWAP 2010, Madrid, Spain, July 5-8, 2010. Extended abstracts) (pp. 1-3)
Document status and date: Published: 01/01/2010
Document Version:
Publisher’s PDF, also known as Version of Record (includes final page, issue and volume numbers)
Please check the document version of this publication:
• A submitted manuscript is the version of the article upon submission and before peer-review. There can be important differences between the submitted version and the official published version of record. People interested in the research are advised to contact the author for the final version of the publication, or visit the DOI to the publisher's website.
• The final author version and the galley proof are versions of the publication after peer review.
• The final published version features the final layout of the paper including the volume, issue and page numbers.
Link to publication
General rights
Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights. • Users may download and print one copy of any publication from the public portal for the purpose of private study or research. • You may not further distribute the material or use it for any profit-making activity or commercial gain
• You may freely distribute the URL identifying the publication in the public portal.
If the publication is distributed under the terms of Article 25fa of the Dutch Copyright Act, indicated by the “Taverne” license above, please follow below link for the End User Agreement:
www.tue.nl/taverne Take down policy
If you believe that this document breaches copyright please contact us at: openaccess@tue.nl
providing details and we will investigate your claim.
On estimation of the intensity function of a point process
M.N.M. van Lieshout1
1 CWI and Eindhoven University of Technology,
Science Park 123, 1098 XG Amsterdam, The Netherlands
Abstract. Estimation of the intensity function of spatial point processes is a fundamental problem. In this paper, we interpret the Delaunay tessellation field estimator recently introduced by Schaap and Van de Weygaert as an adaptive kernel estimator and give explicit expressions for the mean and variance. Special attention is paid to Poisson processes.
Keywords.Delaunay tessellation field estimator, generalised weight function estimator, intensity func-tion, kernel estimator, mass preservafunc-tion, Poisson process, second order product density.
1
Introduction
Let Φ be a simple point process onRd whose moment measure M exists and is locally finite.
Make the further assumption that M is absolutely continuous with respect to Lebesgue measure, so that
M (A) = Z
A
λ(x) dx
for some measurable function λ(x) ≥ 0 for every bounded Borel set A. The goal is to estimate λ(·) based on a realisation Φ ∩ A of Φ in some open, convex and bounded Borel set A 6= ∅.
The classic approach is to use the Berman–Diggle (1989) estimator \
λBD(x0) :=
N (b(x0, h) ∩ A)
|b(x0, h) ∩ A|
, (1)
for x0∈ A. Here the notation N (B) is used for the cardinality of Φ ∩ B for bounded Borel sets
B ⊂Rd, and |B| for its Lebesgue mass. The bandwidth h > 0 controls the amount of smoothing.
Note that as A is open, one never divides by zero. By construction, the expectation of (1) is M (b(x0, h) ∩ A)/|b(x0, h) ∩ A|.
Although (1) is a natural estimator, it does not necessarily preserve the total mass in W , nor is it based on a (generalised) weight function. In this paper we consider alternative estimators that do possess these two properties.
2
Mass preserving kernel estimation
Note that (1) can be regarded as a kernel estimator with kernel 1{||x − x0|| < h}/|b(x0, h) ∩ A|.
The edge correction term |b(x0, h) ∩ A| is ‘global’ as it does not depend on x. Using a ‘local’
edge correction instead suggests the estimator \ λK(x0) := X x∈Φ∩A 1{||x − x0|| < h} |b(x,h) ∩ A| (2) based on the kernel kh(x0 | x) = 1{||x − x0|| < h}/|b(x, h) ∩ A| for x0 ∈ A. Note that (2) is
2 M.N.M. van Lieshout
(2) is based on a proper weight function as Z A kh(x0| x) dx0= Z A 1{||x − x0|| < h} |b(x, h) ∩ A| dx0≡ 1 for all x ∈ A. Consequently, (2) is mass preserving, that is, R
Aλ\K(x0) dx0 = N (A) almost
surely.
The first two moments of (2) are given by Eh \λK(x0) i = Z A∩b(x0,h) λ(x) |b(x, h) ∩ A| dx; E · \ λK(x0) 2¸ = Z (b(x0,h)∩A)2 ρ(2)(x, y) |b(x, h) ∩ A| |b(y, h) ∩ A| dx dy + Z b(x0,h)∩A λ(x) |b(x, h) ∩ A|2 dx,
provided the second order factorial moment measure of Φ exists as a locally finite measure that is absolutely continuous with respect to the product Lebesgue measure with Radon–Nikodym derivative ρ(2).
3
Local versus global edge correction
Neither (1) nor (2) is universally better in terms of integrated mean squared error than its competitor. To see this, first consider a homogeneous Poisson process Φ with intensity λ > 0. Then, the integrated variance of both (1) and (2) is equal to λR
A|b(x, h) ∩ A|
−1dx. The bias of
the Berman–Diggle estimator is zero, whereas (2) is biased unlessR
b(x0,h)∩A|b(x, h) ∩ A|
−1dx =
1. So, in general, (1) will be preferred.
Next, let Φ be a Poisson process on A with intensity function λ(x) = λ |b(x, h) ∩ A|, for some λ > 0. Then, (2) is unbiased with integrated variance λ |A|. Write
m(x0) := Z b(x0,h)∩A |b(x, h) ∩ A| |b(x0, h) ∩ A|2 dx.
Then,E \λBD(x0) can be expressed as λ(x0) m(x0), so its integrated squared bias is zero if and
only if m(x0) = 1 for almost all x0∈ A. The integrated variance of (1) is λRAm(x0) dx0which
reduces to λ|A| if m(x0) = 1 for almost all x0∈ A so that the estimators are indistinguishable.
Otherwise the mass preserving kernel estimator should be preferred sinceR
Am(x0) dx0≥ |A|.
4
Delaunay tessellation field estimator
Suppose that realisations of the point process Φ are almost surely in general quadratic position, that is, no d + 2 points are located on the boundary of a sphere and no k + 1 points lie in a k − 1 dimensional affine subspace for k = 2, . . . d. Then the Delaunay tessellation of Φ is well defined. The union of Delaunay cells containing a point xi∈ Φ is the contiguous Voronoi cell of
xi in Φ and will be denoted by W (xi | Φ). For further details, see e.g. Møller (1994) or Okabe
et al. (2000).
Note that the tessellation cells described above can be seen as adaptive neighbourhoods of a point of Φ. In contrast to the balls of fixed radius h used before, the size of the cells depend on the point process: In densely populated regions, the cells will be small, whereas they tend to be larger in regions of low intensity. Based on this idea, Schaap and Van de Weygaert (2000,
On estimation of the intensity function of a point process 3
2007) introduced the Delaunay tessellation field estimator (DTFE) as follows. For x ∈ Φ ∩ A, set \λD(x) := (d + 1)/|W (x | Φ ∩ A)|, and for any x0∈ A in the interior of some Delaunay cell,
define \ λD(x0) := 1 d + 1 X x∈Φ∩D(x0|Φ∩A) \ λD(x) (3)
by averaging over the vertices of the Delaunay cell D(x0| Φ ∩ A) that contains x0.
The DTFE preserves total mass and is an adaptive kernel estimator. To see this, write D(ϕ ∩ A) for the family of Delaunay cells of ϕ ∩ A, and set
g(x0| x, ϕ) := P Dj∈D(ϕ∩A)1{x0∈ D ◦ j; x ∈ Dj} |W (x | ϕ ∩ A)| ,
for x0∈ A \ ϕ, x ∈ ϕ, and g(x | x, ϕ) := (d + 1)/|W (x | ϕ ∩ A)| if x ∈ ϕ ∩ A. Then \λD(x0) =
P
x∈Φ∩Ag(x0| x, Φ), and, asRAg(x0| x, ϕ) dx0= 1, mass preservation follows. Note that there
is no need for a subjective choice of bandwidth, though at some computational cost. The first two moments of (3) are given by
Eh \λD(x0) i = Z A Ex[g(x0| x, Φ)] λ(x) dx E · \ λD(x0) 2¸ = Z A2E (2) x,y[g(x0| x, Φ) g(x0| y, Φ)] ρ(2)(x, y) dx dy + Z A Ex£g2(x0| x, Φ)¤ λ(x) dx
provided the second order factorial moment measure of Φ exists as a locally finite measure that is absolutely continuous with respect to the product Lebesgue measure with Radon–Nikodym derivative ρ(2). HereE
x (E (2)
x,y) denotes expectation with respect to the Palm distribution of Φ
at x (the two-fold Palm distribution at x, y). If Φ is a Poisson process with intensity function λ(·), P!
x= P and ρ(2)(x, y) = λ(x) λ(y). The result should be compared to its kernel estimation
counterpart.
Finally, assume that Φ is a stationary Poisson process on Rd. Then, (3) is unbiased with
variance cdλ2. The constant cd depends on the dimension. For example if d = 1, c1 = 2 (2 −
π2/6) ≈ 0.7. Since the Berman–Diggle estimator is unbiased with variance λ ω−1
d h
−d, where ω d
is the volume of the unit ball, it is more efficient than (3) wheneverEN(b(0, h)) > 1/cd. Hence on
the line, (1) is the better choice ifEN((−h, h)) = 2λh > 1.4. For comparison, the computation of (3) in this case requires four points. Simulations by the author indicate that DTFE would be preferred for strongly oscillating intensity functions in contexts where peak preservation is important; for mildly fluctuating intensity functions, kernel estimation seems more efficient.
References
Berman, M. and Diggle, P.J. (1989): Estimating Weighted Integrals of the Second-Order Intensity of a Spatial Point Process. Journal of the Royal Statistical Society Series B, 51, 81–92.
Møller, J (1994): Lectures on Random Voronoi Tessellations. Lecture Notes in Statistics 87. Springer-Verlag.
Okabe, A. and Boots, B. and Sugihara, K. and Chiu, S.N. (2000): Spatial Tessellations. Concepts and Applications of Voronoi Diagrams.Second Edition. Wiley.
Schaap, W.E. (2007): DTFE. The Delaunay Tessellation Field Estimator. Ph.D. Thesis, University of Groningen.
Schaap, W.E. and Weygaert, R. van de (2000): Letter to the Editor. Continuous Fields and Discrete Samples: Reconstruction through Delaunay Tessellations. Astronomy and Astrophysics, 363, L29– L32.