Identifying fine-scale habitat preferences of threatened butterflies using country-wide airborne laser scanning data

(1)

BIODIVERSITY RESEARCH

1

2

Identifying fine-scale habitat preferences of threatened

3

butterflies using country-wide airborne laser scanning

4

data

5

6

Jan Peter Reinier de Vries1, Zsófia Koma1, Michiel F. WallisDeVries2,3 & W. Daniel Kissling1,*

7 1

Institute of Biodiversity and Ecosystem Dynamics (IBED), University of Amsterdam, Amsterdam, 8

the Netherlands. 9

2

De Vlinderstichting / Dutch Butterfly Conservation, Wageningen, the Netherlands 10

3_{Plant Ecology and Nature Conservation Group, Wageningen University, Wageningen, the} 11

Netherlands 12

*Corresponding author: E-mail address: wdkissling@gmail.com (W.D. Kissling). 13

14

Acknowledgements

15

This work is part of the project “eScience infrastructure for Ecological applications of LiDAR point 16

clouds” (eEcoLiDAR) (Kissling et al., 2017), funded by the Netherlands eScience Center 17

(https://www.esciencecenter.nl), grant number ASDI.2016.014. The Dutch Butterfly Monitoring 18

Scheme is a joint scheme of Dutch Butterfly Conservation and Statistics Netherlands and is financed 19

by the Ministry of Agriculture, Nature and Food Quality in the framework of the Dutch Network 20

Ecological Monitoring programme. We are greatly indebted to the many volunteers that contributed to 21

the data collection. We thank Chris van Swaay for discussion and providing photographs of butterflies. 22

23

Biosketch

24

The research team is interested in understanding the spatial distribution of biodiversity and the habitat 25

preferences of species, using a combination of in-situ (species) observations with remote sensing. The 26

extraction of vegetation structural parameters from 3D point clouds derived from Light Detection And 27

Ranging (LiDAR) is of particular interest for quantifying ecosystem structure across broad spatial 28

extents. The research team is also interested in applying ecological knowledge and theory to the 29

conservation of species and habitats. 30

(2)

BIODIVERSITY RESEARCH

32

33

Identifying fine-scale habitat preferences of threatened

34

butterflies using country-wide airborne laser scanning

35

data

36

37

Running title: LiDAR-based butterfly habitats

38

Word count: 5362 (excluding the title, abstract, references, figure captions, and tables)

39 Number of references: 63 40 41 Abstract 42

Aim: Light Detection And Ranging (LiDAR) is a promising remote sensing technique for ecological

43

applications because it can quantify vegetation and habitat structure at high resolution over broad 44

spatial extents. Using country-wide airborne laser scanning (ALS) data, we test to what extent LiDAR 45

metrics capturing low vegetation, medium-to-high vegetation and landscape-scale habitat structures 46

can explain the fine-scale habitat preferences of threatened butterflies. 47

Location: The Netherlands.

48

Methods: We applied a machine learning (random forest) algorithm to build species distribution

49

models (SDMs) for grassland and woodland butterflies in wet and dry habitats using various LiDAR 50

metrics and butterfly presence-absence data collected by a national butterfly monitoring scheme. The 51

LiDAR metrics captured vertical vegetation complexity (e.g. height and vegetation density of different 52

strata) and horizontal heterogeneity (e.g. canopy roughness, terrain slope, vegetation openness and 53

edge extent). We assessed the relative variable importance and response curves of each LiDAR metric 54

for explaining butterfly occurrences. 55

Results: All SDMs showed a good to excellent fit, but woodland butterfly SDMs performed better

56

than those of grassland butterflies. Grassland butterfly occurrences were best explained by landscape-57

scale habitat structures (e.g. open patches, terrain slope) and vegetation height. Woodland butterfly 58

occurrences were mainly determined by vegetation density of medium-to-high vegetation and 59

landscape-scale LiDAR metrics (open patches and edge extent). The importance of metrics generally 60

differed between wet and dry habitats for both grassland and woodland species. 61

Main conclusions: LiDAR metrics derived from country-wide ALS data provide detailed insights into

62

the fine-scale habitat preferences of butterflies, even in low-stature habitats such as grasslands. 63

However, the information content of low vegetation metrics derived from leaf-off ALS is limited and 64

may not capture all key habitat structures of open habitats. Nevertheless, country-wide ALS and other 65

LiDAR data offer great potential for ecological studies and provide detailed insights into invertebrate-66

habitat relationships. 67

Keywords: active remote sensing, ecological niche, ecosystem structure, environmental heterogeneity,

68

habitat suitability, insects, microhabitat 69

(3)

1 | INTRODUCTION

70

Butterflies and other invertebrates have declined severely in recent decades, especially in parts of 71

Europe where well-monitored populations have revealed long-term trends (van Swaay, Warren & 72

Loïs, 2006; Hallmann et al., 2017). The specialised niches of many butterflies in terms of habitat and 73

food plant requirements make them vulnerable to ongoing habitat modification and other global 74

change drivers (Thomas et al., 2004). Butterflies are generally a well-known group that can be easily 75

detected, they are diverse and often bound to specific habitats, and hence a very good indicator and 76

umbrella taxon for invertebrate conservation (Thomas, 2005; van Swaay et al., 2006). Comprehensive 77

survey efforts have revealed severe population declines and extinctions, especially of specialist 78

species, e.g. in the Netherlands (Bos, Bosveld, Groenendijk, van Swaay & Wynhoff, 2006; van Strien, 79

van Swaay, van Strien-van Liempt, Poot & WallisDeVries, 2019), Flanders (Maes & Van Dyck, 80

2001), Denmark (Eskildsen et al., 2015) and Great Britain (Fox et al., 2015). In the Netherlands 81

butterflies have declined by 50% since 1992 and over 80% since 1890 (van Strien et al., 2019). The 82

major causes of these declines have been the intensification of human land use, the modification of 83

heterogeneous (semi-)natural landscapes, and an increase in habitat fragmentation (e.g. Thomas et al., 84

2004; van Swaay et al., 2006; Aguirre‐Gutiérrez et al. 2017; van Strien et al., 2019). Although a 85

reduction of landscape conversion and an increase in conservation efforts have slowed down butterfly 86

declines since 1990 (Carvalheiro et al., 2013; van Strien et al., 2016), a large part of the Dutch 87

butterfly species remain highly vulnerable and are still declining (van Strien et al., 2019; van Swaay, 88

2019). This shows the urgent need of sustaining and increasing efforts to preserve butterflies and their 89

habitats. 90

The preservation of habitats is of critical importance to prevent further losses and declines of 91

butterflies and other invertebrates (van Swaay et al., 2006; van Strien et al., 2019). Since most 92

invertebrates depend on specific habitat elements that provide food resources, nesting sites and shelter, 93

understanding how the fine-scale structure and distribution of habitats determines species distribution 94

is crucial for biodiversity science and conservation (Thomas, 1995; Dennis, Schreeve & van Dyck, 95

2003, 2006). Habitat structure has also many indirect effects on invertebrates, e.g. by influencing 96

microclimate, light availability, and floristic composition (Davies & Asner, 2014; Müller, Bae, Röder, 97

Chao & Didham, 2014; Aguirre-Gutiérrez et al., 2017). The fine-scale habitat suitability of 98

invertebrates is typically driven by various aspects of vegetation structure, including vertical 99

vegetation complexity (e.g. the density of specific strata), horizontal heterogeneity (e.g. canopy 100

roughness) or the horizontal structure of vegetation at the landscape scale (e.g. the extent of edges and 101

open spaces) (Bakx, Koma, Seijmonsbergen & Kissling, 2019; Davies & Asner, 2014; Glad, 102

Reineking, Montadert, Depraz & Monnet, 2020; Simonson, Allen & Coomes, 2014). Despite many 103

local field studies on butterfly-habitat relationships, the generality of these relationships remains 104

unclear because quantifying vegetation structure across broad spatial extents has often been limited by 105

the difficulty to obtain detailed, high resolution data in a standardised, comparable and spatially 106

contiguous way (Davies & Asner, 2014; Kissling, Seijmonsbergen, Foppen & Bouten, 2017; Valbuena 107

et al., 2020). 108

Recent developments in remote sensing show great potential to fill this gap. For instance, 109

LiDAR (Light Detection And Ranging) can produce standardised 3D measurements of vegetation 110

structure at high resolution and over broad spatial extents, with relatively low costs (Davies & Asner, 111

2014; Kissling et al., 2017). LiDAR data derived from country-wide Airborne Laser Scanning (ALS) 112

are also increasingly becoming available from free and open sources (Valbuena et al., 2020). LiDAR 113

uses short-range laser pulses to measure the x,y,z-coordinates of reflective objects, often from 114

aircrafts. Since the exact timing and position of the sensor are known, the distance to each point can be 115

calculated and a 3D point cloud with high precision can be derived, from which a large number of 116

vegetation structure parameters can be calculated (Bakx et al., 2019; Davies & Asner, 2014). These 117

parameters —often referred to as LiDAR metrics— are statistical properties of the point cloud 118

(4)

describing the mean, variability or proportions of returns for vertical strata. They can capture 119

information on vegetation structure at a local scale (e.g. for a high resolution grid cell or a radius 120

around a focal observation point) or at the landscape scale (e.g. measuring habitat patches and edges 121

based on grid cells that capture LiDAR-derived vegetation height) (Bakx et al., 2019). LiDAR metrics 122

can thus directly be used to quantify ecological niches and habitat requirements of species. This makes 123

LiDAR a transformative resource for ecological studies, enabling a detailed understanding of the 124

specific and scale-dependent habitat preferences of species across broad spatial extents, and with 125

direct insights for management and conservation (Müller & Brandl, 2009; Davies & Asner, 2014; 126

Simonson et al., 2014; Moeslund et al., 2019). 127

Only few LiDAR studies have so far focused on invertebrates (Davies & Asner, 2014). While 128

such studies generally emphasize the importance of vegetation structure and landscape heterogeneity 129

for species diversity, they also show that individual taxa respond differently to specific habitat 130

characteristics (Vierling et al., 2011; Hess et al., 2013; Davies & Asner, 2014). As most LiDAR 131

studies have focussed on forests and woody habitats (Bakx et al., 2019; Davies & Asner, 2014), it 132

remains open to what extent LiDAR can capture vegetation structure of low-stature habitats such as 133

grasslands, dunes and wetlands. Some previous studies show promising results for measuring 3D 134

vegetation structure in grasslands and wetlands (e.g. Alexander, Deák, Kania, Mücke & Heilmeier, 135

2015; Koma, Seijmonsbergen & Kissling, 2020; Zlinszky et al., 2014). However, country-wide 136

LiDAR surveys are often conducted in the leaf-off season to optimise terrain mapping (Reutebuch, 137

Andersen & McGaughey, 2005) and may then contain little information for quantifying the vertical 138

structure within low-stature vegetation (Alexander et al., 2015). Moreover, measuring vegetation 139

structure in the understory of forests with dense canopies can also be challenging because laser returns 140

might predominantly be recorded from the canopy, especially with discrete return data (Anderson, 141

Hancock, Disney & Gaston, 2016). Comparing the explanatory power and information content of a 142

suite of LiDAR metrics in open and woody habitats is thus important to better understand the potential 143

of LiDAR data for ecological research, biodiversity conservation and nature management (Davies & 144

Asner, 2014). 145

Here, we analyse to what extent specific LiDAR metrics derived from country-wide ALS data 146

can explain the fine-scale habitat preferences of threatened butterflies in the Netherlands. We focus on 147

four species that are all of conservation concern (van Swaay, 2019) and which are bound to specific 148

habitats, representing grassland and woodland habitats in wet or dry conditions. We build species 149

distribution models (SDMs) with LiDAR metrics that capture the vertical complexity and horizontal 150

heterogeneity of vegetation, and use species presence-absence data derived from a national butterfly 151

monitoring scheme (van Swaay, Nowicki, Settele & van Strien, 2008) as the response variable. We 152

expect that (H1) LiDAR metrics reflecting low vegetation (e.g. forest understory or grasses and herbs 153

in open habitats) show little importance in explaining habitat preferences of butterflies because of the 154

limitations of LiDAR data in dense forests (due to the low penetrability of the canopy) or in grasslands 155

(due to the leaf-off acquisition of LiDAR data), (H2) metrics reflecting medium to high vegetation 156

(e.g. the density or heterogeneity of shrub and tree layers) are especially important to explain habitat 157

preferences of woodland butterflies, and (H3) metrics reflecting landscape-scale habitat structures (e.g. 158

terrain, woodland edges and vegetation openness) are important to explain habitat preferences of both 159

grassland and woodland butterflies. Our analyses gain new insights into habitat preferences of 160

butterflies at local and landscape scales, and show that LiDAR can improve our yet limited knowledge 161

on invertebrate-habitat relationships at a national extent. 162 163

2 | METHODS

164

2.1 | Butterfly data

165

(5)

We focus on four butterfly species with different habitat preferences: (1) the small pearl-bordered 166

fritillary (Boloria selene), a specialist of wet grasslands with a low and flower-rich vegetation 167

(Bergman et al., 2008; Bos et al., 2006; Cozzi, Müller & Krauss, 2008; van Swaay, 2019); (2) the 168

grayling (Hipparchia semele), a species inhabiting dry open habitats with a heterogeneous cover of 169

bare sand, low grasses, nectar plants and scattered woody vegetation (Bos et al., 2006; Vanreusel, 170

Maes & van Dyck, 2007; van Swaay, 2019); (3) the white admiral (Limenitis camilla), a butterfly of 171

moist deciduous woodlands with open patches providing sunlight throughfall (Bos et al., 2006; van 172

Swaay, 2019); and (4) the heath fritillary (Melitaea athalia), a dry woodland species which is mostly 173

found on sheltered open spaces with a flower-rich herb vegetation near woodland edges (Bergman et 174

al., 2008; Bos et al., 2006; van Swaay, 2019; Warren, 1978a,b). More details on their habitat 175

preferences are summarized in Supporting Information Table S1. All four species have a localised 176

distribution in the Netherlands and have strongly declined over the last century (van Strien et al., 2019; 177

van Swaay, 2019). Two species (L. camilla and M. athalia) are confined to regions in the east and 178

centre of the Netherlands, whereas the other two (B. selene and H. semele) occur on both inland and 179

coastal (dune) locations (Figure 1). 180

Presence-absence data of all four species were derived from the Dutch butterfly monitoring 181

scheme, which conducts weakly repeated surveys along transects throughout the flight season (April to 182

September) (van Swaay et al., 2008). Each monitoring transect is about 1 km long and consists of 183

sequences of 50×5 m sections, typically placed in one habitat type. We used the nation-wide 184

monitoring data from 2014–2018, in correspondence with the LiDAR data collection period (winter 185

2014 – winter 2019). This comprised a total of >10.000 transect sections across the Netherlands, from 186

which the focal species were recorded in 371 (B. selene), 807 (H. semele), 369 (L. camilla) and 119 187

(M. athalia) sections, respectively. The recorded presence and absence of each species was assigned to 188

the centre point of each 50×5 m transect section and later used as the response variable in the SDMs 189

(see below). 190

Since the number of individuals of all monitored butterfly species is also recorded per transect 191

section, we used this information not only to identify absences but also incidental records. Presence 192

points for which only one individual was observed during all 2014–2018 surveys were excluded, as 193

these records could represent misidentifications or wandering individuals. For the analyses, we only 194

included absences in a 10 km buffer around presence points to account for the limited mobility of the 195

species (Essens, van Langevelde, Vos, van Swaay & WallisDeVries, 2017). This selection was done 196

using QGIS 3.4 (QGIS Development Team, 2009). We further identified the soil type of each transect 197

section —a key determinant of vegetation and thus an indirect driver of butterfly distributions— using 198

national soil classification data (Wösten, de Vries, Denneboom & van Holst, 1988). We included 199

absence points only from those soil types that also host presence points to focus on habitats in which 200

the species can potentially occur. We excluded the soil types ‘water’, ‘urban’ and ‘zero’ because they 201

do not represent key habitats of the focal species. Selecting absence points that are in principle 202

reachable for the focal species (10 km radius) and potentially suitable given the abiotic environment 203

(soil conditions), but yet remain unoccupied, enables to identify the effect of vegetation structure on 204

the presence-absence of the species (Zellweger, Braunisch, Baltensweiler & Bollmann, 2013). 205

To reduce the spatial clustering of data points induced by the transect sampling design, we 206

discarded presence and absence points that were located within 100 m distance from their nearest 207

neighbour (compare Zielewska-Büttner, Heurich, Müller & Braunisch, 2018). A 100 meter distance 208

was chosen because it roughly represents the home range of the focal species (Bos et al., 2006; Essens 209

et al., 2017) and because it corresponds to the radius from which we derived landscape-scale LiDAR 210

metrics (see below). We used the thinning optimisation algorithm ‘spThin’ (Aiello-Lammens, Boria, 211

Radosavljevic, Vilela & Anderson, 2015) in R 3.5.3 (R Core Team, 2019) with 1,000 repetitions per 212

species to derive the maximum number of data points given the 100 m distance criterion. This resulted 213

in a final sample size of 248 presence and 610 absence points for H. semele, 106 presence and 384 214

(6)

absence points for L. camilla, 92 presence and 151 absence points for B. selene, and 45 presence and 215

101 absence points for M. athalia (Figure 1). 216

2.2 | LiDAR data

217

We used LiDAR data from the third country-wide ALS campaign (AHN3) in the Netherlands (see 218

https://ahn.arcgisonline.nl/ahnviewer), conducted in the years 2014–2019 in leaf-off conditions 219

(northern hemisphere winter, December–March). The data has an average point density of 6–10 points 220

per m2, an overall point cloud accuracy of 10 cm and a vertical standard deviation of 5 cm 221

(https://ahn.nl/kwaliteitsbeschrijving). Further details — including the scanner type, pulse reputation 222

frequency, flight lines and flight elevations — are not provided with the published dataset. Information 223

on uncalibrated intensity and the number of returns is provided, but as the intensity data are not 224

radiometrically corrected, their use is limited because of the potential influence of the flight pattern 225

and laser scanner type. Ground points, buildings and water are pre-classified, which enables (1) 226

vegetation points to be distinguished from ground points, and (2) to exclude infrastructure and water 227

as non-vegetation elements. We downloaded the LiDAR data in a 1000 m radius around the boundary 228

polygon of each transect, corresponding to 483 LiDAR tiles, from which point clouds of a 100 m 229

radius around the transect section centroids were extracted. 230

From the LiDAR point clouds, we derived 12 LiDAR metrics that captured the vertical 231

complexity and horizontal heterogeneity of the vegetation (Table 1). This was done using the R 232

package ‘lidR’ (Roussel & Auty, 2019). Each metric was chosen to reflect vegetation structure-related 233

habitat preferences of the focal species as reported in the ecological literature, either from field or from 234

LiDAR studies (see Table 1). A total of six LiDAR metrics reflected the vertical complexity of 235

vegetation. Those were directly derived from the LiDAR point cloud using a 25 m radius around each 236

centroid (Table 1). This scale matches the length of a transect section and was chosen to describe the 237

local habitat conditions in which the presence (or absence) of a species was recorded. A further six 238

metrics reflected the horizontal heterogeneity of vegetation in either 25 m (vegetation roughness) or 239

100 m (landscape-scale terrain or vegetation structure) around each centroid. The 100 m scale reflects 240

the home range scale of the butterflies (Bos et al., 2006; Maes, Ghesquiere, Logie & Bonte, 2006; 241

Warren, 1987b). Vegetation roughness and landscape-scale vegetation structure metrics were derived 242

from the variability of a digital surface model (DSM), based on LiDAR-derived vegetation height (90th 243

percentile of z) within 1 m resolution grid cells, using the R package ’landscapemetrics’ (Hesselbarth, 244

Sciaini, With, Wiegand & Nowosad, 2019). The mean slope of the terrain was quantified with a 1 m 245

resolution digital terrain model (DTM) extracted as the minimum height of ground points in each cell. 246

2.3 | Statistical analysis

247

We build species distribution models (SDMs) to analyse whether and how specific LiDAR metrics 248

(Table 1) can explain the presence-absence of the four butterfly species. We used the ODMAP 249

protocol (Zurell et al., 2020) to document the modelling objectives and decisions (Supporting 250

Information ODMAP protocol). We carefully explored multi-collinearity among the LiDAR metrics 251

with Spearman rank correlations (Supporting Information Figure S1). Metrics that showed high 252

pairwise Spearman rank correlations (r > |0.70|) were discarded in the SDMs by first removing the 253

metric with the largest variance inflation factor (VIF) using the function vifcor in the R package 254

‘usdm’ (Naimi, Hamm, Groen, Skidmore & Toxopeus, 2014). For conceptually related metrics that 255

were highly collinear with each other or other metrics (e.g. open area, open patches and edge extent), 256

we kept the metric that best reflected the ecology and habitat preferences of a specific species 257

(compare Supporting Information Table S1). All metrics in the final SDMs were not strongly 258

correlated (r < |0.70|) and had VIF < 3, as suggested for model implementation (Naimi et al., 2014). 259

We initially tested three different SDM algorithms for modelling butterfly species 260

distributions and habitat suitability: General Linear Models (GLM), Random Forest (RF) and 261

(7)

Maximum Entropy (Maxent) (Breiman, 2001; Naimi & Araújo, 2016; Phillips, Anderson & Schapire, 262

2006). Model accuracy was examined with the Area Under Curve (AUC) (Pearce & Ferrier, 2000; 263

Brotons, Thuiller, Araújo & Hirzel, 2004) and the True Skill Statistic (TSS) (Allouche, Tsoar & 264

Kadmon, 2006), and visualised using Receiver Operating Characteristics (ROC) curves (Pearce & 265

Ferrier, 2000). Since RF outperformed the other two SDM algorithms for all species in terms of AUC 266

and TSS (Supporting Information Table S2), we only present the results of the RF below. The RF 267

algorithm is a machine learning method which easily deals with non-linear relationships. We used the 268

R package ‘sdm’ (Naimi & Araújo, 2016) to build RF models using 100 decision trees with 20 nodes 269

(splits) per tree. Model calibration was performed on 100 random bootstrap subsets of 70% of the data, 270

and predictive performance was then validated with the remaining 30% of the data in each run. 271

To test our three hypotheses, we assessed whether and to what extent specific LiDAR metrics 272

reflecting low vegetation (H1), medium-to-high vegetation (H2) and landscape-scale habitat structure 273

(H3) can explain the presence-absence of the four butterfly species (Table 1). Specifically, we used the 274

relative variable importance and the response curves of each metric as implemented in the R package 275

'sdm' (aggregated over 100 model runs for each species) to interpret the role of LiDAR metrics in 276

explaining butterfly habitat preferences. Relative variable importance —measured by AUC 277

improvements of model performance due to inclusion of the focal variable— was obtained using the 278

function ‘getVarImp’ (Naimi & Araújo, 2016). The species-specific responses to each LiDAR metric 279

were visualised with response curves using the function ‘getResponseCurve’ (Naimi & Araújo, 2016, 280

following Elith, Ferrier, Huettmann & Leathwick, 2005). The response curves show the probability of 281

occurrence along the gradient of vegetation structure as measured by a given LiDAR metric. Since the 282

two grassland species occur in both coastal (dune) and inland habitats, we additionally implemented 283

separate RF models for coastal and inland populations to explore whether habitat relationships differ 284

between these habitats. For inland populations, this included 71 and 137 (B. selene) and 122 and 181 285

(H. semele) presence and absence points, respectively. For coastal populations, sample size was only 286

sufficient for H. semele (126 presence and 429 absence points, respectively) whereas records for B. 287

selene were too limited (21 presence and 14 absence points, respectively).

288 289

3 | Results

290

3.1 | Metrics selection

291

Spearman rank correlations were high (r = 0.7–0.9) between several LiDAR metric pairs (Supporting 292

Information Figure S1). For instance, total vegetation roughness and low vegetation roughness and 293

most density metrics of adjacent strata were discarded from the models based on the VIF. Vegetation 294

height and the 5–20 m vegetation density were highly correlated with each other but also with 295

landscape-scale habitat structure metrics (particularly open area). Vegetation height was selected for 296

both grassland species as it best reflects the ecological conditions in grasslands (e.g. shelter), whereas 297

the 5–20 m density was selected for both woodland species as it reflects their association with trees. 298

Terrain slope was only weakly correlated with other metrics and thus kept in all models. Landscape-299

scale vegetation structure metrics were highly correlated with each other. Open area and edge extent 300

were selected for B. selene, reflecting shelter and open vegetation in wet grasslands. Open patches was 301

selected for H. semele, reflecting patchiness of open ground in dry habitats such as grasslands and 302

heathlands. Open patches and edge extent were selected for L. camilla, reflecting canopy gaps in moist 303

deciduous woodlands. Edge extent was selected for M. athalia, as it represents open spaces in dry 304

woodlands. The final selection of LiDAR metrics partly differed among species due to their specific 305

habitat preferences and comprised seven (B. selene, H. semele and L. camilla) and six (M. athalia) 306

LiDAR metrics, respectively (Table 1), overall avoiding high correlations (all r < 0.7 and VIF < 3). 307

3.2 | Model performance

308

(8)

The ROC curves of the SDMs revealed a good fit of the test data for B. selene, H. semele and M. 309

athalia (AUC = 0.87, 0.82 and 0.89; TSS = 065, 0.52 and 0.70) and an excellent fit for L. camilla

310

(AUC = 0.96, TSS = 0.82) (Supporting Information Figure S2). The deviance D between model 311

repetitions was acceptable for B. selene, H. semele and M. athalia (D = 0.89, 0.94 and 0.79), but low 312

for L. camilla (D = 0.48). This provided a robust basis for interpreting the contributions of predictor 313

variables, especially for both woodland species. For the grassland species, model performance 314

increased when their coastal and inland distributions were modelled separately, especially of inland 315

habitats for both B. selene and H. semele (AUC = 0.90 and 0.87, TSS = 0.71 and 0.66, D = 0.80 and 316

0.92), and to a lesser extent also of coastal habitats for H. semele (AUC = 0.83, TSS = 0.53, D = 0.78). 317

3.3 | Effects of low vegetation

318

Low vegetation metrics were of minor importance in most SDMs (Figure 2 and Figure 3). B. selene 319

showed a weak response to the density < 0.2 m in wet grasslands. The probability of occurrence of H. 320

semele increased with the density of low vegetation (<0.2 m density) (Figure 2). This effect was

321

particularly pronounced in inland populations, but not in coastal habitats (Supporting Information 322

Figure S3). The response of H. semele to vegetation density between 0.2 and 1 m (reflecting tall herbs 323

and low shrubs) was generally weak (Fig. 2b and Fig. S3 in Appendix 1). For woodland butterflies, 324

low vegetation density was unimportant for L. camilla (occurring in moist deciduous woodlands), but 325

important for M. athalia (a dry woodland species) which was associated with a high vegetation density 326

<0.2 m (Figure 3b). 327

3.4 | Medium to high vegetation

328

Medium to high vegetation metrics were of major importance for both woodland species (Figure 3) 329

and for the wet grassland species B. selene (Figure 2). L. camilla was especially associated with a high 330

density of trees >20 m whereas M. athalia was most strongly associated with a high 5–20 m vegetation 331

density (Figure 3). L. camilla was also associated with a high 5–20 m vegetation density, whereas M. 332

athalia was only weakly associated with the density of >20 m trees. For the wet grassland species B.

333

selene, a low vegetation height (< 10 m) strongly increased its probability of occurrence (Figure 2).

334

The dry grassland species H. semele was only weakly associated with medium to high vegetation 335

metrics (Figure 2), but the importance of vegetation height increased in coastal habitats (Supporting 336

Information Figure S3). The density of 1–5 m shrubs was unimportant in all SDMs and weakly 337

associated with the species’ probability of occurrence, but M. athalia and H. semele in inland habitats 338

responded positively to a low 1–5 m density of vegetation (Fig. 3 and Supporting Information Figure 339

S3). 340

3.5 | Landscape-scale habitat structure

341

Metrics related to landscape-scale habitat structure were of key importance in all SDMs, but the 342

specific metrics partly differed among species (Figure 2 and Figure 3). B. selene, H. semele and M. 343

athalia mainly occurred in flat terrain (mean slope <10 degrees), and this effect was particularly

344

pronounced in inland populations of B. selene and H. semele (Supporting Information Figure S3). In 345

addition to terrain slope, H. semele was strongly associated with a low number of open patches in dry 346

habitats. L. camilla was most strongly associated with a high number of open patches in moist 347

deciduous woodland. L. camilla and B. selene in inland habitats responded positively to a high edge 348

extent, and M. athalia showed a strong preference for high edge extents in dry woodlands (Figure 3b). 349

350

4 | Discussion

351

Our LiDAR-based analysis of four threatened butterfly species in the Netherlands provides detailed 352

insights into how vegetation structure shapes habitat preferences at a national extent. Using high-353

(9)

quality butterfly presence-absence data derived from a national monitoring scheme we show that 354

landscape-level habitat structures are especially important for both grassland and woodland species 355

and that medium-to-high vegetation structures are crucial for woodland species. Low vegetation 356

structure LiDAR metrics were generally of minor importance across all four species. 357

The weak association of low vegetation LiDAR metrics with grassland butterflies reflects our 358

expectation that low vegetation elements are difficult to capture with leaf-off ALS data. However, 359

ecological field studies show the critical importance of such low vegetation elements for butterflies in 360

grassland habitats (Bos et al., 2006; van Swaay, 2019). This indicates that the leaf-off conditions in 361

which many airborne LiDAR data are captured are not well suited to capture the seasonal structure of 362

annual herbs and grasses in grassland habitats. The distinction of vegetation <0.2 m height and bare 363

ground was especially problematic, but could be improved at specific study sites with LiDAR data 364

captured from unmanned aerial vehicles (UAV) or terrestrial laser scanners (TLS). Low vegetation 365

structure was especially difficult to measure in wet grassland habitats of B. selene, which are nearly all 366

mown in winter (van Swaay, 2019). In dry grassland habitats of H. semele, structures of low height 367

such as low grasses and bare ground were also difficult to quantify, but perennial dwarf scrub 368

vegetation such as heather (Calluna vulgaris) was well captured in the metric of vegetation density 369

<0.2 m, being of major importance in inland habitats. Heather is abundant and an important nectar 370

source in these habitats (mainly heathlands), and a patchy cover is preferred over monotonous heather 371

fields (Bos et al., 2006; Vanreusel et al., 2007; van Swaay, 2019). The response curves of the SDM 372

indicated that a vegetation density <0.2 m between 10–30% promotes the occurrence of H. semele. 373

The density of perennial dwarf shrubs probably also drives the strong response of M. athalia to the 374

<0.2 m vegetation density, reflecting its dependence on open patches with low vegetation in dry 375

forests (Bos et al., 2006; van Swaay, 2019). Heather and other perennial dwarf shrubs (e.g. Vaccinium) 376

are typically abundant in these places. The weak responses of L. camilla to understory vegetation 377

strata (both <0.2 m and 1–5 m) may reflect the difficulty of capturing understory vegetation structure 378

in dense woodlands using discrete return LiDAR data. While L. camilla mainly flies in high vegetation 379

strata, understory vegetation is also important, e.g. for nectar sources such as bramble (Rubus) (Bos et 380

al., 2006). 381

The structure of medium to high vegetation is particularly important in woodland habitats 382

where species may differ in their vertical habitat niches and use of different vegetation strata. As a 383

moist deciduous woodland species, the white admiral (Limenitis camilla) is thought to primarily use 384

the forest canopy layer (Bos et al., 2006), e.g. as gathering places of territorial males or as resting 385

places (Lederer 1960). Our analysis supports the importance of the forest canopy for this species by 386

showing that sites with >20% density of tall (> 20 m high) trees are preferred, while sites where trees 387

of this height are absent are avoided. In contrast, the heath fritillary (Melitaea athalia), a dry woodland 388

species, flies low to the ground and needs trees to provide sheltered conditions and to support its main 389

host plant, the parasitic cow-wheat (Melampyrum) (Bos et al., 2006; Warren, 1987a). This association 390

is reflected by a strong preference for sites with a high vegetation density of 5–20 m tall shrubs and 391

trees as derived from LiDAR. Dense low vegetation may act as a barrier for M. athalia (Warren, 392

1987b), reflected in its preference of a low vegetation density within 1–5 m height. Both grassland 393

butterflies (B. selene and H. semele) use open habitats with preferably few tall vegetation elements 394

(Bos et al., 2006; van Swaay, 2019), as indicated by a preference for low vegetation height in B. selene 395

and a low number of open patches (i.e. large open areas) in H. semele. 396

Both grassland and woodland habitats are shaped by their structural heterogeneity at the 397

landscape scale. This can be captured with LiDAR metrics that quantify the amount of microhabitats 398

(e.g. via terrain ruggedness), or the extent and patchiness of open vegetation or the length of woodland 399

edges (Table 1). The main effect of terrain ruggedness was found for B. selene, which prefers flat 400

terrain, especially on inland locations that have marshy habitat conditions. Both B. selene and H. 401

semele occur on steeper terrain in dune habitats, which for H. semele might reflect the amount of bare

(10)

sand patches (van Swaay, 2019; Maes et al., 2006). Another important landscape-scale LiDAR metric 403

was the number of open habitat patches which showed a contrasting response for H. semele and L. 404

camilla, reflecting their different habitats (dry grasslands vs. moist woodlands). Whereas H. semele

405

was associated with low-stature landscapes that have only few open patches (with vegetation height 406

<1m), L. camilla prefers woodlands with many small patches of open vegetation (roughly >70 patches 407

of >2 m2 per ha) where sunlight can penetrate through the canopy (Bos et al., 2006; van Swaay, 2019). 408

This shows that the same LiDAR metric can have opposing effects depending on which species or 409

habitat is considered. The extent of woodland edges was especially important for the dry woodland 410

species M. athalia which reflects is association with woodland edges, where it finds shelter and host 411

plant habitat (van Swaay, 2019; Warren, 1978a,b). Woodland edges can also provide suitable habitat 412

features for L. camilla (e.g. sunny conditions within woodland) and shelter for inland populations of B. 413

selene (Bos et al., 2006; van Swaay, 2019), but effects were only weakly reflected in the SDMs.

414

An important finding of our study is that SDMs of woodland species achieved higher 415

performances and stronger response curves than those of grassland species. This probably reflects that 416

LiDAR data obtained under leaf-off conditions are better suited to capture the physical structure of tall 417

woody vegetation compared to low-stature vegetation in open habitats such as grasslands. This means 418

that several habitat structures that are potentially preferred by grassland butterflies may be 419

insufficiently captured by the available country-wide LiDAR data. Nevertheless, further developments 420

in LiDAR technology (e.g. full-waveform data instead of discrete echoes, provisioning of high density 421

point clouds, information for calibrating intensity data, complementary data from UAV or TLS) and 422

LiDAR flight campaigns during leaf-on conditions could strongly improve ecological analyses of 423

vegetation structure in open, low-stature habitats such as grasslands and wetlands (Alexander et al., 424

2015; Zlinszky et al., 2014). Additionally, full-waveform LiDAR could also improve the ability to 425

quantify understory vegetation in forests and woody habitats (Anderson et al., 2016). The availability 426

of such LiDAR data over broad spatial extents would greatly improve analyses of species distributions 427

and habitat preferences of invertebrates and other taxa. Moreover, synergies with other remote sensing 428

data such as spectral imagery or information obtained from synthetic aperture radar (SAR) could 429

improve the quantification of habitat aspects in grasslands that are not captured by LiDAR, e.g. the 430

identification of specific grasses or host plant species (Marcinkowska-Ochtyra, Jarocińska, Bzdęga & 431

Tokarska-Guzik, 2019), the quantification of seasonal growth dynamics in grasslands (Metz, 432

Marconcini, Esch, Reinartz & Ehlers, 2014), or the extent of bare ground patches. Furthermore, SDMs 433

of host plant species could be used to improve butterfly SDMs through an assessment of host plant 434

availability, a crucial habitat factor for most butterfly species. 435

436

5 | Conclusions

437

Our study shows how the distribution of grassland and woodland butterflies depends on different 438

aspects of vegetation structure, including the vertical variability of vegetation and the horizontal 439

heterogeneity of vegetation and terrain at the landscape scale. Since vegetation structure is a key 440

habitat determinant for many invertebrate species (Dennis et al., 2003, 2006), the ability of LiDAR to 441

quantify vegetation structure offers promising new ways to gain insights into invertebrate-habitat 442

relationships (Davies & Asner, 2014; Moeslund et al., 2019; Zellweger et al., 2013). Such information 443

can be beneficial for improving the management and conservation of threatened species and their 444

habitats. Our study demonstrates that LiDAR metrics are not only informative for species inhabiting 445

woody habitats, but also for invertebrates occurring in low-stature habitats. This offers ample new 446

opportunities for applying LiDAR-based habitat analyses to invertebrates, not only in grasslands but 447

also in dunes, heathlands and wetlands, and other non-forest habitats that are threatened throughout 448

Europe and other parts of the world. 449

(11)

REFERENCES

451

Aiello‐Lammens, M. E., Boria, R. A., Radosavljevic, A., Vilela, B., & Anderson, R. P. (2015). 452

spThin: an R package for spatial thinning of species occurrence records for use in ecological niche 453

models. Ecography, 38(5), 541-545. 454

Aguirre‐Gutiérrez, J., WallisDeVries, M. F., Marshall, L., van't Zelfde, M., Villalobos‐Arámbula, A. 455

R., Boekelo, B., ... Biesmeijer, J. C. (2017). Butterflies show different functional and species diversity 456

in relationship to vegetation structure and land use. Global Ecology and Biogeography, 26(10), 1126-457

1137. 458

Alexander, C., Deák, B., Kania, A., Mücke, W., & Heilmeier, H. (2015). Classification of vegetation 459

in an open landscape using full-waveform airborne laser scanner data. International Journal of 460

Applied Earth Observation and Geoinformation, 41, 76-87.

461

Allouche, O., Tsoar, A., & Kadmon, R. (2006). Assessing the accuracy of species distribution models: 462

prevalence, kappa and the true skill statistic (TSS). Journal of Applied Ecology, 43(6), 1223-1232. 463

Anderson, K., Hancock, S., Disney, M., & Gaston, K. J. (2016). Is waveform worth it? A comparison 464

of LiDAR approaches for vegetation and landscape characterization. Remote Sensing in Ecology and 465

Conservation, 2(1), 5-15.

466

Bakx, T. R., Koma, Z., Seijmonsbergen, A. C., & Kissling, W. D. (2019). Use and categorization of 467

Light Detection and Ranging vegetation metrics in avian diversity and species distribution research. 468

Diversity and Distributions, 25(7), 1045-1059.

469

Bergman, K. O., Ask, L., Askling, J., Ignell, H., Wahlman, H., & Milberg, P. (2008). Importance of 470

boreal grasslands in Sweden for butterfly diversity and effects of local and landscape habitat factors. 471

Biodiversity and Conservation, 17(1), 139-153.

472

Bos, F., Bosveld, M., Groenendijk, D., van Swaay, C. A., Wynhoff, I., De Vlinderstichting (2006). De 473

dagvlinders van Nederland, verspreiding en bescherming (Lepidoptera: Hesperioidea, Papilionidea). 474

Nederlandse Fauna 7. Leiden. Nationaal Natuurhistorisch Museum Naturalis, KNNV Uitgeverij &

475

European Invertebrate Survey – Netherlands. 476

Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5-32. 477

Brotons, L., Thuiller, W., Araújo, M. B., & Hirzel, A. H. (2004). Presence‐absence versus presence‐ 478

only modelling methods for predicting bird habitat suitability. Ecography, 27(4), 437-448. 479

Carvalheiro, L. G., Kunin, W. E., Keil, P., Aguirre‐Gutiérrez, J., Ellis, W. N., Fox, R., ... van de 480

Meutter, F. (2013). Species richness declines and biotic homogenisation have slowed down for NW‐ 481

European pollinators and plants. Ecology Letters, 16(7), 870-878. 482

Cozzi, G., Müller, C. B., & Krauss, J. (2008). How do local habitat management and landscape 483

structure at different spatial scales affect fritillary butterfly distribution on fragmented wetlands? 484

Landscape Ecology, 23(3), 269-283.

485

Davies, A. B., & Asner, G. P. (2014). Advances in animal ecology from 3D-LiDAR ecosystem 486

mapping. Trends in Ecology & Evolution, 29(12), 681-691. 487

Dennis R. L. H., Shreeve, T. G. & van Dyck, H. (2003). Towards a functional resource-based concept 488

for habitat: a butterﬂy biology viewpoint. Oikos, 102, 417–426. 489

Dennis, R. L. H., Shreeve, T. G. & van Dyck, H. (2006). Habitats and resources: the need for a 490

resource-based definition to conserve butterflies. Biodiversity & Conservation, 15(6), 1943–1966. 491

(12)

Elith, J., Ferrier, S., Huettmann, F., & Leathwick, J. (2005). The evaluation strip: a new and robust 492

method for plotting predicted responses from species distribution models. Ecological Modelling, 493

186(3), 280-289. 494

Eskildsen, A., Carvalheiro, L. G., Kissling, W. D., Biesmeijer, J. C., Schweiger, O. & Høye, T. T. 495

(2015) Ecological specialization matters: long-term trends in butterfly species richness and 496

assemblage composition depend on multiple functional traits. Diversity and Distributions, 21(7), 792– 497

802. 498

Essens, T., van Langevelde, F., Vos, R. A., van Swaay, C. A., & WallisDeVries, M. F. (2017). 499

Ecological determinants of butterfly vulnerability across the European continent. Journal of Insect 500

Conservation, 21(3), 439-450.

501

Fox, R., Brereton, T. M., Asher, J., August, T. A., Botham, M. S., Bourn, N. A. D., ... Middlebrook, I. 502

(2015). The State of the UK’s Butterflies 2015. Butterfly Conservation and the Centre for Ecology & 503

Hydrology, Wareham.

504

Glad, A., Reineking, B., Montadert, M., Depraz, A. & Monnet, J.-M. (2020). Assessing the 505

performance of object-oriented LiDAR predictors for forest bird habitat suitability modelling. Remote 506

Sensing in Ecology and Conservation, 6(1), 5-19.

507

Graham, L. J., Spake, R., Gillings, S., Watts, K., & Eigenbrod, F. (2019). Incorporating fine‐scale 508

environmental heterogeneity into broad‐extent models. Methods in Ecology and Evolution, 10(6), 767-509

778. 510

Hallmann, C. A., Sorg, M., Jongejans, E., Siepel, H., Hofland, N., Schwan, H., ... Goulson, D. (2017). 511

More than 75 percent decline over 27 years in total flying insect biomass in protected areas. PloS One, 512

12(10), e0185809. 513

Hess, A. N., Falkowski, M. J., Webster, C. R., Storer, A. J., Pocewicz, A., & Martinuzzi, S. (2013). 514

Employing lidar data to identify butterfly habitat characteristics of four contrasting butterfly species 515

across a diverse landscape. Remote Sensing Letters, 4(4), 354-363. 516

Hesselbarth, M. H. K., Sciaini, M., With, K. A., Wiegand, K., Nowosad, J. (2019). landscapemetrics: 517

an open-source R tool to calculate landscape metrics. Ecography, 42(10), 1648-1657. 518

Karlsson, B., & Wiklund, C. (2005). Butterfly life history and temperature adaptations; dry open 519

habitats select for increased fecundity and longevity. Journal of Animal Ecology, 74(1), 99-104. 520

Kissling, W. D., Seijmonsbergen, A., Foppen, R. & Bouten, W. (2017). eEcoLiDAR, eScience 521

infrastructure for ecological applications of LiDAR point clouds: reconstructing the 3D ecosystem 522

structure for animals at regional to continental scales. Research Ideas and Outcomes, 3, e14939. 523

Koma, Z., Seijmonsbergen, A.C. & Kissling, W.D. (2020) Classifying wetland-related land cover 524

types and habitats using fine-scale lidar metrics derived from country-wide Airborne Laser Scanning. 525

Remote Sensing in Ecology and Conservation, DOI: https://doi.org/10.1002/rse2.170

526

Lederer, G. (1960). Verhaltensweisen der Imagines und der Entwicklungsstadien von Limenitis 527

camilla camilla L.(Lep. Nymphalidae). Zeitschrift für Tierpsychologie, 17(5), 521-546.

528

Maes, D., & van Dyck, H. (2001). Butterfly diversity loss in Flanders (north Belgium): Europe's worst 529

case scenario? Biological Conservation, 99(3), 263-276. 530

Maes, D., Ghesquiere, A., Logie, M., & Bonte, D. (2006). Habitat use and mobility of two threatened 531

coastal dune insects: implications for conservation. Journal of Insect Conservation, 10(2), 105-115. 532

(13)

Marcinkowska-Ochtyra, A., Jarocińska, A., Bzdęga, K., & Tokarska-Guzik, B. (2018). Classification 533

of expansive grassland species in different growth stages based on hyperspectral and LiDAR data. 534

Remote Sensing, 10(12), 2019.

535

Metz, A., Marconcini, M., Esch, T., Reinartz, P., & Ehlers, M. (2014). Classification of grassland 536

types by means of multi-seasonal TerraSAR-X and RADARSAT-2 imagery. In 2014 IEEE 537

Geoscience and Remote Sensing Symposium (pp. 1202-1205). IEEE, Quebec City, QC, Canada. 538

Moeslund, J. E., Zlinszky, A., Ejrnæs, R., Brunbjerg, A. K., Bøcher, P. K., Svenning, J. C., & 539

Normand, S. (2019). Light detection and ranging explains diversity of plants, fungi, lichens and 540

bryophytes across multiple habitats and large geographic extent. Ecological Applications, 29(5), 541

e01907. 542

Müller, J., Bae, S., Röder, J., Chao, A., & Didham, R. K. (2014). Airborne LiDAR reveals context 543

dependence in the effects of canopy architecture on arthropod diversity. Forest Ecology and 544

Management, 312, 129-137.

545

Müller, J., & Brandl, R. (2009). Assessing biodiversity by remote sensing in mountainous terrain: the 546

potential of LiDAR to predict forest beetle assemblages. Journal of Applied Ecology, 46(4), 897-905. 547

Naimi, B., Hamm, N. A., Groen, T. A., Skidmore, A. K., & Toxopeus, A. G. (2014). Where is 548

positional uncertainty a problem for species distribution modelling? Ecography, 37(2), 191-203. 549

Naimi, B., & Araújo, M. B. (2016). sdm: a reproducible and extensible R platform for species 550

distribution modelling. Ecography, 39(4), 368-375. 551

Pearce, J., & Ferrier, S. (2000). Evaluating the predictive performance of habitat models developed 552

using logistic regression. Ecological Modelling, 133(3), 225-245. 553

Phillips, S. J., Anderson, R. P., & Schapire, R. E. (2006). Maximum entropy modelling of species 554

geographic distributions. Ecological Modelling, 190(3-4), 231-259. 555

QGIS Development Team (2009). QGIS Geographic Information System. Open Source Geospatial 556

Foundation Project. https://qgis.osgeo.org

557

R Core Team (2019). R: A language and environment for statistical computing. R Foundation for 558

Statistical Computing, Vienna, Austria. https://www.R-project.org. 559

Reutebuch, S. E., Andersen, H. E., & McGaughey, R. J. (2005). Light detection and ranging (LIDAR): 560

an emerging tool for multiple resource inventory. Journal of Forestry, 103(6), 286-292. 561

Roussel, J. R. & Auty, D. (2019). lidR: Airborne LiDAR data manipulation and visualization for 562

forestry applications. R package version 2.1.1. https://CRAN.R-project.org/package=lidR. 563

Simonson, W. D., Allen, H. D., & Coomes, D. A. (2014). Applications of airborne lidar for the 564

assessment of animal species diversity. Methods in Ecology and Evolution, 5(8), 719-729. 565

Thomas, J. A. (1995). Why small cold‐blooded insects pose different conservation problems to birds 566

in modern landscapes. Ibis, 137, 112-119. 567

Thomas, J. A., Telfer, M. G., Roy, D. B., Preston, C. D., Greenwood, J. J. D., Asher, J., ... & Lawton, 568

J. H. (2004). Comparative losses of British butterflies, birds, and plants and the global extinction 569

crisis. Science, 303(5665), 1879-1881. 570

Thomas, J. A. (2005). Monitoring change in the abundance and distribution of insects using butterflies 571

and other indicator groups. Philosophical Transactions of the Royal Society B: Biological Sciences, 572

360(1454), 339-357. 573

(14)

Valbuena, R., O’Connor, B., Zellweger, F., Simonson, W., Vihervaara, P., Maltamo, M., ... & Chirici, 574

G. (2020). Standardizing Ecosystem Morphological Traits from 3D Information Sources. Trends in 575

Ecology & Evolution, 35(8), 656-667.

576

Vanreusel, W., Maes, D., & van Dyck, H. (2007). Transferability of species distribution models: a 577

functional habitat approach for two regionally threatened butterflies. Conservation Biology, 21(1), 578

201-212. 579

van Strien, A. J., Meyling, A. W. G., Herder, J. E., Hollander, H., Kalkman, V. J., Poot, M. J., ... & 580

van Turnhout, C. A. (2016). Modest recovery of biodiversity in a western European country: The 581

Living Planet Index for the Netherlands. Biological Conservation, 200, 44-50. 582

van Strien, A. J., van Swaay, C. A., van Strien-van Liempt, W. T., Poot, M. J., & WallisDeVries, M. 583

F. (2019). Over a century of data reveal more than 80% decline in butterflies in the Netherlands. 584

Biological Conservation, 234, 116-122.

585

van Swaay, C. A., Warren, M., & Loïs, G. (2006). Biotope use and trends of European butterflies. 586

Journal of Insect Conservation, 10(2), 189-209.

587

van Swaay, C. A., Nowicki, P., Settele, J., & van Strien, A. J. (2008). Butterfly monitoring in Europe: 588

methods, applications and perspectives. Biodiversity and Conservation, 17(14), 3455-3469. 589

van Swaay, C.A. (2019). Basisrapport Rode Lijst Dagvlinders 2019 volgens Nederlandse en 590

IUCNcriteria. Rapport VS2019.001, De Vlinderstichting. Wageningen, the Netherlands. 591

Vierling, K. T., Bässler, C., Brandl, R., Vierling, L. A., Weiß, I., & Müller, J. (2011). Spinning a laser 592

web: predicting spider distributions using LiDAR. Ecological Applications, 21(2), 577-588. 593

Warren, M. S. (1987a). The ecology and conservation of the heath fritillary butterfly, Mellicta athalia. 594

I. Host selection and phenology. Journal of Applied Ecology, 24(2), 467-482. 595

Warren, M. S. (1987b). The ecology and conservation of the heath fritillary butterfly, Mellicta athalia. 596

II. Adult population structure and mobility. Journal of Applied Ecology, 24(2), 483-498. 597

Wösten, J. H. M., de Vries, F., Denneboom, J. & van Holst, A. F. (1988). Generalisatie en 598

bodemfysische vertaling van de Bodemkaart van Nederland 1 : 250 000 ten behoeve van de PAWN-599

studie. Rapport 2055, STIBOKA, Wageningen, the Netherlands. 600

Zellweger, F., Braunisch, V., Baltensweiler, A., & Bollmann, K. (2013). Remotely sensed forest 601

structural complexity predicts multi species occurrence at the landscape scale. Forest Ecology and 602

Management, 307, 303-312.

603

Zielewska-Büttner, K., Heurich, M., Müller, J., & Braunisch, V. (2018). Remotely sensed single tree 604

data enable the determination of habitat thresholds for the three-toed woodpecker (Picoides 605

tridactylus). Remote Sensing, 10(12), 1972.

606

Zlinszky, A., Schroiff, A., Kania, A., Deák, B., Mücke, W., Vári, Á., ... & Pfeifer, N. (2014). 607

Categorizing grassland vegetation with full-waveform airborne laser scanning: A feasibility study for 608

detecting Natura 2000 habitat types. Remote Sensing, 6(9), 8056-8087. 609

Zurell, D., Franklin, J., König, C., Bouchet, P. J., Dormann, C. F., Elith, J., ... & Lahoz‐Monfort, J. J. 610

(2020). A standard protocol for reporting species distribution models. Ecography, 43(9), 1261-1277. 611

612 613 614

(15)

Data Accessibility Statement

615

All scripts for processing the raw LiDAR data (point clouds) into LiDAR metrics are available on 616

GitHub (LINK WILL BE PROVIDED). The data and R scripts to reproduce the statistical analyses are 617

available on DRYAD (LINK WILL BE PROVIDED) 618

(16)

Tables

620

TABLE 1 LiDAR metrics capturing different aspects of the vertical complexity and horizontal heterogeneity of

621

vegetation. For each metric, a description, the vegetation part, relation to hypotheses and inclusion in the 622

modelling is provided. Vertical complexity metrics were directly derived from LiDAR point clouds within a 25 623

m radius. Horizontal heterogeneity metrics were derived in a 25 m radius (vegetation roughness) or 100 m radius 624

(landscape-scale terrain or vegetation structure), either from a 1 m resolution digital surface model (DSM) based 625

on the 90th_{percentile height of vegetation (vegetation structure metrics), or from a 1 m digital terrain model}

626

(DTM)based on the minimum height of the ground points in each cell (slope). Normalization of the point cloud 627

was carried out on a local surface based on an inverse-distance weighting (IDW) interpolation of the 10 nearest 628

ground points. The inclusion column indicates which metrics were finally included in the species distribution 629

models for each species (All = all species; B = B. selene; H = H. semele; L = L. camilla; M = M. athalia). 630

References 1–10 refer to ecological field studies and 11–18 to LiDAR studies. 631

Metric [unit] Description Vegetation Hypothesis Inclusion References*

Vertical complexity (25 m radius)

<0.2 m density [%]

Vegetation density as proportion of returns below 0.2 m relative to all vegetation and ground points

Herb/grass layer H1 (low vegetation) All 1; 2; 4; 8; 9; 10; 12; 16 0.2–1 m density [%]

Vegetation density as proportion of returns between 0.2–1 m relative to all vegetation and ground points

Tall herbs/ low shrubs layer H1 (low vegetation) H 2; 4; 8; 9; 12; 16 1–5 m density [%]

Vegetation density as proportion of returns between 1–5 m relative to all vegetation and ground points

Shrub layer H2 (medium to high vegetation)

All 1; 2; 4; 9; 12; 16

5–20 m density [%]

Vegetation density as proportion of returns between 5–20 m relative to all vegetation and ground points

Tree layer H2 (medium to high vegetation)

L, M 1; 2; 4; 6; 8; 9; 10; 12; 13; 16 >20 m

density [%]

Vegetation density as proportion of returns between >20 m relative to all vegetation and ground points

Canopy trees H2 (medium to high vegetation)

All 2; 4; 6; 9; 13; 16

Height [m] 90th_{percentile of normalized height} of vegetation points Tall vegetation H2 (medium to high vegetation) B, H 1; 2; 9; 12; 16; 17; 19 Horizontal heterogeneity Total veg. roughness [m]

Roughness of total vegetation DSM (maximum difference in total vegetation height between focal and 8 neighbouring cells, averaged across all 1 m cells in 25 m radius)

Total vegetation H2 (medium to high vegetation) - 2; 7; 9; 14; 19 Low veg. roughness [m]

Roughness of low vegetation DSM (maximum difference in vegetation height <1 m between focal and 8 neighbouring cells, averaged across all 1 m cells in 25 m radius)

Low vegetation H1 (low vegetation) - 2; 7; 9; 14; 19 Slope [degree]

Mean slope derived from DTM using maximum ground height difference between focal and 8 neighbouring 1 m cells in 100 m radius Terrain ruggedness H3 (landscape-scale habitat structure) All 5; 7; 16; 17; 18 Open area [ha]

Total low vegetation area (cells with 90th_{percentile height of vegetation} <1m) in the DSM in 100 m radius Extent of open vegetation H3 (landscape-scale habitat structure) B 1; 2; 3; 9; 11; 13; 15; 18 Open patches [count]

Number of patches of connected low vegetation (90th_{percentile height} <1m) cells separated by other cells in the DSM in 100 m radius Patchiness of open areas H3 (landscape-scale habitat structure) H, L 2; 9; 11; 15; 18 Edge extent [m]

Length of the edges between interfacing low (90th_{percentile height} <1m) and non-low vegetation cells in the DSM in 100 m radius Extent of woodland edges H3 (landscape-scale habitat structure) B, L, M 2; 3; 9; 10; 11; 15; 18

*References: 1 = Bergman et al. (2008), 2 = Bos et al. (2006), 3 = Cozzi et al. (2008), 4 = Dennis et al. (2006), 5= Karlsson

632

& Wiklund (2005), 6 = Lederer (1960), 7 = Maes et al. (2007), 8 = Vanreusel et al. (2007), 9 = van Swaay (2019), 10 =

633

Warren (1987a), 11 = Warren (1987b), 12 = Aguirre-Gutiérrez et al. (2017), 13 = Glad et al. (2020), 14 = Graham, Spake,

634

Gillings, Watts & Eigenbrod (2019), 15 = Hesselbarth et al. (2019), 16 = Moeslund et al. (2019), 17 = Müller & Brandl

635

(2009), 18 = Zellweger et al. (2013), 19 = Zlinszky et al. (2014).

(17)

Figure legends

637 638

FIGURE 1 Spatial distribution of presence and absence points of four grassland and woodland

639

butterflies in the Netherlands. (a) Two grassland species, namely the small pearl-bordered fritillary 640

(Boloria selene) (green) and the grayling (Hipparchia semele) (red). (b) Two woodland species, 641

namely the white admiral (Limenitis camilla) (yellow) and the heath fritillary (Melitaea athalia) 642

(blue). Presences are given in bold colours, absences in light colours. Distributional overlap does not 643

occur between the two woodland species but occasionally between the two grassland species, e.g. on 644

the island of Terschelling (northwest cluster of B. selene), where H. semele also occurs. Photo credits: 645

(a) top: Boloria selene (Michiel WallisDeVries), bottom: Hipparchia semele (Chris van Swaay), (b) 646

top: Limenitis camilla (Chris van Swaay), bottom: Melitaea athalia (Chris van Swaay). 647

FIGURE 2 Associations of two grassland butterflies with LiDAR metrics. (a) Typical habitats of the

648

small pearl-bordered fritillary (Boloria selene) (a wet grassland species, left) and the grayling 649

(Hipparchia semele) (a dry grassland species, right) (photos: Reinier de Vries). (b) Relative variable 650

importance, showing the contribution of each LiDAR metric to explain butterfly distirbutions by the 651

mean and deviance of 100 random forest (RF) model runs (empty rows are metrics discarded from the 652

RF). (c) Response curves of included LiDAR metrics, showing how they are associated with the 653

species’ probability of occurrence by the mean and confidence interval of 100 RF model runs. In (b) 654

and (c), colours indicate LiDAR metrics related to low vegetation (red), medium-to-high vegetation 655

(green) and landscape-scale habitat structure (blue). Photo credits: Reinier de Vries. 656

FIGURE 3 Associations of two woodland butterflies with LiDAR metrics. (a) Typical habitats of the

657

white admiral (Limenitis camilla) (a moist woodland species, left) and the heath fritillary (Melitaea 658

athalia) (a dry woodland species, right). (b) Relative variable importance, showing the contribution of

659

each LiDAR metric to explain butterfly distirbutions by the mean and deviance of 100 random forest 660

(RF) model runs (empty rows are metrics discarded from the RF). (c) Response curves of included 661

LiDAR metrics, showing how they are associated with the species’ probability of occurrence by the 662

mean and confidence interval of 100 RF model runs. In (b) and (c), colours indicate LiDAR metrics 663

related to low vegetation (red), medium-to-high vegetation (green) and landscape-scale habitat 664

structure (blue). Photo credits: Reinier de Vries. 665

(18)

Figure 1

667

(19)

Figure 2

(20)

Figure 3

670 671

(21)

Supporting Information

672

Identifying fine-scale habitat preferences of threatened butterflies

673

using country-wide airborne laser scanning data

674

675

Content:

676 677

Table S1: Butterfly habitat preferences 678

Table S2: Performance of three species distribution modelling algorithms 679

Figure S1: Spearman rank correlations 680

Figure S2: Receiver Operating Characteristics (ROC) curves 681

Figure S3: Associations of wet and dry grassland butterflies with LiDAR metrics in inland and coastal 682 habitats 683 ODMAP protocol 684 References 685

(22)

TABLE S1 Butterfly habitat preferences of four species: (1) the small pearl-bordered fritillary

686

(Boloria selene), a wet grassland species; (2) the grayling (Hipparchia semele), a dry grassland 687

species; (3) the white admiral (Limenitis camilla), a moist woodland species; and (4) the heath 688

fritillary (Melitaea athalia), a dry woodland species. Preferred microhabitat elements in relation to 689

vegetation structure are provided, with an explanation of their ecological function. The full list of 690

references* is provided at the end of the Supporting Information file. 691

Species Microhabitat elements Explanation References*

Boloria selene Flower-rich herb vegetation Host plant habitat & nectar sources 1; 2; 3; 8

Low sward height Requirement of host plant Viola 1; 2; 3; 8

Scattered woody vegetation Provides shelter 1; 2

Hipparchia semele Herb & dwarf scrub vegetation Nectar sources 2; 6; 7

Bare sand with low grass tufts Reproduction habitat with host plants 2; 6; 7

Scattered woody vegetation Provide shelter & food (tree liquids) 2; 6; 7

South slopes Prefers warm sunny conditions 4

Limenitis camilla Discontinuous forest cover Sustains direct sunlight penetration 2; 5; 8

Canopy structure Spends most time in the canopy 2; 5

Vines Include host plant Lonicera 2; 5; 8

Understory vegetation Provides nectar sources (e.g. bramble) 2; 8

Melitaea athalia Flower-rich herb vegetation Nectar sources 2; 8; 9

Low sward height Favours nectar sources & warm conditions 1; 2; 9; 10

Edges of woody vegetation Provide shelter & host plant habitat 2; 8; 9; 10 South slopes & edges Prefers warm sunny conditions 8; 10 *References: 1 = Bergman et al. (2008), 2 = Bos et al. (2006), 3= Cozzi et al. (2008), 4 = Karlsson & Wiklund 692

(2005), 5 = Lederer (1960), 6 = Maes et al. (2006), 7 = Vanreusel et al. (2007), 8 = van Swaay (2019), 9 = 693

Warren (1987a), 10 = Warren (1987b). 694

(23)

TABLE S2 Performance of three species distribution modelling algorithms (GLM = general linear

696

model; RF = random forest; Maxent = maximum entropy) as applied to the four focal butterfly species 697

in this study: Boloria selene, Hipparchia semele, Limenitis camilla and Melitaea athalia. Inland 698

locations of B. selene and both coastal and inland locations H. semele are additionally analysed in 699

separate models. Model accuracy is given as the area under the Receiver Operating Characteristics 700

(ROC) curve (AUC), the true skill statistic (TSS) and the deviance (D). 701

Species Model AUC TSS D

Boloria selene GLM 0.7 0.4 1.38 RF 0.88 0.67 0.88 Maxent 0.8 0.54 1.09 Hipparchia semele GLM 0.73 0.39 1.1 RF 0.82 0.52 0.94 Maxent 0.77 0.44 1.18 Limenitis camilla GLM 0.91 0.72 0.66 RF 0.96 0.83 0.46 Maxent 0.94 0.77 0.56 Melitaea athalia GLM 0.81 0.57 1.15 RF 0.89 0.71 0.75 Maxent 0.81 0.58 1.05

Inland and coastal models of the two grassland species

Boloria selene, GLM 0.8 0.53 1.16 inland habitats RF 0.8 0.68 0.8 Maxent 0.84 0.61 0.97 Hipparchia semele, coastal habitats GLM 0.73 0.4 1.03 RF 0.83 0.57 0.8 Maxent 0.76 0.44 1.13 Hipparchia semele, inland habitats GLM 0.76 0.46 1.21 RF 0.88 0.67 0.9 Maxent 0.79 0.5 1.13 702