• No results found

VU Research Portal

N/A
N/A
Protected

Academic year: 2021

Share "VU Research Portal"

Copied!
23
0
0

Bezig met laden.... (Bekijk nu de volledige tekst)

Hele tekst

(1)

VU Research Portal

Cracking the code-ing sequence for Parkinson’s disease

Jansen, I.E.

2017

document version

Publisher's PDF, also known as Version of record

Link to publication in VU Research Portal

citation for published version (APA)

Jansen, I. E. (2017). Cracking the code-ing sequence for Parkinson’s disease.

General rights

Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights. • Users may download and print one copy of any publication from the public portal for the purpose of private study or research. • You may not further distribute the material or use it for any profit-making activity or commercial gain

• You may freely distribute the URL identifying the publication in the public portal ? Take down policy

If you believe that this document breaches copyright please contact us providing details, and we will remove access to the work immediately and investigate your claim.

E-mail address:

(2)
(3)

CHAPTER 3

Establishing the role of

rare coding variants in known

Parkinson’s disease risk loci

(4)

3

ABSTRACT

Importance͗DĂŶLJĐŽŵŵŽŶŐĞŶĞƟĐĨĂĐƚŽƌƐŚĂǀĞďĞĞŶŝĚĞŶƟĮĞĚƚŽĐŽŶƚƌŝďƵƚĞƚŽ WƐƵƐĐĞƉƟďŝůŝƚLJ͕ŝŵƉƌŽǀŝŶŐŽƵƌƵŶĚĞƌƐƚĂŶĚŝŶŐŽĨƚŚĞƌĞůĂƚĞĚƵŶĚĞƌůLJŝŶŐďŝŽůŽŐŝĐĂů mechanisms. KďũĞĐƟǀĞ͗dŽĞdžƉůŽƌĞƚŚĞŝŶǀŽůǀĞŵĞŶƚŽĨƌĂƌĞŐĞŶĞƟĐǀĂƌŝĂŶƚƐŝŶĐŽŵŵŽŶƐƉŽƌĂĚŝĐ

risk loci for Parkinson’s disease.

ĞƐŝŐŶ͕ƐĞƫŶŐĂŶĚƉĂƌƟĐŝƉĂŶƚƐ͗hƐŝŶŐ/ŶƚĞƌŶĂƟŽŶĂůWĂƌŬŝŶƐŽŶ͛ƐŝƐĞĂƐĞ'ĞŶŽŵŝĐƐ

ŽŶƐŽƌƟƵŵ;/W'ͿĚĂƚĂƐĞƚƐ͕ǁĞƉĞƌĨŽƌŵĞĚĂĐŽŵƉƌĞŚĞŶƐŝǀĞƐƚƵĚLJƚŽĚĞƚĞƌŵŝŶĞ ƚŚĞ ŝŵƉĂĐƚ ŽĨ ƌĂƌĞ ǀĂƌŝĂŶƚƐ ŝŶ Ϯϲ ƉƌĞǀŝŽƵƐůLJ ƉƵďůŝƐŚĞĚ 't^ ůŽĐŝ ŝŶ W͘ dŚĞƐĞ ĚĂƚĂƐĞƚƐ ĞŶĐŽŵƉĂƐƐĞĚ ǁŚŽůĞͲĞdžŽŵĞ ƐĞƋƵĞŶĐŝŶŐ ĚĂƚĂ ŽĨ ϭ͕ϭϲϳ ĐĂƐĞƐ ĂŶĚ ϭ͕ϲϴϱ controls and a genotype-array NeuroX dataset, comprising 6,801 cases and 5,970 ĐŽŶƚƌŽůƐ͘tĞĂƉƉůŝĞĚWƌŝdžĮdžĞƚŽƐĞůĞĐƚƚŚĞƉƵƚĂƟǀĞĐĂƵƐĂůŐĞŶĞƐƵŶĚĞƌŶĞĂƚŚƚŚĞ 't^ƉĞĂŬƐ͕ǁŚŝĐŚǁĂƐďĂƐĞĚŽŶƵŶĚĞƌůLJŝŶŐĨƵŶĐƟŽŶĂůƐŝŵŝůĂƌŝƟĞƐ͘dŚĞ^ĞƋƵĞŶĐĞ <ĞƌŶĞůƐƐŽĐŝĂƟŽŶdĞƐƚ;^<dͿǁĂƐƵƐĞĚƚŽĂŶĂůLJnjĞƚŚĞũŽŝŶƚĞīĞĐƚŽĨƌĂƌĞ͕ĐŽŵŵŽŶ ŽƌďŽƚŚƚLJƉĞƐŽĨǀĂƌŝĂŶƚƐŽŶWƐƵƐĐĞƉƟďŝůŝƚLJ͘ůůŐĞŶĞƐǁĞƌĞƚĞƐƚĞĚƐŝŵƵůƚĂŶĞŽƵƐůLJ as a gene-set and each gene individually.

DĂŝŶŽƵƚĐŽŵĞĂŶĚŵĞĂƐƵƌĞƐ͗WŚĞŶŽƚLJƉŝĐŝŶĨŽƌŵĂƟŽŶǁĂƐĞdžƚƌĂĐƚĞĚĨƌŽŵĐůŝŶŝĐĂů

ŝŶĨŽƌŵĂƟŽŶŐĂƚŚĞƌĞĚďLJůŽĐĂůŝŶƐƟƚƵƚĞƐƉĂƌƟĐŝƉĂƟŶŐŝŶ/W'͘dŚĞŐĞŶĞƟĐĚĂƚĂ ĐŽŶƐŝƐƚƐŽĨǁŚŽůĞĞdžŽŵĞƐĞƋƵĞŶĐŝŶŐĂŶĚĂŐĞŶŽƚLJƉĞĂƌƌĂLJĞŶƌŝĐŚĞĚĨŽƌ;ŶĞƵƌŽůŽŐŝĐͿ rare coding variants.

(5)

3

/EdZKhd/KE

'ĞŶĞƟĐ ĨĂĐƚŽƌƐ ƉůĂLJ ĂŶ ŝŵƉŽƌƚĂŶƚ ƌŽůĞ ŝŶ WĂƌŬŝŶƐŽŶ͛Ɛ ĚŝƐĞĂƐĞ ;WͿ ƉĂƚŚŽŐĞŶĞƐŝƐ͘ /Ŷ ĂĚĚŝƟŽŶƚŽƚŚĞĚŝƐĐŽǀĞƌLJŽĨƌĂƌĞǀĂƌŝĂŶƚƐƵƐŝŶŐĨĂŵŝůLJͲďĂƐĞĚůŝŶŬĂŐĞƐƚƵĚŝĞƐ͕ƌĞƐƵůƟŶŐŝŶ ƚŚĞŝĚĞŶƟĮĐĂƟŽŶŽĨĨŽƌĞdžĂŵƉůĞSNCA, LRRK2, parkin, DJ-1, PINK1 and VPS35, numerous ŐĞŶŽŵĞͲǁŝĚĞ ĂƐƐŽĐŝĂƟŽŶ ƐƚƵĚŝĞƐ ;'t^Ϳ ŚĂǀĞ ƐŚŽǁŶ ƚŚĂƚ ĐŽŵŵŽŶ ŐĞŶĞƟĐ ǀĂƌŝĂŶƚƐ increase PD risk.1dŚĞŵŽƐƚƌĞĐĞŶƚĂŶĚůĂƌŐĞƐƚWĂƐƐŽĐŝĂƟŽŶƐƚƵĚLJ2ŝĚĞŶƟĮĞĚŽǀĞƌϮϬ ĐŽŵŵŽŶƌŝƐŬǀĂƌŝĂŶƚƐ͕ĐŽŶĮƌŵŝŶŐŵĂŶLJƉƌĞǀŝŽƵƐůLJĂƐƐŽĐŝĂƚĞĚƌŝƐŬĨĂĐƚŽƌƐ͘ EĞǀĞƌƚŚĞůĞƐƐ͕ŚĞƌŝƚĂďŝůŝƚLJĞƐƟŵĂƚĞƐŝŶĚŝĐĂƚĞƚŚĂƚĂĚĚŝƟŽŶĂůŐĞŶĞƟĐƌŝƐŬĨĂĐƚŽƌƐ ƌĞŵĂŝŶ ƚŽ ďĞ ĚŝƐĐŽǀĞƌĞĚ ƐŝŶĐĞ Ă ƌĞůĂƟǀĞůLJ ůĂƌŐĞ ĨƌĂĐƟŽŶ ŽĨ W ŚĞƌŝƚĂďŝůŝƚLJ ĐĂŶŶŽƚ ďĞ ĞdžƉůĂŝŶĞĚďLJŬŶŽǁŶWƌŝƐŬůŽĐŝŽƌDĞŶĚĞůŝĂŶŐĞŶĞƐ͘3-5't^ĂƉƉƌŽĂĐŚĞƐĂƌĞƉƌŝŵĂƌŝůLJ ĚĞƐŝŐŶĞĚƚŽŝĚĞŶƟĨLJĐŽŵŵŽŶƌŝƐŬǀĂƌŝĂŶƚƐďLJƚŚĞƵƐĂŐĞŽĨŐĞŶŽƚLJƉŝŶŐĂƌƌĂLJƐ͘,ŽǁĞǀĞƌ͕ ĞŵĞƌŐŝŶŐĞǀŝĚĞŶĐĞƐƵŐŐĞƐƚƐƚŚĂƚƌĂƌĞǀĂƌŝĂŶƚƐŵĂLJĞdžƉůĂŝŶƉĂƌƚŽĨƚŚĞŵŝƐƐŝŶŐŚĞƌŝƚĂďŝůŝƚLJ͘6,7 ZĂƌĞǀĂƌŝĂŶƚƐŝŶƉƌŽƚĞŝŶĐŽĚŝŶŐƌĞŐŝŽŶƐĂƌĞŵŽƌĞůŝŬĞůLJƚŽĂīĞĐƚƚŚĞĨƵŶĐƟŽŶŽĨĂŐĞŶĞ than common variants which tag the causal variants via linkage disequilibrium (LD) and ĂƌĞŽŌĞŶůŽĐĂƚĞĚŝŶŶŽŶͲĐŽĚŝŶŐƌĞŐŝŽŶƐŽĨƚŚĞŐĞŶŽŵĞ͘8,9 Therefore, rare variants might ďĞ ŽĨ ŵŽƌĞ ŝŵƉŽƌƚĂŶĐĞ ƚŽ ĐŽŵƉůĞdž ĚŝƐĞĂƐĞƐ ƚŚĂŶ ƉƌĞĚŝĐƚĞĚ ďLJ ƚŚĞ ŽŵŵŽŶ ŝƐĞĂƐĞͲ Common Variant hypothesis.10-13 /Ŷ ĐŽŶƚƌĂƐƚ ƚŽ 't^͕ ĞdžŽŵĞ ƐĞƋƵĞŶĐŝŶŐ ƐƚƵĚŝĞƐ Ăŝŵ ĂƚƐLJƐƚĞŵĂƟĐĂůůLJĂŶĂůLJnjŝŶŐĐŽĚŝŶŐ ƌĞŐŝŽŶƐ ŽĨƚŚĞŐĞŶŽŵĞƚŽŝĚĞŶƟĨLJĐĂƵƐĂů ǀĂƌŝĂŶƚƐŝŶ ĐŽŵƉůĞdžĚŝƐĞĂƐĞƐ͘14džŽŵĞƐƚƵĚŝĞƐŚĂǀĞďĞĞŶƉƌŽǀĞŶƚŽďĞĞīĞĐƟǀĞĨŽƌƐƚƵĚLJŝŶŐĨĂŵŝůŝĂů disease15ďƵƚĂŶŝŶĐƌĞĂƐŝŶŐŶƵŵďĞƌŽĨĂƉƉůŝĐĂƟŽŶƐĨŽƌƉŽƉƵůĂƟŽŶƐͲďĂƐĞĚ ƐƚƵĚŝĞƐ ŚĂǀĞ been developed.16,17

(6)

3

Dd,K^

Subjects ůůWĐĂƐĞƐŝŶĐůƵĚĞĚŝŶƚŚŝƐƐƚƵĚLJŚĂǀĞŐŝǀĞŶǁƌŝƩĞŶŝŶĨŽƌŵĞĚĐŽŶƐĞŶƚ͘ZĞůĞǀĂŶƚůŽĐĂů ĞƚŚŝĐĂůĐŽŵŵŝƩĞĞƐĨŽƌŵĞĚŝĐĂůƌĞƐĞĂƌĐŚĂƉƉƌŽǀĞĚŝŶǀŽůǀĞŵĞŶƚŝŶŐĞŶĞƟĐƐƚƵĚŝĞƐ͘dŚĞW ƉĂƟĞŶƚƐǁĞƌĞĚŝĂŐŶŽƐĞĚƵƐŝŶŐƚŚĞh<ƌĂŝŶĂŶŬĐƌŝƚĞƌŝĂ͘26 tŚŽůĞĞdžŽŵĞƐĞƋƵĞŶĐŝŶŐĚĂƚĂƐĞƚ dŚĞǁŚŽůĞĞdžŽŵĞƐĞƋƵĞŶĐŝŶŐ;t^ͿĚĂƚĂƐĞƚŝŶĐůƵĚĞƐϭ͕ϭϲϳWĐĂƐĞƐĂŶĚϭ͕ϲϴϱĐŽŶƚƌŽůƐ ;ƉŽƐƚYͿŽĨƵƌŽƉĞĂŶĂŶĐĞƐƚƌLJ͘dŚĞWƉĂƟĞŶƚƐŚĂǀĞĂƚĞŶĚĞŶĐLJƚŽǁĂƌĚƐĂLJŽƵŶŐĂŐĞ of onset with an average of 41.2 years (SD = 10.9). 1,201 controls originate from the ZŽƩĞƌĚĂŵ^ƚƵĚLJǀĞƌƐŝŽŶϭ;Z^yϭͿ͕ĂƐǁĞŵĞƌŐĞĚƚŚĞ/W't^ĚĂƚĂǁŝƚŚƚŚĞZ^yϭt^ data.27dŚĞƐĂŵƉůĞƐǁĞƌĞƐĞƋƵĞŶĐĞĚŝŶĚŝīĞƌĞŶƚďĂƚĐŚĞƐǁŝƚŚƚǁŽĞdžŽŵĞĐĂƉƚƵƌĞŬŝƚƐ͗ džŽŵĞ>ŝďƌĂƌLJǀϮ͘Ϭ;ZŽĐŚĞͬEŝŵďůĞŐĞŶͿĂŶĚdƌƵƐĞƋdžŽŵĞŶƌŝĐŚŵĞŶƚ<ŝƚƚĂƌŐĞƟŶŐϰϰ͘ϭ DďĂŶĚϲϮDď͕ƌĞƐƉĞĐƟǀĞůLJ;^ƵƉƉůĞŵĞŶƚĂƌLJdĂďůĞϭͿ͘dŽĂĐĐŽƵŶƚĨŽƌƉƵƚĂƟǀĞƚĞĐŚŶŝĐĂů ĚŝīĞƌĞŶĐĞƐ ďĞƚǁĞĞŶ ƚŚĞ ĚŝīĞƌĞŶƚ ĐĂƉƚƵƌĞ ŬŝƚƐ͕ ǁĞ ŽŶůLJ ĐŽŶƐŝĚĞƌĞĚ ǀĂƌŝĂŶƚƐ ƚŚĂƚ ǁĞƌĞ targeted by both capture protocols and included preQC individual sample missingness (as ĂƌĞĨĞƌĞŶĐĞƚŽƐĞƋƵĞŶĐŝŶŐĐŽǀĞƌĂŐĞͿĂƐĐŽǀĂƌŝĂƚĞƐĚƵƌŝŶŐĂůůŐĞŶĞƟĐĂŶĂůLJƐĞƐ͘

KŶĂǀĞƌĂŐĞ͕ϵϰ͘ϰйŽĨƚŚĞĞdžŽŵĞǁĂƐĐŽǀĞƌĞĚĨŽƌĂƚůĞĂƐƚϭϬdž͘dŚĞϭϬϬͲďƉƉĂŝƌĞĚͲ end reads were sequenced on a HiSeq2000 and aligned to the human reference genome (build hg19) using Barrow Wheeler Aligner (BWA)-MEM28͘ 'ĞŶŽŵĞ ŶĂůLJƐŝƐ dŽŽůŬŝƚ29 ;'d<ͿĐĂůůĞĚƐŝŶŐůĞŶƵĐůĞŽƟĚĞǀĂƌŝĂŶƚƐ;^EsƐͿĂŶĚƐŵĂůůŝŶƐĞƌƟŽŶƐͬĚĞůĞƟŽŶƐ;ŝŶĚĞůƐͿĨŽƌ ĞĂĐŚƐĂŵƉůĞ͕ƌĞƐƵůƟŶŐŝŶŝŶĚŝǀŝĚƵĂůŐs&ĮůĞƐ͘'ĞŶŽƚLJƉĞƐŽĨĂůů/W'ĂŶĚZ^yϭĞdžŽŵĞ ƐĂŵƉůĞƐ ǁĞƌĞ ƚŚĞŶ ũŽŝŶƚůLJ ĐĂůůĞĚ ĂŶĚ ƌĞĐĂůŝďƌĂƚĞĚ͕ ĂůůŽǁŝŶŐ ƚŽ ŵĞƌŐĞ ƚŚĞ ĚŝƐƟŶĐƚ t^ ĚĂƚĂƐĞƚƐŝŶĂĐŽƌƌĞĐƚŵĂŶŶĞƌ͘^ƚĂŶĚĂƌĚ'd<ĮůƚĞƌƐƚĞƉƐǁĞƌĞĂƉƉůŝĞĚ͕ƚŽŐĞƚŚĞƌǁŝƚŚĂ minimum genotype quality Phred-score of 20 and depth of 8, to only select high-quality ǀĂƌŝĂŶƚƐ͘ KŶůLJ ďŝͲĂůůĞůŝĐ ĐĂůůƐ ǁĞƌĞ ĐŽŶƐŝĚĞƌĞĚ ƚŚĂƚ ǁĞƌĞ ůŽĐĂƚĞĚ ŝŶ ƌĞŐŝŽŶƐ ƚĂƌŐĞƚĞĚ ďLJ ďŽƚŚĐĂƉƚƵƌĞŬŝƚƐ͘^ƵƉƉůĞŵĞŶƚĂƌLJdĂďůĞϮƌĞƉŽƌƚƐƚŚĞĞdžŽŶƐƚŚĂƚŚĂǀĞďĞĞŶĞdžĐůƵĚĞĚĚƵĞ ƚŽŝŶƐƵĸĐŝĞŶƚĐŽǀĞƌĂŐĞǁŝƚŚŝŶŽŶĞŽĨƚŚĞĞdžŽŵĞĐĂƉƚƵƌĞƉƌŽƚŽĐŽůƐ͘

NeuroX dataset

(7)

3

YƵĂůŝƚLJƉƌŽĐĞĚƵƌĞƐ

&Žƌ ŝŶĚŝǀŝĚƵĂů Y ŝŶ ďŽƚŚ ƚŚĞ t^ ĂŶĚ ƚŚĞ EĞƵƌŽy ĚĂƚĂƐĞƚƐ͕ ƐĂŵƉůĞƐ ǁĞƌĞ ƌĞŵŽǀĞĚ ǁŚĞŶ ƐŚŽǁŝŶŐ ŐĞŶĚĞƌ ĂŵďŝŐƵŝƚLJ͕ ĚƵďŝŽƵƐ ŚĞƚĞƌŽnjLJŐŽƐŝƚLJͬŐĞŶŽƚLJƉĞ ĐĂůůƐ͕ ĞǀŝĚĞŶĐĞ ŽĨ ƌĞůĂƚĞĚŶĞƐƐ͕ŽƌďĞŝŶŐĂƉŽƉƵůĂƟŽŶŽƵƚůŝĞƌ͘dŚĞůĂƩĞƌƚǁŽǁĞƌĞĐĂůĐƵůĂƚĞĚǁŝƚŚ>ͲƉƌƵŶĞĚ ĐŽŵŵŽŶ ǀĂƌŝĂŶƚƐ͘ sĂƌŝĂŶƚ Y ƉƌŽĐĞĚƵƌĞƐ ǁĞƌĞ ƐůŝŐŚƚůLJ ĚŝīĞƌĞŶƚ ĨŽƌ ƚŚĞ ƚǁŽ ĚŝīĞƌĞŶƚ ĚĂƚĂƐĞƚƐ͘&ŽƌƚŚĞt^ĚĂƚĂƐĞƚ͕ǀĂƌŝĂŶƚƐƉĂƐƐĞĚYǁŚĞŶŚĂǀŝŶŐĂŵŝŶŝŵƵŵĐĂůůƌĂƚĞх 85% and being in Hardy-Weinberg equilibrium (HWE pͲǀĂůƵĞƐхϭĞͲϴďĂƐĞĚŽŶĐŽŶƚƌŽůƐͿ͘ &ŽƌƚŚĞEĞƵƌŽyĚĂƚĂƐĞƚ͕ǀĂƌŝĂŶƚƐǁĞƌĞĞdžĐůƵĚĞĚĨŽƌƐƵďƐĞƋƵĞŶƚĂŶĂůLJƐĞƐǁŝƚŚĂŵŝŶŝŵƵŵ call rate < 95%, a HWE pͲǀĂůƵĞфϭĞͲϲ͕ŽƌǁŝƚŚƐŝŐŶŝĮĐĂŶƚĚŝīĞƌĞŶĐĞƐŝŶŵŝƐƐŝŶŐŶĞƐƐƌĂƚĞ between cases and controls.

ĂƵƐĂůŐĞŶĞƐĞůĞĐƟŽŶǁŝƚŚŝŶWƌŝƐŬůŽĐŝ

(8)

3

ŽŵďŝŶĞĚ ŶŶŽƚĂƟŽŶ ĞƉĞŶĚĞŶƚ ĞƉůĞƟŽŶ ;Ϳ ǀϭ͘34 &ŝŐƵƌĞ ϭ ĚŝƐƉůĂLJƐ Ă ǁŽƌŬŇŽǁ ŽĨ ƚŚĞ ĐůĂƐƐŝĮĐĂƟŽŶ ŽĨ ƚŚĞ ĚŝīĞƌĞŶƚ ǀĂƌŝĂŶƚ ƐƵďƐĞƚƐ͘ dŚĞ ĞdžŽŶŝĐ ƐƵďƐĞƚ ǁĂƐ ĞdžĐůƵƐŝǀĞůLJ tested for the gene-set analysis to determine the involvement of common PD risk loci in ƚŚĞt^ĂŶĚEĞƵƌŽyĚĂƚĂƐĞƚ͘dŚĞ^ĞƋƵĞŶĐĞ<ĞƌŶĞůƐƐŽĐŝĂƟŽŶdĞƐƚ;^<dͿ35,36was used to ƉĞƌĨŽƌŵďƵƌĚĞŶĂŶĂůLJƐĞƐ͘dŚĞD&ƚŚƌĞƐŚŽůĚ͕ƐĞƉĂƌĂƟŶŐƚŚĞƌĂƌĞĂŶĚĐŽŵŵŽŶǀĂƌŝĂŶƚƐ͕ ǁĂƐďĂƐĞĚŽŶƚŚĞƚŽƚĂůƐĂŵƉůĞƐŝnjĞƵƐŝŶŐƚŚĞĨŽƌŵƵůĂ;dсϭͬ;я;ϮŶͿͿͿƐƵŐŐĞƐƚĞĚďLJ^<d͕36

dĂďůĞϭ͘Selected set of genes

WŽůLJŵŽƌƉŚŝƐŵ >ŽĐĂƟŽŶ;ŚŐϭϵͿ PͲǀĂůƵĞ Seeding SNP WƌŝdžĮdžĞŐĞŶĞ rs71628662 chr1:155359992 ϲ͘ϴϲdžϭϬ-28 NA rs823118 chr1:205723572 ϭ͘ϵϲdžϭϬ-16 rs823114 RAB7L1 rs10797576 chr1:232664611 ϭ͘ϳϲdžϭϬ-10 rs2182431 SIPA1L2 rs6430538 chr2:135539967 ϯ͘ϯϱdžϭϬ-19 rs6430538 ACMSD rs1955337 chr2:169129145 ϭ͘ϲϳdžϭϬ-20 rs2390669 STK39 rs12637471 chr3:182762437 ϱ͘ϯϴdžϭϬ-22 rs12637471 LAMP3 rs11724635 chr4:15737101 ϰ͘ϮϲdžϭϬ-17 rs11724635 &y>ϱ rs6812193 chr4:77198986 ϭ͘ϴϱdžϭϬ-11 rs6812193 STBD1 rs356182 chr4:90626111 ϭ͘ϴϱdžϭϬ-82 rs356219 SNCA rs34311866 chr4:951947 ϲ͘ϬdžϭϬ-41 rs748483 D&^ϳ rs9275326 chr6:32666660 ϱ͘ϴϭdžϭϬ-13 rs9275311 HLA-DRB5 rs199347 chr7:23293746 ϱ͘ϲϮdžϭϬ-14 rs199347 GPNMB rs591323 chr8:16697091 ϯ͘ϭϳdžϭϬ-8 NA rs117896735 chr10:121536327 ϭ͘ϮϭdžϭϬ-11 rs10886515 RGS10 rs329648 chr11:133765367 ϴ͘ϬϱdžϭϬ-12 rs329648 SPATA19 rs3793947 chr11:83544472 Ϯ͘ϱϵdžϭϬ-08 rs1400313 DLG2 rs11060180 chr12:123303586 ϯ͘ϬϴdžϭϬ-11 rs11060180 HIP1R rs76904798 chr12:40614434 ϰ͘ϴϲdžϭϬ-14 rs2708435 LRRK2 rs7155501 chr14:55347827 ϭ͘ϮϱdžϭϬ-10 rs2878174 LGALS3 rs1555399 chr14:67984370 ϱ͘ϳϬdžϭϬ-16 rs7155830 ARG2 rs2414739 chr15:61994134 ϯ͘ϱϵdžϭϬ-12 NA rs14235 chr16:31121793 ϯ͘ϲϯdžϭϬ-12 rs14235 PRSS8 rs17649553 chr17:43994648 ϲ͘ϭϭdžϭϬ-49 rs17649553 MAPT rs12456492 chr18:40673380 Ϯ͘ϭϱdžϭϬ-11 rs12456492 RIT2 rs62120679 chr19:2363319 Ϯ͘ϱϮdžϭϬ-09 rs2074546 PLEKHJ1 rs55785911 chr20:3153503 ϯ͘ϯϬdžϭϬ-10 rs2295545 AVP

(9)
(10)

3

dŽ ĐŽƌƌĞĐƚ ĨŽƌ ĐŽŶĨŽƵŶĚŝŶŐ ĨĂĐƚŽƌƐ ;Ğ͘Ő͘ ƉŽƉƵůĂƟŽŶ ƐƚƌĂƟĮĐĂƟŽŶ ĂŶĚ ƚĞĐŚŶŝĐĂů ĂƌƟĨĂĐƚƐͿ͕ǁĞŝŶĐůƵĚĞĚϮϬŵƵůƟͲĚŝŵĞŶƐŝŽŶĂůƐĐĂůŝŶŐĐŽŵƉŽŶĞŶƚƐ͕ŐĞŶĚĞƌĂŶĚŝŶĚŝǀŝĚƵĂů missingness rate pre QC (as a reference to the individual WES coverage) for the WES ĚĂƚĂƐĞƚ͘ƐƚŚĞEĞƵƌŽyĚĂƚĂƐĞƚŝƐŵŽƌĞŚŽŵŽŐĞŶĞŽƵƐ͕ǁĞĐŽƌƌĞĐƚĞĚĨŽƌƚŚĞĮƌƐƚϰD^ components and gender. Empirical pͲǀĂůƵĞƐǁĞƌĞĐĂůĐƵůĂƚĞĚĨŽƌƐŝŐŶŝĮĐĂŶƚƐĂŵƉůĞƌĞƐƵůƚƐ (pфϬ͘ϬϱͿ͘&ŽƌƚŚĞŐĞŶĞͲƐĞƚĂŶĂůLJƐŝƐ͕ƚŚĞŽƌŝŐŝŶĂůƐĂŵƉůĞp-value of the gene-set of interest was compared to pͲǀĂůƵĞƐŽĨϭ͕ϬϬϬƌĂŶĚŽŵůLJĚƌĂǁŶŐĞŶĞͲƐĞƚƐŽĨƚŚĞƐĂŵĞƐŝnjĞ͘&ŽƌƚŚĞ ŝŶĚŝǀŝĚƵĂů ŐĞŶĞ ĂƐƐŽĐŝĂƟŽŶƐ͕ ĞŵƉŝƌŝĐĂů p-values were calculated using the resampling ŵĞƚŚŽĚŝŵƉůĞŵĞŶƚĞĚďLJ^<d͕ďLJϭϬ͕ϬϬϬƉĞƌŵƵƚĂƟŽŶƐŽĨƚŚĞĂīĞĐƟŽŶƐƚĂƚƵƐ͘ŵƉŝƌŝĐĂů

p-values are calculated by (n1+1)/(n+1), where n1 = the number of resampling p-values

smaller than the original sample p-value and n = the number of resampling.

WŽǁĞƌĐĂůĐƵůĂƟŽŶƐ tĞ ĞƐƟŵĂƚĞĚ ƚŚĞ ƉŽǁĞƌ ŽĨ ŽƵƌ ƐƚƵĚLJ ĚĞƐŝŐŶ ƚŽ ĚĞƚĞĐƚ ƌĂƌĞ ǀĂƌŝĂŶƚ ĂƐƐŽĐŝĂƟŽŶƐ͘ ^ƵƉƉůĞŵĞŶƚĂƌLJ dĂďůĞ ϯ ĚŝƐƉůĂLJƐ ƚŚĞ ƉĂƌĂŵĞƚĞƌƐ ƚŚĂƚ ǁĞƌĞĐŚŽƐĞŶ ĨŽƌ ƚŚĞ ĐĂůĐƵůĂƟŽŶƐ͘ &ŽƌďŽƚŚĚĂƚĂƐĞƚƐ͕ƚŚĞWƉƌĞǀĂůĞŶĐĞǁĂƐƐĞƚƚŽϬ͘ϬϬϱϳ38͘ƐĂƉƉƌŽdžŝŵĂƚĞůLJŚĂůĨŽĨƚŚĞ ůŽĐŝŝŶWŐĞŶĞ͘ŽƌŐŚĂǀĞĂŶŽĚĚƐƌĂƟŽďĞůŽǁϭ͕ƚŚĞƉĞƌĐĞŶƚĂŐĞƉƌŽƚĞĐƟǀĞĞīĞĐƚǁĂƐƐĞƚ ƚŽϱϬй͘ƚŚŽƵƐĂŶĚƐŝŵƵůĂƟŽŶƐ;ɲсϬ͘ϬϮϱͿǁĞƌĞƉĞƌĨŽƌŵĞĚŽŶĂŚĂƉůŽƚLJƉĞŵĂƚƌŝdžŽĨ ^<d͕ŵŝŵŝĐŬŝŶŐůŝŶŬĂŐĞĚŝƐĞƋƵŝůŝďƌŝƵŵƐƚƌƵĐƚƵƌĞŽĨƵƌŽƉĞĂŶĂŶĐĞƐƚƌLJ͕ĐŽŵƉƌŝƐŝŶŐϭϬ͕ϬϬϬ haplotypes over 200 kb regions.

Z^h>d^

t^ĂŶĚƌĂƌĞǀĂƌŝĂŶƚƐ

&ŝƌƐƚ͕ǁĞĂŶĂůLJnjĞĚƚŚĞt^ĚĂƚĂƐĞƚĂƐŝƚƌĞƉƌĞƐĞŶƚƐĂůůĞdžŽŶŝĐǀĂƌŝĂŶƚƐ͕ŽĨǁŚŝĐŚƚŚĞƐƚƵĚLJ ĚĞƐŝŐŶŚĂƐϲϱйƉŽǁĞƌƚŽĚĞƚĞĐƚĂƌĂƌĞǀĂƌŝĂŶƚĂƐƐŽĐŝĂƟŽŶƐŝŐŶĂůĐŽŶƐŝĚĞƌŝŶŐŝŶĚŝǀŝĚƵĂů ŐĞŶĞƐ͘dĞƐƟŶŐƚŚĞĂŐŐƌĞŐĂƚĞĚĞīĞĐƚŽĨŐƌŽƵƉĞĚǀĂƌŝĂŶƚƐǁŝƚŚŝŶĂŐĞŶĞͲƐĞƚŚĂƐƚŚĞƉŽƚĞŶƟĂů to increase power. Supplementary Table 4 shows the results of the gene-set analyses in ƚŚĞt^ĚĂƚĂƐĞƚ͘ŽŵŵŽŶĞdžŽŶŝĐǀĂƌŝĂŶƚƐĂƌĞŵŽĚĞƌĂƚĞůLJĂƐƐŽĐŝĂƚĞĚƚŽW͘dŚĞŶŽŵŝŶĂů p-ǀĂůƵĞŝƐƐŝŐŶŝĮĐĂŶƚ͕ďƵƚƚŚĞĞŵƉŝƌŝĐĂů p-ǀĂůƵĞĞdžĐĞĞĚƐϬ͘Ϭϱ͘ůƚŚŽƵŐŚǁĞĂŶƟĐŝƉĂƚĞĚ Ă ƐŝŐŶŝĮĐĂŶƚ ĂƐƐŽĐŝĂƟŽŶ ŽĨ ĐŽŵŵŽŶ ǀĂƌŝĂŶƚƐ͕ ǁĞ ĂƩƌŝďƵƚĞ ƚŚĞ ŵŽĚĞƌĂƚĞ ĂƐƐŽĐŝĂƟŽŶ ƚŽ Ă ƌĞůĂƟǀĞůLJ ůŽǁ ƐĂŵƉůĞ ƐŝnjĞ ;ĐŽŵƉĂƌĞĚ ƚŽ ƚŚĞ ŽƌŝŐŝŶĂů 't^Ϳ͕ ĂŶĚ ƚŚĞ ƐĞůĞĐƟŽŶ ŽĨ ŐĞŶĞƐ;ďLJWƌŝdžĮdžĞͿǁŝƚŚǀĂƌŝĂŶƚƐŝŶŵŽĚĞƌĂƚĞ>ǁŝƚŚƚŚĞŽƌŝŐŝŶĂůŚŝŐŚĞƐƚ^EW͘dŚĞŐĞŶĞͲ ƐĞƚĂƐƐŽĐŝĂƟŽŶŝƐĂďƐĞŶƚǁŚĞŶĨŽĐƵƐŝŶŐŽŶƚŚĞĐŽŵŵŽŶĂŵŝŶŽĂĐŝĚĐŚĂŶŐŝŶŐĂŶĚ variants, which is probably due to a decrease in power as the number of variants drops.

(11)

3

ŐĞŶĞƐƉĞƌǀĂƌŝĂŶƚƐƵďƐĞƚ͘&ŽƌƚŚĞǀĂƌŝĂŶƚƐ͕ŶŽŐĞŶĞŝƐŝŶĚĞƉĞŶĚĞŶƚůLJĂƐƐŽĐŝĂƚĞĚƚŽ W͘,ŽǁĞǀĞƌ͕ŝŶǀĞƐƟŐĂƟŶŐƚŚĞĂŵŝŶŽͲĂĐŝĚĐŚĂŶŐŝŶŐǀĂƌŝĂŶƚƐƵďƐĞƚƌĞƐƵůƚĞĚŝŶĂƐŝŐŶŝĮĐĂŶƚ ĂƐƐŽĐŝĂƟŽŶĨŽƌSTBD1 (empirical p = 0.046).

NeuroX and rare variants

dŚĞ EĞƵƌŽy ĚĂƚĂƐĞƚ ĐŽŶƚĂŝŶƐ ƉƌĞǀŝŽƵƐůLJ ŝĚĞŶƟĮĞĚ ĞdžŽŶŝĐ ǀĂƌŝĂŶƚƐ͕ ŽĨ ǁŚŝĐŚ Ă ůĂƌŐĞ ƉƌŽƉŽƌƟŽŶ ŝƐ ƌĂƌĞ͘30 dŚĞ ůĂƌŐĞƌ ƐĂŵƉůĞ ƐŝnjĞ ;ϲ͕ϴϬϭ ĐĂƐĞƐ ĂŶĚ ϱ͕ϵϳϬ ĐŽŶƚƌŽůƐͿ ŝŶĐƌĞĂƐĞƐ ƚŚĞƉŽǁĞƌ;ĞƐƟŵĂƚĞĚĂƚϵϲйͿƚŽĚĞƚĞĐƚĂƌĂƌĞǀĂƌŝĂŶƚĂƐƐŽĐŝĂƟŽŶƐŝŐŶĂů͘^ŝŵŝůĂƌůLJƚŽƚŚĞ t^ĚĂƚĂƐĞƚ͕ĂŵŽĚĞƌĂƚĞĐŽŵŵŽŶǀĂƌŝĂŶƚĂƐƐŽĐŝĂƟŽŶŝƐĚĞƚĞĐƚĞĚ;ŶŽŵŝŶĂůp = 0.031). /ŶĐŽŶƚƌĂƐƚƚŽƚŚĞt^ĚĂƚĂƐĞƚ͕ǁĞĚŽŽďƐĞƌǀĞƐŝŐŶŝĮĐĂŶƚĂƐƐŽĐŝĂƟŽŶƐŽĨƚŚĞŐĞŶĞͲƐĞƚ ǁŝƚŚW͕ĞǀĞŶǁŚĞŶŽŶůLJĐŽŶƐŝĚĞƌŝŶŐƌĂƌĞǀĂƌŝĂŶƚƐ;ĐŚĂŶŐŝŶŐсϬ͘ϬϬϳ͖сϬ͘ϬϬϮ͖ Supplementary Table 5a).

dŽĚŝƐĐŽǀĞƌǁŚĞƚŚĞƌƐƉĞĐŝĮĐŐĞŶĞƐĚƌŝǀĞƚŚŝƐŽďƐĞƌǀĞĚƌĂƌĞǀĂƌŝĂŶƚĂƐƐŽĐŝĂƟŽŶ ĚĞƚĞĐƚĞĚǁŝƚŚƚŚĞƚǁŽͲƐŝĚĞĚ^<dƚĞƐƚ͕ƚŚĞǀĂƌŝĂŶƚƐǁĞƌĞŐƌŽƵƉĞĚƉĞƌŐĞŶĞĂŶĚĂŐĂŝŶ ƚǁŽͲƐŝĚĞĚƚĞƐƚĞĚĨŽƌƚŚĞŝƌĂƐƐŽĐŝĂƟŽŶƚŽW͘LRRK2 ŝƐƚŚĞŐĞŶĞĚƌŝǀŝŶŐƚŚĞĂƐƐŽĐŝĂƟŽŶ ŽďƐĞƌǀĞĚŝŶƚŚĞƚŽƚĂůŐĞŶĞͲƐĞƚ;^ƵƉƉůĞŵĞŶƚĂƌLJdĂďůĞϲͿ͘&ŽĐƵƐŝŶŐŽŶƚŚĞƐƵďƐĞƚ͕ ƚŚŝƐĂƐƐŽĐŝĂƟŽŶ;ŶŽŵŝŶĂůp сϱ͘ϭϳdžϭϬ-13) is considerably stronger than the second most ƐŝŐŶŝĮĐĂŶƚŐĞŶĞ͗SPATA19 (nominal p сϬ͘ϬϱϬͿ͘ƐƚŚĞEĞƵƌŽyĂƌƌĂLJŝƐŶĞƵƌŽůŽŐLJƐƉĞĐŝĮĐ͕ŝƚ harbors many variants of the known PD gene LRRK2͘&ƵƌƚŚĞƌŵŽƌĞ͕LRRK2 is a large gene coding for 2,527 amino acids. As a comparison, NeuroX contains 32 harmful (predicted by CADD) LRRK2 variants, while only 2 harmful variants are present for SPATA19. The variants in LRRK2 are overrepresented and biasing the results of the total gene-sets. We therefore ƉĞƌĨŽƌŵĞĚƚŚĞƐĂŵĞŐĞŶĞͲƐĞƚĂŶĂůLJƐĞƐŽŶƚŚĞEĞƵƌŽyĚĂƚĂƐĞƚĞdžĐůƵĚŝŶŐƚŚĞǀĂƌŝĂŶƚƐŽĨ

LRRK2;^ƵƉƉůĞŵĞŶƚĂƌLJdĂďůĞϱďͿ͕ƌĞƐƵůƟŶŐŝŶƚŚĞĂďƐĞŶĐĞŽĨĂƌĂƌĞǀĂƌŝĂŶƚĂƐƐŽĐŝĂƟŽŶ

in the NeuroX dataset (nominal p ш Ϭ͘ϮϴͿ͘ dŚŝƐ ƐƵŐŐĞƐƚƐ ƚŚĂƚ ƚŚĞ ƉƌĞǀŝŽƵƐůLJ ŽďƐĞƌǀĞĚ ĂƐƐŽĐŝĂƟŽŶŽĨƌĂƌĞǀĂƌŝĂŶƚƐǁŝƚŚŝŶƚŚĞƚŽƚĂůŐĞŶĞͲƐĞƚƚŽWǁĂƐƐŽůĞůLJĚƌŝǀĞŶďLJLRRK2.

dŚĞƚǁŽͲƐŝĚĞĚ^<dĂŶĂůLJƐŝƐƉĞƌŐĞŶĞĂŝŵĞĚĂƚƚŚĞĚŝƐĐŽǀĞƌLJŽĨŐĞŶĞƐĚƌŝǀŝŶŐ ƚŚĞƌĂƌĞǀĂƌŝĂŶƚĂƐƐŽĐŝĂƟŽŶŝŶƚŚĞƚŽƚĂůŐĞŶĞͲƐĞƚ͘EĞdžƚ͕ǁĞǁĞƌĞŝŶƚĞƌĞƐƚĞĚƚŽĞdžƉůŽƌĞƚŚĞ

dĂďůĞϮ͘'ĞŶĞͲďĂƐĞĚƌĂƌĞǀĂƌŝĂŶƚĂƐƐŽĐŝĂƟŽŶƌĞƐƵůƚƐĨŽƌt^ĚĂƚĂƐĞƚ͘

sĂƌŝĂŶƚƚLJƉĞ Gene pͲǀĂůƵĞ;ĞŵƉͿ n variants maf cases ŵĂĨĐŽŶƚƌŽůƐ

(12)

3

ŐĞŶĞƟĐďƵƌĚĞŶŽĨƌĂƌĞǀĂƌŝĂŶƚƐĨŽƌĞĂĐŚŐĞŶĞŝŶĚŝǀŝĚƵĂůůLJǁŚĞŶĂƐƐƵŵŝŶŐĂůůƌĂƌĞǀĂƌŝĂŶƚƐ ƚŽŚĂǀĞƚŚĞƐĂŵĞĚŝƌĞĐƟŽŶŽĨĞīĞĐƚ;ŽŶĞͲƐŝĚĞĚhZEƚĞƐƚͿ͘dĂďůĞϯƐŚŽǁƐĂŐĂŝŶƚŚĂƚ

LRRK2 (empirical p с Ϭ͘ϬϬϬϱͿ ŝƐ ƚŚĞ ƐƚƌŽŶŐĞƐƚ ĂƐƐŽĐŝĂƚĞĚ ŐĞŶĞ͘ &ƵƌƚŚĞƌŵŽƌĞ͕ SPATA19

(empirical p сϬ͘ϬϭϳͿŝƐƐŝŐŶŝĮĐĂŶƚůLJĂƐƐŽĐŝĂƚĞĚǁŚĞŶƐƉĞĐŝĮĐĂůůLJĐŽŶƐŝĚĞƌŝŶŐƌĂƌĞ variants.

dĂďůĞϯ͘'ĞŶĞͲďĂƐĞĚƌĂƌĞǀĂƌŝĂŶƚĂƐƐŽĐŝĂƟŽŶƌĞƐƵůƚƐĨŽƌEĞƵƌŽyĚĂƚĂƐĞƚ͘

sĂƌŝĂŶƚƚLJƉĞ Gene pͲǀĂůƵĞ;ĞŵƉͿ n variants maf cases ŵĂĨĐŽŶƚƌŽůƐ

AAchanging LRRK2 Ϭ͘ϬϬϬϰ;Ϭ͘ϬϬϬϱͿ 48 1.70% 1.13% RIT2 0.051 2 0.00% 0.03% PRSS8 0.098 1 0.04% 0.01% CADD LRRK2 Ϭ͘ϬϬϬϯ;Ϭ͘ϬϬϬϱͿ 32 1.38% 0.86% SPATA19 Ϭ͘Ϭϭϰ;Ϭ͘ϬϭϳͿ 2 0.05% 0.00% RIT2 0.051 2 0.00% 0.03% p-value = nominal pͲǀĂůƵĞ͖;ĞŵƉ͘ͿсĞŵƉŝƌŝĐĂůpͲǀĂůƵĞĐĂůĐƵůĂƚĞĚďLJĐŽŵƉĂƌŝƐŽŶƚŽϭϬ͕ϬϬϬƉĞƌŵƵƚĂƟŽŶƐŽĨĂīĞĐƟŽŶƐƚĂƚƵƐ͘ ĐŚĂŶŐŝŶŐсĂŵŝŶŽĂĐŝĚĐŚĂŶŐŝŶŐǀĂƌŝĂŶƚƐ͖сǀĂƌŝĂŶƚƐƉƌĞĚŝĐƚĞĚƉĂƚŚŽŐĞŶŝĐ ŝƌĞĐƟŽŶĂůŝƚLJŽĨĞīĞĐƚ tĞĨƵƌƚŚĞƌĞdžƉůŽƌĞĚƚŚĞƐŝŐŶŝĮĐĂŶƚŝŶĚŝǀŝĚƵĂůĂƐƐŽĐŝĂƟŽŶƐŝŐŶĂůƐ;ĞŵƉŝƌŝĐĂůp < 0.05) for

LRRK2, STBD1, and SPATA19. By focusing on the variant level we aimed to comprehend the

(13)

3

frequency of 6.2% in cases and 6.9% in controls. All 3 variants were detected in the WES ĚĂƚĂƐĞƚ͕ĂůůŽǁŝŶŐƚŽƚĞƐƚƚŚĞĨƵůůŚĂƉůŽƚLJƉĞ;'ͲͲͿ͘ůƚŚŽƵŐŚƚŚĞŚĂƉůŽƚLJƉĞĂƐƐŽĐŝĂƟŽŶ ǁĂƐŶŽƚƐŝŐŶŝĮĐĂŶƚŝŶƚŚĞt^ĚĂƚĂƐĞƚ;KZсϬ͘ϴϭ͕p Ϭ͘ϮϮϯͿ͕ƚŚĞƚƌĞŶĚŽĨĞīĞĐƚŝƐƐŝŵŝůĂƌǁŝƚŚ a minor haplotype frequency of 7.0% in cases and 7.5% in controls. The smaller sample ƐŝnjĞŽĨƚŚĞt^ĚĂƚĂƐĞƚŝƐĂƉůĂƵƐŝďůĞƌĞĂƐŽŶĨŽƌŶŽƚŽďƚĂŝŶŝŶŐĂƐŝŐŶŝĮĐĂŶƚĂƐƐŽĐŝĂƟŽŶ͘

EĞdžƚ͕ ƚŚĞ t^ͲďĂƐĞĚ STBD1 and NeuroX-based SPATA19 ǁĞƌĞ ŝŶǀĞƐƟŐĂƚĞĚ ĨŽƌ ƚŚĞŝƌ ǀĂƌŝĂŶƚ ĨƌĞƋƵĞŶĐŝĞƐ͘ ^ŝŶŐůĞͲŵĂƌŬĞƌ ĂƐƐŽĐŝĂƟŽŶ ĂŶĂůLJƐŝƐ ƐŚŽǁĞĚ ŶŽ ƐŝŐŶŝĮĐĂŶƚ individual results for the 8 variants within STBD1. It therefore appears that the observed ƌĂƌĞ ǀĂƌŝĂŶƚ ĂƐƐŽĐŝĂƟŽŶ ŝƐ ŶŽƚ ĐĂƵƐĞĚ ďLJ ŽŶĞ ĞdžĐůƵƐŝǀĞ ǀĂƌŝĂŶƚ ďƵƚ ŝƐ ƌĂƚŚĞƌ ƚŚĞ ĞīĞĐƚ ŽĨ ŵƵůƟƉůĞ ƌĂƌĞ ǀĂƌŝĂŶƚƐ͘ ^ĞǀĞŶ ŽĨ ƚŚĞ ϴ ǀĂƌŝĂŶƚƐ ĂƌĞ ĐŽŶƚƌŽůͲƐƉĞĐŝĮĐ ĂƐ ƚŚĞLJ ĂƌĞ ŽŶůLJ present in 10 control individuals. In contrast, only 1 variant is present in a single case. dŚĞĚŝƌĞĐƟŽŶŽĨĞīĞĐƚŽĨƚŚĞǀĂƌŝĂŶƚƐƚŚĂƚĂƌĞŐĞŶĞƌĂƟŶŐƚŚĞSTBD1 ŐĞŶĞĂƐƐŽĐŝĂƟŽŶŝƐ ƚŚĞƌĞĨŽƌĞŝŵƉůŝĞĚƚŽďĞƉƌŽƚĞĐƟǀĞ͘dŚĞƐŝŐŶŝĮĐĂŶƚŐĞŶĞͲďĂƐĞĚĂƐƐŽĐŝĂƟŽŶĨŽƌSPATA19 is ƌĞůĂƟǀĞůLJƐƚƌŽŶŐĐŽŶƐŝĚĞƌŝŶŐƚŚĂƚŝƚŝƐĚƌŝǀĞŶďLJŽŶůLJϮǀĂƌŝĂŶƚƐƚŚĂƚĂƌĞƉƌĞƐĞŶƚŝŶϳ cases and 0 controls. The absence of SPATA19 CADD variants in controls suggests that the ĂƐƐŽĐŝĂƟŽŶƐŝŐŶĂůŝƐĚĂŵĂŐŝŶŐ͘,ŽǁĞǀĞƌ͕SPATA19 variants could have been missed as the NeuroX dataset is array-based.

/^h^^/KE

dŽ ĞƐƚĂďůŝƐŚ ƚŚĞ ŝŶŇƵĞŶĐĞ ŽĨ ƌĂƌĞ ǀĂƌŝĂŶƚƐ ŝŶ ƐƉŽƌĂĚŝĐ W ƌŝƐŬ ůŽĐŝ͕ ǁĞ ĞdžƉůŽƌĞĚ ƚǁŽ independent PD datasets (WES and NeuroX) enriched for coding rare variants. We used ƚŚĞWƌŝdž&ŝdžĞƐƚƌĂƚĞŐLJƚŽƐĞůĞĐƚƚŚĞŵŽƐƚůŝŬĞůLJĐĂƵƐĂůŐĞŶĞƐƵŶĚĞƌůLJŝŶŐƚŚĞWůŽĐŝƉĞĂŬƐ͕ ǁŚŝĐŚŝƐďĂƐĞĚŽŶŽǀĞƌůĂƉƉŝŶŐďŝŽůŽŐŝĐĂůĨƵŶĐƟŽŶĂůƐŝŵŝůĂƌŝƟĞƐ͘tĞƚĞƐƚĞĚďŽƚŚƚŚĞĞīĞĐƚ ŽĨƌĂƌĞǀĂƌŝĂŶƚƐŝŶƚŚĞŐĞŶĞͲƐĞƚĂƚŽŶĐĞ͕ĂƐĞĂĐŚŐĞŶĞŝŶĚŝǀŝĚƵĂůůLJ͘ŐŐƌĞŐĂƟŶŐǀĂƌŝĂŶƚƐ ƐŝŵƵůƚĂŶĞŽƵƐůLJĂĐƌŽƐƐĂƐĞƚŽĨŐĞŶĞƐŚĂƐƚŚĞƉŽƚĞŶƟĂůƚŽŝŶĐƌĞĂƐĞƉŽǁĞƌƚŽĚĞƚĞĐƚĂŶ ĂƐƐŽĐŝĂƟŽŶƐŝŐŶĂů͕ŐŝǀĞŶƚŚĂƚƚŚĞƐĞůĞĐƚĞĚŐĞŶĞƐĂƌĞĞŶƌŝĐŚĞĚĨŽƌĂŐƌŽƵƉŽĨŐĞŶĞƐƚŚĂƚĂƌĞ genuinely involved in the disease pathogenesis.

dŚĞĂǀĞƌĂŐĞĂŐĞŽĨŽŶƐĞƚǁŝƚŚŝŶƚŚĞĐĂƐĞŐƌŽƵƉŽĨƚŚĞt^ĚĂƚĂƐĞƚ;ΕϰϭLJĞĂƌƐͿ ŝƐϮϬLJĞĂƌƐLJŽƵŶŐĞƌƚŚĂŶŝŶƚŚĞŵĞƚĂͲĂŶĂůLJƐŝƐŽĨƚŚĞŵŽƐƚƌĞĐĞŶƚW't^;ΕϲϭLJĞĂƌƐͿ ǁŚĞƌĞƚŚĞWƌŝƐŬůŽĐŝǁĞƌĞďĂƐĞĚŽŶ͘ƐƐŽŵĞƌĂƌĞŐĞŶĞƟĐƌŝƐŬĨĂĐƚŽƌƐ;DJ-1, parkin and PINK1)1ĂƌĞƐƉĞĐŝĮĐĨŽƌLJŽƵŶŐŽŶƐĞƚW;zKWͿ͕ǁĞĂĐŬŶŽǁůĞĚŐĞƚŚĞƉƵƚĂƟǀĞĞdžŝƐƚĞŶĐĞŽĨ zKWͲƐƉĞĐŝĮĐĐŽŵŵŽŶŐĞŶĞƟĐƌŝƐŬĨĂĐƚŽƌƐǁŝƚŚŝŶƚŚĞt^ĚĂƚĂƐĞƚ͘,ŽǁĞǀĞƌ͕ƌŝƐŬĨĂĐƚŽƌƐ ƌĞůĂƚĞĚ ƚŽ ůĂƚĞ ŽŶƐĞƚ ƐƉŽƌĂĚŝĐ W ŵŝŐŚƚ ĂůƐŽ ƉůĂLJ Ă ƌŽůĞ ŝŶ zKW͘ W ƌŝƐŬ ůŽĐŝ͕ ƐƵĐŚ ĂƐ

(14)

3

ŝŶŇƵĞŶĐĞĚ ďLJ ƌĂƌĞ ǀĂƌŝĂŶƚƐ͕ ƉŽƐƐŝďůLJ ŝŶĐƌĞĂƐŝŶŐ ƚŚĞ ůŝŬĞůŝŚŽŽĚ ŽĨ ĚĞƚĞĐƟŶŐ ƌĂƌĞ ǀĂƌŝĂŶƚ ĂƐƐŽĐŝĂƟŽŶƐ͘

hƐŝŶŐŐĞŶĞͲƐĞƚĂƉƉƌŽĂĐŚŝŶƚŚĞt^ĚĂƚĂƐĞƚ͕ǁĞĚŝĚŶŽƚĚĞƚĞĐƚĂďƵƌĚĞŶŽĨƌĂƌĞ variants when comparing PD subjects to controls. However, it remains unclear whether ƚŚĞĂďƐĞŶĐĞŽĨĂƐƐŽĐŝĂƟŽŶŝƐŐĞŶƵŝŶĞŽƌĚƵĞƚŽŝŶƐƵĸĐŝĞŶƚƉŽǁĞƌŽƌŝŶĐŽƌƌĞĐƚƐĞůĞĐƟŽŶ ŽĨĐĂƵƐĂůŐĞŶĞƐ͘/ŶĐŽŶƚƌĂƐƚ͕ǁŝƚŚƚŚĞŐĞŶĞďĂƐĞĚĂƐƐŽĐŝĂƟŽŶƚĞƐƚĨŽƌƚŚĞŐĞŶĞƐƐĞůĞĐƚĞĚ ǁŝƚŚƚŚĞWƌŝdžĮdžĞƐƚƌĂƚĞŐLJǁĞŽďƐĞƌǀĞĚĂƌĂƌĞǀĂƌŝĂŶƚĂƐƐŽĐŝĂƟŽŶĨŽƌSTBD1, implying that rare variants in this gene could be a risk factor for PD. STBD1ŚĂƐŝƚƐĨƵŶĐƟŽŶŝŶůLJƐŽƐŽŵĂůͲ mediated autophagy that has been shown to contribute to PD pathogenesis.41'ĞŶĞƟĐ ƌĞƉůŝĐĂƟŽŶŽƌĨƵŶĐƟŽŶĂůǀĂůŝĚĂƟŽŶĨŽƌSTBD1 is required. tĞ ĚĞƚĞĐƚĞĚ ƐƚƌŽŶŐ ĂƐƐŽĐŝĂƟŽŶƐ ŽĨ ƌĂƌĞ ǀĂƌŝĂŶƚƐ ǁŝƚŚŝŶ ƚŚĞ ŐĞŶĞͲƐĞƚ ĨŽƌ ƚŚĞ EĞƵƌŽy ĚĂƚĂƐĞƚ͘ ,ŽǁĞǀĞƌ͕ ƐƵďƐĞƋƵĞŶƚ ĂŶĂůLJƐĞƐ ƐŚŽǁĞĚ ƚŚĂƚ ƚŚĞƐĞ ĂƐƐŽĐŝĂƟŽŶƐ ǁĞƌĞ dominated by LRRK2 ǀĂƌŝĂŶƚƐ͘ƐƐŽĐŝĂƟŽŶĂŶĂůLJƐŝƐŽŶǀĂƌŝĂŶƚůĞǀĞůƌĞǀĞĂůĞĚƚŚĂƚƚŚĞLRRK2 ŐĞŶĞƐŝŐŶĂůǁĂƐĚƌŝǀĞŶďLJƚŚĞŬŶŽǁŶƉ͘'ϮϬϭϵ^ǀĂƌŝĂŶƚ͘dŚŝƐŽďƐĞƌǀĂƟŽŶŚŝŐŚůŝŐŚƚƐƚŚĞ ŝŵƉŽƌƚĂŶĐĞĨŽƌĞǀĞƌLJĂŐŐƌĞŐĂƚĞĚǀĂƌŝĂŶƚƐƚƵĚLJƚŽĨƵƌƚŚĞƌŝŶǀĞƐƟŐĂƚĞǀĂƌŝĂŶƚĂŐŐƌĞŐĂƟŽŶ ĂƐƐŽĐŝĂƟŽŶ ƌĞƐƵůƚƐ ŽŶ ǀĂƌŝĂŶƚ ůĞǀĞů ƚŽ ĐŽƌƌĞĐƚůLJ ĚƌĂǁ ĐŽŶĐůƵƐŝŽŶƐ͘ Ɛ ƐŚŽǁŶ ĨŽƌ ƚŚĞ LRRK2 ĂƐƐŽĐŝĂƟŽŶĂŶĚĞǀĞŶƚŚĞƚŽƚĂůŐĞŶĞͲƐĞƚĂƐƐŽĐŝĂƟŽŶ͕ŝƚŝƐĚƌŝǀĞŶďLJŽŶůLJϭǀĂƌŝĂŶƚ͕

which also could have been detected with the performance of a simple single-marker ĂƐƐŽĐŝĂƟŽŶƚĞƐƚ͘ĞƐŝĚĞƐƚŚĞƉĂƚŚŽŐĞŶŝĐĂƐƐŽĐŝĂƟŽŶƐŝŐŶĂůŽĨƌĂƌĞǀĂƌŝĂŶƚƉ͘'ϮϬϭϵ^͕ǁĞ ŽďƐĞƌǀĞĚĂƐŝŐŶŝĮĐĂŶƚƉƌŽƚĞĐƟǀĞĞīĞĐƚŽĨĂƉƌĞǀŝŽƵƐůLJƉƵďůŝƐŚĞĚĐŽŵŵŽŶŚĂƉůŽƚLJƉĞ͘39 dŚŝƐ ŽďƐĞƌǀĂƟŽŶ ƐƵƉƉŽƌƚƐ ƚŚĞ ƚŚĞŽƌLJ ƚŚĂƚ ŽƚŚĞƌ ǀĂƌŝĂŶƚƐ ǁŝƚŚ ŽƉƉŽƐŝƚĞ ĞīĞĐƚƐ ĐŽƵůĚ ŝŶƚĞƌĂĐƚ ĂŶĚ ƉŽƚĞŶƟĂůůLJ ŝŶŇƵĞŶĐĞ ƚŚĞ ƉĞŶĞƚƌĂŶĐĞ ŽĨ ƉĂƚŚŽŐĞŶŝĐ LRRK2 variants, such ĂƐƉ͘'ϮϬϭϵ^͘ĞƐŝĚĞƐLRRK2, we furthermore detected a NeuroX-based burden of rare CADD variants for SPATA19 that increases PD risk (p = 0.017). Although the biological ĨƵŶĐƟŽŶŽĨSPATA19 ŝƐƉŽŽƌůLJƐƚƵĚŝĞĚ͕ƚŚĞ'ddžƉŽƌƚĂůĚŝƐƉůĂLJƐƐƉĞĐŝĮĐŚŝŐŚĞdžƉƌĞƐƐŝŽŶ ĨŽƌ ƚŚĞ ƚĞƐƟƐ͕ ĚŝŵŝŶŝƐŚŝŶŐ ƚŚĞ ůŝŬĞůŝŚŽŽĚ ƚŚĂƚ ĚĞĨĞĐƚƐ ŽĨ ƚŚŝƐ ŐĞŶĞ ǁŽƵůĚ ĐŽŶƚƌŝďƵƚĞ ƚŽ ŶĞƵƌŽĚĞŐĞŶĞƌĂƟŽŶ͘ LJƐĞůĞĐƟŶŐŐĞŶĞƐďĂƐĞĚŽŶďŝŽůŽŐŝĐĂůƐŝŵŝůĂƌŝƟĞƐǁĞĂŝŵĞĚƚŽŝĚĞŶƟĨLJƚŚĞƚƌƵĞ ĐĂƵƐĂůůŽĐŝ͘ƐǁĞĞdžƉĞĐƚƚŚĂƚŽŶůLJŽŶĞŐĞŶĞƉĞƌůŽĐƵƐŝƐƚŚĞƚƌƵĞĐĂƵƐĂůŐĞŶĞ͕ǁĞĚŝĚ ŶŽƚĚĞĮŶĞĂŐĞŶĞͲƐĞƚŝŶĐůƵĚŝŶŐĂůůƚŚĞŐĞŶĞƐƵŶĚĞƌŶĞĂƚŚƚŚĞ't^ůŽĐŝĂƐƐƵŵŝŶŐƚŚĞ ŽǀĞƌƌĞƉƌĞƐĞŶƚĂƟŽŶŽĨŶŽŶͲĐĂƵƐĂů ŐĞŶĞƐǁŽƵůĚ ĚŝůƵƚĞĂƉƵƚĂƟǀĞĂƐƐŽĐŝĂƟŽŶƐŝŐŶĂů͘ tĞ ĂĐŬŶŽǁůĞĚŐĞƚŚĂƚƚŚĞƵůƟŵĂƚĞƐƚƌĂƚĞŐLJƚŽƚĞƐƚƚŚĞĞīĞĐƚŽĨƌĂƌĞǀĂƌŝĂŶƚƐŝŶƚŚĞWůŽĐŝ ǁŽƵůĚďĞƚŽƐĞƋƵĞŶĐĞĂůůŐĞŶĞƐŝŶĂůĂƌŐĞĐŽŚŽƌƚ͕ĂŶĚƚĞƐƚƚŚĞĞīĞĐƚŽĨƌĂƌĞǀĂƌŝĂŶƚƐŝŶ ĞĂĐŚŐĞŶĞŝŶĚŝǀŝĚƵĂůůLJ͘&ƵƌƚŚĞƌŵŽƌĞ͕ƐĞƋƵĞŶĐŝŶŐƌĂƚŚĞƌƚŚĂŶŐĞŶŽƚLJƉŝŶŐǁŝůůĚĞĮŶĞŶŽǀĞů ƌĂƌĞǀĂƌŝĂŶƚƐĂŶĚĐŽŶƚƌŝďƵƚĞƚŽĐĂƚĂůŽŐƵŝŶŐƚŚĞŝŶŇƵĞŶĐĞŽĨƌĂƌĞǀĂƌŝĂŶƚƐƵŶĚĞƌŶĞĂƚŚƚŚĞ PD risk loci.

(15)
(16)

3

ƚŚĞ/ƚĂůŝĂŶDŝŶŝƐƚƌLJŽĨ,ĞĂůƚŚͬDŝŶŝƐƚƌLJŽĨĚƵĐĂƟŽŶ͕hŶŝǀĞƌƐŝƟĞƐĂŶĚZĞƐĞĂƌĐŚ the Israeli Ministry of Health

(17)

3

ϭ͘ ƌĂƐ :͕ 'ƵĞƌƌĞŝƌŽ Z͕ ,ĂƌĚLJ :͘ ^ŶĂƉ^ŚŽƚ͗

'ĞŶĞƟĐƐŽĨWĂƌŬŝŶƐŽŶ͛ƐĚŝƐĞĂƐĞ͘CellϮϬϭϱ͖ 160(3): 570-.e1.

Ϯ͘ EĂůůƐ D͕ WĂŶŬƌĂƚnj E͕ >ŝůů D͕ Ğƚ Ăů͘ >ĂƌŐĞͲ scale meta-analysis of genome-wide ĂƐƐŽĐŝĂƟŽŶĚĂƚĂŝĚĞŶƟĮĞƐƐŝdžŶĞǁƌŝƐŬůŽĐŝ for Parkinson’s disease. EĂƚƵƌĞ ŐĞŶĞƟĐƐ ϮϬϭϰ͖ϰϲ;ϵͿ͗ϵϴϵͲϵϯ͘ ϯ͘WŝŚůƐƚƌŽŵ>͕dŽŌD͘WĂƌŬŝŶƐŽŶ͛ƐĚŝƐĞĂƐĞ͗tŚĂƚ ƌĞŵĂŝŶƐ ŽĨ ƚŚĞ ͞ŵŝƐƐŝŶŐ ŚĞƌŝƚĂďŝůŝƚLJ͍͟ DŽǀĞŵĞŶƚ ĚŝƐŽƌĚĞƌƐ ͗ ŽĸĐŝĂů ũŽƵƌŶĂů ŽĨ ƚŚĞ DŽǀĞŵĞŶƚ ŝƐŽƌĚĞƌ ^ŽĐŝĞƚLJ ϮϬϭϭ͖ 26(11): 1971-3. ϰ͘<ĞůůĞƌD&͕^ĂĂĚD͕ƌĂƐ:͕ĞƚĂů͘hƐŝŶŐŐĞŶŽŵĞͲ ǁŝĚĞ ĐŽŵƉůĞdž ƚƌĂŝƚ ĂŶĂůLJƐŝƐ ƚŽ ƋƵĂŶƟĨLJ ͚ŵŝƐƐŝŶŐ ŚĞƌŝƚĂďŝůŝƚLJ͛ ŝŶ WĂƌŬŝŶƐŽŶ͛Ɛ disease. ,ƵŵĂŶŵŽůĞĐƵůĂƌŐĞŶĞƟĐƐϮϬϭϮ͖ 21(22): 4996-5009.

5. Do CB, Tung JY, Dorfman E, et al. Web-based ŐĞŶŽŵĞͲǁŝĚĞ ĂƐƐŽĐŝĂƟŽŶ ƐƚƵĚLJ ŝĚĞŶƟĮĞƐ ƚǁŽ ŶŽǀĞů ůŽĐŝ ĂŶĚ Ă ƐƵďƐƚĂŶƟĂů ŐĞŶĞƟĐ component for Parkinson’s disease. PLoS ŐĞŶĞƟĐƐϮϬϭϭ͖ϳ;ϲͿ͗ĞϭϬϬϮϭϰϭ͘

ϲ͘DĂŶŽůŝŽd͕ŽůůŝŶƐ&^͕ŽdžE:͕ĞƚĂů͘&ŝŶĚŝŶŐ ƚŚĞ ŵŝƐƐŝŶŐ ŚĞƌŝƚĂďŝůŝƚLJ ŽĨ ĐŽŵƉůĞdž diseases. NatureϮϬϬϵ͖ϰϲϭ;ϳϮϲϱͿ͗ϳϰϳͲϱϯ͘ ϳ͘ ƵŬ K͕ ^ĐŚĂīŶĞƌ ^&͕ ^ĂŵŽĐŚĂ <͕ Ğƚ Ăů͘

Searching for missing heritability: ĚĞƐŝŐŶŝŶŐƌĂƌĞǀĂƌŝĂŶƚĂƐƐŽĐŝĂƟŽŶƐƚƵĚŝĞƐ͘ WƌŽĐĞĞĚŝŶŐƐ ŽĨ ƚŚĞ EĂƟŽŶĂů ĐĂĚĞŵLJ ŽĨ ^ĐŝĞŶĐĞƐ ŽĨ ƚŚĞ hŶŝƚĞĚ ^ƚĂƚĞƐ ŽĨ ŵĞƌŝĐĂ ϮϬϭϰ͖ϭϭϭ;ϰͿ͗ϰϱϱͲϲϰ͘ ϴ͘ EĞůƐŽŶ DZ͕ tĞŐŵĂŶŶ ͕ Śŵ D'͕ Ğƚ Ăů͘ ŶĂďƵŶĚĂŶĐĞŽĨƌĂƌĞĨƵŶĐƟŽŶĂů ǀĂƌŝĂŶƚƐ in 202 drug target genes sequenced in 14,002 people. ^ĐŝĞŶĐĞ ;EĞǁ zŽƌŬ͕ EzͿ ϮϬϭϮ͖ϯϯϳ;ϲϬϵϬͿ͗ϭϬϬͲϰ͘ ϵ͘ dĞŶŶĞƐƐĞŶ :͕ ŝŐŚĂŵ t͕ K͛ŽŶŶŽƌ d͕ Ğƚ Ăů͘ǀŽůƵƟŽŶĂŶĚĨƵŶĐƟŽŶĂůŝŵƉĂĐƚŽĨƌĂƌĞ ĐŽĚŝŶŐǀĂƌŝĂƟŽŶĨƌŽŵĚĞĞƉƐĞƋƵĞŶĐŝŶŐŽĨ ŚƵŵĂŶ ĞdžŽŵĞƐ͘ ^ĐŝĞŶĐĞ ;EĞǁ zŽƌŬ͕ EzͿ ϮϬϭϮ͖ϯϯϳ;ϲϬϵϬͿ͗ϲϰͲϵ͘

10. Lander ES. The new genomics: global views of biology. ^ĐŝĞŶĐĞ ;EĞǁ zŽƌŬ͕ EzͿ ϭϵϵϲ͖ 274(5287): 536-9.

ϭϭ͘WƌŝƚĐŚĂƌĚ:<͕ŽdžE:͘dŚĞĂůůĞůŝĐĂƌĐŚŝƚĞĐƚƵƌĞ of human disease genes: common ĚŝƐĞĂƐĞͲĐŽŵŵŽŶǀĂƌŝĂŶƚ͘͘͘ŽƌŶŽƚ͍Human ŵŽůĞĐƵůĂƌŐĞŶĞƟĐƐϮϬϬϮ͖ϭϭ;ϮϬͿ͗ϮϰϭϳͲϮϯ͘ ϭϮ͘ŽƚƐƚĞŝŶ͕ZŝƐĐŚE͘ŝƐĐŽǀĞƌŝŶŐŐĞŶŽƚLJƉĞƐ

underlying human phenotypes: past successes for mendelian disease, future ĂƉƉƌŽĂĐŚĞƐ ĨŽƌ ĐŽŵƉůĞdž ĚŝƐĞĂƐĞ͘ Nature ŐĞŶĞƟĐƐϮϬϬϯ͖ϯϯ^ƵƉƉů͗ϮϮϴͲϯϳ͘

ϭϯ͘ ^ŚĂƌŵĂ D͕ <ƌƵŐĞƌ Z͕ 'ĂƐƐĞƌ d͘ &ƌŽŵ ŐĞŶŽŵĞͲǁŝĚĞĂƐƐŽĐŝĂƟŽŶƐƚƵĚŝĞƐƚŽŶĞdžƚͲ ŐĞŶĞƌĂƟŽŶ ƐĞƋƵĞŶĐŝŶŐ͗ ůĞƐƐŽŶƐ ĨƌŽŵ ƚŚĞ past and planning for the future. JAMA neurologyϮϬϭϰ͖ϳϭ;ϭͿ͗ϱͲϲ͘

ϭϰ͘ <ŝĞnjƵŶ ͕ 'ĂƌŝŵĞůůĂ <͕ Ž Z͕ Ğƚ Ăů͘ džŽŵĞ ƐĞƋƵĞŶĐŝŶŐ ĂŶĚ ƚŚĞ ŐĞŶĞƟĐ ďĂƐŝƐ ŽĨ ĐŽŵƉůĞdž ƚƌĂŝƚƐ͘ EĂƚƵƌĞ ŐĞŶĞƟĐƐ ϮϬϭϮ͖ 44(6): 623-30.

15. Bamshad MJ, Ng SB, Bigham AW, et al. džŽŵĞƐĞƋƵĞŶĐŝŶŐĂƐĂƚŽŽůĨŽƌDĞŶĚĞůŝĂŶ disease gene discovery. Nature reviews 'ĞŶĞƟĐƐϮϬϭϭ͖ϭϮ;ϭϭͿ͗ϳϰϱͲϱϱ͘

ϭϲ͘ WƵƌĐĞůů ^D͕ DŽƌĂŶ :>͕ &ƌŽŵĞƌ D͕ Ğƚ Ăů͘  ƉŽůLJŐĞŶŝĐ ďƵƌĚĞŶ ŽĨ ƌĂƌĞ ĚŝƐƌƵƉƟǀĞ ŵƵƚĂƟŽŶƐŝŶƐĐŚŝnjŽƉŚƌĞŶŝĂ͘NatureϮϬϭϰ͖ 506(7487): 185-90.

17. Cirulli ET, Lasseigne BN, Petrovski S, et Ăů͘ džŽŵĞ ƐĞƋƵĞŶĐŝŶŐ ŝŶ ĂŵLJŽƚƌŽƉŚŝĐ ůĂƚĞƌĂů ƐĐůĞƌŽƐŝƐ ŝĚĞŶƟĮĞƐ ƌŝƐŬ ŐĞŶĞƐ ĂŶĚ pathways. ^ĐŝĞŶĐĞ ;EĞǁ zŽƌŬ͕ EzͿ ϮϬϭϱ͖ 347(6229): 1436-41.

18. Polymeropoulos MH, Lavedan C, Leroy E, et Ăů͘ DƵƚĂƟŽŶ ŝŶ ƚŚĞ ĂůƉŚĂͲƐLJŶƵĐůĞŝŶ ŐĞŶĞ ŝĚĞŶƟĮĞĚ ŝŶ ĨĂŵŝůŝĞƐ ǁŝƚŚ WĂƌŬŝŶƐŽŶ͛Ɛ disease. ^ĐŝĞŶĐĞ ;EĞǁ zŽƌŬ͕ EzͿ ϭϵϵϳ͖ 276(5321): 2045-7.

ϭϵ͘WĂŝƐĂŶͲZƵŝnj͕:ĂŝŶ^͕ǀĂŶƐt͕ĞƚĂů͘ůŽŶŝŶŐ

(18)

3

ŽĨ ƚŚĞ ŐĞŶĞ ĐŽŶƚĂŝŶŝŶŐ ŵƵƚĂƟŽŶƐ ƚŚĂƚ ĐĂƵƐĞ WZ<ϴͲůŝŶŬĞĚ WĂƌŬŝŶƐŽŶ͛Ɛ ĚŝƐĞĂƐĞ͘ NeuronϮϬϬϰ͖ϰϰ;ϰͿ͗ϱϵϱͲϲϬϬ͘ ϮϬ͘ ŝŵƉƌŝĐŚ ͕ ŝƐŬƵƉ ^͕ >ĞŝƚŶĞƌ W͕ Ğƚ Ăů͘ DƵƚĂƟŽŶƐ ŝŶ >ZZ<Ϯ ĐĂƵƐĞ ĂƵƚŽƐŽŵĂůͲ dominant parkinsonism with pleomorphic pathology. NeuronϮϬϬϰ͖ϰϰ;ϰͿ͗ϲϬϭͲϳ͘ Ϯϭ͘^ŝŵŽŶͲ^ĂŶĐŚĞnj:͕^ĐŚƵůƚĞ͕ƌĂƐ:D͕ĞƚĂů͘ 'ĞŶŽŵĞͲǁŝĚĞ ĂƐƐŽĐŝĂƟŽŶ ƐƚƵĚLJ ƌĞǀĞĂůƐ ŐĞŶĞƟĐƌŝƐŬƵŶĚĞƌůLJŝŶŐWĂƌŬŝŶƐŽŶ͛ƐĚŝƐĞĂƐĞ͘ EĂƚƵƌĞŐĞŶĞƟĐƐϮϬϬϵ͖ϰϭ;ϭϮͿ͗ϭϯϬϴͲϭϮ͘ ϮϮ͘ ĚǁĂƌĚƐ d>͕ ^ĐŽƩ t<͕ ůŵŽŶƚĞ ͕ Ğƚ Ăů͘ 'ĞŶŽŵĞͲǁŝĚĞ ĂƐƐŽĐŝĂƟŽŶ ƐƚƵĚLJ ĐŽŶĮƌŵƐ SNPs in SNCA and the MAPT region as common risk factors for Parkinson disease. ŶŶĂůƐŽĨŚƵŵĂŶŐĞŶĞƟĐƐϮϬϭϬ͖ϳϰ;ϮͿ͗ϵϳͲ 109.

Ϯϯ͘ EĂůůƐ D͕ WůĂŐŶŽů s͕ ,ĞƌŶĂŶĚĞnj '͕ Ğƚ Ăů͘ /ŵƉƵƚĂƟŽŶ ŽĨ ƐĞƋƵĞŶĐĞ ǀĂƌŝĂŶƚƐ ĨŽƌ ŝĚĞŶƟĮĐĂƟŽŶ ŽĨ ŐĞŶĞƟĐ ƌŝƐŬƐ ĨŽƌ Parkinson’s disease: a meta-analysis of ŐĞŶŽŵĞͲǁŝĚĞ ĂƐƐŽĐŝĂƟŽŶ ƐƚƵĚŝĞƐ͘ Lancet ϮϬϭϭ͖ϯϳϳ;ϵϳϲϲͿ͗ϲϰϭͲϵ͘ Ϯϰ͘'ŽŬĞƌͲůƉĂŶK͕^ĐŚŝīŵĂŶŶZ͕>ĂDĂƌĐĂD͕ EƵƐƐďĂƵŵZ>͕DĐ/ŶĞƌŶĞLJͲ>ĞŽ͕^ŝĚƌĂŶƐŬLJ ͘ WĂƌŬŝŶƐŽŶŝƐŵ ĂŵŽŶŐ 'ĂƵĐŚĞƌ ĚŝƐĞĂƐĞ carriers. :ŽƵƌŶĂůŽĨŵĞĚŝĐĂůŐĞŶĞƟĐƐϮϬϬϰ͖ 41(12): 937-40. Ϯϱ͘WĂŶŬƌĂƚnjE͕ĞĞĐŚĂŵ't͕Ğ^ƚĞĨĂŶŽ>͕Ğƚ al. Meta-analysis of Parkinson’s disease: ŝĚĞŶƟĮĐĂƟŽŶŽĨĂŶŽǀĞůůŽĐƵƐ͕Z/dϮ͘Annals of neurologyϮϬϭϮ͖ϳϭ;ϯͿ͗ϯϳϬͲϴϰ͘

Ϯϲ͘ ,ƵŐŚĞƐ :͕ ĂŶŝĞů ^͕ <ŝůĨŽƌĚ >͕ >ĞĞƐ :͘ Accuracy of clinical diagnosis of idiopathic Parkinson’s disease: a clinico-pathological study of 100 cases. Journal of neurology, ŶĞƵƌŽƐƵƌŐĞƌLJ͕ĂŶĚƉƐLJĐŚŝĂƚƌLJϭϵϵϮ͖ϱϱ;ϯͿ͗ 181-4.

Ϯϳ͘,ŽĨŵĂŶ͕ƌƵƐƐĞůůĞ''͕ĂƌǁŝƐŚDƵƌĂĚ^͕Ğƚ Ăů͘dŚĞZŽƩĞƌĚĂŵ^ƚƵĚLJ͗ϮϬϭϲŽďũĞĐƟǀĞƐ and design update. ƵƌŽƉĞĂŶ ũŽƵƌŶĂů ŽĨ ĞƉŝĚĞŵŝŽůŽŐLJϮϬϭϱ͖ϯϬ;ϴͿ͗ϲϲϭͲϳϬϴ͘ Ϯϴ͘ >ŝ ,͕ ƵƌďŝŶ Z͘ &ĂƐƚ ĂŶĚ ĂĐĐƵƌĂƚĞ ƐŚŽƌƚ

read alignment with Burrows-Wheeler transform. ŝŽŝŶĨŽƌŵĂƟĐƐ ;KdžĨŽƌĚ͕ ŶŐůĂŶĚͿϮϬϬϵ͖Ϯϱ;ϭϰͿ͗ϭϳϱϰͲϲϬ͘

Ϯϵ͘ DĐ<ĞŶŶĂ ͕ ,ĂŶŶĂ D͕ ĂŶŬƐ ͕ Ğƚ Ăů͘ dŚĞ 'ĞŶŽŵĞ ŶĂůLJƐŝƐ dŽŽůŬŝƚ͗ Ă DĂƉZĞĚƵĐĞ ĨƌĂŵĞǁŽƌŬ ĨŽƌ ĂŶĂůLJnjŝŶŐ ŶĞdžƚͲŐĞŶĞƌĂƟŽŶ DNA sequencing data. Genome research ϮϬϭϬ͖ϮϬ;ϵͿ͗ϭϮϵϳͲϯϬϯ͘

ϯϬ͘EĂůůƐD͕ƌĂƐ:͕,ĞƌŶĂŶĚĞnj'͕ĞƚĂů͘EĞƵƌŽy͕ Ă ĨĂƐƚ ĂŶĚ ĞĸĐŝĞŶƚ ŐĞŶŽƚLJƉŝŶŐ ƉůĂƞŽƌŵ ĨŽƌ ŝŶǀĞƐƟŐĂƟŽŶ ŽĨ ŶĞƵƌŽĚĞŐĞŶĞƌĂƟǀĞ diseases. Neurobiology of aging 2014. ϯϭ͘ >ŝůů D͕ ZŽĞŚƌ :d͕ DĐYƵĞĞŶ D͕ Ğƚ Ăů͘

Comprehensive research synopsis and ƐLJƐƚĞŵĂƟĐ ŵĞƚĂͲĂŶĂůLJƐĞƐ ŝŶ WĂƌŬŝŶƐŽŶ͛Ɛ ĚŝƐĞĂƐĞ ŐĞŶĞƟĐƐ͗ dŚĞ W'ĞŶĞ ĚĂƚĂďĂƐĞ͘ W>Ž^ŐĞŶĞƟĐƐϮϬϭϮ͖ϴ;ϯͿ͗ĞϭϬϬϮϱϰϴ͘ ϯϮ͘ tĂŶŐ <͕ >ŝ D͕ ,ĂŬŽŶĂƌƐŽŶ ,͘ EEKsZ͗

ĨƵŶĐƟŽŶĂů ĂŶŶŽƚĂƟŽŶ ŽĨ ŐĞŶĞƟĐ ǀĂƌŝĂŶƚƐ from high-throughput sequencing data. EƵĐůĞŝĐĂĐŝĚƐƌĞƐĞĂƌĐŚϮϬϭϬ͖ϯϴ;ϭϲͿ͗Ğϭϲϰ͘ ϯϯ͘ ŵĞŶĚŽůĂ >D͕ ŽƌƐĐŚŶĞƌ DK͕ ZŽďĞƌƚƐŽŶ W͕ Ğƚ Ăů͘ ĐƟŽŶĂďůĞ ĞdžŽŵŝĐ ŝŶĐŝĚĞŶƚĂů ĮŶĚŝŶŐƐ ŝŶ ϲϱϬϯ ƉĂƌƟĐŝƉĂŶƚƐ͗ ĐŚĂůůĞŶŐĞƐ ŽĨǀĂƌŝĂŶƚĐůĂƐƐŝĮĐĂƟŽŶ͘Genome research ϮϬϭϱ͖Ϯϱ;ϯͿ͗ϯϬϱͲϭϱ͘ ϯϰ͘ <ŝƌĐŚĞƌ D͕ tŝƩĞŶ D͕ :ĂŝŶ W͕ K͛ZŽĂŬ :͕ ŽŽƉĞƌ 'D͘  ŐĞŶĞƌĂů ĨƌĂŵĞǁŽƌŬ ĨŽƌ ĞƐƟŵĂƟŶŐ ƚŚĞ ƌĞůĂƟǀĞ ƉĂƚŚŽŐĞŶŝĐŝƚLJ ŽĨ ŚƵŵĂŶŐĞŶĞƟĐǀĂƌŝĂŶƚƐ͘ϮϬϭϰ͖ϰϲ;ϯͿ͗ϯϭϬͲ 5.

(19)

3

ĂŶĚ ƉŽƉƵůĂƟŽŶͲďĂƐĞĚ ůŝŶŬĂŐĞ ĂŶĂůLJƐĞƐ͘

ŵĞƌŝĐĂŶũŽƵƌŶĂůŽĨŚƵŵĂŶŐĞŶĞƟĐƐϮϬϬϳ͖ 81(3): 559-75.

(20)

3

^hWW>DEd>d

dĂďůĞϭ͘WES capture protocols

cases ĐŽŶƚƌŽůƐ /W' /W' RSX1 Nimblegenv2 252 37 1201 Truseq 912 446 0 DŝdžĞĚ 3 1 0 Total 1167 484 1201 DŝdžĞĚсƐĂŵƉůĞƐƚŚĂƚŚĂǀĞďĞĞŶĐĂƉƚƵƌĞĚƵƐŝŶŐƚŚĞϮĚŝƐƟŶĐƚĐĂƉƚƵƌĞŬŝƚƐ͘ dĂďůĞϮ͘džĐůƵƐŝŽŶŽĨĞdžŽŶƐďĂƐĞĚŽŶĐĂƉƚƵƌĞŝŶĐŽŶƐŝƐƚĞŶĐŝĞƐ͘

gene exon Source

PD meta ASH1L 21 Truseq

DLG2 1+2 Nimblegenv2

TMEM229B 1+2 Nimblegenv2

TMEM175 1 Nimblegenv2

dĂďůĞϯ͘WĂƌĂŵĞƚĞƌƐĨŽƌƉŽǁĞƌĐĂůĐƵůĂƟŽŶƐ͘

Arguments WES NeuroX

(21)
(22)
(23)

Referenties

GERELATEERDE DOCUMENTEN

hsͲǀŝƐ ĞdžƉĞƌŝŵĞŶƚƐ ŽĨ ĐŽŵƉůĞdžĞƐ ϭ ĂŶĚ ϯ ŝŶ ĂƋƵĞŽƵƐ ƐŽůƵƚŝŽŶ ƐŚŽǁ ƚŚĂƚ

ϭϵϳͮ Summary | ^ƵŵŵĂƌLJ /Ŷ ƚŚĞ ůĂƐƚ ĐŽƵƉůĞ ŽĨ ĚĞĐĂĚĞƐ͕ ƚŚĞƌĞ ŚĂƐ ďĞĞŶ ŵŽƌĞ ĂƩĞŶƟŽŶ ĨŽƌ ƚŚĞ ƌĞůĂƟŽŶ ďĞƚǁĞĞŶ ŚĞĂůƚŚ ƐƚĂƚƵƐ ĂŶĚ

ϯϯ ǁĞƌĞƌĞƐĞƌǀĞĚĨŽƌƌĞƉƌĞƐĞŶƚĂƟǀĞƐŽĨetnis  minoritas

ϭϬϮ

ϭϳϱ

ϮϯϬ

ƚŚĂƚ ŽĨ ŵŝĐƌŽZEƐ ĨƌŽŵ ƚŚĞ ϭϰƋϯϮ ĐůƵƐƚĞƌ͕ ďLJ ƐƚƵĚLJŝŶŐ ƚŚĞŝƌ ŝŶǀŽůǀĞŵĞŶƚ ŝŶ ĂƌƚĞƌŝŽŐĞŶĞƐŝƐ

ŵŽĚĞů ĨŽƌ ƚŚĞ ĚĞǀĞůŽƉŵĞŶƚ ŽĨ ĐĞůŝĂĐ ĚŝƐĞĂƐĞ͕ ǁŚŝĐŚ ŝƐ ĚĞƐĐƌŝďĞĚ ŝŶ ĐŚĂƉƚĞƌ ϭ͘ /ƚ