• No results found

University of Groningen Aneuploidy in the human brain and cancer van den Bos, Hilda

N/A
N/A
Protected

Academic year: 2021

Share "University of Groningen Aneuploidy in the human brain and cancer van den Bos, Hilda"

Copied!
21
0
0

Bezig met laden.... (Bekijk nu de volledige tekst)

Hele tekst

(1)

Aneuploidy in the human brain and cancer

van den Bos, Hilda

IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please check the document version below.

Document Version

Publisher's PDF, also known as Version of record

Publication date: 2017

Link to publication in University of Groningen/UMCG research database

Citation for published version (APA):

van den Bos, H. (2017). Aneuploidy in the human brain and cancer: Studying heterogeneity using single-cell sequencing. University of Groningen.

Copyright

Other than for strictly personal use, it is not permitted to download or to forward/distribute the text or part of it without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license (like Creative Commons).

Take-down policy

If you believe that this document breaches copyright please contact us providing details, and we will remove access to the work immediately and investigate your claim.

Downloaded from the University of Groningen/UMCG research database (Pure): http://www.rug.nl/research/portal. For technical reasons the number of authors shown on this cover page is limited to 10 maximum.

(2)

Chapter 5

Copy number alterations assessed at the single-cell

level revealed mono- and polyclonal seeding

patterns of distant metastases in a small cell lung

cancer patient

Hilda van den Bos*, Paranita Ferronika*, Aaron Taudt, Diana C.J. Spierings, Ali Saber, Thijo J.N. Hiltermann, Klaas Kok, David Porubsky, Anthonie J. van der Wekken, Wim Timens, Floris Foijer, Maria Colomé-Tatché, Harry J.M. Groen, Peter M. Lansdorp, and Anke van den Berg

*These authors contributed equally to the paper Adapted from Annals of oncology (2017) April 11

(3)

Abstract

Background: Intra-tumour heterogeneity is a common feature of many cancers and can facilitate tumour evolution. The aim of this study was to assess heterogeneity in genomic copy number alterations (CNAs) at the single-cell level in lung tumour samples using low coverage single-cell whole genome sequencing (scWGS).

Patient and methods: We used 586 nuclei from three metastases and two primary tumour samples from a small cell lung cancer (SCLC) patient to generate scWGS libraries. Good quality scWGS libraries (54-82 cells/sample) and CNAs were identified using AneuFinder.

Results: CNA plots of merged scWGS data for all five tumour samples revealed patterns very similar to those seen in array CGH-based plots of DNA derived from the total tumour. Analysis of the scWGS data revealed a high degree of intra-tumour CNA heterogeneity among single cells from the primary tumour, lymph node metastasis and adrenal gland metastasis, but a much lower degree of heterogeneity and a distinct CNA pattern in the liver metastasis cells. A CNA pattern identical to that of the merged liver metastasis was observed in two out of 74 cells from one of the two primary tumour samples and in five out of 72 cells from the adrenal metastasis. No cells with the CNA pattern observed in the liver metastasis were found in the other primary tumour sample or in the lymph node metastasis.

Conclusions: A high degree of CNA heterogeneity was observed among cells from five distinct tumour locations in a SCLC patient. Strikingly, a minority of tumour cells from one of the primary tumour samples and from the adrenal metastasis showed the dominant CNA pattern observed in liver metastasis cells. Our data suggests polyclonal tumour-cell seeding occurred in lymph node and adrenal metastases and monoclonal seeding occurred in the liver metastasis.

Key message

Heterogeneity in both primary tumour and metastasis is a prominent factor in tumour evolution. Using single-cell whole genome sequencing of multiple tumour samples from a small cell lung cancer patient, we found heterogeneity at the copy-number-alteration level that indicated mono- and polyclonal seeding of metastasis-specific subclones. These metastasis-specific subclones were already present in the primary tumour.

Introduction

Lung cancer is the main cause of cancer-related deaths in the world. The estimated number of new cases in the US in 2016 was 224,390 with approximately 158,080 deaths 1. Small-cell lung carcinoma (SCLC) represents approximately 12% of all lung cancer cases. Patients with SCLC have a very poor prognosis, with a 5-year survival of 20-25% for limited stage disease and a 2-year survival of less than 10% for extensive stage disease 2. The most striking clinical features of SCLC is the rapid development of metastasis and an initially good tumour response to chemotherapy that is often quickly followed by an early tumour relapse.

Phenotypic intra-tumour heterogeneity (ITH) has been reported in many tumour types in multiple studies as reviewed by Navin and Hicks 3. In addition to phenotypic ITH, studies using next generation sequencing techniques support a marked degree of genomic ITH 4–7. Occurrence of ITH implies the presence of different subclonal populations of cancer cells 8. Using a multi-region approach, it has been shown that subclones of cancer cells in the primary tumour can seed distant metastasis 5,9, and the metastases appeared to have a more homogeneous profile of somatic mutations and copy number alterations (CNAs) in comparison to the highly heterogeneous primary tumour 5.

Single-cell sequencing approaches allow analysis of ITH at the level of CNAs. In SCLC patients, low-coverage whole-genome sequencing (WGS) of single circulating tumour cells revealed different copy number alterations that partly resembled the aberrations in the primary tumour or metastasis 10. Single-cell sequencing studies in primary tumour and metastasis samples of SCLC patients have not been reported previously.

In this study, we used a recently developed single-cell sequencing platform 11,12 to quantify tumour heterogeneity by measuring CNAs, which are considered as events underlying oncogenic rearrangements 13. What we find is a strikingly high number of CNA heterogeneities in primary and metastatic SCLC samples, but a low degree of CNA heterogeneity in the liver metastasis.

Materials and Methods Patient

The patient was a heavily smoking 79-year-old woman without a history of malignancy who had been diagnosed with chronic obstructive pulmonary disease and diabetes mellitus more than thirty years prior. She was admitted to the hospital with superior vena cava syndrome and was found to have an enlarged right-sided adrenal gland due to metastasized small-cell lung cancer. She received one course of carboplatin and etoposide immediately, but died suddenly in the hospital eight days later due to heart failure. FDG-PET/CT showed a large mediastinal tumour in the upper lobe of the right lung with mediastinal involvement and compression of the upper caval vein, multiple liver metastases, metastases in the right adrenal gland, and bone metastases at the 6th rib, sacral bone, and 7th thoracic and 2nd lumbal vertebra (Figure 1). No brain metastases were detected on MRI. At autopsy, two fresh tissue samples were taken from different areas of the 7-cm-sized primary tumour and one

(4)

5

Abstract

Background: Intra-tumour heterogeneity is a common feature of many cancers and can facilitate tumour evolution. The aim of this study was to assess heterogeneity in genomic copy number alterations (CNAs) at the single-cell level in lung tumour samples using low coverage single-cell whole genome sequencing (scWGS).

Patient and methods: We used 586 nuclei from three metastases and two primary tumour samples from a small cell lung cancer (SCLC) patient to generate scWGS libraries. Good quality scWGS libraries (54-82 cells/sample) and CNAs were identified using AneuFinder.

Results: CNA plots of merged scWGS data for all five tumour samples revealed patterns very similar to those seen in array CGH-based plots of DNA derived from the total tumour. Analysis of the scWGS data revealed a high degree of intra-tumour CNA heterogeneity among single cells from the primary tumour, lymph node metastasis and adrenal gland metastasis, but a much lower degree of heterogeneity and a distinct CNA pattern in the liver metastasis cells. A CNA pattern identical to that of the merged liver metastasis was observed in two out of 74 cells from one of the two primary tumour samples and in five out of 72 cells from the adrenal metastasis. No cells with the CNA pattern observed in the liver metastasis were found in the other primary tumour sample or in the lymph node metastasis.

Conclusions: A high degree of CNA heterogeneity was observed among cells from five distinct tumour locations in a SCLC patient. Strikingly, a minority of tumour cells from one of the primary tumour samples and from the adrenal metastasis showed the dominant CNA pattern observed in liver metastasis cells. Our data suggests polyclonal tumour-cell seeding occurred in lymph node and adrenal metastases and monoclonal seeding occurred in the liver metastasis.

Key message

Heterogeneity in both primary tumour and metastasis is a prominent factor in tumour evolution. Using single-cell whole genome sequencing of multiple tumour samples from a small cell lung cancer patient, we found heterogeneity at the copy-number-alteration level that indicated mono- and polyclonal seeding of metastasis-specific subclones. These metastasis-specific subclones were already present in the primary tumour.

Introduction

Lung cancer is the main cause of cancer-related deaths in the world. The estimated number of new cases in the US in 2016 was 224,390 with approximately 158,080 deaths 1. Small-cell lung carcinoma (SCLC) represents approximately 12% of all lung cancer cases. Patients with SCLC have a very poor prognosis, with a 5-year survival of 20-25% for limited stage disease and a 2-year survival of less than 10% for extensive stage disease 2. The most striking clinical features of SCLC is the rapid development of metastasis and an initially good tumour response to chemotherapy that is often quickly followed by an early tumour relapse.

Phenotypic intra-tumour heterogeneity (ITH) has been reported in many tumour types in multiple studies as reviewed by Navin and Hicks 3. In addition to phenotypic ITH, studies using next generation sequencing techniques support a marked degree of genomic ITH 4–7. Occurrence of ITH implies the presence of different subclonal populations of cancer cells 8. Using a multi-region approach, it has been shown that subclones of cancer cells in the primary tumour can seed distant metastasis 5,9, and the metastases appeared to have a more homogeneous profile of somatic mutations and copy number alterations (CNAs) in comparison to the highly heterogeneous primary tumour 5.

Single-cell sequencing approaches allow analysis of ITH at the level of CNAs. In SCLC patients, low-coverage whole-genome sequencing (WGS) of single circulating tumour cells revealed different copy number alterations that partly resembled the aberrations in the primary tumour or metastasis 10. Single-cell sequencing studies in primary tumour and metastasis samples of SCLC patients have not been reported previously.

In this study, we used a recently developed single-cell sequencing platform 11,12 to quantify tumour heterogeneity by measuring CNAs, which are considered as events underlying oncogenic rearrangements 13. What we find is a strikingly high number of CNA heterogeneities in primary and metastatic SCLC samples, but a low degree of CNA heterogeneity in the liver metastasis.

Materials and Methods Patient

The patient was a heavily smoking 79-year-old woman without a history of malignancy who had been diagnosed with chronic obstructive pulmonary disease and diabetes mellitus more than thirty years prior. She was admitted to the hospital with superior vena cava syndrome and was found to have an enlarged right-sided adrenal gland due to metastasized small-cell lung cancer. She received one course of carboplatin and etoposide immediately, but died suddenly in the hospital eight days later due to heart failure. FDG-PET/CT showed a large mediastinal tumour in the upper lobe of the right lung with mediastinal involvement and compression of the upper caval vein, multiple liver metastases, metastases in the right adrenal gland, and bone metastases at the 6th rib, sacral bone, and 7th thoracic and 2nd lumbal vertebra (Figure 1). No brain metastases were detected on MRI. At autopsy, two fresh tissue samples were taken from different areas of the 7-cm-sized primary tumour and one

(5)

sample from multiple other metastatic lesions. All samples were frozen at -80°C. Haematoxylin and eosin staining was performed for each tumour sample to confirm diagnosis and to determine tumour cell percentage.

Figure 1. Presentation of tumour in different organs as detected by FDG-PET/CT (A-C); A) primary

tumour in right upper lobe of lung (circle), which consists of primary tumour 1 (*) and primary tumour 2 (**), mediastinal lymph node (arrow), B) metastasis in liver (circle), and C) metastasis in right adrenal gland (circle). (D) Schematic representation of the distribution pattern of tumour region specific cells calculated based on Pearson’s correlation coefficient. Colours indicate different tumour regions.

Array-based comparative genomic hybridization

Genomic DNA was isolated using standard laboratory procedures. 500ng tumour DNA and a mixed DNA pool of 15 normal subjects (from peripheral mononuclear blood cells) were labelled with Cy5 and Cy3, respectively, according to the Complete Genomic SureTag DNA Enzymatic Labelling Kit protocol (Agilent Technologies, Santa Clara, USA). The program setting for incubation for denaturation was 3 min at 95°C followed by 5 min at 4°C. DNA labelling was for 2h at 37°C and 10 min at 65°C. Hybridization was carried out using custom-designed Agilent 4x180K Human Genome CGH microarray slides (Agilent ID 027730) following the protocol OligoaCGH/ChIP-on-Chip Hybridization kit (Agilent Technologies). The program setting for incubation for blocking unspecific binding and removing repetitive sequences was 3 min at 95°C then 30 min at 37°C. Hybridization was for 22h at 65°C, 20 RPM. Slides were scanned using the G25052C DNA microarray scanner (Agilent Technologies). The data were analysed in Nexus 7.5, Biodiscovery using FE Agilent data type (Biodiscovery, California, USA).

Single cell whole genome sequencing, CNA, heterogeneity and aneuploidy scores

Three 30μm thick sections of fresh-frozen tissues were incubated in nuclear isolation medium [10mM Tris-Cl (pH8), 320mM sucrose, 5mM CaCl2, 3mM Mg(Ac)2, 0.1mM EDTA, 1mM

dithiothreitol (DTT) and 0.1% Triton X-100]. Nuclei were gently pushed out of the tissue pieces through a 70µm filter using a syringe plunger. After centrifugation, nuclei were re-suspended in PBS containing 2% BSA and the DNA-binding dye DAPI (10 μg/ml) for assessment of DNA content. Single G1 phase nuclei were sorted based on low DAPI fluorescence using a MoFloAstrios Cell Sorter (Beckman Coulter) into 96-well skirted PCR plates containing 5μl freeze medium [50% PBS, 7.5% DMSO and 42.5% 2X Pro-Freeze CDM Freeze Medium (Lonza)]. Wells with 10 nuclei were sorted as positive control and empty wells served as negative controls.

Library preparation was performed as described by van den Bos et al. (2016) 11. For sequencing, clusters were generated on the cBot (HiSeq2500) and single-end 50bp reads were generated using the HiSeq2500 sequencing platform (Illumina, San Diego, USA). Raw sequencing data were demultiplexed based on library-specific barcodes that were incorporated during library preparation. Demultiplexed fastq files were aligned to the human reference genome assembly (GRCh37) using Bowtie2 (version 2.2.4) 14 with default settings. The resulting BAM files were sorted using Samtools 15 and duplicate reads were marked using BamUtil (version 1.0.3).

CNAs were determined CNAs were determined of 586 tumour nuclei using AneuFinder 16 with the following settings. Duplicate reads and low-quality alignments (MAPQ<10) were discarded, read counts in 2Mb variable-width bins were GC-corrected, and CNA state was determined with a 10-state Hidden Markov Model (HMM) with copy-number states: zero-inflation, null-, mono-, di-, tri-, tetra-, penta-, hexa-, septa- and octasomy. Using the Aneufinder function, ClusterByQuality, the resulting single-cell libraries were clustered based on several quality criteria: bin-to-bin variation in read density (spikiness), entropy, number of ploidy state segments, Bhattacharyya distance and the log-likelihood of the fit as described by Bakker et al. 16. The highest quality cluster from each sample was then used in subsequent analysis. After this filtering step, 383 of 586 tumour nuclei remained.

After the second filtering step, 346 out of 383 tumour nuclei remained in the final libraries, in which the most common state of chromosome arm is disomy. For further analysis, two bins at the centromere of chromosome 3 (Chr3:82444744-86324666) that showed an artificially high variance in CNA among cells were excluded from each single-cell data. The heterogeneity score (HS) was measured as the number of cells with a distinct copy number state within the population as described by Bakker et al. (2016). HS was defined as:

𝐻𝐻𝐻𝐻 =𝑇𝑇𝑇𝑇1 ∑𝑇𝑇𝑡𝑡=1∑𝑆𝑆𝑓𝑓=0𝑓𝑓 ∙ 𝑚𝑚𝑓𝑓,𝑡𝑡

where N is total number of single cells, T is total number of bins, mf,t is the number of cells

with copy number states at bin t, and S is the total number of copy number states. The aneuploidy score (AS) was measured based on the divergence from the euploidy as described by Bakker et al. 16. The AS was defined as:

(6)

5

sample from multiple other metastatic lesions. All samples were frozen at -80°C. Haematoxylin and eosin staining was performed for each tumour sample to confirm diagnosis and to determine tumour cell percentage.

Figure 1. Presentation of tumour in different organs as detected by FDG-PET/CT (A-C); A) primary

tumour in right upper lobe of lung (circle), which consists of primary tumour 1 (*) and primary tumour 2 (**), mediastinal lymph node (arrow), B) metastasis in liver (circle), and C) metastasis in right adrenal gland (circle). (D) Schematic representation of the distribution pattern of tumour region specific cells calculated based on Pearson’s correlation coefficient. Colours indicate different tumour regions.

Array-based comparative genomic hybridization

Genomic DNA was isolated using standard laboratory procedures. 500ng tumour DNA and a mixed DNA pool of 15 normal subjects (from peripheral mononuclear blood cells) were labelled with Cy5 and Cy3, respectively, according to the Complete Genomic SureTag DNA Enzymatic Labelling Kit protocol (Agilent Technologies, Santa Clara, USA). The program setting for incubation for denaturation was 3 min at 95°C followed by 5 min at 4°C. DNA labelling was for 2h at 37°C and 10 min at 65°C. Hybridization was carried out using custom-designed Agilent 4x180K Human Genome CGH microarray slides (Agilent ID 027730) following the protocol OligoaCGH/ChIP-on-Chip Hybridization kit (Agilent Technologies). The program setting for incubation for blocking unspecific binding and removing repetitive sequences was 3 min at 95°C then 30 min at 37°C. Hybridization was for 22h at 65°C, 20 RPM. Slides were scanned using the G25052C DNA microarray scanner (Agilent Technologies). The data were analysed in Nexus 7.5, Biodiscovery using FE Agilent data type (Biodiscovery, California, USA).

Single cell whole genome sequencing, CNA, heterogeneity and aneuploidy scores

Three 30μm thick sections of fresh-frozen tissues were incubated in nuclear isolation medium [10mM Tris-Cl (pH8), 320mM sucrose, 5mM CaCl2, 3mM Mg(Ac)2, 0.1mM EDTA, 1mM

dithiothreitol (DTT) and 0.1% Triton X-100]. Nuclei were gently pushed out of the tissue pieces through a 70µm filter using a syringe plunger. After centrifugation, nuclei were re-suspended in PBS containing 2% BSA and the DNA-binding dye DAPI (10 μg/ml) for assessment of DNA content. Single G1 phase nuclei were sorted based on low DAPI fluorescence using a MoFloAstrios Cell Sorter (Beckman Coulter) into 96-well skirted PCR plates containing 5μl freeze medium [50% PBS, 7.5% DMSO and 42.5% 2X Pro-Freeze CDM Freeze Medium (Lonza)]. Wells with 10 nuclei were sorted as positive control and empty wells served as negative controls.

Library preparation was performed as described by van den Bos et al. (2016) 11. For sequencing, clusters were generated on the cBot (HiSeq2500) and single-end 50bp reads were generated using the HiSeq2500 sequencing platform (Illumina, San Diego, USA). Raw sequencing data were demultiplexed based on library-specific barcodes that were incorporated during library preparation. Demultiplexed fastq files were aligned to the human reference genome assembly (GRCh37) using Bowtie2 (version 2.2.4) 14 with default settings. The resulting BAM files were sorted using Samtools 15 and duplicate reads were marked using BamUtil (version 1.0.3).

CNAs were determined CNAs were determined of 586 tumour nuclei using AneuFinder 16 with the following settings. Duplicate reads and low-quality alignments (MAPQ<10) were discarded, read counts in 2Mb variable-width bins were GC-corrected, and CNA state was determined with a 10-state Hidden Markov Model (HMM) with copy-number states: zero-inflation, null-, mono-, di-, tri-, tetra-, penta-, hexa-, septa- and octasomy. Using the Aneufinder function, ClusterByQuality, the resulting single-cell libraries were clustered based on several quality criteria: bin-to-bin variation in read density (spikiness), entropy, number of ploidy state segments, Bhattacharyya distance and the log-likelihood of the fit as described by Bakker et al. 16. The highest quality cluster from each sample was then used in subsequent analysis. After this filtering step, 383 of 586 tumour nuclei remained.

After the second filtering step, 346 out of 383 tumour nuclei remained in the final libraries, in which the most common state of chromosome arm is disomy. For further analysis, two bins at the centromere of chromosome 3 (Chr3:82444744-86324666) that showed an artificially high variance in CNA among cells were excluded from each single-cell data. The heterogeneity score (HS) was measured as the number of cells with a distinct copy number state within the population as described by Bakker et al. (2016). HS was defined as:

𝐻𝐻𝐻𝐻 =𝑇𝑇𝑇𝑇1 ∑𝑇𝑇𝑡𝑡=1∑𝑆𝑆𝑓𝑓=0𝑓𝑓 ∙ 𝑚𝑚𝑓𝑓,𝑡𝑡

where N is total number of single cells, T is total number of bins, mf,t is the number of cells

with copy number states at bin t, and S is the total number of copy number states. The aneuploidy score (AS) was measured based on the divergence from the euploidy as described by Bakker et al. 16. The AS was defined as:

(7)

where N is total number of single cells, T is total number of bins, cn,t is the copy number state

of cell n in bin t, and et is the euploid copy number in bin t.

The hierarchical clustering of tumour cells was performed with complete linkage and Euclidean distance measure on the matrix of copy number states for each cell and bin. The correlation of the CNA pattern of each single cell to the merged CNAs of each of the five tumour samples was calculated using Pearson’s correlation based on copy number states per 2Mb. The data set(s) supporting the results of this article are available in the ArrayExpress repository under accession number E-MTAB-4186.

Results

Copy number alterations in single tumour cells

Low coverage scWGS was performed on 586 single tumour nuclei derived from five frozen tumour samples of the SCLC patient: two areas of the primary tumour and one each from mediastinal lymph node, liver and adrenal metastasis. Cells passing quality control and having their most common copy number state being disomy (Supplementary Table S1) revealed 74 (primary tumour 1), 54 (primary tumour 2), 82 (lymph node metastasis), 64 (liver metastasis) and 72 (adrenal metastasis) single cell libraries (n=346 cells) (Supplementary Table S2, Supplementary materials and methods).

To exclude experimental bias introduced by the settings of the sorting gates at the G1 peak, we performed array-based comparative genomic hybridization (aCGH) on DNA isolated from the same five tissue samples. Merged scWGS data of each tumour used to generate bulk CNA patterns were highly similar to those generated by aCGH, thus ruling out bias due to sorting gates (Supplementary Figure S1).

The overall CNA patterns of the three metastases were similar to the primary tumour, supporting their clonal relation to the primary tumour rather than representing secondary unrelated primary tumours.

Intra-tumour CNA heterogeneity

To identify intra-tumour CNA heterogeneity, we performed unsupervised clustering based on the CNA patterns of all individual cells per sample. As expected, the majority of the single tumour cells of each sample showed CNA patterns that were consistent with merged scWGS data and aCGH-based plots (Figure 2 and Supplementary Figure S1). Each sample also contained tumour cells with additional unique CNAs that were not detected by aCGH analysis of bulk material (Figure 2 and 3).

Clustering of primary tumour 1 cells revealed two main clusters (Figure 2A). The dominant CNA characteristics were disomy of chromosome 10 and pentasomy of 3q and 13q in one cluster and disomy or trisomy of chromosome 10 and tetrasomy of 3q and 13q in the other cluster. The overall HS of primary tumour sample 1 was 0.19 and the AS was 0.91 (Figure 2A, Supplementary Figure S2A). Primary tumour sample 2 cells grouped into several smaller clusters (Figure 2B), and the dominant cluster showed CNAs similar to one of the two clusters

in primary tumour 1 (Figure 2A). Primary tumour 2 had the highest HS (0.30) and AS (0.95) among all tumour regions (Figure 2B, Supplementary Figure S2B). The lymph node metastasis cells showed one main cluster with CNA patterns that were quite similar to primary tumours 1 and 2. The HS of the lymph node metastasis was 0.14 and the AS was 0.87 (Figure 2C, Supplementary Figure S2C). The adrenal metastasis cells clustered into several small clusters, which showed an overall disomic state of chromosomes 2, 4, 7, 8, 10 and 21 (Figure 2E). The HS of the adrenal metastasis was 0.23 and the AS was 0.92 (Figure 2E, Supplementary Figure S2E).

In contrast to the primary tumour and metastasis cells described above, the liver metastasis showed a much more homogenous CNA pattern that included trisomy of 18, disomy of 22 and pentasomy of Xp. None of these CNAs were prominent in any of the other tumour samples. Furthermore, a very characteristic liver-metastasis-specific CNA pattern was observed for chromosome 11, with one part being disomic, one part trisomic and one part tetrasomic (Figure 2D). This relatively low variation in CNAs in liver metastasis led to this region having the lowest HS (0.07). Liver metastasis also had the lowest AS among all tumour regions (0.85) (Figure 2D, Supplementary Figure S2D).

Altogether, our single cell CNA analyses revealed a marked variation in the degree of heterogeneity between liver metastasis on the one hand and the primary tumours, lymph node and adrenal of the same patient on the other.

(8)

5

where N is total number of single cells, T is total number of bins, cn,t is the copy number state

of cell n in bin t, and et is the euploid copy number in bin t.

The hierarchical clustering of tumour cells was performed with complete linkage and Euclidean distance measure on the matrix of copy number states for each cell and bin. The correlation of the CNA pattern of each single cell to the merged CNAs of each of the five tumour samples was calculated using Pearson’s correlation based on copy number states per 2Mb. The data set(s) supporting the results of this article are available in the ArrayExpress repository under accession number E-MTAB-4186.

Results

Copy number alterations in single tumour cells

Low coverage scWGS was performed on 586 single tumour nuclei derived from five frozen tumour samples of the SCLC patient: two areas of the primary tumour and one each from mediastinal lymph node, liver and adrenal metastasis. Cells passing quality control and having their most common copy number state being disomy (Supplementary Table S1) revealed 74 (primary tumour 1), 54 (primary tumour 2), 82 (lymph node metastasis), 64 (liver metastasis) and 72 (adrenal metastasis) single cell libraries (n=346 cells) (Supplementary Table S2, Supplementary materials and methods).

To exclude experimental bias introduced by the settings of the sorting gates at the G1 peak, we performed array-based comparative genomic hybridization (aCGH) on DNA isolated from the same five tissue samples. Merged scWGS data of each tumour used to generate bulk CNA patterns were highly similar to those generated by aCGH, thus ruling out bias due to sorting gates (Supplementary Figure S1).

The overall CNA patterns of the three metastases were similar to the primary tumour, supporting their clonal relation to the primary tumour rather than representing secondary unrelated primary tumours.

Intra-tumour CNA heterogeneity

To identify intra-tumour CNA heterogeneity, we performed unsupervised clustering based on the CNA patterns of all individual cells per sample. As expected, the majority of the single tumour cells of each sample showed CNA patterns that were consistent with merged scWGS data and aCGH-based plots (Figure 2 and Supplementary Figure S1). Each sample also contained tumour cells with additional unique CNAs that were not detected by aCGH analysis of bulk material (Figure 2 and 3).

Clustering of primary tumour 1 cells revealed two main clusters (Figure 2A). The dominant CNA characteristics were disomy of chromosome 10 and pentasomy of 3q and 13q in one cluster and disomy or trisomy of chromosome 10 and tetrasomy of 3q and 13q in the other cluster. The overall HS of primary tumour sample 1 was 0.19 and the AS was 0.91 (Figure 2A, Supplementary Figure S2A). Primary tumour sample 2 cells grouped into several smaller clusters (Figure 2B), and the dominant cluster showed CNAs similar to one of the two clusters

in primary tumour 1 (Figure 2A). Primary tumour 2 had the highest HS (0.30) and AS (0.95) among all tumour regions (Figure 2B, Supplementary Figure S2B). The lymph node metastasis cells showed one main cluster with CNA patterns that were quite similar to primary tumours 1 and 2. The HS of the lymph node metastasis was 0.14 and the AS was 0.87 (Figure 2C, Supplementary Figure S2C). The adrenal metastasis cells clustered into several small clusters, which showed an overall disomic state of chromosomes 2, 4, 7, 8, 10 and 21 (Figure 2E). The HS of the adrenal metastasis was 0.23 and the AS was 0.92 (Figure 2E, Supplementary Figure S2E).

In contrast to the primary tumour and metastasis cells described above, the liver metastasis showed a much more homogenous CNA pattern that included trisomy of 18, disomy of 22 and pentasomy of Xp. None of these CNAs were prominent in any of the other tumour samples. Furthermore, a very characteristic liver-metastasis-specific CNA pattern was observed for chromosome 11, with one part being disomic, one part trisomic and one part tetrasomic (Figure 2D). This relatively low variation in CNAs in liver metastasis led to this region having the lowest HS (0.07). Liver metastasis also had the lowest AS among all tumour regions (0.85) (Figure 2D, Supplementary Figure S2D).

Altogether, our single cell CNA analyses revealed a marked variation in the degree of heterogeneity between liver metastasis on the one hand and the primary tumours, lymph node and adrenal of the same patient on the other.

(9)

Figure 2. scWGS-derived whole genome copy number profiles of each site of tumour; A) 74 cells of

primary tumour 1, B) 54 cells of primary tumour 2, C) 82 cells of lymph node metastasis, D) 64 cells of liver metastasis, E) 72 cells of adrenal metastasis. Each row represents a single cell with chromosomes plotted as columns. Cells are clustered based on similarity of their copy number profile. Copy number states are indicated by colours as shown in the legend at the bottom of the figure (upper row). The coloured bar on the left represents the tumour region. The tumour region is indicated by colours as shown in the legend at the bottom of the figure (lower row). Heterogeneity score (HS) and aneuploidy score (AS) are shown at the right of each image.

Figure 3. Schematic representation of the CNA states of the four chromosomes that show differences between primary tumour 1, liver metastasis, and adrenal metastasis. The upper panel

represents the CNA patterns obtained from the merged scWGS data of the primary tumour, lymph node metastasis, liver metastasis, and adrenal metastasis. The second row shows the CNA patterns of the single cells obtained from two primary tumour 1 (#59 and #82) that are identical to the CNA pattern of the merged scWGS data of the liver metastasis. The third row gives the CNA patterns of the adrenal metastasis derived single cells (#26, #44, #74 #85 and #90) that clustered between the liver metastasis cells. Copy number states are indicated by colours as shown in the legend at the bottom of the figure.

Identification of metastasis founder cells

We followed two strategies to identify single cells in the primary tumour that could be metastasis founder cells based on their copy number state. The first strategy was based on hierarchical clustering of all sequenced primary tumour and metastasis cells. This revealed an overall intermixed pattern, especially for the primary tumours and the lymph node and adrenal metastasis (Supplementary Figure S3). In contrast, liver cells clustered separately, with almost all single cells of the liver metastasis clustering in one subcluster with two cells from primary tumour 1 (#59 and #82) and five cells of the adrenal metastasis (#26, #44, #74, #85 and #90), with all showing the liver-specific CNA pattern of four chromosomes identical to the merged liver CNA pattern (Figure 3).

(10)

5

Figure 2. scWGS-derived whole genome copy number profiles of each site of tumour; A) 74 cells of

primary tumour 1, B) 54 cells of primary tumour 2, C) 82 cells of lymph node metastasis, D) 64 cells of liver metastasis, E) 72 cells of adrenal metastasis. Each row represents a single cell with chromosomes plotted as columns. Cells are clustered based on similarity of their copy number profile. Copy number states are indicated by colours as shown in the legend at the bottom of the figure (upper row). The coloured bar on the left represents the tumour region. The tumour region is indicated by colours as shown in the legend at the bottom of the figure (lower row). Heterogeneity score (HS) and aneuploidy score (AS) are shown at the right of each image.

Figure 3. Schematic representation of the CNA states of the four chromosomes that show differences between primary tumour 1, liver metastasis, and adrenal metastasis. The upper panel

represents the CNA patterns obtained from the merged scWGS data of the primary tumour, lymph node metastasis, liver metastasis, and adrenal metastasis. The second row shows the CNA patterns of the single cells obtained from two primary tumour 1 (#59 and #82) that are identical to the CNA pattern of the merged scWGS data of the liver metastasis. The third row gives the CNA patterns of the adrenal metastasis derived single cells (#26, #44, #74 #85 and #90) that clustered between the liver metastasis cells. Copy number states are indicated by colours as shown in the legend at the bottom of the figure.

Identification of metastasis founder cells

We followed two strategies to identify single cells in the primary tumour that could be metastasis founder cells based on their copy number state. The first strategy was based on hierarchical clustering of all sequenced primary tumour and metastasis cells. This revealed an overall intermixed pattern, especially for the primary tumours and the lymph node and adrenal metastasis (Supplementary Figure S3). In contrast, liver cells clustered separately, with almost all single cells of the liver metastasis clustering in one subcluster with two cells from primary tumour 1 (#59 and #82) and five cells of the adrenal metastasis (#26, #44, #74, #85 and #90), with all showing the liver-specific CNA pattern of four chromosomes identical to the merged liver CNA pattern (Figure 3).

(11)

In the second strategy, we showed that the Pearson’s correlation coefficients of the liver metastasis cells to the merged liver CNA data were much higher than the correlation coefficients of the other single cells to their matching merged patterns (Supplementary Figure S4). This confirmed the more homogenous nature of the liver metastasis. A subset of the single tumour cells (19 out of 346) of the other four tumour samples showed a stronger association with the merged pattern of one of the other tumour samples as compared to their own merged pattern (Figure 1D and Supplementary Table S3). Three of the 74 cells of primary tumour 1 and six of the 72 adrenal gland tumour cells showed a stronger association with the merged liver metastasis, including the cells with the characteristic liver CNA patterns. The two cells strongly associated with the liver metastasis (cell #70 of primary tumour 1 and #82 of adrenal metastasis) did not show the characteristic CNA pattern of chromosomes 11, 18, 22 and Xp. Thus we clearly showed liver metastasis resembling cells in primary tumour and adrenal metastasis.

Discussion

Recent developments in single cell sequencing, in combination with improved bio-informatics tools, have greatly advanced the field of tumour genetics. Here, we generated scWGS-based CNA patterns to study heterogeneity within a single tumour region and between different tumour regions in a SCLC patient. The merged CNA patterns of the five tumour samples were similar, supporting a clonal relation of the metastasis to the primary tumour. In addition, we showed a marked degree of CNA heterogeneity at the single-cell level.

Mutational and CNA heterogeneity can vary greatly among tumour lesions and among patients 17. Smoking and ultraviolet radiation are the main causes of genomic alterations. Multi-region sampling of non-SCLC primary tumour samples revealed a marked degree of heterogeneity in both mutational and CNA patterns 18. However, aberrations present in a minority of cells can be missed due to dilution within the background of other tumour cells and normal cell admixture within a tumour. This can be circumvented in part by increasing sequencing depth 19 or by using single-cell sequencing. Using the latter approach, we were able to detect various distinct tumour subclones with unique CNA patterns both within and between different tumour regions in our SCLC patient. Whole exome sequencing of the same tumour samples in a previously published study revealed a low degree of heterogeneity with approximately 95% of the somatic mutations being shared 20. Thus, the high degree of heterogeneity among different tumour regions at the CNA level was undetectable at the mutational-level using DNA isolated from tumour bulk. This shows the power of single-cell analysis to truly determine heterogeneity in tumour samples. In fact, the true level of heterogeneity in this patient might be even higher because the single cells we analysed originated from only a limited region of the tumour mass.

Metastases are the result of multiple advantageous aberrations that support dissociation, migration and secondary colonization of specific subclones of the primary tumour. The ability to metastasize might innately be limited to a subset of the primary tumour cells or be gained through genetic or non-genetic changes throughout tumour progression. Such non-genetic

changes include the emergence of cancer stem cells, epithelial-mesenchymal transition, interclonal cooperation and tumour microenvironment such as stromal cells, extracellular matrix and immune cells. Metastasizing potential will be gained in specific subclones at different time points during evolution 8. In our study, some of the primary lung tumour-derived cells had a CNA pattern identical to those of the merged liver metastasis cells, indicating that liver-metastasis founder cells were present in the primary tumour as a minor clone with specific CNAs.

Indeed, single cells or collectively migrating groups of cells may very well represent the founder cells of the metastasis clone. Hou et al (2012) showed that circulating tumour cells (CTCs) in SCLC patients frequently clump together 21. Lack of Ki67 expression in these circulating tumour microemboli indicated that circulating cells are released from the primary tumour as a clump of cells through a collective migration mechanism, rather than being formed upon proliferation of solitary CTCs in the blood circulation 21,22. Presence of liver-metastasis-resembling cells in the primary tumour, but not the lymph node tumour, in our patient suggests that the liver metastasis clone formed independently of the lymph node metastasis clone and seeded through blood vessels. A similar lymph-node-independent seeding pattern has been shown for bone marrow metastasis of breast cancer 23. Interestingly, the presence of five cells in the adrenal metastasis with a CNA pattern similar to the liver metastasis may even suggest a possible migration of cells between different metastatic regions, although the actual order of events cannot be deduced.

The low degree of ITH suggests a monoclonal seeding pattern in the liver. This pattern is supported by WGS of single CTCs of lung cancer patients showing CNA patterns more similar to the CNA patterns of the metastasis tumour than they were to the primary tumour 10. However, the low degree of ITH could also be the result of a restrictive growth potential of a specific tumour-cell subclone in the liver. Another explanation could be that the liver-metastasis-founding cells have lost the potential to survive on-going chromosomal instability, resulting in the outgrowth of a more homogeneous cell population 8.

The higher degree of ITH observed in the lymph node and adrenal gland metastases potentially suggests polyclonal seeding during the development of these metastases in our patient. Subcutaneous injection of a combination of neuroendocrine and non-neuroendocrine SCLC cell lines into immune deficient mice resulted in more aggressive liver metastasis development than injection of only a single-cell clone 24. This supports more favourable growth of a polyclonal tumour cell population. Moreover, interaction among subclones with different tumour secreted factors has been shown to contribute to the development of new tumour phenotypes 25. Thus, tumour cells with unique CNA patterns, as identified in our SCLC patient, could represent newly emerging clones with new characteristics. Polyclonal seeding to metastatic sites may also occur from single or multiple subclones of the primary tumour and from one metastatic site into another 26,27. The fact that we identified multiple subclones in primary and metastasis tumour regions with identical CNA patterns could be consistent with a polyclonal seeding mechanism.

(12)

5

In the second strategy, we showed that the Pearson’s correlation coefficients of the liver metastasis cells to the merged liver CNA data were much higher than the correlation coefficients of the other single cells to their matching merged patterns (Supplementary Figure S4). This confirmed the more homogenous nature of the liver metastasis. A subset of the single tumour cells (19 out of 346) of the other four tumour samples showed a stronger association with the merged pattern of one of the other tumour samples as compared to their own merged pattern (Figure 1D and Supplementary Table S3). Three of the 74 cells of primary tumour 1 and six of the 72 adrenal gland tumour cells showed a stronger association with the merged liver metastasis, including the cells with the characteristic liver CNA patterns. The two cells strongly associated with the liver metastasis (cell #70 of primary tumour 1 and #82 of adrenal metastasis) did not show the characteristic CNA pattern of chromosomes 11, 18, 22 and Xp. Thus we clearly showed liver metastasis resembling cells in primary tumour and adrenal metastasis.

Discussion

Recent developments in single cell sequencing, in combination with improved bio-informatics tools, have greatly advanced the field of tumour genetics. Here, we generated scWGS-based CNA patterns to study heterogeneity within a single tumour region and between different tumour regions in a SCLC patient. The merged CNA patterns of the five tumour samples were similar, supporting a clonal relation of the metastasis to the primary tumour. In addition, we showed a marked degree of CNA heterogeneity at the single-cell level.

Mutational and CNA heterogeneity can vary greatly among tumour lesions and among patients 17. Smoking and ultraviolet radiation are the main causes of genomic alterations. Multi-region sampling of non-SCLC primary tumour samples revealed a marked degree of heterogeneity in both mutational and CNA patterns 18. However, aberrations present in a minority of cells can be missed due to dilution within the background of other tumour cells and normal cell admixture within a tumour. This can be circumvented in part by increasing sequencing depth 19 or by using single-cell sequencing. Using the latter approach, we were able to detect various distinct tumour subclones with unique CNA patterns both within and between different tumour regions in our SCLC patient. Whole exome sequencing of the same tumour samples in a previously published study revealed a low degree of heterogeneity with approximately 95% of the somatic mutations being shared 20. Thus, the high degree of heterogeneity among different tumour regions at the CNA level was undetectable at the mutational-level using DNA isolated from tumour bulk. This shows the power of single-cell analysis to truly determine heterogeneity in tumour samples. In fact, the true level of heterogeneity in this patient might be even higher because the single cells we analysed originated from only a limited region of the tumour mass.

Metastases are the result of multiple advantageous aberrations that support dissociation, migration and secondary colonization of specific subclones of the primary tumour. The ability to metastasize might innately be limited to a subset of the primary tumour cells or be gained through genetic or non-genetic changes throughout tumour progression. Such non-genetic

changes include the emergence of cancer stem cells, epithelial-mesenchymal transition, interclonal cooperation and tumour microenvironment such as stromal cells, extracellular matrix and immune cells. Metastasizing potential will be gained in specific subclones at different time points during evolution 8. In our study, some of the primary lung tumour-derived cells had a CNA pattern identical to those of the merged liver metastasis cells, indicating that liver-metastasis founder cells were present in the primary tumour as a minor clone with specific CNAs.

Indeed, single cells or collectively migrating groups of cells may very well represent the founder cells of the metastasis clone. Hou et al (2012) showed that circulating tumour cells (CTCs) in SCLC patients frequently clump together 21. Lack of Ki67 expression in these circulating tumour microemboli indicated that circulating cells are released from the primary tumour as a clump of cells through a collective migration mechanism, rather than being formed upon proliferation of solitary CTCs in the blood circulation 21,22. Presence of liver-metastasis-resembling cells in the primary tumour, but not the lymph node tumour, in our patient suggests that the liver metastasis clone formed independently of the lymph node metastasis clone and seeded through blood vessels. A similar lymph-node-independent seeding pattern has been shown for bone marrow metastasis of breast cancer 23. Interestingly, the presence of five cells in the adrenal metastasis with a CNA pattern similar to the liver metastasis may even suggest a possible migration of cells between different metastatic regions, although the actual order of events cannot be deduced.

The low degree of ITH suggests a monoclonal seeding pattern in the liver. This pattern is supported by WGS of single CTCs of lung cancer patients showing CNA patterns more similar to the CNA patterns of the metastasis tumour than they were to the primary tumour 10. However, the low degree of ITH could also be the result of a restrictive growth potential of a specific tumour-cell subclone in the liver. Another explanation could be that the liver-metastasis-founding cells have lost the potential to survive on-going chromosomal instability, resulting in the outgrowth of a more homogeneous cell population 8.

The higher degree of ITH observed in the lymph node and adrenal gland metastases potentially suggests polyclonal seeding during the development of these metastases in our patient. Subcutaneous injection of a combination of neuroendocrine and non-neuroendocrine SCLC cell lines into immune deficient mice resulted in more aggressive liver metastasis development than injection of only a single-cell clone 24. This supports more favourable growth of a polyclonal tumour cell population. Moreover, interaction among subclones with different tumour secreted factors has been shown to contribute to the development of new tumour phenotypes 25. Thus, tumour cells with unique CNA patterns, as identified in our SCLC patient, could represent newly emerging clones with new characteristics. Polyclonal seeding to metastatic sites may also occur from single or multiple subclones of the primary tumour and from one metastatic site into another 26,27. The fact that we identified multiple subclones in primary and metastasis tumour regions with identical CNA patterns could be consistent with a polyclonal seeding mechanism.

(13)

Conclusions

CNAs determined by scWGS varied markedly among distinct primary tumour and metastasis samples. Tumour cells derived from liver metastasis and a small subset of the cells derived from the adrenal metastasis resembled a minor subset of the cells in the primary tumour. Our data imply that metastases develop through either a monoclonal seeding pattern, as we observed in liver, or a polyclonal seeding patterns, as we observed in lymph node and adrenal gland metastases in our patient.

Competing interests

None of the authors have conflicts of interest to declare.

Authors' contributions

PF, HB and AS carried out the molecular genetic studies and drafted the manuscript. TJNH, AW, WT and HJMG collected patient material and participated in the design of the study and drafting of the manuscript. KK, AT, DP, FF, MCT and PML participated in the study design and data analysis. AB and DCJS participated in the study design, analysis and coordination to draft the manuscript. All authors read and approved the final manuscript.

Acknowledgements

We thank Martijn M. Terpstra for data analysis support and Nancy Halsema, Hinke G. Kazemier and Karina Hoekstra-Wakker for assistance with single-cell sequencing and sequencing support. We thank the operators at the central Flow Cytometry Unit (UMCG), Geert Mesander, Henk Moes and Roelof Jan van der Lei, for their assistance with sorting. This work was supported by a European Research Council Advanced grant [ROOTS-Grant Agreement 294740 to PML], the Pediatric Oncology Foundation Groningen (SKOG) and the Dutch Cancer Society [2012-RUG-5549 to FF], a MEERVOUD grant from the Netherlands Organization for Scientific Research (NWO) [836.12.011 to MCT] and Rosalind Franklin Fellowship from the University of Groningen [to MCT].

References

1. Siegel, R. L., Miller, K. D. & Jemal, A. Cancer statistics. CA Cancer J Clin 66, 7–30 (2016). 2. Jett, J. R., Schild, S. E., Kesler, K. A. & Kalemkerian, G. P. Treatment of small cell lung cancer:

Diagnosis and management of lung cancer, 3rd ed: American college of chest physicians evidence-based clinical practice guidelines. Chest 143, (2013).

3. Navin, N. E. & Hicks, J. Tracing the tumor lineage. Mol. Oncol. 4, 267–283 (2010). 4. Lamb, A. D. et al. Intratumoral and Intertumoral Genomic Heterogeneity of Multifocal

Localized Prostate Cancer Impacts Molecular Classifications and Genomic Prognosticators.

Eur. Urol. 71, 183–192 (2017).

5. Kim, T.-M. M. et al. Subclonal genomic architectures of primary and metastatic colorectal cancer based on intratumoral genetic heterogeneity. Clin. Cancer Res. 21, 4461–4472 (2015). 6. Gerlinger, M. et al. Genomic architecture and evolution of clear cell renal cell carcinomas

defined by multiregion sequencing. Nat. Genet. 46, (2014).

7. Navin, N. et al. Inferring tumor progression from genomic heterogeneity. Genome Res. 20, 68–80 (2010).

8. Neelakantan, D., Drasin, D. J. & Ford, H. L. Intratumoral heterogeneity: Clonal cooperation in epithelial-to-mesenchymal transition and metastasis. Cell Adhes. Migr. 9, 265–276 (2015). 9. Yachida, S. et al. Distant metastasis occurs late during the genetic evolution of pancreatic

cancer. Nature 467, 1114–1117 (2010).

10. Ni, X. et al. Reproducible copy number variation patterns among single circulating tumor cells of lung cancer patients. Pnas 110, 21083–8 (2013).

11. van den Bos, H. et al. Single-cell whole genome sequencing reveals no evidence for common aneuploidy in normal and Alzheimer’s disease neurons. Genome Biol. 17, 116 (2016). 12. Bakker, B. et al. Single-cell sequencing reveals karyotype heterogeneity in murine and human

malignancies. Genome Biol. 17, 115 (2016).

13. Beroukhim, R., Zhang, X. & Meyerson, M. Copy number alterations unmasked as enhancer hijackers. Nat. Genet. 49, 5–6 (2016).

14. Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–9 (2012).

15. Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078– 2079 (2009).

16. Bakker, B. et al. Single cell sequencing reveals karyotype heterogeneity in murine and human tumours. Genome Biol. 17, 1–15 (2016).

17. Xue, R. et al. Variable Intra-Tumor Genomic Heterogeneity of Multiple Lesions in Patients with Hepatocellular Carcinoma. Gastroenterology 150, 998–1008 (2016).

18. de Bruin, E. C. et al. Spatial and temporal diversity in genomic instability processes defines lung cancer evolution. Science (80-. ). 346, 251–256 (2014).

19. Zhang, J. et al. Intratumor heterogeneity in localized lung adenocarcinomas delineated by multiregion sequencing. Science (80-. ). 346, 256–259 (2014).

20. Saber, A. et al. Mutation patterns in small cell and non-small cell lung cancer patients suggest a different level of heterogeneity between primary and metastatic tumors. Carcinogenesis 38, bgw128 (2016).

21. Hou, J. M. et al. Clinical significance and molecular characteristics of circulating tumor cells and circulating tumor microemboli in patients with small-cell lung cancer. J. Clin. Oncol. 30, 525–532 (2012).

22. Friedl, P. & Gilmour, D. Collective cell migration in morphogenesis, regeneration and cancer.

Nat. Rev. Mol. Cell Biol. 10, 445–457 (2009).

23. Schmidt-Kittler, O. et al. From latent disseminated cells to overt metastasis: Genetic analysis of systemic breast cancer progression. Proc. Natl. Acad. Sci. 100, 7737–7742 (2003). 24. Calbo, J. et al. A Functional Role for Tumor Cell Heterogeneity in a Mouse Model of Small Cell

Lung Cancer. Cancer Cell 19, 244–256 (2011).

25. Marusyk, A. et al. Non-cell-autonomous driving of tumour growth supports sub-clonal heterogeneity. Nature 514, 54–58 (2014).

26. Gundem, G. et al. The evolutionary history of lethal metastatic prostate cancer. Nature 520, 353–357 (2015).

27. Hong, M. K. H. et al. Tracking the origins and drivers of subclonal metastatic expansion in prostate cancer. Nat. Commun. 6, 6605 (2015).

(14)

5

Conclusions

CNAs determined by scWGS varied markedly among distinct primary tumour and metastasis samples. Tumour cells derived from liver metastasis and a small subset of the cells derived from the adrenal metastasis resembled a minor subset of the cells in the primary tumour. Our data imply that metastases develop through either a monoclonal seeding pattern, as we observed in liver, or a polyclonal seeding patterns, as we observed in lymph node and adrenal gland metastases in our patient.

Competing interests

None of the authors have conflicts of interest to declare.

Authors' contributions

PF, HB and AS carried out the molecular genetic studies and drafted the manuscript. TJNH, AW, WT and HJMG collected patient material and participated in the design of the study and drafting of the manuscript. KK, AT, DP, FF, MCT and PML participated in the study design and data analysis. AB and DCJS participated in the study design, analysis and coordination to draft the manuscript. All authors read and approved the final manuscript.

Acknowledgements

We thank Martijn M. Terpstra for data analysis support and Nancy Halsema, Hinke G. Kazemier and Karina Hoekstra-Wakker for assistance with single-cell sequencing and sequencing support. We thank the operators at the central Flow Cytometry Unit (UMCG), Geert Mesander, Henk Moes and Roelof Jan van der Lei, for their assistance with sorting. This work was supported by a European Research Council Advanced grant [ROOTS-Grant Agreement 294740 to PML], the Pediatric Oncology Foundation Groningen (SKOG) and the Dutch Cancer Society [2012-RUG-5549 to FF], a MEERVOUD grant from the Netherlands Organization for Scientific Research (NWO) [836.12.011 to MCT] and Rosalind Franklin Fellowship from the University of Groningen [to MCT].

References

1. Siegel, R. L., Miller, K. D. & Jemal, A. Cancer statistics. CA Cancer J Clin 66, 7–30 (2016). 2. Jett, J. R., Schild, S. E., Kesler, K. A. & Kalemkerian, G. P. Treatment of small cell lung cancer:

Diagnosis and management of lung cancer, 3rd ed: American college of chest physicians evidence-based clinical practice guidelines. Chest 143, (2013).

3. Navin, N. E. & Hicks, J. Tracing the tumor lineage. Mol. Oncol. 4, 267–283 (2010). 4. Lamb, A. D. et al. Intratumoral and Intertumoral Genomic Heterogeneity of Multifocal

Localized Prostate Cancer Impacts Molecular Classifications and Genomic Prognosticators.

Eur. Urol. 71, 183–192 (2017).

5. Kim, T.-M. M. et al. Subclonal genomic architectures of primary and metastatic colorectal cancer based on intratumoral genetic heterogeneity. Clin. Cancer Res. 21, 4461–4472 (2015). 6. Gerlinger, M. et al. Genomic architecture and evolution of clear cell renal cell carcinomas

defined by multiregion sequencing. Nat. Genet. 46, (2014).

7. Navin, N. et al. Inferring tumor progression from genomic heterogeneity. Genome Res. 20, 68–80 (2010).

8. Neelakantan, D., Drasin, D. J. & Ford, H. L. Intratumoral heterogeneity: Clonal cooperation in epithelial-to-mesenchymal transition and metastasis. Cell Adhes. Migr. 9, 265–276 (2015). 9. Yachida, S. et al. Distant metastasis occurs late during the genetic evolution of pancreatic

cancer. Nature 467, 1114–1117 (2010).

10. Ni, X. et al. Reproducible copy number variation patterns among single circulating tumor cells of lung cancer patients. Pnas 110, 21083–8 (2013).

11. van den Bos, H. et al. Single-cell whole genome sequencing reveals no evidence for common aneuploidy in normal and Alzheimer’s disease neurons. Genome Biol. 17, 116 (2016). 12. Bakker, B. et al. Single-cell sequencing reveals karyotype heterogeneity in murine and human

malignancies. Genome Biol. 17, 115 (2016).

13. Beroukhim, R., Zhang, X. & Meyerson, M. Copy number alterations unmasked as enhancer hijackers. Nat. Genet. 49, 5–6 (2016).

14. Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–9 (2012).

15. Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078– 2079 (2009).

16. Bakker, B. et al. Single cell sequencing reveals karyotype heterogeneity in murine and human tumours. Genome Biol. 17, 1–15 (2016).

17. Xue, R. et al. Variable Intra-Tumor Genomic Heterogeneity of Multiple Lesions in Patients with Hepatocellular Carcinoma. Gastroenterology 150, 998–1008 (2016).

18. de Bruin, E. C. et al. Spatial and temporal diversity in genomic instability processes defines lung cancer evolution. Science (80-. ). 346, 251–256 (2014).

19. Zhang, J. et al. Intratumor heterogeneity in localized lung adenocarcinomas delineated by multiregion sequencing. Science (80-. ). 346, 256–259 (2014).

20. Saber, A. et al. Mutation patterns in small cell and non-small cell lung cancer patients suggest a different level of heterogeneity between primary and metastatic tumors. Carcinogenesis 38, bgw128 (2016).

21. Hou, J. M. et al. Clinical significance and molecular characteristics of circulating tumor cells and circulating tumor microemboli in patients with small-cell lung cancer. J. Clin. Oncol. 30, 525–532 (2012).

22. Friedl, P. & Gilmour, D. Collective cell migration in morphogenesis, regeneration and cancer.

Nat. Rev. Mol. Cell Biol. 10, 445–457 (2009).

23. Schmidt-Kittler, O. et al. From latent disseminated cells to overt metastasis: Genetic analysis of systemic breast cancer progression. Proc. Natl. Acad. Sci. 100, 7737–7742 (2003). 24. Calbo, J. et al. A Functional Role for Tumor Cell Heterogeneity in a Mouse Model of Small Cell

Lung Cancer. Cancer Cell 19, 244–256 (2011).

25. Marusyk, A. et al. Non-cell-autonomous driving of tumour growth supports sub-clonal heterogeneity. Nature 514, 54–58 (2014).

26. Gundem, G. et al. The evolutionary history of lethal metastatic prostate cancer. Nature 520, 353–357 (2015).

27. Hong, M. K. H. et al. Tracking the origins and drivers of subclonal metastatic expansion in prostate cancer. Nat. Commun. 6, 6605 (2015).

(15)

Supplementary Figure S1. Comparison of the array-based CGH patterns of total DNA with the merged scWGS-data based CNA plots. CNA plots of the A) primary tumour 1, B) primary tumour 2, C)

lymph node, D) liver metastasis, and E) adrenal metastasis. Array-based CGH plots are shown at the top and the merged scWGS-data based plots are shown below. The X-axis of array-based CGH plots represents each chromosome region, while the Y-axis represents log2 intensity ratio between tumour and reference. The diploid clones will correspond to log2 ratio of zero in ideal condition. The X-axis of the merged scWGS-data based plots represents each chromosome region, while the Y-axis represents the number of reads for each 1Mb bin of each copy number sate. Copy number states are indicated by colours as shown at the bottom right of the figure.

Supplementary Figure S2. The plots of aneuploidy score (AS) and heterogeneity score (HS) for each site of tumour. A) primary tumour 1, B) primary tumour 2, C) lymph node metastases, D) liver

metastasis, E) adrenal metastasis. The X-axis represents each chromosome region, while the Y-axis represents the score for aneuploidy or heterogeneity. The upper plots of each chromosome region show the aneuploidy (divergence from euploidy) score while the lower plots show the heterogeneity score. Each dot in the plot represents one chromosome.

(16)

5

Supplementary Figure S1. Comparison of the array-based CGH patterns of total DNA with the merged scWGS-data based CNA plots. CNA plots of the A) primary tumour 1, B) primary tumour 2, C)

lymph node, D) liver metastasis, and E) adrenal metastasis. Array-based CGH plots are shown at the top and the merged scWGS-data based plots are shown below. The X-axis of array-based CGH plots represents each chromosome region, while the Y-axis represents log2 intensity ratio between tumour and reference. The diploid clones will correspond to log2 ratio of zero in ideal condition. The X-axis of the merged scWGS-data based plots represents each chromosome region, while the Y-axis represents the number of reads for each 1Mb bin of each copy number sate. Copy number states are indicated by colours as shown at the bottom right of the figure.

Supplementary Figure S2. The plots of aneuploidy score (AS) and heterogeneity score (HS) for each site of tumour. A) primary tumour 1, B) primary tumour 2, C) lymph node metastases, D) liver

metastasis, E) adrenal metastasis. The X-axis represents each chromosome region, while the Y-axis represents the score for aneuploidy or heterogeneity. The upper plots of each chromosome region show the aneuploidy (divergence from euploidy) score while the lower plots show the heterogeneity score. Each dot in the plot represents one chromosome.

(17)

Supplementary Figure S3. Unsupervised hierarchical clustering of the single cells from primary tumour 1, primary tumour 2, lymph node metastasis, liver metastasis, and adrenal metastasis. Each

row represents a single cell with chromosomes plotted as columns. Cells are clustered based on similarity of their copy number profile. Copy number states are indicated by colours as shown in the legend at the bottom of the figure (upper row). The coloured bar on the left represents the tumour region. The tumour region is indicated by colours as shown in the legend at the bottom of the figure (lower row). The most characteristic cluster was dominated by single cells of the liver metastasis admixed with two cells of primary tumour 1, 5 cells of the adrenal metastasis and one cell of primary tumour 2. However, the primary 2 cell did not show specific CNA pattern as seen in liver metastasis cells.

Supplementary Figures S4. The correlation of single cells to their own merged CNA libraries data.

Each dot in the plot represents one single cell. Different tumour regions indicated by the colours of the dots.

Supplementary Table 1. Quality assessment by Aneufinder for single cell libraries filtering. Shown are

the mean values for the highest scoring cluster in each tumour region. Sample ID Spikiness Entropy

Log-likelihood Number of ploidy state segments Bhattacharyya distance

Primary 1 0.13 7.11 -6953.63 81.24 3.05

Primary 2 0.14 7.11 -6026.63 62.73 2.36

Lymph node 0.12 7.11 -7313.90 85.63 2.66

Liver 0.11 7.12 -7139.36 70.12 3.36

Adrenal 0.12 7.12 -6887.18 63.87 3.35

Supplementary Table 2. Overview of sequencing results.

Sample ID No. cells sequenced (excluding controls)

Mapped

reads Unique mapped reads Average no. of reads for each bin Coverage

(%) No. cells with aberrant CNV pattern that passed QC (%) Primary 1 92 780863 521965 386 0.87 74 Primary 2 93 1177623 545697 404 0.91 54 Lymph node 165 913704 458330 339 0.76 82 Liver 92 299575 211548 156 0.35 64 Adrenal 144 1007752 444981 329 0.74 72

(18)

5

Supplementary Figure S3. Unsupervised hierarchical clustering of the single cells from primary tumour 1, primary tumour 2, lymph node metastasis, liver metastasis, and adrenal metastasis. Each

row represents a single cell with chromosomes plotted as columns. Cells are clustered based on similarity of their copy number profile. Copy number states are indicated by colours as shown in the legend at the bottom of the figure (upper row). The coloured bar on the left represents the tumour region. The tumour region is indicated by colours as shown in the legend at the bottom of the figure (lower row). The most characteristic cluster was dominated by single cells of the liver metastasis admixed with two cells of primary tumour 1, 5 cells of the adrenal metastasis and one cell of primary tumour 2. However, the primary 2 cell did not show specific CNA pattern as seen in liver metastasis cells.

Supplementary Figures S4. The correlation of single cells to their own merged CNA libraries data.

Each dot in the plot represents one single cell. Different tumour regions indicated by the colours of the dots.

Supplementary Table 1. Quality assessment by Aneufinder for single cell libraries filtering. Shown are

the mean values for the highest scoring cluster in each tumour region. Sample ID Spikiness Entropy

Log-likelihood Number of ploidy state segments Bhattacharyya distance

Primary 1 0.13 7.11 -6953.63 81.24 3.05

Primary 2 0.14 7.11 -6026.63 62.73 2.36

Lymph node 0.12 7.11 -7313.90 85.63 2.66

Liver 0.11 7.12 -7139.36 70.12 3.36

Adrenal 0.12 7.12 -6887.18 63.87 3.35

Supplementary Table 2. Overview of sequencing results.

Sample ID No. cells sequenced (excluding controls)

Mapped

reads Unique mapped reads Average no. of reads for each bin Coverage

(%) No. cells with aberrant CNV pattern that passed QC (%) Primary 1 92 780863 521965 386 0.87 74 Primary 2 93 1177623 545697 404 0.91 54 Lymph node 165 913704 458330 339 0.76 82 Liver 92 299575 211548 156 0.35 64 Adrenal 144 1007752 444981 329 0.74 72

(19)

Supplementary Table 3. Cell correlation to merged libraries data from each tumour region.

Tumour region

Total cells Cells similar to other tumour region

Details

{Other tumour region [number of cells; cell ID (correlation value to other tumour region vs. correlation value to own merge)]}

primary 1 74 7 primary 2 [2 cells; #11 (0.89 vs. 0.88), #88 (0.91 vs. 0.89)] lymph node [2 cells; #13 (0.98 vs. 0.97), #80 (0.91 vs. 0.89)] liver [3 cells; #59 (0.94 vs. 0.88), #70 (0.81 vs. 0.76), #82 (0.93 vs. 0.87)]

primary 2 54 4 primary 1 [4 cells; #23 (0.93 vs. 0.87), #41 (0.59 vs. 0.52) #48 (0.81 vs. 0.80), #52 (0.93 vs. 0.92)]

lymph node

82 1 primary 1 [1 cell; #56 (0.93 vs. 0.91)]

liver 64 0 -

adrenal 72 7 primary 1 [1 cell; #51 (0.40 vs. 0.38)]

liver [6 cells; #26 (0.98 vs. 0.96), #44 (0.95 vs. 0.92), #74 (0.93 vs. 0.91), #82 (0.74 vs. 0.73), #85 (0.94 vs. 0.92), #90 (0.95 vs. 0.92)]

(20)

Supplementary Table 3. Cell correlation to merged libraries data from each tumour region.

Tumour region

Total cells Cells similar to other tumour region

Details

{Other tumour region [number of cells; cell ID (correlation value to other tumour region vs. correlation value to own merge)]}

primary 1 74 7 primary 2 [2 cells; #11 (0.89 vs. 0.88), #88 (0.91 vs. 0.89)] lymph node [2 cells; #13 (0.98 vs. 0.97), #80 (0.91 vs. 0.89)] liver [3 cells; #59 (0.94 vs. 0.88), #70 (0.81 vs. 0.76), #82 (0.93 vs. 0.87)]

primary 2 54 4 primary 1 [4 cells; #23 (0.93 vs. 0.87), #41 (0.59 vs. 0.52) #48 (0.81 vs. 0.80), #52 (0.93 vs. 0.92)]

lymph node

82 1 primary 1 [1 cell; #56 (0.93 vs. 0.91)]

liver 64 0 -

adrenal 72 7 primary 1 [1 cell; #51 (0.40 vs. 0.38)]

liver [6 cells; #26 (0.98 vs. 0.96), #44 (0.95 vs. 0.92), #74 (0.93 vs. 0.91), #82 (0.74 vs. 0.73), #85 (0.94 vs. 0.92), #90 (0.95 vs. 0.92)]

(21)

Chapter 6

Referenties

GERELATEERDE DOCUMENTEN

peptide vaccination days: NKG2A relative

[r]

The author is not responsible for any losses which may result from the use or Distribution of this artwork as part of the xfig package, where xfig is part of a commercially

Om meer inzicht te krijgen in de aanwezigheid van aneuploïde cellen in het menselijk brein en de mogelijke rol van aneuploïdie in de ziekte van Alzheimer hebben we individuele cellen

Indeed, several recent reports on copy number variation and aneuploidy in normal brain cells using single cell sequencing have also emphasized the advantages of using whole

Tumor cell heterogeneity is important in the development and progression of cancer, and its detection using single cell sequencing will become a key diagnostic tool and play

(A) Micrograph of a pure population of epithelial PDAC cells, as obtained via contrast phase light microscopy (Original magnification 20×, scale bar = 50 μm).. (B) Micrographs of

(A) Western blot results show the expression of MMP-2 and MMP-9 proteins, both in the active (cleaved) and inactive (full-length) forms in PVA/G sponge, PEOT/PBT sponge and