• No results found

University of Groningen Exploring chaotic time series and phase spaces de Carvalho Pagliosa, Lucas

N/A
N/A
Protected

Academic year: 2021

Share "University of Groningen Exploring chaotic time series and phase spaces de Carvalho Pagliosa, Lucas"

Copied!
5
0
0

Bezig met laden.... (Bekijk nu de volledige tekst)

Hele tekst

(1)

University of Groningen

Exploring chaotic time series and phase spaces

de Carvalho Pagliosa, Lucas

DOI:

10.33612/diss.117450127

IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please check the document version below.

Document Version

Publisher's PDF, also known as Version of record

Publication date: 2020

Link to publication in University of Groningen/UMCG research database

Citation for published version (APA):

de Carvalho Pagliosa, L. (2020). Exploring chaotic time series and phase spaces: from dynamical systems to visual analytics. University of Groningen. https://doi.org/10.33612/diss.117450127

Copyright

Other than for strictly personal use, it is not permitted to download or to forward/distribute the text or part of it without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license (like Creative Commons).

Take-down policy

If you believe that this document breaches copyright please contact us providing details, and we will remove access to the work immediately and investigate your claim.

Downloaded from the University of Groningen/UMCG research database (Pure): http://www.rug.nl/research/portal. For technical reasons the number of authors shown on this cover page is limited to 10 maximum.

(2)

10

C O N C L U S I O N

We now summarize the main ndings on the research work de-scribed in this thesis, as follows.

Firstly, we have explored the relationship between entropy and the independence of phase states (Chapter 5). For this, we relied on the Statistical Learning Theory (SLT) framework (Section 5.2) to show some phase spaces (embedding with dierent parameters m and τ) led to better learning models after applying the same regression function. Further, as SLT requires the input space to be formed by independent-and-identically-distributed (i.i.d.) data, we empirically show that phase states have satised such an assump-tion after testifying generalizaassump-tion. For this study, it was assumed data to come from a controlled environment. Furthermore, we stud-ied the correlation of entropy with dierent types of embeddings, both numerically (Chapter 5) and visually (Chapter 8). Although we found that optimal embeddings do have a low entropy, the rela-tion was not one-to-one. For example, we found that very dierent embeddings from the known optimal one can also have low entropy levels. This raises the question of what is, actually, a good denition of an optimal embedding. Intuitively, one would say that determin-istic systems generally have well-designed structures in the phase space, as one state maps to a single other in the future. Conversely, stochastic processes tend to have phase states spread all over the phase space, such that such patterns are rarer to happen. However, as outlined above, we could not nd a one-to-one correspondence between the distance from an optimal embedding and the entropy level. This means, also, that using entropy as a criterion to com-pute the parameters of an optimal embedding is a very dicult, if even possible, task.

As part of our studies concerning chaos theory and dynamical systems, we wanted to prove the eectiveness of phase space mod-els. We have performed experimental analysis in the context of time series semi-supervised classication, where we have compared time series and phase space methods using self-learning methods (

Chap-ter 6). As shown there, our method based on state spaces yields

more accurate results than state-of-the-art methods. As such, this strengthens our belief that modeling dynamical systems via their (optimal or near-optimal) embeddings in phase space is a useful

proposition.

We next revisited the context of the SLT, aiming to come up with a set of criteria and methodology for qualitatively assessing

(3)

conclusion

how well learning is ensured by algorithms that attempt to detect concept drift in data streams (Chapter 7). We adapted the SLT to account for this more challenging type of data (as compared to time series drawn from a single phenomenon) and next evaluated sev-eral state-of-the-art concept-drift-detection algorithms. Saliently, our evaluation showed that no algorithm (from the studied ones) fully complies with our criteria, thus can ensure learning. This re-sult points out some limitations of existing concept drift detection methods (which should be overcome by future research), although it also provides a new theoretical framework to evaluate all such algorithms in a principled way.

On a more practical side, we examined the challenge of getting a visual insight into large high-dimensional datasets such as created by our time series. We proposed RadViz++, a visual exploration tool that overcomes several limitations of existing data visualiza-tion tools based on the radial metaphor (Chapter 8). We validated this tool on several real-world high-dimensional datasets. Next, we tried to visually explore (in another proposal) the set of phase spaces generated by various embeddings. Our exploration revealed several interesting (though, complex to interpret) patterns, and cor-related positively with the diculty of fully characterizing optimal embeddings by low entropy levels.

Finally, we turned back to the problem of estimating optimal em-beddings, and used articial neural networks to address this task

(Chapter 9). The proposed approach is simple to implement, fast to

execute, has only a few parameters, and yields values for the opti-mal embedding parameters m and τ which are very close to ground-truth values stated by the literature. Given these results, and the simplicity of our approach, we argue that exploring more rened deep-learning architectures is a promising way forward phase-space reconstruction.

Several future work is possible based on our results. From a theoretical viewpoint, it is interesting (and valuable) to further ex-plore the relationship between entropy and optimal embeddings. By dening what an optimal embedding is (which may be a problem-specic question to answer), we believe that stronger cor-relations with entropy can be found. This aspect can benet from rening and extending our phase-space visualizations to e.g., show not only how much two phase spaces are dierent, but what pre-cisely makes them dierent. Along the same lines, methods to vi-sually summarize large and complex trajectories in phase spaces are of interest. Also, ways to visualize trajectories in phase spaces having more than three dimensions, e.g., by the suitable use of dimensionality reduction, are an interesting extension to consider. At the practical endpoint, using dierent models fed by state-space

(4)

conclusion

features could improve the classication and prediction of time se-ries, better than with the current methods employed in this thesis.

(5)

Referenties

GERELATEERDE DOCUMENTEN

First, Takens' theorem stated nothing about the embedding pair (m, τ), only that a sucient phase space can be properly unfolded when the embedding dimension m is greater or equal

If the generating rule is guided by a stochastic process, e.g., a Normal distribution, the space has greater probability to be equally lled by states, leading the fractal dimension

The aim of selecting such systems was to create a consistent set of datasets that describes the behavior of dynamical systems for which ground truth is available in terms of

This chapter provided a detailed and, to our knowledge, comprehen- sive overview of existing methods designed to estimate the optimal embedding dimension m and time delay τ

There are several directions of possible future work based on the results presented in this chapter: i) adapting the joint probability distribution so one can rely on

Table 6: Case Study 3: Although positive and unlabeled series (espe- cially the ones generated from the sine function) present sim- ilar trends and recurrences, MDL-CRQA still

Complementarily, model g assumes that each data window may come from distinct but xed/unique probabil- ity distributions, so when this indicator function reports a drift, any

eral methods try to optimize anchor placement and how points are attracted to them ( Section 8.3.2 ). Yet, inconsistencies will eventu- ally occur, especially when the number