• No results found

Artificial neural network models for the evolution of assortative learning

N/A
N/A
Protected

Academic year: 2021

Share "Artificial neural network models for the evolution of assortative learning"

Copied!
2
0
0

Bezig met laden.... (Bekijk nu de volledige tekst)

Hele tekst

(1)

University of Groningen

Artificial neural network models for the evolution of assortative learning Méndez Salinas, Emiliano

IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please check the document version below.

Document Version Other version

Publication date: 2019

Link to publication in University of Groningen/UMCG research database

Citation for published version (APA):

Méndez Salinas, E. (2019). Artificial neural network models for the evolution of assortative learning. Poster session presented at Zoology 2019, Groningen, Netherlands.

Copyright

Other than for strictly personal use, it is not permitted to download or to forward/distribute the text or part of it without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license (like Creative Commons).

Take-down policy

If you believe that this document breaches copyright please contact us providing details, and we will remove access to the work immediately and investigate your claim.

Downloaded from the University of Groningen/UMCG research database (Pure): http://www.rug.nl/research/portal. For technical reasons the number of authors shown on this cover page is limited to 10 maximum.

(2)

λ – Vold

Box 2. The neural network approach

Associative learning is the process whereby an organism comes to associate one stimulus or event with other stimuli or events.

The Rescorla-Wagner Rule is arguably the most prominent model for explaining how the strength of these associations develop during learning.

Here we use evolving neural networks to address the question how natural

selection shapes associative learning and whether it will lead to learning patterns

that are similar to the Rescorla-Wagner Rule.

Input

from environment (internal and external)

Processing of information Output “Response” of the network. Vnew Hidden Nodes Input

Nodes OutputNode

Connection weights

- Evolve (inheritance + mutation) - Fixed during lifetime

Values change during lifetime depending on inputs and weights Values change during lifetime Vold λ

Figure 2. Comparison of the updating mediated by the evolved networks and the

Rescorla Wagner Rule. When the difference in estimates Vnew-Vold is plotted against the

difference λ-Vold between reward and old estimate, the Rescorla-Wagner Rule (gold)

produces a straight line with slope β. By plotting the same characteristics in one graph, the updating behaviour of different rules and networks can be compared.

Networks used in our simulations

Artificial Neural Network Models for the Evolution of Associative Learning

Trimmer et al. (2012, JTB. 302:39) approached the same question using genetic

algorithms and binary trees where learning rules of arbitrary complexity could evolve. We follow their framework but use the more realistic assumption that learning is mediated by a neural network. Their model can be conceptualized as bumblebees that sequentially sample flowers which can either have a nectar reward or not. Each time they experience reward (or not) they update their estimate V of

the probability that any given flower provides reward.

In our model, the updating of the probability estimate

V

in response to reward λ is not mediated by a learning rule, but by an artificial neural network (see Box 2).

The network has two input nodes (for the reward λ and the previous estimate of V, Vold) and one output node (whose value corresponds to the new estimate of V, Vnew). Information processing happens in-between and is governed by connections between nodes. Connections’ weights are genetically encoded and transmitted from parent to offspring (subject to small mutations). Individuals producing a good

estimate of the true probability of getting nectar have high fitness and thus produce

more offspring. In this way the population of networks evolves over the generations.

λ λ

λ λ

Emiliano Méndez Salinas*, Franjo Weissing, Magdalena Kozielska

MARM-group, Groningen Institute for Evolutionary Life Sciences, University of Groningen, The Netherlands

*Email contact: e.mendez.salinas@rug.nl

Introduction

Background and Model

In the context of Trimmer’s model, Rescorla-Wagner updating is given by:

Rescorla-Wagner Rule:

V

new

= V

old

+ β(λ – V

old

)

where λ is the reward (1 or 0) and β is the learning rate. The optimal value of β strongly reflects the number of learning events.

Trimmer et al. (2012) showed that the Rescorla-Wagner Rule readily evolves, even though there is a learning rule with better performance:

Optimal Rule:

V

new

= V

old

+ β(λ – 0.5)

Box 1. The ‘learning rule’ approach

Networks used in our simulations

Results

N1

N2

N3

N4

Main Findings and Conclusions

Network N1 evolves to behave and perform exactly as the Rescorla-Wagner Rule (Fig. 1 and Fig. 2).

Network N2 (N1 + constant bias) evolves to behave and perform as the

Optimal Rule (Fig. 1 and Fig. 2 at T3).

More complex networks (N3, N4) evolve more slowly than their simple counterparts (Fig. 1).

But these more complex networks do not perform better (Fig. 1), and they show the same updating behaviour as the simpler networks (Fig. 2 at times T2

and T3).

In line with Trimmer et al.’s results, networks that evolve to reach optimal performance, transiently behave and perform as the Rescorla-Wagner Rule. (In Fig. 2 N2 and N4 at times T1 and T2, respectively).

In a more demanding associative learning task, only some (even more complex) networks outperform the

Rescorla-Wagner Rule (networks and results not shown).

Es tima tio n Er ror

Figure 1. For the networks used, over the generations, the mean estimation error

(difference between the true and the estimated value of V) decreases and converges to an asymptotic value. Generations V ne w -V old T1 T2 T3 T1 T2 T3 Rescorla-Wagner Rule Optimal Rule Vold Vold Vold Vold Vnew Vnew Vnew Vnew

N1

N2

N3

N4

Referenties

GERELATEERDE DOCUMENTEN

Conclusions: Cavitation was shown to occur in UAI at clinically relevant ultrasonic power settings in both straight and curved canals but not around sonically oscillating

Dissimilar to the small loans subsample, the interaction variable shows a significant value at a 5% significance level in column (4) and (6) indicating that for

This thesis focuses on the release from proximity (RfP) as it was originally proposed by Clive Gamble and will attempt to establish a model with

Alternatively, if the governments neglect to do so, awareness needs to be raised for the impacts that the palm oil industry has on the environment and biodiversity, while the benefits

An earlier study of the author has indicated that the Burgers model precisely predicts the creep displacements of B270i glass in a wide molding temperature range, which corre-

Ook zijn monsters genomen van verschillende lagen veen in het Bargerveen in Drenthe waarin de populaties van bacteriën met behulp van moleculaire technieken zijn vastgesteld.. Er

Voor de ondiepe delen, voornamelijk bemonsterd door de Luctor, waren de dichtheden in het voorjaar voor veel soorten groter in het MVII gebied dan in de overige twee

In het noorden (sleuven 15-17) en oosten (sleuf 9) van het onderzoeksterrein bevinden zich enkele greppels die op basis van het verzamelde aardewerk kunnen