• No results found

Imitation dynamics in cournot games with heterogeneous players.

N/A
N/A
Protected

Academic year: 2021

Share "Imitation dynamics in cournot games with heterogeneous players."

Copied!
60
0
0

Bezig met laden.... (Bekijk nu de volledige tekst)

Hele tekst

(1)

Imitation Dynamics in Cournot Games with

Heterogeneous Players

D. J. Lindeman (student number 5975417) October 28, 2013

Supervisors: Dr. M.I. Ochea and Prof. Dr. C.H. Hommes

A thesis submitted for the degree Master of Science in Econometrics

(2)

Master thesis in the field of Mathematical Economics University of Amsterdam

Faculty of Economics and Business Supervisor: Dr. M.I. Ochea

(3)

Abstract

This thesis is a study on the stability of the Cournot model under heterogeneous expectations. The underlying model is the one-shot Cournot game where all n-firms have to decide which heuristic to use to determine their production for the next pe-riod. Focus lays on the imitation heuristic in competition with the Cournot and/or rational heuristic. An evolutionary model is used where the fractions of firms that use a specific heuristic are endogenously updated. The endogenous updating cap-tures the idea that heuristics that perform better will be used more frequently. Five different models have been investigated analytically and for these models thresholds, in terms of the number of firms, on the stability have been derived. Moreover, the settings have been analyzed in terms of simulation. Main findings are that: (i) in the case when Cournot firms compete with imitators we found that the threshold on the number of firms that changes the system from stable to unstable is 7, (ii) in the case when rational firms compete with imitators, in the specific scenario of linear inverse demand and constant marginal cost, we found that the system is always stable regardless of the game and behavioural parameters. (iii) in the case when rational firms, Cournot firms and imitators compete we found that the stability de-pends on the evolutionary pressure and the the stability of the cheapest heuristic(s). When the cheapest behavioural rule is stable, the dynamics converge to a situation where most firms use this behavioural rule and all firms produce the Cournot-Nash equilibrium quantity. So having more information about the market does not neces-sarily lead to higher profits due to information costs. When the cheapest heuristic is unstable, complicated endogenous fluctuations may occur. In particular, when the evolutionary pressure is high or when the number of firms passes a certain threshold. Note that the nonlinearity causing this erratic behaviour comes from the endoge-nously updating of the fractions, because in our leading example the specifications were linear.

(4)

Contents

1 Introduction 1

2 Theoretical Framework 4

2.1 Production plans . . . 6

2.2 Instability threshold . . . 8

2.2.1 Cournot adjustment heuristic . . . 8

2.2.2 Imitation heuristic . . . 9

2.3 Population dynamics . . . 10

2.3.1 Discrete choice models - the Logit evolutionary dynamics . . . 11

3 Heterogeneity in behaviour in Cournot oligopolies 13 3.1 Cournot vs. Imitation firms . . . 13

3.1.1 Theory 1: Firms select their heuristic . . . 13

3.1.2 Theory 2: A large population game . . . 17

3.2 Rational vs. Imitation firms . . . 19

4 Evolutionary competition between two heuristics 23 4.1 Cournot versus Imitation firms . . . 24

4.2 Rational vs. Imitation firms . . . 30

5 Rational vs. Cournot vs. Imitation 36

6 Concluding Remarks 52

(5)

1

Introduction

Theocharis (1960) shows that, when firms compete on quantity using the Cournot (1838) adjustment process,1 the Cournot model becomes unstable if the number of firms in-creases. In fact, with linear demand and constant marginal costs, the Cournot-Nash equilibrium loses stability and bounded but perpetual oscillations arise already for a triopoly. For more than three firms oscillations grow unbounded, but they are limited once the non-negativity price and demand constraints bind. This is a remarkable result since unbounded oscillations is not what we encounter in practice.

Whereas Theocharis focused only on the Cournot adjustment process newer research extends to models of heterogeneous expectations.2 Hommes, Ochea and Tuinstra (2011) created a framework in which these heuristics compete in a quantity-setting. Each firm chooses a behavioural rule from a finite set of different rules, which are assumed to be commonly known. When making a choice concerning the behavioural rules, a firm takes the past performance of the rules, i.e., the past realized profit net of the cost associated with the behavioural rules to compare fitness. Both past performance and costs associated with the behavioural rules are publicly available. This implies that successful heuristics will continue to be used, while unsuccessful behavioural rules are dropped. This strategic behaviour thus causes the distribution of fractions of firms over a given set of behavioural rules to change per period.

Hommes, Ochea and Tuinstra (2011) focused on the Cournot heuristic in competi-tion with the Nash quantity or with racompeti-tional firms. Interestingly Huck, Normann and Oechssler (2002) discuss a linear Cournot oligopoly experiment with four firms. They do not find that quantities explode as the Theocharis (1960) model predicts, instead the time average quantities converge to the Cournot-Nash equilibrium quantity, although

1

Firms that display Cournot behaviour take the current period’s aggregate output of their competitors as a predictor for the next period competitors’ aggregate output and best-respond to that.

2

In models with heterogeneous expectations producers can have different heuristics to adjust their production.

(6)

there is substantial volatility around the Cournot-Nash equilibrium quantity. Huck, Normann and Oechssler (2002) find that a process where participants mix between the Cournot adjustment heuristic an imitating the previous period’s average quantity gives the best description of behaviour. Therefore we focus on competition of the imitation heuristic with the Cournot heuristic. Moreover, since classical economic theory assumes rationality, we investigate the dynamics in competition with this heuristic too. In total five models where imitators compete with Cournot and/or rational firms are investi-gated analytically.. The framework created by Hommes, Ochea and Tuinstra (2011) will gratefully be followed in order to do the analytics.

Our concern is, first of all, under what circumstances firms may want to switch between behavioural rules over time and second, once the Cournot-Nash equilibrium is reached whether all firms will keep producing the Cournot-Nash quantity or deviate.

Main findings are that, (i) in the case when Cournot firms compete with imitators that the threshold on the number of firms that changes the system from stable to unstable is 7, (ii) when rational firms compete with imitators, in the specific scenario of linear inverse demand and constant marginal cost, the system is always stable regardless of the game and behavioural parameters, (iii) in the case when rational firms, Cournot firms and imitators compete, the stability depends on the evolutionary pressure and the the stability of the cheapest heuristic(s). When the cheapest behavioural rule is stable, the dynamics converge to a situation where most firms use this behavioural rule and all firms produce the Cournot-Nash equilibrium quantity. So having more information about the market does not necessarily lead to higher profits due to information costs. In the case when the cheapest heuristic is unstable, complicated endogenous fluctuations may occur. In particular, when the evolutionary pressure is high or when the number of firms passes a certain threshold. Note that the nonlinearity causing this erratic behaviour comes from the endogenously updating of the fractions, because in our leading example the specifications were linear.

(7)

The remainder of this thesis is organized as follows, in Section 2 the theoretical framework is introduced, here the quantity and population dynamics will be explained extensively. In Section 3 the dynamics will be investigated under exogenous population dynamics whereas in Section 4 the stability of the system will be investigated under endogenous population dynamics. In the fifth Section the results of section four are combined and the stability of a system where rational, Cournot and imitators compete in one economy under endogenous fraction dynamics is investigated. Finally, we conclude in Section 6.

(8)

2

Theoretical Framework

Consider a finite population of firms who are competing on the market for a certain good, each discrete-time period all producers have to decide their production plans for the next period. However, instead of simultaneously choosing the supplied quantities directly, the firms act according to behavioural rules that exactly prescribe the quantity to be supplied. Before the evolutionary model is studied a brief review of the traditional, static Cournot model will be given.

Consider a symmetric Cournot oligopoly game, where qi denotes the quantity

sup-plied by firm i, where i = 1, ...n. Next to that let Q = Pn

i=1qi be the aggregated

production. Furthermore let P (Q) denote the twice differentiable, nonnegative and non-increasing inverse demand function and let C(qi) denote the twice differentiable

non-decreasing cost function, which is the same for all firms. For firm i the resulting profit function from the above described model is given by

πi(qi, Q−i) = P (qi+ Q−i)qi− C(qi), i = 1, ...n (1)

where Q−i=Pj6=iqj. Assume that the profit function of a firm is strictly concave in its

own output qi. The profit maximizing strategy of firm i, taking the quantity supplied by

the competitors as given, results in the well-known best-reply function for firm i, which is given by

qi = Ri(Q−i) = Argmax qi

[P (qi+ Q−i)qi− C(qi)].

Due to symmetry, all firms have the same best-reply function R(·). Moreover, the symmetric Cournot-Nash equilibrium quantity q∗ corresponds to the solution of

q∗ = R((n − 1)q∗).

(9)

exists. For simplicity assume that q∗ is the unique symmetric Cournot-Nash equilibrium strategy.3

In this thesis focus lays on the following specification of the Cournot oligopoly game which will be called the leading example. This is the original specification Theocharis (1960) used, where inverse demand is linear and marginal costs are constant. The inverse demand and cost function are given by

P (qi+ Q−i) = a − b(qi+ Q−i) and C(qi) = cqi, i = 1, ...n

respectively. First, in order to have a strictly concave profit function assume that b > 0. Furthermore, for strictly positive prices assume that Q < ab. For these specifications of the inverse demand function and cost function the reaction function is given by

qi = R(Q−i) = a − c 2b − 1 2Q−i= q ∗1 2(Q−i− (n − 1)q ∗ ). (2)

Note that if the other firms produce on average more (less) than the Cournot-Nash equilibrium quantity, firm i reacts by producing less (more) than that quantity.

Straightforward calculations show that in this case the Cournot-Nash equilibrium quantity, aggregated production, price and profit are equal to q∗ = b(n+1)a−c , Q∗ = a−cb n+1n , P∗= a+ncn+1 and π∗ = π(qi∗, Q∗−i) = b(n+1)(a−c)22.

Traditional Cournot analysis refers to a static environment. However, in a dy-namic setting the reaction function introduced above can be used to study the so called Cournot-dynamics where firms best-reply to their expectations

qi,t = R(Qe−i,t), i = 1, ...n

3

The Cournot duopoly game may also have asymmetric Cournot-Nash equilibria, but they do not correspond to equilibria of the evolutionary game when there is a single population. For the linear-quadratic specification of the Cournot oligopoly model specified below, there can indeed be asymmetric boundary equilibria, but they do not influence the dynamics of the evolutionary model.

(10)

where qi,tdenotes the quantity supplied by player i in period t. The symmetric

Cournot-Nash equilibrium where all firms produce q∗ is stable under the Cournot-dynamics if (n − 1)|R0((n − 1)q∗)| < 1.

Main interest is on how firm i decides to play q∗ and on top of that, what does firm i believe about Q−i when the production decision has to be made.

In the next Subsection the description of the quantity dynamics will be given. In Subsection 2.2 some local instability results for the general evolutionary system are discussed. In Subsection 2.3 the population dynamics will be discussed.

2.1 Production plans

In the Cournot oligopoly game the producers have to form expectations about opponents’ production plans. Based on this expectation firms decide how much to produce the next period. One approach is to assume complete information, i.e. rational firms with common knowledge of rationality. This implies that firms have perfect foresight about competitors’ aggregated production plan, i.e. Qe−i,t+1 = Q−i,t+1. This results in the

following production plan:

qi,t+1= R(Q−i,t+1), i = 1, ...n

Alternatively one may consider rules that require less information, for example Qe

−i,t+1=

Q−i,t. This results in the following production plan:

qi,t+1= R(Q−i,t), i = 1, ...n (3)

where firms expect that aggregated production in the next period equals current aggre-gated production. This is the so called Cournot adjustment heuristic.

It is a broadly supported idea that not all producers best-reply to their expectations. Experiments (Huck 2002) show that people often imitate others’ behaviour. A heuristic

(11)

that possibly seizes this production plan is the so called imitation-heuristic. Imitators belief that “everyone else can’t be wrong” and will therefore produce the average of the other players’ production in the next period, i.e.

qi,t+1=

Q−i,t

n − 1, i = 1, ..., n. (4)

Finally, Bosch-Dom`enech and Vriend (2003) test the importance of models of behaviour characterised by imitation of successful behaviour, that is to imitate the quantity which the firm with the highest profit in the current period produced, i.e.

qi,t+1= qj,t, i = 1, ...n, where Πj,t= M ax{Π1,t, ..., Πk,t}.

They find that the players do not rely more on imitation of successful behaviour in more demanding environments and explain the different output decisions as predominantly relate to a general disorientation of the players, and more specifically to a significant decrease of best responses.

In the next subsection we will investigate the dynamics under expectation rule (3) and (4) in greater detail.

(12)

2.2 Instability threshold

2.2.1 Cournot adjustment heuristic

If all firms use the Cournot adjustment heuristic (3), quantities evolve according to the following system of n first order difference equations

q1,t+1= R(q2,t+ q3,t+ . . . + qn,t), q2,t+1= R(q1,t+ q3,t+ . . . + qn,t), .. . qn,t+1= R(q2,t+ q3,t+ . . . + qn,t). (5)

Local stability of the Cournot-Nash equilibrium depends on the eigenvalues of the Jaco-bian matrix J of the system of equations (5), evaluated at that Cournot-Nash equilibrium q∗. This Jacobian matrix is given by

J |q∗ =          0 R0(Q∗−1) · · · R0(Q∗−1) R0(Q∗−2) 0 ... .. . . .. R0  Q∗−(n−1)  R0(Q∗−n) · · · R0(Q∗−n) 0          . (6)

Firms do not respond to their own previous production, therefore all diagonal elements are equal to zero. All off-diagonal elements in row i are equal to R0(Q∗−i), since individual production levels only enter through aggregate production of the other firms. Moreover, at the symmetric Cournot-Nash equilibrium we have Q∗−i = (n − 1)q∗ for i = 1, ..., n, thus all off-diagonal elements of (6) are equal to R0((n − 1)q∗). The Jacobian matrix (6) thus has n − 1 eigenvalues equal to −R0((n − 1)q∗) and one eigenvalue equal to (n − 1)R0((n − 1)q∗), which is the largest in absolute value. From this it follows directly

(13)

that the symmetric Cournot-Nash equilibrium is stable whenever

λ(n) ≡ (n − 1)|R0((n − 1)q∗)| < 1, (7)

where λ(n) is defined as the largest eigenvalue of the Jacobian, evaluated at the equilib-rium.

Leading example. From equation (2) it can easily be seen that R0(Q∗−i) = −12, meaning that if others’ aggregated output increases by one unit, the Cournot-Nash firms decrease their output by 12 units. From stability condition (7) it follows that the Cournot-Nash equilibrium is stable for this specification only when n = 2 and unstable when n > 3 (and neutrally stable, resulting in bounded oscillations, for n = 3). The reason for this instability is ‘overshooting’: if aggregated output is above (below) the Cournot-Nash equilibrium quantity, firms react by reducing (increasing) their output. For n > 3 this aggregated reduction (increase) in output is so large that the resulting deviation of ag-gregated output from the equilibrium quantity is larger in the next period than in the current, and so on.

2.2.2 Imitation heuristic

If all firms use the imitation heuristic (4), quantities evolve according to the following system of n equations q1,t+1= Q−1,t n − 1, q2,t+1= Q−2,t n − 1, .. . qn,t+1= Q−n,t n − 1. (8)

(14)

Local stability of the Cournot-Nash equilibrium with only imitation firms depends on the eigenvalues of the Jacobian matrix of the system of equations (8) evaluated at that Cournot-Nash equilibrium q∗. This Jacobian matrix is given by

J |q∗ =          0 n−11 · · · n−11 1 n−1 0 ... .. . . .. n−11 1 n−1 · · · 1 n−1 0          . (9)

Imitators only respond to other firms’ production and do not respond to their own production, therefore all diagonal elements are equal to zero. If one competitor increases current production by one unit, an imitator will increase next production with n−11 unit, therefore all off-diagonal elements are equal to n−11 . The Jacobian matrix (6) thus has n − 1 eigenvalues equal to −n−11 and one eigenvalue equal to (n − 1)n−11 = 1 which is the largest in absolute value. Therefore it follows immediately that the Cournot-Nash equilibrium is neutrally stable independent of n and system structure (price and cost function). The reason for this is that if one producer changes his production plan the economy will stabilize to a new equilibrium unequal to q∗ and will remain at this new equilibrium until one producer deviates again. In fact this system has infinitely many neutrally stable equilibria, namely if qi= q†∀i, the system is neutrally stable for all q†.

2.3 Population dynamics

In the previous sections it is explained how the supplied quantities evolve over time under the Cournot and the imitation heuristic. In this section it will be explained how the population fractions evolve over time. Let us first introduce the vector ηt which

has entries equal to ηk,t, which is the fraction of the population that uses heuristic k

at time t. Thus for every time t, ηt denotes the K-dimensional vector of fractions for

(15)

PK

k=1ηk,t= 1; 0 ≤ ηk,t ≤ 1 ∀k}. We will now describe how the fractions ηk,t evolve over

time. It is assumed that the choice of a behavioural rule is based on its past performance, capturing the idea that more successful rules will be used more frequently.

Evolutionary game theory deals with games played within a (large) population over a long time horizon. Its main ingredients are its underlying game, in this thesis the Cournot one-shot game, and the evolutionary dynamic class which defines a dynamical system on the state of the population. The evolutionary dynamical system depends on current fractions ηt and current fitness Ut. In general, such an evolutionary dynamic in

discrete time, describing how the population fractions evolve, is given by

ηk,t+1 = K(Ut, ηt) (10)

with Ut = (U1,t, ..., UK,t)0 the vector of average utilities and ηt = (η1,t, ..., ηK,t)0 the

factor of fractions. To make sure that the population dynamics is well-behaved in terms of dynamic implications we assume that K(·, ·) is continuous, nondecreasing in Uk,t, and

such that the population state remains in the K-dimensional unit simplex ∆K. In the next Subsection leading class of population dynamics will be explained in detail, the Logit evolutionary dynamics.

2.3.1 Discrete choice models - the Logit evolutionary dynamics

The Logit evolutionary dynamic is treated extensively in Brock and Hommes (1997). This Section contains a brief discussion.

In order to update the fractions we assume that average utility of all heuristics is publicly observable. Suppose that the observed average utility associated behavioural rule Hk takes the form

˜

Uk= Uk+

1 βk,

(16)

necessarily select the rule that yields the highest utility. The parameter β represents the evolutionary pressure. Notice that in the extreme case where β = 0 we have completely random behaviour: the noise is so large that observed average utility is equal for all behavioural rules. Each behavioural rule is thus chosen with equal probability: ηk,t =

1

K ∀k. In the other extreme case, when β → ∞ obscures and everybody switches to the

most profitable strategy each period. If the noise terms k’s are distributed according to

the extreme value distribution the evolutionary fraction dynamic results in the so-called multinomial Logit evolutionary dynamic, the following updating dynamic is given by

ηk,t+1= eβUk,t K X j=1 eβUj,t , k = 1, ..., K. (11)

The equilibrium fractions are given by

ηk,t+1 = eβ(Π∗−Tk) K X j=1 eβ(Π∗−Tj) , k = 1, ..., K (12)

In case of equal costs of the heuristics, equilibrium fractions are thus given by ηk∗= K1 ∀k, since production is equal and thus profits are equal. Note that the population dynamics remains in the interior of the unit simplex for finite β. This implies that in each time period all behaviour rules are present in the population and no behavioural rule will ever vanish (this is the so-called no-extinction condition). Furthermore, no new behavioural rules emerge from this model (this is the so-called no-creation condition).

In the leading examples we will focus on the Logit evolutionary dynamics. First of all because this dynamic is also used in Hommes, Ochea and Tuinstra (2011) and therefore creates the possibility to make a good comparison and furthermore because the Logit evolutionary dynamic has by definition nice regularity/continuity conditions (0 ≤ ηk ≤ 1).

(17)

3

Heterogeneity in behaviour in Cournot oligopolies

In this Section we study the Cournot game and introduce heterogeneity in production plans. In this Section we focus on competition between two heuristics. We relax this in Section five, where we study the competition between rational, Cournot and imitation firms. First we study competition between the Cournot and the imitation firms, with this as an example, two theories will be presented on how to model this heterogeneity in production. In the first theory the firms select their heuristic that completely describes how much to supply in the next period. They select heuristic k with probability ηk. In

the second theory n firms are randomly picked from a large population of firms in which a fraction ηk plays according to strategy k. Main difference is that the firms observe

under the second theory more outcomes and thus under the law of large numbers lets the production plans within a heuristic converge whereas in the first theory all firms (even the firms using the same heuristic) have different production plans, making the dynamics analytically untractable. After this extensive study of competition between Cournot firms and imitators, we introduce another model where rational firms compete with imitation firms. Since the dynamics are only tractable under theory 2, we will focus on this theory when studying this model. The assumption of fixed η for each period will be relaxed in section 4.

3.1 Cournot vs. Imitation firms

3.1.1 Theory 1: Firms select their heuristic

Suppose that each firm has to select each period which heuristic it will use to determine its production plan. This selection goes by the K-dimensional4 vector η. Remember that

PK

k=1ηk = 1 and 0 ≤ ηk ≤ 1 for k = 1, ..., n. We assume that with a fixed probability

ηk a firm will choose heuristic k. After each one-shot Cournot game firms ‘select’ again

4

Note that we actually only need the first K − 1 elements of the vector η because the last element is automatically defined by ηk= 1 −PK−1k=1 ηk, for notational convenience we will keep η in its full form.

(18)

which heuristic to use in the next period. Below a stylized example with heterogeneous heuristics will be discussed: Cournot vs. Imitation firms.

Suppose that with chance η a firm chooses the Cournot heuristic and in the next period best-replies to the other players’ current production qi,t+1 = R(Q−i,t).

Con-sequently with chance 1 − η a firm chooses the imitation heuristic and produces the average quantity it observes in the next period qi,t+1=

Q−i,t

n−1 . This leads to the following

stochastic system of equations.

q1,t+1=      R(Q−1,t) with chance η Q−1,t n−1 with chance 1 − η , q2,t+1=      R(Q−2,t) with chance η Q−2,t n−1 with chance 1 − η , .. . qn,t+1=      R(Q−n,t) with chance η Q−n,t n−1 with chance 1 − η . (13)

To determine the stability of this system at the Cournot-Nash equilibrium, the system has to be transformed into a deterministic system in order to be able to calculate the Jacobian. Suppose there are c firms using the Cournot heuristic, leaving n − c firms using the imitator heuristic. Without loss of generality the order of the equations can be changed such that the first c equations represent the Cournot-firms’ production plans, and the last n−c equations the imitators’ productions plans. This results in the following

(19)

deterministic system of equations q1,t+1= R(Q−1,t), .. . qc,t+1= R(Q−c,t), qc+1,t+1= Q−(c+1),t n − 1 .. . qn,t+1= Qn,t n − 1. (14)

The Jacobian of system (14) is given by

J |q∗ =                    0 R0(Q∗−1) · · · R0(Q∗−1) R0(Q∗−2) 0 · · · R0(Q∗−2) .. . . .. ... R0(Q∗−c) · · · R0(Q∗−c) 0 · · · R0(Q∗−c) 1 n−1 · · · 1 n−1 0 · · · 1 n−1 .. . . .. ... 1 n−1 · · · 1 n−1 0                    . (15)

The Jacobian of system (14) has complex eigenvalues which depend on n, c, and R0(Q∗−i), the system is stable if all eigenvalues are inside the unit circle. No analytical expression has been found to express under which circumstances the system is stable. The effects of changing n or c are clearly visible in figure 1, namely increasing the number of imita-tors in an economy stabilizes the system while increasing the number of Cournot firms unstabilizes the system, moreover increase in |R0((n − 1)q∗)| decreases the stability. Leading example. In figure 1 we see when the system is stable. The length of the largest eigenvalues (√Im2+ Re2where Im is the imaginary part of the largest eigenvalue

(20)

of n and c, for R0(Q∗−i) = −12. 0 2 4 6 8 10 2 3 4 5 6 7 8 9 10 11 0 1 2 3 4 Cournot firms Firms

absolute Largest eigenvalue

Figure 1: Stability of the model under theory 1 when Cournot firms compete with imita-tors.

Figure 1 confirms the earlier findings that an economy with only imitators, i.e. zero Cournot firms, is neutrally stable because the largest eigenvalue of the system evaluated at the Cournot-Nash equilibrium is one. It also shows that an economy with only Cournot firms is stable for n = 2, neutrally stable for n = 3 and unstable for n > 3. More interesting is the fact that an increase in the number of imitation firms stabilizes the economy. An economy with three Cournot players is neutrally stable but after entry of an imitator, this 4-firm economy becomes stable. Next to that an economy with four Cournot firms is unstable, but after entry of two imitators, this 6-firm economy becomes stable. For an economy with five Cournot firms we need at minimum five imitating firms to stabilize. If an economy has six or more Cournot firms an unreasonable number of imitators have to enter this economy to stabilize the Cournot behaviour.

(21)

compete with different firms. This creates a disadvantage because now the quantity dynamics is described by a system of n-equations. In the next section a theory will be explained in which the quantity dynamics (irrespective of the number of firms) can be expressed in a system of K-equations, the number of heuristics.

3.1.2 Theory 2: A large population game

In order to facilitate studying the aggregate behaviour of a heterogeneous set of inter-acting quantity-setting-heuristics we study the Cournot model as a population game. Consider a large population of firms from which in each period groups of n firms are sampled randomly and matched to play the one-shot n-player Cournot game. We assume that a fixed fraction of η of the large population of firms uses the Cournot heuristic and the others use the imitation heuristic. After each one-shot Cournot game, the random matching procedure is repeated, leading to new combinations types of firms. The dis-tribution of possible samples follows a binomial disdis-tribution with parameters n, and η. Below the example Cournot vs. Imitation firms will be discussed again but now under theory 2 of random matching.

Suppose that a fraction of η of the population of the firms uses the Cournot heuristic and observes the population-wide average quantity ¯qt and best responds to it, qt+1C =

R((n − 1)¯qt), where qCt is the quantity produced by each Cournot firm in period t.

Consequently a fraction of η firms of the large population makes use of the the imitation heuristic. Making use of the law of large numbers, the average quantity played in period t can be expressed as

¯

qt= ηqtC+ (1 − η)qIt.

Remember that imitation firms produce in the next period the by the other firms average produced quantity in the current period qIi,t+1= Q−i,t

(22)

we obtain Q−i,t

n−1 → ¯qtwhen n → ∞. Therefore we obtain the following quantity dynamics

qCt+1= R((n − 1)(ηqtC+ (1 − η)qtI)),

qIt+1= ηqtC+ (1 − η)qtI.

(16)

Note that this is a 2-dimensional dynamical system which dimension cannot be reduced. Furthermore the Cournot-Nash equilibrium is not the unique equilibrium of the imitation rule, in fact all quantities are. The Cournot-Nash equilibrium is, however, still the unique equilibrium quantity of the complete dynamical system.

Proposition 1 The Nash equilibrium,where all firms produce the Cournot-Nash quantity (q∗, q∗), is a locally stable fixed point for the model with exogenous fractions of Cournot and imitation firms if and only if

|1 − η + η(n − 1)(R0((n − 1)q∗)| < 1. (17)

Proof. It can easily be shown that the Jacobian matrix, evaluated at the Cournot-Nash equilibrium (q∗, q∗), is given by    (n − 1)ηR0((n − 1)q∗) (n − 1)(1 − η)R0((n − 1)q∗) η 1 − η   . (18)

The corresponding eigenvalues are λ1 = 0 and λ2 = 1 − η + η(n − 1)(R0((n − 1)q∗). Here

λ2 is the largest eigenvalue in absolute value. Thus the system is stable if |λ2| < 1, this

is the condition stated in the proposition.

Leading example. Here R0((n − 1)q∗) = −12 substituting this in equation (17) gives, after some simplification

n < 4 − η

η . (19)

(23)

n < 7. Next to that as found earlier, an economy with only cournot firms (η = 1) is stable if n < 3. Furthermore, an economy where close to all firms use the imitation heuristic, but some Cournot firms exist (η close to zero), the economy is always stable.

3.2 Rational vs. Imitation firms

In this section we focus on the dynamics when there is competition between rational and imitation firms. Remember that we will model this heterogeneity under theory 2 since this makes the dynamics analytically tractable. We set the fraction of rational firms equal to η. A fully rational firm is assumed to know the fraction of imitation firms. Moreover, it knows exactly how much all firms will produce. However, we assume that it does not know the composition of firms in its market (or has to make a production decision before observing this). The rational quantity dynamics therefore have the following structure

qR= Argmax

qi

E[P (qi+ Q−i)qi− C(qi)].

It forms expectations over all possible mixtures of heuristics resulting from randomly drawing n − 1 other players from a large population, of which each with chance η is a rational firm too, and with chance 1 − η is an imitator. Rational firm i therefore chooses quantity qi such that his objective function, its own expected utility

UtR(qi,t|qtR, qIt, η) = n−1 X k=0  n − 1 k 

ηk(1 − η)n−1−k[P ((n−1−k)qIt+kqRt +qt,i)qt,i−C(qt,i)],

(20) is maximized given the production of the other players and the population fractions. Here qtR is the symmetric output level of all of the other rational firms in period t, and qIt is the output level of all of the imitation firms. The first order condition for an optimum is characterized by equality between marginal cost an expected marginal revenue. Typically, marginal revenue in the realized market will differ from marginal

(24)

costs.

Given the value of qtI and the fraction η, all rational firms coordinate on the same output level qR

t . This gives the first order condition

δUtR(qi,t|qtR, qIt, η)

δqi,t

= 0,

which equals to:

n−1 X k=0  n − 1 k  ηk(1 − η)n−1−k× [P ((n − 1 − k)qIt+(k + 1)qtR) + qtRP0((n − 1 − k)qtI+ (k + 1)qtR) − C0(qRt)] = 0. (21)

Let the solution to equation (21) be given by qRt = HR(qtI, η), the full system of equations is thus given by

qt+1R = HR(qIt+1, η) = HR(ηqtR+ (1 − η)qIt, η)

qt+1I = ηqRt + (1 − η)qtI.

(22)

It is easily checked that if the imitators play the Cournot-Nash equilibrium quantity q∗, or if all firms are rational, the rational firms will play the Cournot-Nash equilibrium quantity, that is HR(q∗, η) = q∗, for all η and HR(qI, 1) = q∗ for all qI. Moreover, if a rational firm is certain it will only meet imitation firms (that is η = 0), it plays a best response to the currently average played quantity, that is HR(qtI, 0) = R((n − 1)qtI), for all qtI. In the remainder we will denote the partial derivatives of HR(q, η) with respect to q and η by HR

q(q, η) and HηR(q, η) respectively.

Proposition 2 The Nash equilibrium, where all firms produce the Cournot-Nash quantity (q∗, q∗), is a locally stable fixed point for the model with exogenous fractions

(25)

of rational and imitation firms if and only if

|ηHq(q∗, η) + 1 − η| < 1 (23)

Proof. In order to determine the local stability of the equilibrium (q∗, q∗) where all firms produce the Cournot-Nash quantity, we need to determine the eigenvalues of the Jacobian matrix of system (22), evaluated at the equilibrium. It can be shown that this Jacobian matrix is given by

J |q,q∗ =    ηHq(q∗, η) (1 − η)Hq(q∗, η) η 1 − η   , (24)

which has eigenvalues λ1 = ηHq(q∗, η) + 1 − η and λ2 = 0. Consequently the system is

locally stable when |λ1| < 1, this is exactly the condition stated.

Leading example. In the leading example the implicit function defining qtR(Eq. (21)) when using that

n−1 X k=0  n − 1 k  ηk(1 − η)n−1−k = 1 and n−1 X k=0  n − 1 k  ηk(1 − η)n−1−kk = (n − 1)η boils down to qt+1R = HR(qt+1I , η) = a − c b(2 + (n − 1)η) − (n − 1)(1 − η) 2 + (n − 1)η (ηq R t + (1 − η)qtI).

The system of equations for the leading example is given by

qRt+1= HR(qt+1I , η) = a − c b(2 + (n − 1)η) − (n − 1)(1 − η) 2 + (n − 1)η (ηq R t + (1 − η)qtI) qIt+1= ηqtR+ (1 − η)qtI (25)

(26)

the system is stable if |λ2| < 1. Since 0 ≤(n−1)(1−η)η2+(n−1)η < 1, this stability condition always

holds and the economy is always stable in the linear specification. In figure 2 this is graphically shown.

3 4 5 6 7 8 9 10 11 0 0.2 0.4 0.6 0.8 1 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 Firms Fraction Largest eigenvalue

Figure 2: Largest eigenvalue for the model rational vs. imitation firms. As one can see the largest eigenvalue decreases when the number of firms increases and when the fraction of rational players increases. Since an economy consisting of only imitation firms is neutrally stable, this model is stable for all combinations of η and n.

(27)

4

Evolutionary competition between two heuristics

In this Section we develop an evolutionary version of the model outlined in Section 3, i.e. relaxing the assumption that η is fixed. As before in ever period t, n firms play the n-player Cournot game. We now assume that the fractions of firms using a heuristic η evolves over time according to a general monotone selection dynamic, capturing the idea that heuristics that perform relatively better are more likely to spread through the population as explained in Section 2.3, Eq. (10), here it is explained that future fractions depend on current fractions and current fitness.

Under the assumption of random interactions, the fitness of heuristic k is determined by averaging the payoffs from from each interaction with weights given by the chance of that specific state minus the information cost of using the heuristic. Denoting with Πt the expected payoff vector in period t, its entries - individual payoff or fitness in

biological terms - of strategy 1 is given by:

Π1,t = F (q1,t, q2,t, ηt) = n−1 X k=0 (n − 1)! k!(n − 1 − k)!η k t(1 − ηt)n−1−kP ((k + 1)q1,t+ (n − 1 − k)q2,t)q1,t− C(q1,t), (26)

and with expected profits for heuristic 2 given by Π2 = F (q2, q1, 1 − η). If the population

of firms and the number of groups of n firms drawn from that population are large enough, average profits will be approximated well by these expected profits, which we will use therefore as a proxy for average profits from now on.

There might be a substantial difference in sophistication between different heuristics. As a consequence some heuristics may require more information or effort to implement than others. Therefore we allow for the possibility that heuristics involve information cost Ck ≥ 0, that may differ across heuristics. Fitness of a heuristic is then given by

(28)

We only use the realized profit to determine the fitness measure of a behavioural rule. The fitness measure can be generalized by weighting the utility of the past M periods, according to Tuinstra (1999) this yields similar results. We assume that the above fitness measures Uk are publicly observable.

Having the fitness measure we are ready to introduce the population dynamics. Let the fraction of firms using the first heuristic be given by η in period t. This frac-tion evolves endogenously according to an evolufrac-tionary dynamic which is an increasing function in the difference between the current fitness of the two heuristics and current fraction, that is

ηt+1 = K(U1,t− U2,t) = K(∆U1,t).

The map K : R → [0, 1] is a continuously differentiable, monotonically increasing func-tion with K(0) = 12, K(x) + K(−x) = 1, meaning that it is symmetric around x = 0, limx→−∞K(x) = 0 and limx→∞K(x) = 1

In the following two sections we will derive two dynamical versions of the two models discussed in Section 3 and investigate their stability. First we investigate the stability of the Cournot-Nash equilibrium for the model with endogenous fractions of Cournot and imitation firms and second we investigate the stability of the Cournot-Nash equilibrium for the model with endogenous fractions of rational and imitation firms.

4.1 Cournot versus Imitation firms

The dynamics in this section consists of three equations, two equations describing the quantity dynamics: the production of the Cournot firms and the production of the imita-tion firms. Next to that we need one equaimita-tion to describe the dynamics of the populaimita-tion fraction. The population and quantity dynamics look like the following system of three equations:

(29)

qt+1C = R((n − 1)(ηtqCt + (1 − ηt)qIt)),

qt+1I = ηtqCt + (1 − ηt)qIt

ηt+1 = K(∆Ut),

(27)

where ∆Ut = UC,t − UI,t. Note that this is a 3-dimensional dynamical system which

dimensions cannot be reduced. Furthermore, the Cournot-Nash equilibrium quantity q∗ is the unique equilibrium quantity of the complete dynamical system. Let η∗ be the unique equilibrium fraction such that η∗ = K(−C). Without specializing the population dynamics K(·) we have the result as stated in the proposition below.

Proposition 3 The Cournot-Nash equilibrium (q∗, q∗, η∗) is a locally stable fixed point for the model with endogenous fractions of Cournot and imitators where all firms produce the Cournot-Nash quantity, firms if and only if

η∗R((n − 1)q∗)(n − 1) − η∗ > −2. (28)

Proof. It can easily be shown that the Jacobian matrix of system 27, evaluated at the equilibrium (q∗, q∗, η∗) is given by J |q,q∗ =       (n − 1)η∗R0((n − 1)q∗) (n − 1)(1 − η∗)R0((n − 1)q∗) 0 η∗ 1 − η∗ 0 J31 J32 δK(∆Uδ t) ηt q∗,q∗       . (29) The eigenvalues of this Jacobian matrix are, independently of J31 and J32 given by

λ1 = η∗R((n − 1)q∗)(n − 1) + η∗− 1, λ2 = δK(∆Ut) δηt q∗,q∗ and λ3= 0. (30)

(30)

To our best knowledge of possible population dynamics δK(∆Ut)

δηt is positive but smaller

than 1. This holds for all population dynamics discussed in Section 2.3. Therefore, for the system to be stable we need

η∗R((n − 1)q∗)(n − 1) − η∗ > −2,

which is exactly the condition stated in the proposition.

Note that this is the same condition we derived in Section 3.1.2 where we fixed η. Leading example. In the equilibrium, when all firms produce the same quantity, prof-its are equal and therefore the equilibrium fraction simplifies to η∗ = K(−C). The equilibrium quantities are given by q∗. Here R0((n − 1)q∗) = −12, filling this in equa-tion (28) gives the stability condiequa-tion for the leading example. Thus the equilibrium (q∗, q∗, η∗) is stable when n < 4−ηη∗∗.

In Figure 3 the model is simulated under Logit-dynamics with intensity of choice parameter β, see Brock and Hommes (1997). Panel (a) depicts a period-doubling route to chaotic quantity dynamics as the number of firms n increases. The first period-doubling bifurcation is for n = 7 as calculated analytically. Panel (b) displays oscillating time series of produced quantity by the Cournot and imitation firms and the equilibrium quantity fraction q∗. As one can see the Cournot quantities are fluctuating more than the imitation quantities. The stabilizing effect of the imitation firms is here clearly visible, when Cournot firms produce more (less) then the Cournot-Nash equilibrium quantity, the imitation firms produce less (more) than the Cournot-Nash equilibrium quantity and therefore decrease the aggregated deviation from the equilibrium. Panel (c) displays the resulting Cournot profit differential ΠC− ΠI. Panel (d) displays the resulting oscillating

time series of the Cournot and imitation fractions. In Panel (e) a phase portrait is shown for the Cournot heuristic whereas in Panel (f) a phase portrait for the imitation heuristic is shown. In Panel (g) the largest Lyapunov exponent for an increasing number

(31)

of firms is shown and in Panel (h) the largest Lyapunov exponent is shown for increasing β. Game and behavioural parameters are equal set to: n = 10, a = 17, b = 1, c = 1, CC = 0, CI = 0, β = 0.05. Initial conditions are set equal to: qC

0 = 0.8, q0I = 0.8,

(32)

0 1 2 3 4 5 2 4 6 8 10 12 14 q I n

(a) Bifurcation diagram (qt, n)

0 0.5 1 1.5 2 2.5 3 3.5 4 10 20 30 40 50 60 qt time qC qI q*

(b) Time path of Cournot and imitiation quantities -15 -10 -5 0 5 10 20 30 40 50 60 Profit difference time

(c) Cournot profit differential

0.3 0.35 0.4 0.45 0.5 0.55 0.6 0.65 0.7 10 20 30 40 50 60 fraction time Cournot Imitation

(d) Time path Cournot fraction

0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 0.55 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 fraction qC

(e) Cournot phase plot

0.45 0.5 0.55 0.6 0.65 0.7 0.75 0.8 0.85 0.6 0.8 1 1.2 1.4 1.6 1.8 2 fraction qI

(f) Imitation phase plot

-5 -4 -3 -2 -1 0 2 4 6 8 10 12 14

Largest Lyapunov Exponent

n

(g) Largest lyapunov Exponent

-0.2 -0.1 0 0.1 0.2 0.3 0 0.5 1 1.5 2 2.5 3 3.5 4

Largest Lyapunov Exponent

Beta

(33)

When the evolutionary pressure increases, the system evolves to an equilibrium dif-ferent from the Cournot-Nash equilibrium where the imitation firms produce more than the Cournot-Nash equilibrium whereas the Cournot firms produce less. Imitation profits are therefore much higher and as a consequence the complete population switches to the imitation heuristic.

The bifurcation diagram is plotted below once more under the same game and be-havioural parameters and initial conditions, the only difference is that now β = 3.

0

1

2

3

4

5

6

7

8

1

2

3

4

5

6

n

Figure 4: Bifurcation diagram (qt, n) with β = 3

When 1.7 < n < 2.8 the imitation firms produce more then the Cournot-Nash equilib-rium quantity while the Cournot firms produce less. This results in higher profits for the imitators and therefore the complete populations switches to imitators (η = 0). When 2.8 ≤ n ≤ 3.2 all firms produce the Cournot-Nash equilibrium quantity again, therefore profits and thus fractions are equal. When n > 3.2 The imitation firms produce again more then the equilibrium quantity while the Cournot firms produce less, exept when n is close to 3.65, then all firms produce the Cournot-Nash equilibrium quantity. Finally,

(34)

when n > 5.6 the imitation firms produce so much that the Cournot firms decide to produce nothing (qC = 0).

4.2 Rational vs. Imitation firms

As in the previous Section we need a 3-dimensional system to describe the dynamics of the model. The rational firms produce each period such that their expected profit is maximized whereas an imitator produces in the next period the currently average played quantity.

The rational quantity dynamics therefore have the following structure

qRt = Argmax

qi

E[P (qi,t+ Q−i,t)qi− C(qi,t)].

It forms expectations over all possible mixtures of heuristics resulting from randomly drawing n − 1 other players from a large population, of which each with chance ηt is a

rational firm too, and with chance 1 − ηt is a imitator. Rational firm i therefore chooses

quantity qi such that his objective function, its own expected utility

UtR(qi,t|qtR, qIt, ηt) = n−1 X k=0  n − 1 k 

ηtk(1 − ηt)n−1−k[P ((n−1−k)qtI+kqtR+qt,i)qt,i−C(qt,i)],

(31) is maximized given the production of the other players and the population fraction. Here qtR is the symmetric output level of each of the other rational firms in period t, and qtI is the output level of each of the imitator firms in period t. The first order condition for an optimum is characterized by equality between marginal cost an expected marginal revenue.

(35)

output level qtR. This gives the first order condition δUtR(qi,t|qtR, qtI, ηt) δqi,t = 0, which equals to n−1 X k=0  n − 1 k  ηkt(1 − ηt)n−1−k× [P ((n − 1 − k)qIt+(k + 1)qtR) + qtRP0((n − 1 − k)qtI+ (k + 1)qtR) − C0(qRt)] = 0. (32)

Let the solution to equation (32) be given by qR

t = HR(qtI, ηt), the full system of equations

is thus given by

qt+1R = HR(qIt+1, ηt+1)

qt+1I = ηtqtR+ (1 − ηt)qtI

ηt+1= K(∆Ut).

(33)

where ∆Ut= UtR− UtI. It is easily checked that if the imitators play the Cournot-Nash

equilibrium quantity q∗, or if all firms are rational, then the rational firms will play the Cournot-Nash equilibrium quantity, that is HR(q∗, η) = q∗, for all η and HR(qI, 1) = q∗ for all qI. Moreover, if a rational firm is certain it will only meet imitation firms (that is η = 0), it plays a best response to the currently average played quantity, that is HR(qtI, 0) = R((n − 1)qIt), for all qIt. In the remainder we will denote the partial derivatives of HR(q, η) with respect to q and η by HqR(q, η) and HηR(q, η) respectively.

Proposition 4 The Cournot-Nash equilibrium (q∗, q∗, η∗) is a locally stable fixed point for the model with endogenous fractions of rational and imitation firms, where all firms produce the Cournot-Nash quantity, if and only if

(36)

Proof. Since a dynamical system can only depend on lagged variables, we substitute the second and third equation into the first. This gives us the following system that depends only on lagged variables.

qt+1R = ψ1= HR(ηtqtR+ (1 − ηt)qIt, K(∆UtR))

qt+1I = ψ2= ηtqtR+ (1 − ηt)qtI

ηt+1= ψ3= K(∆UtR).

(35)

In the equilibrium all firms produce the Cournot-Nash quantity q∗, therefore profits are equal, hence the equilibrium fraction is given by η∗ = K(−C). In order to determine the local stability of the equilibrium (q∗, q∗, η∗) where all firms produce the Cournot-Nash quantity, we need to determine the eigenvalues of the Jacobian matrix of system (22), evaluated at the equilibrium.

The partial derivatives of ψ2 with respect to qRt , qtI and ηt, evaluated at the

equilib-rium are η∗, 1 − η∗ and 0 respectively.

Next, let us determine the partial derivatives of ψ3 with respect to qtR, qIt and ηt,

respectively. To that end, note that we can write the profit differential as

∆UtR= ΠRt − ΠIt − C =

n−1

X

k=0

Ak(ηt)Dk(qRt , qtI, ηt) − C,

with Ak(ηt) = n−1k  ηtk(1 − ηt)n−1−k, which does not depend upon qRand qI, and

Dk(qRt , qtI, ηt) =P ((k + 1)qR+ (n − 1 − k)qI)qR− C(qR)

− [P (kqR+ (n − k)qI)qI− C(qI)],

(36)

which depends upon ηt through qtR= H(qIt, ηt). Note that Dk(qtR, q∗, ηt) = 0, moreover

(37)

by δDk(qRt , qtI, ηt) δqR t (q∗,q) = [P0(Q∗)q∗+ P (Q∗) − C0(q∗)]Hq(q∗, η∗) = 0, δDk(qRt , qtI, ηt) δqI t (q∗,q) = −[P0(Q∗)q∗− P (Q∗) + C0(q∗)] = 0, δDk(qRt , qtI, ηt) δηt (q∗,q) = [P0(Q∗)q∗P (Q∗) − C0(q∗)]Hη(q∗, η∗) = 0.

The second equalities follows from the fact that P0(Q∗)q∗+P (Q∗)−C0(Q∗) = 0 is the first order condition of any firm in a Cournot-Nash equilibrium. Furthermore Dk(q∗, q∗, η) =

0 for all η by the first order condition for a Cournot-Nash equilibrium. Using this it follows immediately that:

δψ3 δqR t (q∗,q) = K0(−C) δ∆Ut δqR t (q∗,q) = K0(−C) n−1 X k=0 Ak(η∗) δDk(qtR, qIt, ηt) δqtR (q∗,q) = 0 (37) and δψ3 δqtI (q∗,q) = K0(−C) δ∆Ut δqtI (q∗,q) = K0(−C) n−1 X k=0 Ak(η∗) δDk(qtR, qIt, ηt) δqIt (q∗,q) = 0 (38)

(38)

and δψ3 δηt (q∗,q) = K0(−C) δ∆Ut δηt (q∗,q) = K0(−C) n−1 X k=0 [Ak(η∗) δDk(qtR, qIt, ηt) δηt (q∗,q) +δAk(η) δηt Dk(qR ∗ , qIt∗, η∗)] = 0. (39)

This leaves us to examine the partial derivatives of ψ1 with respect to qRt , qIt and ηt,

evaluated at the equilibrium.

δψ1 δqR t (q∗,q) = η∗HqR(q∗, η∗) +δK(∆Ut) δqR t Hη(q∗, η∗) = η∗HqR(q∗, η∗) (40) and δψ1 δqtI (q∗,q) = (1 − η∗)HqR(q∗, η∗) +δK(∆Ut) δqtI Hη(q ∗ , η∗) = (1 − η∗)HqR(q∗, η∗) (41) and δψ1 δηt (q∗,q) = (q∗− q∗)HqR(q∗, η∗) +δK(∆Ut) δηt Hη(q∗, η∗) = 0 (42)

(39)

Therefore the Jacobian matrix, evaluated at the equilibrium is given by J |q,q∗=       η∗HqR(q∗, η∗) (1 − η∗)HqR(q∗, η∗) 0 η∗ 1 − η∗ 0 0 0 0       . (43)

Which has eigenvalues λ1 = ηHq(q∗, η∗)+1−η∗, λ2 = 0 and λ3 = 0. Consequently the

system is locally stable when |λ1| < 1, this is exactly the condition stated in proposition

4. Note again the similarity with the condition in Section 3 where we fixed the fraction η.

Leading example. Since the stability condition is the similar to the condition derived in Section 3.2, the equilibrium (q∗, q∗, η∗) is stable for all n in this linear specification.

(40)

5

Rational vs. Cournot vs. Imitation

In this section we combine the ideas that we gathered in Section 4. We will investigate the dynamics when the three heuristics discussed before compete. As before every round n firms are drawn from a large pool of firms to play the one-shot Cournot game. From this large pool of firms a fraction ηRt plays according to the rational strategy in period t, a fraction ηCt plays according to the Cournot heuristic in period t and consequently the fraction of imitators in period t is determined by 1 − ηtR− ηC

t . As in Section 4 the

fitness of a heuristic is determined by the average payoff minus the information cost of using that heuristic. Again the average profits will be approximated by the expected profits but in contrast to Section 4 the distribution of states now follows a multinomial distribution instead of a binomial distribution. In general the average profit of a firm producing q1 and competing with other firms that produce either q2 or q3 given the

fractions η1 and η2 is stated below, in this average profit approximation the profit in

each state is weighted by the chance of this state.

Π1,t = F (q1,t, q2,t, q3,t, η1,t, η2,t) = X ∆ (n − 1)! k1!k2!(n − k1− k2− 1)! ηk1 1,tη k2 2,t(1 − η1,t− η2,t) n−k1−k2−1× (44) P ((k1+ 1)q1,t+ k2q2,t+ (n − 1 − k1− k2)q3,t)q1,t− C(q1,t),

The summation is over all possible combinations of k1 and k2, which stand for the

number of other firms producing q1 and q2 respectively, that is: ∆ = {k1, k2 ∈ I2 : 0 ≤

k1 ≤ n − 1; 0 ≤ k2 ≤ n − 1; 0 ≤ k1 + k2 ≤ n − 1}. Expected profits for heuristic 2 in

period t are given by F (q2,t, q1,t, q3,t, η2,t, η1,t), expected profits for heuristic 3 in period

t are given by F (q3,t, q2,t, q1,t, 1 − η1,t− η2,t, η2,t).

The complete dynamical system consists of five equations, three for the quantity dynamics and two to describe how the fractions evolve. As in all previous sections,

(41)

the Cournot firms play in the next period a best-response to the current aggregated output of the others, imitators play in the next period the average produced quantity by the others in the current period. Rational players produce every period the quantity that maximizes expected payoff given the fractions and production plans of all other firms (imitators, Cournot players but rational players too). The rational firms produce expectations over all possible mixtures of heuristics resulting from randomly drawing the n − 1 other players from the large population of firms. In this setting the rational objective function, its own expected utility is of the following form:

UtR(qi,t|x) =

X

fk1,k2(n − 1, η

R, ηC

P (k1qtR+ k2qtC+ (n − 1 − k1− k2)qIt + qi,t)qi,t− C(qi,t),

(45) with fk1,k2(n − 1, η R, ηC) = (n−1)! k1!k2!(n−k1−k2−1)!η R t k1 ηCt k2 (1 − ηRt − ηC t )n−k1−k2−1 and x =

qtR, qIt, qtC, ηRt , ηtC. The first order condition for an optimum of (45) is characterized by equality between marginal cost an expected marginal revenue.

Given the value of qtC qtI ηtR ηtC, all rational firms coordinate on the same output level qRt . Differentiating equation (45) with respect to qi,t gives the first order condition,

which is equal for all rational firms. This first order condition is given by:

δUtR(qi,t|x)

δqi,t

= 0

which equals to:

X ∆ fk1,k2(n − 1, η R, ηC [P ((k1+ 1)qRt + k2qtC+ (n − 1 − k1− k2)qtI)+ P0((k1+ 1)qtR+ k2qtC+ (n − 1 − k1− k2)qIt)qtR− C0(qRt )] = 0 (46)

(42)

Let the solution to this be given by qtR = HR(qtC, qtI, ηtR, ηCt ). The system of quantity dynamics is thus given by

qt+1R = HR(qCt+1, qIt+1, ηRt+1, ηCt+1)

qt+1C = R((n − 1)(ηRt qtR+ ηtCqCt + (1 − ηRt − ηCt )qIt)

qt+1I = ηRt qtR+ ηtCqCt + (1 − ηRt − ηCt )qIt

(47)

Note that rational player plays such that expected marginal revenue equals marginal cost at t + 1 and a Cournot firm plays such that its marginal revenue (of period t) equals marginal cost (at period t). Therefore the Cournot heuristic is a lagged version of the rational heuristic if and only if

P0((n − 1)(ηRqR+ ηCqC+ (1 − ηR− ηC)qI) + qC) = X ∆ fk1,k2(n − 1, η R, ηC)P0((k 1+ 1)qR+ k2qC+ (n − 1 − k1− k2)qI). (48)

Thus the Cournot heuristic is only a lagged version of the rational heuristic if the inverse demand is linear. In this specific case the analysis become easier because this gives the possibility to lower the dimension of the dynamical system.

It is easily checked that if the imitation and Cournot firms play the Cournot-Nash equilibrium quantity q∗, or if all firms are rational, the rational firms will play the Cournot-Nash equilibrium quantity, that is HR(q∗, q∗, ηRt , ηtC) = q∗, for all ηR and ηC and HR(qt+1C , qt+1I , 1, 0) = q∗ for all qC, qI. In the remainder we will denote by HqRR,

HqRC, HqRI, HηRR and HηRC the partial derivatives of HR(qC, qI, ηR, ηC) with respect to qR,

qC, qI, ηR and ηC respectively, evaluated at the equilibrium (q, q, q, ηR∗ηC∗), which

we will denote by x∗ in the remainder of this chapter for notational convenience. Now that we have the quantity dynamics we can turn to the population dynamics. These are related to the population dynamics from Section 4 but differ significantly since we are in a three heuristic environment now. The population dynamics, as in Section 4,

(43)

depend on relative fitness. Let the fraction dynamics be given by

ηR,t+1= KR(∆UtR, ∆UtC)

ηC,t+1= KC(∆UtR, ∆UtC).

(49)

Where ηt+1R is the fraction of rational firms in period t + 1 whereas ηt+1C is the fraction of Cournot firms in that period. With ∆UtR = ΠRt − CR− (ΠC

t − CC) we denote the

difference in average fitness of the rational and the Cournot heuristic and with ∆UtC = ΠCt − CC− (ΠI

t− CI) we denote the difference in average fitness of the Cournot and the

imitation heuristic. Note that KRand KC are R2→ [0, 1] are continuously differentiable functions where the difference in fitness of the rational and Cournot heuristics and the difference in fitness of the Cournot and imitation heuristic are used as input. The difference in fitness of the rational and imitation heuristic is not used as an input variable since this information is captured implicitly in the other two differences. Note that KR is a monotonically increasing function in the first and second element whereas KC is decreasing in the first element but increasing in the second element. Furthermore, KR(0, 0) = KC(0, 0) = 13. In the remainder of this chapter we denote K1R and K2R the partial derivatives of KR with respect to the first and the second element respectively and with K1C and K2C the partial derivatives of KC with respect to the first and the second element respectively.

(44)

of equations. The full system is given by: qt+1R = φ1 = HR(φ2, φ3, φ4, φ5) qt+1C = φ2 = R((n − 1)(ηtRHR(qtC, qtI, ηtR, ηCt ) + ηtCqCt + (1 − ηRt − ηC t )qIt) qt+1I = φ3 = ηtRqRt + ηtCqtC+ (1 − ηtR− ηtC)qtI ηt+1R = φ4 = KR(∆UtR, ∆UtC) ηt+1C = φ5 = KC(∆UtR, ∆UtC). (50)

Since a dynamical system can only depend on lagged variables we substituted φ2, φ3, φ4, φ5 into HR(·). In order to determine the local stability of the unique equilibrium x∗, we need to determine the eigenvalues of the Jacobian matrix evaluated at that equilibrium x∗.

It can easily be shown that the partial derivatives of φ3 with respect to qR, qC, qI, ηRand ηC, evaluated at the equilibrium are ηR∗, ηC∗, 1−ηR∗−ηC∗, 0 and 0 respectively.

To determine the partial derivatives of φ4 and φ5 we need to determine the partial derivatives of ∆UtR and ∆UtC. In accordance to Section 4.2 we can write the first profit differential as ∆UR=X ∆ Ak(ηR, ηC)Dk(qRt , qtC, qtI, ηR, ηC) − CR+ CC, with Ak1,k2(η R t , ηCt ) = (n−1)! k1!k2!(n−k1−k2−1)!η R t k1 ηtCk2(1 − ηtR− ηC

t )n−k1−k2−1, which does not

depend upon the produced quantities, and

Dk1,k2(q

R

t , qtC, qtI, ηtR, ηCt ) = P ((k1+ 1)qtR+ k2qtC+ (n − k1− k2− 1)qIt)qtR− C(qRt )

− [P (k1qtR+ (k2+ 1)qCt + (n − k1− k2− 1)qtI)qCt − C(qCt )].

(51)

(45)

Dk1,k2(q

, q, q, ηR

t , ηtC) = 0, ∀ ηR, ηC. Next to that the partial derivatives of Dk1,k2(x)

evaluated at the equilibrium are given by

δDk(qtR, qCt , qIt, ηR, ηC) δqtR x∗ = [P0(Q∗)q∗+ P (Q∗) − C0(q∗)]HqRR(x∗) = 0 δDk(qtR, qCt , qIt, ηR, ηC) δqC t x∗ = [P0(Q∗)q∗+ P (Q∗) − C0(q∗)](HqRC(x∗) − 1) = 0 δDk(qtR, qCt , qIt, ηR, ηC) δqI t x∗ = [P0(Q∗)q∗+ P (Q∗) − C0(q∗)]HqRI(x ∗) = 0 δDk(qtR, qCt , qIt, ηR, ηC) δηR x∗ = [P0(Q∗)q∗+ P (Q∗) − C0(q∗)]HηR(x∗) = 0 δDk(qtR, qCt , qIt, ηR, ηC) δηC x∗ = [P0(Q∗)q∗+ P (Q∗) − C0(q∗)]HηC(x∗) = 0. (52)

Where the second equalities follow from the fact that P0(Q∗)q∗+ P (Q∗) − C0(Q∗) = 0 is the first order condition of any firm in a Cournot-Nash equilibrium.

In this line of reasoning we can write the second profit differential as

∆UC =X ∆ Bk(ηt)Gk(qtR, qCt , qtI, ηR, ηC) − CC + CI, with Bk1,k2(η R t , ηCt ) as (n−1)! k1!k2!(n−1−k1−k2)!η R t k1 ηtCk2(1 − ηRt − ηC

t )n−1−k1−k2, which does not

depend upon the produced quantities, and

Gk1,k2(q

R

t , qCt , qtI, ηtR, ηCt ) = P ((n − 1 − k1− k2)qRt + (k1+ 1)qCt + k2qIt)qtC− C(qCt )

− [P ((n − 1 − k1− k2)qtR+ k1qtC+ (k2+ 1)qtI)qIt − C(qtI)].

(53)

Which depends upon ηR and ηC through qRt = HR(qtC, qtI, ηRt, ηCt ). Note that Gk1,k2(q

, q, q, ηR

t , ηtC) = 0, ∀ ηR, ηC. The partial derivatives of Dk1,k2(q

R

(46)

evaluated at the equilibrium are given by δGk(qtR, qCt , qIt, ηR, ηC) δqR t x∗ = [P0(Q∗)q∗+ P (Q∗) − C0(q∗)]HqRR(x∗) = 0 δGk(qtR, qCt , qIt, ηR, ηC) δqC t x∗ = P0(Q∗)q∗+ P (Q∗) − C0(q∗)(HqRC(x∗) − 1) = 0 δGk(qtR, qCt , qIt, ηR, ηC) δqtI x∗ = [P0(Q∗)q∗+ P (Q∗) − C0(q∗)]HqRI(x ∗ ) = 0 δGk(qtR, qCt , qIt, ηR, ηC) δηR x∗ = [P0(Q∗)q∗+ P (Q∗) − C0(q∗)]HηR(x∗) = 0 δGk(qtR, qCt , qIt, ηR, ηC) δηC x∗ = [P0(Q∗)q∗+ P (Q∗) − C0(q∗)]HηC(x∗) = 0 (54)

Where the second equalities follow from the fact that P0(Q∗)q∗+ P (Q∗) − C0(Q∗) = 0 is the first order condition of any firm in a Cournot-Nash equilibrium. Using this it follows immediately that the partial derivatives of φ4 are given by:

δφ4 δqR t x∗ = K1R(CC− CR, CI− CC) δ∆U R δqR t x∗ + K2R(CC− CR, CI− CC) δ∆U C δqR t x∗ = 0 (55) and δφ4 δqC t x∗ = K1R(CC− CR, CI− CC) δ∆U R δqC t x∗ + K2R(CC− CR, CI− CC) δ∆U C δqC t x∗ = 0 (56)

(47)

and δφ4 δqIt x∗ = K1R(CC − CR, CI− CC) δ∆U R δqIt x∗ + K2R(CC− CR, CI− CC) δ∆U C δqIt x∗ = 0 (57) and δφ4 δηtR x∗ = K1R(CC − CR, CI− CC) δ∆U R δηRt x∗ + K2R(CC− CR, CI− CC) δ∆U C δηRt x∗ = 0 (58) and δφ4 δηtC x∗ = K1R(CC− CR, CI− CC) δ∆U R δηCt x∗ + K2R(CC− CR, CI− CC) δ∆U C δηCt x∗ = 0. (59)

Furthermore, the partial derivatives of φ5 are given by

δφ5 δqRt x∗ = K1C(CC− CR, CI− CC) δ∆U R δqtR x∗ + K2C(CC− CR, CI− CC) δ∆U C δqtR x∗ = 0 (60)

(48)

and δφ5 δqCt x∗ = K1C(CC− CR, CI− CC) δ∆U R δqtC x∗ + K2C(CC− CR, CI− CC) δ∆U C δqtC x∗ = 0 (61) and δφ5 δqtI x∗ = K1C(CC − CR, CI− CC) δ∆U R δqIt x∗ + K2C(CC− CR, CI− CC) δ∆U C δqIt x∗ = 0 (62) and δφ5 δηtR x∗ = K1C(CC − CR, CI− CC) δ∆U R δηRt x∗ + K2C(CC − CR, CI− CC) δ∆U C δηRt x∗ = 0 (63) and δφ5 δηtC x∗ = K1C(CC− CR, CI− CC) δ∆U R δηCt x∗ + K2C(CC − CR, CI− CC) δ∆U C δηCt x∗ = 0. (64)

(49)

The Jacobian of the system, evaluated at the equilibrium x∗ is thus given by J |x∗ =             HqRR HqRC HqRI HηRR HηRC J21 J22 J23 J24 J25 ηR∗ ηC∗ 1 − ηR∗− ηC∗ 0 0 0 0 0 0 0 0 0 0 0 0             (65) with J21= (n − 1)ηR∗HqRRR 0 ((n − 1)q∗) J22= (n − 1)  ηR∗HqRC + ηC ∗  R0((n − 1)q∗) J23= (n − 1)  ηR∗HqRI + 1 − ηR∗− ηC ∗  R0((n − 1)q∗) J24= (n − 1)  HR x∗+ ηR∗HηRR − q ∗R0((n − 1)q) J25= (n − 1)  HR x∗+ η R∗HR ηC− q∗  R0((n − 1)q∗) .

This Jacobian has very complicated eigenvalues which cannot be expressed in a useful function, for this we have to look at the leading example.

Leading example. We know that the Cournot heuristic is a lagged version of the rational heuristic in this leading example since the inverse demand function is linear, therefore the dimension of the dynamical system can be reduced by one. Note that only the Cournot production is a lagged version of the rational production. The Cournot profits and resulting fractions are in general not lagged rational profits and fractions.

Referenties

GERELATEERDE DOCUMENTEN

The results show that the cultural variables, power distance, assertiveness, in-group collectivism and uncertainty avoidance do not have a significant effect on the richness of the

The present text seems strongly to indicate the territorial restoration of the nation (cf. It will be greatly enlarged and permanently settled. However, we must

De meeste effectgerichte maatregelen, zoals een verlaging van de grondwaterstand of een verhoging van de pH in de bodem, verminderen de huidige uitspoeling, maar houden de

Workshop participants had been asked to refl ect on the question whether, and if so how, affect could be incorporated within a rational choice model.. Perhaps it was Boerhaave’s

Het werkvak is niet altijd goed afgesloten, het is niet altijd duidelijk welke gedragsaanpassingen van fietsers verwacht worden en het bord 'fietser afstappen' wordt soms

According to the author of this thesis there seems to be a relationship between the DCF and Multiples in that the DCF also uses a “multiple” when calculating the value of a firm.

To test this assumption the mean time needed for the secretary and receptionist per patient on day 1 to 10 in the PPF scenario is tested against the mean time per patient on day 1

[r]