Fast implicit difference schemes for time-space fractional diffusion equations with the integral fractional Laplacian

(1)

University of Groningen

Fast implicit difference schemes for time-space fractional diffusion equations with the integral

fractional Laplacian

Gu, Xian-Ming; Sun, Hai-Wei; Zhang, Yanzhi; Zhao, Yong-Liang

Published in:

Mathematical methods in the applied sciences

DOI:

10.1002/mma.6746

IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please check the document version below.

Document Version

Publisher's PDF, also known as Version of record

Publication date: 2021

Link to publication in University of Groningen/UMCG research database

Citation for published version (APA):

Gu, X-M., Sun, H-W., Zhang, Y., & Zhao, Y-L. (2021). Fast implicit difference schemes for time-space fractional diffusion equations with the integral fractional Laplacian. Mathematical methods in the applied sciences, 44(1), 441-463. https://doi.org/10.1002/mma.6746

Copyright

Other than for strictly personal use, it is not permitted to download or to forward/distribute the text or part of it without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license (like Creative Commons).

Take-down policy

If you believe that this document breaches copyright please contact us providing details, and we will remove access to the work immediately and investigate your claim.

Downloaded from the University of Groningen/UMCG research database (Pure): http://www.rug.nl/research/portal. For technical reasons the number of authors shown on this cover page is limited to 10 maximum.

(2)

DOI: 10.1002/mma.6746

R E S E A R C H A R T I C L E

Fast implicit difference schemes for time-space fractional

diffusion equations with the integral fractional Laplacian

Xian-Ming Gu

1,2,3

_{Hai-Wei Sun}

2

_{Yanzhi Zhang}

4

_{Yong-Liang Zhao}

5

1_{School of Economic Mathematics/}

Institute of Mathematics , Southwestern University of Finance and Economics, Chengdu, 611130, Sichuan, China

2_{Department of Mathematics, University}

of Macau, Avenida da Universidade, Taipa, Macao, China

3_{Bernoulli Institute for Mathematics,}

Computer Science and Artificial Intelligence, University of Groningen, Nijenborgh 9, P.O. Box 407, Groningen, 9700 AK, The Netherlands

4_{Department of Mathematics and}

Statistics, Missouri University of Science and Technology, Rolla, 65409-0020, MO, USA

5_{School of Mathematical Sciences,}

University of Electronic Science and Technology of China, Chengdu, 611731, Sichuan, China

Correspondence

Yong-Liang Zhao, School of Mathematical Sciences, University of Electronic Science and Technology of China, Chengdu, Sichuan 611731, China.

Email: ylzhaofde@sina.com

Communicated by: C. Cuevas

Funding information

Applied Basic Research Project of Sichuan Province, Grant/Award Number: 2020YJ0007; Fundamental Research Funds for the Central Universities, Grant/Award Number: JBK1902028; National Natural Science Foundation of China, Grant/Award Number: 11801463; National Science Foundation,

Grant/Award Number: DMS-1620465; Science and Technology Development Fund, Macau SAR, Grant/Award Number: 0118/2018/A3

In this paper, we develop two fast implicit difference schemes for solving a class of variable-coefficient time–space fractional diffusion equations with integral fractional Laplacian (IFL). The proposed schemes utilize the graded L1 formula for the Caputo fractional derivative and a special finite difference discretization for IFL, where the graded mesh can capture the model problem with a weak singularity at initial time. The stability and convergence are rigorously proved via the M-matrix analysis, which is from the spatial discretized matrix of IFL. Moreover, the proposed schemes use the fast sum-of-exponential approxima-tion and Toeplitz matrix algorithms to reduce the computaapproxima-tional cost for the nonlocal property of time and space fractional derivatives, respectively. The fast schemes greatly reduce the computational work of solving the discretized lin-ear systems from (MN3 ₊ _M2_N)_{by a direct solver to}_{(MN(log N + N}

exp))

per preconditioned Krylov subspace iteration and a memory requirement from 𝒪(MN2_{) to}_𝒪(NN

exp), where N and (Nexp≪) M are the number of spatial and

temporal grid nodes, respectively. The spectrum of preconditioned matrix is also given for ensuring the acceleration benefit of circulant preconditioners. Finally, numerical results are presented to show the utility of the proposed methods.

K E Y WO R D S

Caputo derivative, circulant preconditioner, fractional diffusion equations, integral fractional Laplacian, Krylov subspace solvers

M S C C L A S S I F I C AT I O N 65R20; 35R11; 65N06; 65F08

1 I N T RO D U CT I O N

In recent decades, fractional partial differential equations (FPDEs) have attracted growing attention in modeling phe-nomena with long-term memory and spatial heterogeneity arising in engineering, physics, chemistry, and other applied

(3)

sciences.1,2_{In physics, fractional derivatives are used to model anomalous diffusion. Anomalous diffusion is the theory of} diffusing particles in environments that are not locally homogeneous.3-6_{A physical–mathematical model to anomalous} diffusion may be based on FPDEs containing derivatives of fractional order in both space and time, where the subdif-fusion appears in time and the superdifsubdif-fusion occurs in space simultaneously.7,8_{On the other hand, although most of} time–space fractional diffusion models are initially defined with the spatially integral fractional Laplacian (IFL),9-12_many previous studies (cf., e.g., previous works4,5,13-15_{) always substitute the space Riesz fractional derivative}1_{for the IFL. In} fact, such two kinds of definitions are not equivalent in high-dimensional cases.11,15,16_{It means that the “direct” study of} time–space fractional diffusion models with the IFL should be worthily considered.

In this paper, we study an alternative time–space fractional diffusion equation (TSFDE) with variable coefficients in one space dimension

⎧ ⎪ ⎨ ⎪ ⎩ C 0D 𝛾 tu(x, t) = −𝜅(x, t)(−Δ)𝛼∕2u(x, t) + 𝑓(x, t), x ∈ Ω, t ∈ (0, T], u(x, t) = 0, x ∈ Ωc_{, t ∈ [0, T],} u(x, 0) = 𝜙(x), x ∈ Ω, (1.1)

where𝜅(x,t) > 0 denotes the diffusivity coefficients, Ω = (− l,l), Ωc₌_R_{⧵ Ω, and the initial condition 𝜙(x) and the source}

term f(x,t) are known functions. Meanwhile,C

0D

𝛾

t is the Caputo derivative

1_{of order}_{𝛾—that is,}

C 0D 𝛾 tu(x, t) = { ₁ Γ(𝛾−1)∫ t 0 1 (t−s)𝛾 𝜕u(x,s) 𝜕s ds, 0< 𝛾 < 1, 𝜕u(x,t) 𝜕t , 𝛾 = 1, (1.2)

and throughout the paper, we always assume that 0< 𝛾 < 1. Here, the fractional Laplacian (− Δ)𝛼/2is defined by9-11,17

(−Δ)𝛼∕2u(x) = c1,𝛼 P.V.∫ R

u(x) − u(x′₎

|x − x′|1+𝛼 dx′, 𝛼 ∈ (0, 2), (1.3)

where P.V. stands for the Cauchy principal value and |x − x′_{| denotes the Euclidean distance between points x and x}′_{. The} normalization constant c1,𝛼is defined as

c1,𝛼= 2𝛼−1𝛼Γ ( 𝛼+1 2 ) √ 𝜋Γ(1 − 𝛼∕2) (1.4)

with Γ(·) denoting the gamma function. From a probabilistic point of view, the IFL represents the infinitesimal genera-tor of a symmetric𝛼-stable Lévy process.11,16,18_{Mathematically, the well-posedness/regularity of the Cauchy problem or} uniqueness of the solutions of the TSFDE (1.1) has been studied in previous works.3,19-23

Due to the nonlocality, the analytical (or closed-form) solutions of TSFDEs (1.1) on a finite domain are rarely avail-able. Therefore, we have to rely on numerical treatments that produce approximations to the desired solutions; refer, for example, to previous studies1,12,24-26_{and references therein for a description of such approaches. In fact, utilizing the} suitable temporal discretization, most of the early established numerical methods including the finite difference (FD) method,8,27-29_{finite element (FE) method,}30,31_{and matrix (named it as all-at-once) method}13,32_{for the TSFDE (1.1) were} developed via the fact that the IFL is equivalent to the Riesz fractional derivative in one space dimension.14 _However, such a numerical framework cannot be directly extended to solve the two- and three-dimensional TSFDEs due to the IFL (−Δ)𝛼∕2u(x, 𝑦) ≠ −𝜕𝛼u(x,𝑦)

𝜕|x|𝛼 − 𝜕 𝛼_u(x_,𝑦)

𝜕|𝑦|𝛼 .11,15,16 Therefore, it will hinder the development of numerical solutions for TSFDEs

from the stated objective.

In order to remedy the above drawback, Duo et al33_{replaced the IFL in TSFDE (1.1) by the spectral fractional} Lapla-cian11,15 _{and presented a fast numerical approach that combines the matrix transfer method}15 _{with inverse Laplace} transform for solving the one-dimensional and multidimensional TSFDEs (1.1) with constant coefficients. Although the numerical results show that their proposed method converges with the second-order accuracy in both time and space vari-ables, the spectral fractional Laplacian on a bounded domain is also not equivalent to the IFL at all.11_{On the other hand,} Nochetto et al34_{used the Caffarelli–Silvestre extension to rewrite the TSFDE (1.1) with}_{𝜅(x,t) ≡ 𝜅 as a two-dimensional} quasi-stationary elliptic problem with dynamic boundary condition. Then, they established an FE scheme for solving the converted elliptic problem and showed that the numerical scheme cannot reach the error estimates of order𝒪(𝜏2 −𝛾₎

(4)

claimed in the literature. Later, Hu et al35,36 _{successively exploited the similar strategy with FD approximation for the} converted elliptic problem of one-dimensional and multidimensional TSFDEs (1.1) with𝜅(x,t) ≡ 𝜅. Nevertheless, the numerical results showed that such FD schemes often converge with the less than first- and second-order accuracy in time and space, respectively, even for TSFDEs with sufficient smooth solutions.

In fact, it is important to set up numerical schemes that utilize the “direct” discretizations of IFL for solving the TSFDEs (1.1). Moreover, the discretizations of (multidimensional) TSFDEs became a recently hot topic, with the main numerical challenge stemming from the approximation of the hypersingular integral (see, e.g., previous works10,37-45_{). Indeed, there} are some numerical schemes that utilize the temporal L1 formula46_{(or numerical Laplace inversion}24_{) and spatial FE} discretization10,26,41-43 _{for solving the (multidimensional) constant-coefficient TSFDEs (1.1).}7,25,26 _{Both the theoretical} and numerical results are reported to show that such numerical schemes are efficient to solve the (multidimensional) TSFDEs (1.1) with𝜅(x,t) ≡ 𝜅. In addition, there are some other kinds of time–space fractional diffusion models but related to TSFDEs (1.1), where the spatial (or temporal) nonlocal operator is a replacement for the IFL (or the Caputo fractional derivative). This is mainly because the nonlocal operators with suitable kernels can exactly embrace the IFL and the Caputo fractional derivative, respectively.9,10,47-49_{For such novel model problems, Guan and Gunzburger}47_{established a} class of numerical methods that unitized the𝜃 schemes and piecewise-linear FE discretization. Their fully discrete scheme is analyzed for all to determine conditional and unconditional stability regimes for the scheme and also to obtain error estimates for the approximate solution. Later, Liu et al48_{improved the idea of Guan and Gunzburger by giving the proof} of convergence behavior with(𝜏2−𝛾₊_h2_{). Meanwhile, Liu et al. considered the piecewise-quadratic FE discretization to} improve the spatial convergence rate. The efficient implementation based on fast Toeplitz-matrix multiplications50-52_of their proposed scheme is also reported. For space–time nonlocal diffusion equations, Chen et al49_{proposed a numerical} scheme, by exploiting the quadrature-based FD method in time and the Fourier spectral method in space, and showed its stability. Moreover, it is shown that the convergence is uniform at a rate of(𝛿 + 𝜎2₎_(where_{𝛿 and 𝜎 are the time and} space horizon parameters) under certain regularity assumptions on initial and source data. Even though these are several methods with linear solvers of quasilinear complexity, the implementation of the above methods is still complicated, especially the computation of entries of stiffness matrix in FE discretization or finding the modes in terms of expansion basis in spectral method (cf. previous works26,44,48,49_{). In particular, it is pointed out that “more than 95% of CPU time is} used to assembly routine” for their FE methods.43

On the other hand, most of the abovementioned methods overlook that the presence of the kernel (t − s)−𝛾 _{results in}

a weak initial singularity in the solution of Equation (1.1), so that approximation methods (e.g., L1 formula) on uniform

mesheshave a poor convergent rate and high computational cost.34,49,53,54_{In this work, we devote ourselves in developing} fast implicit difference schemes (IDSs) for solving the TSFDE (1.1); the direct scheme utilizes the simple FD discretiza-tion38_{and the graded L1 formula}54_{for approximating the IFL and the Caputo fractional derivative, where the nonuniform} temporal discretization can overcome the initial singularity. Due to the repeated summation of numerical solutions in the previous steps, the direct scheme always needs much CPU time and memory cost, especially for the larger number of time steps.55-57_{In order to alleviate the computational cost, the sum-of-exponential (SOE) approximation}58_{of the kernel} (t − s)−𝛾 _{in the graded L1 formula for Caputo fractional derivative can be efficiently evaluated via the recurrence method.} Thus, we can derive the fast implicit difference scheme. In particular, we revisit the matrix properties of the discretized IFL and prove the discretized matrix is a strictly diagonally dominant and symmetric M-matrix with positive diagonal elements (i.e., the symmetric positive definite matrix), which is not studied in Duo et al38_{Based on such matrix} prop-erties, we strictly prove that the fast numerical schemes for the TSFDE (1.1) are unconditionally stable and present the corresponding error estimates of(M−min{r𝛾,2−𝛾}₊_h2₎_{(h is the spatial grid size) under certain regularity assumptions on} the smooth solutions. To our best knowledge, there are few successful attempts to derive the efficient IDSs for solving the TSFDE (1.1) with rigorous theoretical analyses. This is one of the main attractive advantages of our proposed methods compared with the above mentioned methods.

In addition, the nonlocality of IFL results in dense discretized linear systems, which is the leading time-consuming part in practical implementations.26,43,44 _{Fortunately, the coefficient matrix of discretized linear systems enjoys the} Toeplitz-like structure;38-40_{it means that we solve the sequence of discretized linear systems in a matrix-free pattern,}29,52,59 because the Toeplitz matrix–vector products can be computed via fast Fourier transforms (FFTs) in(N log N) opera-tions. More precisely, we will adapt the circulant preconditioners50-52_{for accelerating the Krylov subspace solvers}51 _for the sequence of discretized linear systems. Moreover, the benefit of circulant preconditioners will be verified via both the-oretical and numerical results. It notes that fast schemes greatly reduce the computational work of solving the discretized

(5)

linear systems from(MN3₊_M2_N)_{by a direct solver to}_{(MN(log N+N}

exp))per preconditioned Krylov subspace iteration

and a memory requirement from𝒪(MN2_{) to}_𝒪(NN

exp) (see Section 2.1 for defining Nexp).

The contributions of the current work can be summarized as follows.

• We present two IDSs for solving the TSFDE (1.1) with nonsmooth initial data and such numerical schemes can be

easily extended to solve the multidimensional cases.

• Both the stability and convergence of these IDSs are rigorously proved via the discretized matrix properties.

• We provide the efficient implementation of fast IDSs with theoretical guarantee for reducing the computation and

memory cost deeply.

The rest of this paper is organized as follows. In Section 2, both direct and fast IDSs are derived for the TSFDE (1.1) in details, and their stability and convergence are proved by revisiting the properties of spatial discretized matrix. In Section 3, the efficient implementation based on fast preconditioned Krylov subspace solvers of the proposed IDSs are given and the accelerating benefit of circulant preconditioners is theoretically guaranteed via clustering eigenvalues around 1. Section 4 presents numerical results to support our theoretical findings and show the effectiveness of the proposed IDSs. Finally, the paper closes with conclusions in Section 5.

2 D I R EC T A N D FA ST I M P L I C I T D I F F E R E N C E S C H E M E S

In this section, we will establish two implicit difference schemes for solving the problem (1.1). Meanwhile, the stability, convergence, and error analysis of such difference schemes are investigated and proved in details.

2.1 Two implicit difference schemes

As mentioned above, we assume that the problem (1.1) has a solution u(x,t) such that || || | 𝜕k_u(x_{, t)} 𝜕tk || || |≤ c0t 𝛾−k_{, 0 ≤ k ≤ 3.} _(2.1)

Here and in what follows, ̂C and cj (j = 0,1,2) are positive constants, which depend on the problem but not on the

mesh parameters.54_{Let M}_{, N, r ∈}_N+_{(positive integers), h = 2l/N,x}

i= −l+ ih, tm=(m/M)rT, and𝜏m=tm−tm −1. We also consider the sets

Ωh= {xi|0 ≤ i ≤ N}, Ω𝜏= {tm|0 ≤ m ≤ M}, Ωh,𝜏 = {(xi, tm)|0 ≤ i ≤ N, 0 ≤ m ≤ M},

and let v = {vm

i |0 ≤ i ≤ N, 0 ≤ m ≤ M} be a grid function on Ωh,𝜏. We define the seth= {v|v = (v0, v1, … , vN−1, vN)}

of grid functions on Ωhand provide it with the norm

||v||∞= max 0≤i≤N|vi|.

For m≥ 1, we approximate the Caputo fractional derivative (1.2) by the L1 formula on the graded mesh, which can capture the weak initial singularity of (1.1):

C 0D𝛾tu(tm) ≈D𝛾tu(tm)≜ 1 Γ(1 −𝛾) [ a(_mm,𝛾)u(tm) − m−_∑1 k=1 (a(m,𝛾) k+1 −a (m,𝛾) k )u(tk) −a (m,𝛾) 1 u(t0) ] , where a(_km,𝛾)= 1 𝜏k∫ tk tk−1 ds (tm−s)𝛾 , k ≥ 1. (2.2)

The truncation error 𝜓m _{can be defined by} _𝜓m _≜C

0 D

𝛾

tu(tm) − D𝛾tu(tm). From Shen et al,54, Lemma 2.1 we obtain the

boundedness of the truncation error

|𝜓m_{| ≤ ̂Cm}−min{r(1+𝛾),2−𝛾}_{, m = 1, 2, … , M,} _(2.3)

if |u′′_{(t) |≤c}

(6)

On the other hand, it notes that the above graded L1 scheme always needs much computational cost in practical appli-cations due to the repeatedly weighted sum of the solutions of previous time steps. To reduce the cost, here, it is useful to develop the fast approximation of Caputo fractional derivative on a nonuniform temporal mesh.

Lemma 2.1 (Jiang et al58_{). Let}_{𝜖,𝛿, and T denote the tolerance error, cut-off time restriction, and final time, respectively.}

Then there exist Nexp∈N+and sj,wj> 0, j = 1,2, … , Nexpsuch that

|| || ||t −𝛾₋ Nexp ∑ 𝑗=1 w_𝑗e−s𝑗t|||| ||≤ 𝜖, for any t ∈ [𝛿, T],

where Nexp=((log 𝜖−1)(log log𝜖−1+log(T𝛿−1)) + (log𝛿−1)(log log𝜖−1+log𝛿−1)).

Based on Lemma 2.1, we set𝛿 = (1/M)r_T_{, and then the fast approximation of Caputo fractional derivative on a graded}

temporal grid can be drawn as follows (m≥ 1).

C 0D 𝛾 tu(tm) = FC_D𝛾 tu(tm) +(m −min{r(1+𝛾),2−𝛾}₊_𝜖) ≜ 1 Γ(1 −𝛾) [ b(_mm,𝛾)u(tm) − m−_∑1 k=1 (b(_k+m,𝛾)₁ −b(_km,𝛾))u(tk) −b(₁m,𝛾)u(t0) ] +(m−min{r(1+𝛾),2−𝛾}+𝜖), m = 1, 2, … , M, (2.4)

where the estimate of truncation error holds when |u′_{(t) |}_≤c

0t𝛾 − 1and |u′′(t) |≤c0t𝛾 − 2and b(_km,𝛾) = ⎧ ⎪ ⎨ ⎪ ⎩ Nexp ∑ 𝑗=1 w_𝑗1 𝜏k∫ tk tk−1e −s_𝑗(tm−s)_ds, _{k =}₁, 2, … , m − 1, a(_mm,𝛾), k = m. (2.5)

Moreover, we provide the information for approximating the IFL with extended Dirichlet boundary conditions in (1.1). According to the idea in Duo et al,38_{the approximation is given by}

(−Δ)𝛼∕2_h,𝜇u(xi) =C𝛼,𝜇h [(_N−₁ ∑ 𝓁=2 (𝓁 + 1)𝜈_{− (𝓁 − 1)}𝜈 𝓁𝜇 + N𝜈− (N −1)𝜈 N𝜇 + (2 𝜈₊_𝜅 𝜇−1) + _𝛼N2𝜈_𝛼 ) u(xi) − 2 𝜈₊_𝜅 𝜇−1

2 (u(xi−1) +u(xi+1)) − 1 2 N−1 ∑ 𝑗=1,𝑗≠i,i±1 (|𝑗 − i| + 1)𝜈− (|𝑗 − i| − 1)𝜈 |𝑗 − i|𝜇 u(x𝑗) ] , (2.6) where i = 1,2, … , N − 1, Ch

𝛼,𝜇=c1,𝛼∕(𝜈h𝛼)> 0 and the constant 𝜅𝜇=1 for𝜇 ∈ (𝛼,2), while 𝜅𝜇=2 if𝜇 = 2. Meanwhile, we

denote𝜈 = 𝜇 − 𝛼 for notational simplicity.

Lemma 2.2 (Duo et al38_{). Suppose that u(x) ∈}_s,𝛼

2(R)has the finite support on an open set Ω⊂R, and Equation (2.6)

is a finite difference approximation of the fractional Laplacian (− Δ)𝛼/2. Then, for any𝜇 ∈ (𝛼, 2), there is

||(−Δ)𝛼∕2_{u(x) − (−Δ)}𝛼∕2

h,𝜇u(x)||∞,Ω≤ ̃Chp (2.7)

with ̃C> 0 depending on 𝛼 and 𝜇. Here, p ∈ (0,2] would be determined via the regularity (i.e., the index s ∈N) of u(x).

Lemma 2.2 provides a direct discretization for the IFL that appeared in the TSFDE (1.1). At present, the spatial and temporal discretizations are ready for developing the numerical methods. Evaluating Equation (1.1) at points (xi,tm), we

have C 0D 𝛾 tu(xi, tm) = −𝜅(xi, tm)(−Δ)𝛼∕2u(xi, tm) +𝑓(xi, tm), (2.8) where 1≤ i ≤ N − 1,1 ≤ m ≤ M. Let u = {um

i }0≤i≤N,0≤m≤M be a grid function defined by

(7)

Using these notations and recalling Equation (2.4) along with Lemma 2.2, we can approximate Equation (1.1) at grid point (xi, tm) as follows: ⎧ ⎪ ⎨ ⎪ ⎩ FC_D𝛾 tUim= −𝜅 m i (−Δ) 𝛼∕2 h,𝜇Uim+𝑓 m i +R m i , 1≤ i ≤ N − 1, 1 ≤ m ≤ M, U₀m=U_Nm≡ 0, 0≤ m ≤ M, U0 i =𝜙(xi), 1≤ i ≤ N − 1, (2.9)

where the terms {Rm

i }are small and satisfy the inequality

|Rm

i | ≤ c2(m

−min{r(1+𝛾),2−𝛾}₊_hp₊_{𝜖), 1 ≤ i ≤ N − 1, 1 ≤ m ≤ M.}

We omit the above small terms and arrive at the following implicit difference scheme ⎧ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎩ 1 Γ(1−𝛾) [ b(_mm,𝛾)um_i − m−_∑1 k=1 (b(_k+m,𝛾)₁ −b(_km,𝛾))uk_i −b(₁m,𝛾)u0_i ] = −𝜅m_i (−Δ)𝛼∕2_h_,𝜇um_i +𝑓_im, 1 ≤ i ≤ N − 1, 1 ≤ m ≤ M, um₀ =um_N =0, 0≤ m ≤ M, u0 i =𝜙(xi), 1≤ i ≤ N − 1, (2.10)

which is named as the fast implicit difference scheme (FIDS). Similarly, we combine the graded L1 formula with Lemma 2.2 for deriving the following difference scheme

⎧ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎩ 1 Γ(1−𝛾) [ a(_mm,𝛾)um i − m−_∑1 k=1 (a(m,𝛾) k+1 −a (m,𝛾) k )u k i −a (m,𝛾) 1 u 0 i ] = −𝜅_im(−Δ)𝛼∕2_h,𝜇um i +𝑓m i , 1 ≤ i ≤ N − 1, 1 ≤ m ≤ M, um 0 =u m N =0, 0≤ m ≤ M, u0_i =𝜙(xi), 1≤ i ≤ N − 1, (2.11)

which is labeled as direct implicit difference scheme (DIDS).

At each time level, both FIDS (2.10) and DIDS (2.11) are the resultant linear systems, which can be solved by the direct method (e.g., Gauss elimination) with total computational cost(N3_{M + NMN}

exp)for FIDS and(N3M + NM2)for DIDS.

Note that, generally, Nexp< 10054,58and (if) M is very large, so that FIDS requires smaller computational cost than DIDS.

Moreover, FIDS only requires𝒪(N2_{+ NN}

exp) memory units rather than(N2+NM)for DIDS. In Section 3, we will further

reduce the computational cost of both FIDS and DIDS by means of matrix-free preconditioned iterative solvers.

2.2 The stability and convergence

In this subsection, we discuss the stability and convergence of the difference scheme for the problem (1.1). In order to analyze the stability and convergence, we rewrite the FIDS (2.10) into the matrix form

(m)_um₌ 1 Γ(1 −𝛾) [_m−₁ ∑ k=1 (b(_k+m,𝛾)₁ −b_k(m,𝛾))uk+b(₁m,𝛾)u0 ] +fm, (2.12) where(m) ₌ 1 Γ(1−𝛾)b (m,𝛾)

m I + K(m)Aand refer to Equation (2.13) for the definition of A, I is the identity matrix of order

N −1, K(m) ₌ _diag(_𝜅m 1, 𝜅 m 2 , … , 𝜅 m N−1), um = [u m 1, u m 2, … , u m N−1]⊤, f m = [𝑓₁m, 𝑓₂m, … , 𝑓_N−m₁]⊤, and b(mm,𝛾) > 0.54, Lemma 2.4

First of all, we revisit the properties of spatial discretization, which is not deeply studied in the original paper.38,39_{In fact,} the spatial discretization (2.6) of (− Δ)𝛼/2u(x,t) can be expressed in the matrix–vector product form (−Δ)𝛼∕2_h,𝜇um ₌ _Aum_,

(8)

where A = [aij]i,j = 1, … , N − 1is the matrix representation of the (discretized) fractional Laplacian, defined as a_i𝑗=Ch 𝛼,𝜇 ⎧ ⎪ ⎪ ⎨ ⎪ ⎪ ⎩ N−1 ∑ 𝓁=2 (𝓁+1)𝜈_{−(𝓁−1)}𝜈 𝓁𝜇 + N𝜈−(N−1)𝜈 N𝜇 + (2𝜈+𝜅𝜇−1) + 2𝜈 𝛼N𝛼, 𝑗 = i, −2𝜈+𝜅𝜇−1 2 , 𝑗 = i ± 1, −(|𝑗−i|+1)𝜈−(|𝑗−i|−1)𝜈 2|𝑗−i|𝜇 , 𝑗 ≠ i, i ± 1, (2.13)

where i,j = 1,2, … , N − 1. It is easy to see that the matrix A is a real symmetric Toeplitz matrix, which can be stored with only (N − 1) entries.38,50,51_{Moreover, we can give the following conclusions.}

Proposition 2.1. According to the definition of A, the following holds:

1) A is a strictly diagonally dominant M-matrix; 2) A is symmetric positive definite;

3) The absolute values of the entries aijaway from the diagonals decay gradually, that is, a11> |a12|> … > |a1,N − 1|

andlimN→∞|a1,N−1| = 0.

Proof. (1) Because A is a symmetric Toeplitz matrix, then the diagonal entries are equal to a11> 0. Moreover, it is not hard to see that aij< 0 (i ≠ j). So we conclude that A is an M-matrix60, p. 533and obtain

a11− ∑ 𝑗≠1 |a1𝑗| = a11− N−_∑1 𝑗=2 |a1𝑗| > Ch 𝛼,𝜇 [ 2𝜈+𝜅𝜇−1 2 + N𝜈− (N −1)𝜈 N𝜇 + 2𝜈 𝛼N𝛼 ] > 0, (2.14) and aN−1,N−1− ∑ 𝑗≠N−1 |aN−1,𝑗| = a11− N−_∑1 𝑗=2 |a1𝑗| > 0. (2.15)

Similarly, it follows that

aii− ∑ 𝑗≠i |ai𝑗| = a11− ∑ 𝑗≠i a_i𝑗 > Ch 𝛼,𝜇 [ N𝜈− (N −1)𝜈 N𝜇 + 2𝜈 𝛼N𝛼 ] > 0, i = 2, 3, … , N − 2. (2.16)

A combination of the aforementioned three inequalities verifies that A is a strictly diagonally dominant M-matrix. (2) Because A is a symmetric strictly diagonally dominant M-matrix and all its diagonal elements are positive, that is,

aii=a11> 0, then A is indeed a symmetric positive definite matrix.60, Corollary 7.2.3 (3) First of all, we rewrite the matrix A = Ch

𝛼,𝜇̃A = C𝛼,𝜇h [̃ai𝑗]i,𝑗=1, … ,N−1(cf. Equation 2.13). Meanwhile, it is easy to

note that a11> |a12|, then we find

|̃a12| − |̃a13| = 2𝜈+𝜅_𝜇−1 2 − 3𝜈−1 2𝜇+1 ≥ 4𝜈+𝛼∕2−3𝜈+1 2𝜈+𝛼+1 > 0. (2.17)

For j = 3,4, … , we set |j − i|= |j − 1|=k; thus, k≥ 2 and |̃a1𝑗| ∶= 𝑓(k) = k−𝛼 2 · [( 1 +1 k )𝜈 − ( 1 −1 k )𝜈] =[(𝜈 1 ) k−1−𝛼₊(𝜈 3 ) k−3−𝛼₊(𝜈 5 ) k−5−𝛼_{+ …}] ∽(k−1−𝛼), (2.18)

(9)

According to Proposition 2.1, if we define (C) = min 1≤i≤N−1 ( |Cii| − ∑ 1≤𝑗≤N−1,𝑗≠i |Ci𝑗| ) (2.19)

for any matrix C = [Cij]i,j = 1, … , N − 1, then it follows that(A) ≥ 0, which is helpful in the next context. The following properties of the operator(·) can be given as follows.

Lemma 2.3 (Lin and Ng61, Lemma 3_{). Let C}

1, C2 ∈R(N−1)×(N−1). Suppose both C1and C2have positive diagonal entries.

Then it follows that(C1+C2)≥ (C1) +(C2).

Lemma 2.4 (Lin and Ng61, Lemma 4_{). Let C ∈}_R(N−1)×(N−1)_{. Suppose}_{(C) ≥ 0. Then for any nonnegative diagonal matrix}

K ∈R(N−1)×(N−1)_{, it holds}_{(KC) ≥ (C) min}

1≤𝑗≤N−1K𝑗𝑗≥ 0.

Next, we exploit the above two lemmas to give the following estimation about the coefficient matrices (m) _of

Equation (2.12).

Theorem 2.1. For any b (m,𝛾) m

Γ(1−𝛾)> 0 and 1 ≤ m ≤ M, it holds min1≤m≤M

( 1 Γ(1−𝛾)b (m,𝛾) m I + K(m)A ) ≥ b(mm,𝛾) Γ(1−𝛾).

Proof. From Lemmas 2.3 and 2.4 and Proposition 2.1, we obtain

 ( 1 Γ(1 −𝛾)b (m,𝛾) m I + K(m)A ) ≥  ( 1 Γ(1 −𝛾)b (m,𝛾) m I ) +(K(m)A) ≥ b (m,𝛾) m Γ(1 −𝛾), 1 ≤ m ≤ M, (2.20)

from which the result follows.

Before proving the final result of this section on the unconditional stability and convergence property of the FIDS (2.10), we recall the following useful lemma.

Lemma 2.5. Suppose C ∈R(N−1)×(N−1)_satisfies_{(C) ≥ 𝜆 > 0. Then, for any y ∈}_RN−1_{, it holds}_𝜆||y||

∞≤ ||Cy||∞.

Proof. Because(C) ≥ 𝜆 > 0, then  (

1

𝜆C

)

≥ 1 and ||y||∞≤ ‖‖‖C_𝜆y‖‖‖ ∞,

61, Lemma 7_{which proves the above result.}

Theorem 2.2. The proposed FIDS (2.10) with𝜖 ≤ c1M𝛾is uniquely solvable and unconditionally stable in the sense that

||uk_|| ∞≤ ||u0||∞+ Γ(1 −𝛾) max 1≤s≤k ||fs_|| ∞ b(₁s,𝛾) , k = 1, 2, … , M, (2.21) where||fs||∞≤ max1≤i≤N−1|𝑓_is|.

Proof. It is easy to see that proving the unique solvability of FIDS (2.10) is equivalent to showing the invertibility of coefficient matrices(m)_{with each 1}_{≤ m ≤ M. By means of Theorem 2.1 and Lemma 2.5, it follows that}

||(m)_y_|| ∞=‖‖‖‖ ‖ [ 1 Γ(1 −𝛾)b (m,𝛾) m I + K(m)A ] y‖‖‖‖ ‖∞ ≥ b (m,𝛾) m Γ(1 −𝛾)||y||∞, ∀y ∈R N−1_, _(2.22)

where 1≤ m ≤ M. Therefore, (m) _∶ _RN−1 _→ _RN−1_{is clearly an injection for each 1}_{≤ m ≤ M, whose null space is} simply {0}. Hence,(m)_{s are nonsingular, which proves the unique solvability.}

On the other hand, we apply Equation (2.22) and the monotonicity of {b(m,𝛾)

k } (1≤ m ≤ M) 54, Lemma 2.4_{to obtain} b(_mm,𝛾)||um||∞≤ m−1 ∑ k=1 (b(_k+m,𝛾)₁ −b_k(m,𝛾))||uk||∞+b(1m,𝛾) [ ||u0_|| ∞+ Γ(1 −𝛾) b(₁m,𝛾) ||f m_|| ∞ ] , 1 ≤ m ≤ M.

(10)

Then the inequality (2.21) can be proved by the method of mathematical induction, which is similar to the proof of Shen et al54, Theorem 4.1_{We omit the details here.}

On the other hand, we replace the coefficients {b(m,𝛾)

k }with {a

(m,𝛾)

k }in Theorem 2.1 and Equations (2.12) and (2.22), and

then we can obtain the following conclusion, which is helpful to analyze the stability and convergence of DIDS (2.11).

Theorem 2.3. The proposed DIDS (2.11) is uniquely solvable and unconditionally stable in the sense that

||uk_|| ∞≤ ||u0||∞+ Γ(1 −𝛾) max 1≤s≤k ||fs_|| ∞ a(₁s,𝛾) , k = 1, 2, … , M. (2.23)

Proof. The proof of this theorem is similar to Theorem 2.3; we omit the details here.

From Theorems 2.1–2.3, we can see that both FIDS (2.10) and DIDS (2.11) are stable to the initial value𝜙 and the right-hand term f. Now, we consider the convergence of these two difference schemes.

Theorem 2.4. Let {Um

i |0 ≤ i ≤ N, 0 ≤ m ≤ M} and {U m

i |0 ≤ i ≤ N, 0 ≤ m ≤ M} be, respectively, the solutions of the

problem (1.1) and the difference scheme (2.10). If𝜖 ≤ min{c1M𝛾, T−𝛾∕2}, then ||em_|| ∞≤ 2Γ(1 − 𝛾)c2T𝛾(M−min{r𝛾,2−𝛾}+hp+𝜖), 1 ≤ m ≤ M, (2.24) where em i =U m i −u m

i , 0 ≤ i ≤ N, 0 ≤ m ≤ M, and p would be determined by the spatial regularity of u(x,t).

Proof. Writing the system (2.9) as ⎧ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎩ 1 Γ(1−𝛾) [ b(_mm,𝛾)Um i − m−_∑1 k=1 (b(m,𝛾) k+1 −b (m,𝛾) k )U k i −b (m,𝛾) 1 U 0 i ] = −𝜅(−Δ)𝛼∕2_h,𝜇Um i +𝑓m i +R m i , 1 ≤ i ≤ N − 1, 1 ≤ m ≤ M, Um 0 =U m N =0, 0≤ m ≤ M, U0 i =𝜙(xi), 1≤ i ≤ N − 1,

and subtracting Equation (2.10) from the above corresponding system, ⎧ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎩ 1 Γ(1−𝛾) [ b(_mm,𝛾)em i − m−1 ∑ k=1 (b(m,𝛾) k+1 −b (m,𝛾) k )e k i −b (m,𝛾) 1 e 0 i ] = −𝜅(−Δ)𝛼∕2_h,𝜇em i +Rm i , 1 ≤ i ≤ N − 1, 1 ≤ m ≤ M, em 0 =e m N =0, 0≤ m ≤ M, e0_i =0, 1≤ i ≤ N − 1, (2.25)

By means of Theorem 2.1 and the matrix analysis described above, it follows that ||em_|| ∞≤ Γ(1 − 𝛾) max 1≤s≤m ||Rs_|| ∞ b(₁s,𝛾) , m = 1, 2, … , M.

The rest of this proof is also similar to the work of Shen et al54, Theorem 4.2

Again, we employ the similar strategy to give the error analysis of DIDS (2.11) as follows.

Theorem 2.5. Let {Um

i |0 ≤ i ≤ N, 0 ≤ m ≤ M} and {U m

i |0 ≤ i ≤ N, 0 ≤ m ≤ M} be, respectively, the solutions of the

problem (1.1) and the difference scheme (2.11), then

||em_||

(11)

where em_i =U_im−um_i , 0 ≤ i ≤ N, 0 ≤ m ≤ M.

In practice, the value of𝜖 is sufficiently small such that the tolerance error in (2.24) can be negligible compared with the space and time errors. Then it also finds that the numerical errors for DIDS and FIDS are almost identical but the later is often faster (cf. Section 4). With the help of arguments in proving,38, Theorems 3.1–3.2_{it is not hard to make the convergence} results described in Theorems 2.4 and 2.5 more specific.

Remark2.1. For determining the value of p, it reads

• suppose that the solution of the problem (1.1) satisfies the condition (2.1) and u(x, ·) ∈ s,𝛼∕2(R)with (x, t) ∈

R× [0, T] and s ≥ 1, then solutions of FIDS (2.10) and DIDS (2.11) with 𝜇 ∈ (𝛼,2) converge to the exact solutions of Equation (1.1), respectively.

• for s = 1, 𝛼 ∈ (0,2) and 𝜇 ∈ (𝛼,2], the convergence rates of FIDS (2.10) and DIDS (2.11) are (at least)

(M−min{r𝛾,2−𝛾}₊_h1−𝛼 2 +𝜖 ) and ( M−min{r𝛾,2−𝛾}₊_h1−𝛼 2 )

, respectively; see Duo et al38_{and Duo and Zhang}39_for details.

• for s≥ 3 and 𝛼 ∈ (0,2), the convergence rates of FIDS (2.10) and DIDS (2.11) with 𝜇 = 2 or 𝜇 = 1 + 𝛼/2 are

(M−min{r𝛾,2−𝛾}₊_h2₊_{𝜖) and (M}−min{r𝛾,2−𝛾}₊_h2_{), respectively.}

Moreover, we will work out some numerical results for supporting the above theoretical convergence behaviors described in Section 4.

3 E F F I C I E N T I M P L E M E N TAT I O N BA S E D O N P R ECO N D I T I O N I N G O F T H E

D I F F E R E N C E S C H E M E S

In the section, we analyze both the implementation and computational complexity of FIDS (2.10) and DIDS (2.11) and we propose an efficient implementation utilized preconditioned Krylov subspace solvers. Noting that a(mm,𝛾) =b(mm,𝛾) > 0,

we start the efficient implementation from the following matrix form of these two implicit difference schemes at the time level 1≤ m ≤ M, which are given by Equation (2.12) and

(m)_um₌ 1 Γ(1 −𝛾) [_m−₁ ∑ k=1 (a(_k+m,𝛾)₁ −a_k(m,𝛾))uk+a(₁m,𝛾)u0 ] +fm, (3.1)

respectively. From Theorems 2.2 and 2.3, it knows that both Equations (2.12) and (3.1) have the unique solutions. In addition, it is meaningful to remark that Equations (2.12) and (3.1) corresponding to FIDS (2.10) and DIDS (2.11) are inherently sequential; thus, both of them are difficult to parallelize over time.

3.1 The circulant preconditioner

On the other hand, it is useful to note that the matrix–vector product(m)_v_{can be efficiently calculated by}

(m)_{v =} 1

Γ(1 −𝛾)b (m,𝛾)

m v + K(m)(Av), (3.2)

where v ∈RN−1_{is any vector and the Toeplitz matrix–vector Av can be implicitly evaluated via the FFTs in}_{(N log N)} operations. In other words, we can use a matrix-free method to compute(m)_v_{quickly. Based on such observations, the}

Krylov subspace method should be the most suitable solver for Equation (2.12) or Equation (3.1) one by one. However, when the coefficients and the order of integral fractional Laplacian are not small, then the coefficient matrices(m)_will

be increasingly ill-conditioned (cf. Section 4). This fact deeply slows down the convergence of the Krylov subspace method, while the preconditioning techniques are often used to overcome this difficulty.29,50-52_{In the literature on Toeplitz systems,} circulant preconditioners always played important roles.50,51 _{In fact, circulant preconditioners have been theoretically} and numerically studied with applications to fractional partial differential equations for recent years; see, for instance, previous studies.29,40,52

(12)

In this work, we design a family of the Strang preconditioners50 _{for accelerating the convergence of Krylov subspace} solvers. More precisely, the circulant preconditioners are given for Equation (2.12) and/or Equation (3.1) as follows:

(m)₌ 1 Γ(1 −𝛾)b (m,𝛾) m I +𝜅(m)s(A) = F∗ [ 1 Γ(1 −𝛾)b (m,𝛾) m +𝜅(m)Λ ] F, (3.3)

where F and F* _{are the Fourier matrix and its conjugate transpose, respectively, and the scalar}_𝜅(m) ₌ 1

N−1 ∑N−1

i=1 𝜅im.

Meanwhile, s(A) = F*_Λ_F_{is the Strang circulant approximation}50,51 _{of the Toeplitz matrix A and the diagonal matrix Λ} contains all the eigenvalues of s(A) with the first column: cS = [a11, … , a1,⌊N+₂1⌋, a1,⌊N₂⌋, … , a12]⊤ ∈ RN−1. Therefore, the matrix Λ = diag(FcS) can be computed in advance and only one time during each time level. Besides, as(m)are the

circulant matrices, we observe from Equation (3.3) that the inverse matrix–vector product z = [(m)_]−1_v_{can be carried} out in(N log N) operations via the (inverse) FFTs. In a word, we exploit a fast preconditioned Krylov subspace method with only(N) memory requirement and (N log N) computational cost per iteration, while the number of iterations and the computational cost are greatly reduced.

To investigate the properties of the proposed preconditioners, the following lemma is the key to prove the invertibility of(m)_{in Equation (3.3).}

Lemma 3.1. All eigenvalues of s(A) fall inside the open disk

{z ∈R∶ |z − a11| < a11}, (3.4)

and all the eigenvalues of s(A) are strictly positive for all N.

Proof. First of all, because the matrix A is symmetric, then s(A) is also symmetric and its eigenvalues should be real. All the Gershgorin disk of the circulant matrix s(A) are centered at a11with radius

rN =2 ⌊N+1 2 ⌋ ∑ 𝓁=2 |a1,𝓁| < a11. (3.5)

The above inequality holds due to the expression of Equation (2.13), where the expression of a11contains exactly the sum of 2|a1,𝓁| (𝓁 = 2,3, … , N − 1). In conclusion, all the eigenvalues of s(A) are strictly positive for all N.

According to Lemma 3.1, it means that s(A) is a real symmetric positive definite matrix. Moreover, the invertibility of circulant preconditioners(m)_{(3.3) can be given for all m = 1, 2, … , M as follows.}

Lemma 3.2. Let𝛼 ∈ (0, 2). The preconditioner (m)_{is invertible and}

‖‖ ‖ ( (m))−1‖‖ ‖2< 1 b(mm,𝛾) Γ(1−𝛾)+𝜅(m)·1≤k≤N−1min [Λ]k,k . (3.6)

Proof. According to Lemma 3.1, we have [Λ]k,k> 0. Noting that a(mm,𝛾)> 0 (or b

(m,𝛾) m > 0) and 𝜅(m)> 0, we have [ 1 Γ(1 −𝛾)b (m,𝛾) m +𝜅(m)Λ ] k,k > 0 (3.7)

for k = 1,2, … , N − 1. Therefore,(m)_{is invertible. Furthermore, we have}

‖‖ ‖ ( (m))−1‖‖ ‖2= 1 min 1≤k≤N−1 [ 1 Γ(1−𝛾)b (m,𝛾) m +𝜅(m)Λ ] k,k < 1 b(mm,𝛾) Γ(1−𝛾)+𝜅(m)·1≤k≤N−1min [Λ]k,k .

(13)

3.2 Spectrum of the preconditioned matrix

In this subsection, we study the spectrum of the preconditioned matrix, which can help us to understand the convergence of preconditioned Krylov subspace solvers. For convenience of our investigation, we first assume that the diffusion coef-ficient function𝜅(x,t) ≡ 𝜅(t), then Equation (2.12) or Equation (3.1) will be a sequence of real symmetric positive definite linear systems, where the coefficient matrices reduce to(m) ₌ 1

Γ(1−𝛾)b

(m,𝛾)

m I +𝜅(m)Acorresponding to Equations (2.12)

and (3.1), respectively. The preconditioned CG (PCG) method51_{should be a suitable candidate for solving such linear} sys-tems one by one. Moreover, the spectrum of the preconditioned matrix((m))−1_(m)_{is available for both Equations (2.12)}

and (3.1) at each time level m, so we take Equation (2.12) as the research object in the next context. Throughout this subsection, we rewrite Equation (2.12) into the following equivalent form:

̃ (m)_um₌ 1 Γ(1 −𝛾)Ch 𝛼,𝜇 [_m−₁ ∑ k=1 (b(m,𝛾) k+1 −b (m,𝛾) k )u k₊_b(m,𝛾) 1 u 0 ] + 1 Ch 𝛼,𝜇 fm, (3.8) where ̃(m) ₌ b(mm,𝛾) Γ(1−𝛾)Ch 𝛼,𝜇

I +𝜅(m)̃A and its corresponding circulant preconditioner (m) _{reduces to ̃}_(m) ₌ b(mm,𝛾)

Γ(1−𝛾)Ch 𝛼,𝜇

I + 𝜅(m)_{s( ̃}_A)_{, which is still invertible (cf. Lemma 3.2). Moreover, we assume that M and r are properly chosen, depending on}

N, such that𝜂_N(m)_,M,r ≜ b

(m,𝛾) m

Γ(1−𝛾)Ch

𝛼,𝜇 in (3.8) is bounded away from 0; that is, there exist two real numbers “̌𝜂” and “ ̂𝜂” such that

0< ̌𝜂 ≤ 𝜂(_Nm)_,M,r≤ ̂𝜂, ∀N and m = 1, 2, … , M − 1. (3.9)

We add a subscript N to each matrix to denote the matrix size. Under the above assumption in (3.9), the matrix ̃(m)_,

K(m)_,_𝜂(m)

N,M,r, and ̃(m)are independent of m, and we therefore simply denote them as ̃N−1,𝜅 (constant), 𝜂N,M,r(constant), and ̃N−1, respectively. Now the coefficient matrix ̃(m)in (2.12) becomes

N−1=𝜂N,M,rI +𝜅 ̃AN−1 = [𝜂N,M,r+𝜅(̃a11−𝜚)]I + ⎡ ⎢ ⎢ ⎢ ⎢ ⎣

𝜚 𝜅 ̃a12 … 𝜅 ̃a1,N−2 𝜅 ̃a1,N−1

𝜅 ̃a12 𝜚 𝜅 ̃a12 … 𝜅 ̃a1,N−2

⋮ 𝜅 ̃a12 𝜚 ⋱ ⋮

𝜅 ̃a1,N−2 … ⋱ ⋱ 𝜅 ̃a12

𝜅 ̃a1,N−1 𝜅 ̃a1,N−2 … 𝜅 ̃a12 𝜚 ⎤ ⎥ ⎥ ⎥ ⎥ ⎦ ≜ [𝜂N,M,r+𝜅(̃a11−𝜚)]I + GN−1, (3.10)

where we set|̃a12| < 𝜚 < ̃a11(without loss of generality), GN−1 = [̃gi𝑗](N−1)×(N−1)= [̃g|i−𝑗|](N−1)×(N−1), and̃gi𝑗 =𝜅 ̃ai𝑗.

To study the spectrum of the preconditioned matrix((m))−1_(m)_{, we first introduce the generating function of the}

sequence of Toeplitz matrices {GN}∞_N=₁:

p(𝜃) =

∞ ∑

k=−∞

̃gke𝜄k𝜃, (3.11)

wherẽgkis the kth diagonal of GN = [̃gi−𝑗]N×Nand𝜄 =

√

−1. The generating function p(𝜃) is in the Wiener class50,51_{if and} only if

∞ ∑

k=−∞

|̃gk| < ∞. (3.12)

For GN −1defined in (3.8), we have the following conclusion.

Lemma 3.3. Under the above assumptions, it finds that p(𝜃) is real valued and in the Wiener class.

Proof. For convenience of our investigation, we can rewrite p(𝜃) for the matrix GNdefined in (3.8) as

p(𝜃) = ̃g0+2 ∞ ∑ k=1 ̃gkcos(k𝜃) = 𝜚 − 2 ∞ ∑ k=1 (−̃gk)cos(k𝜃). (3.13)

(14)

TABLE 1 The L∞- and L2-norm of errors, temporal convergence orders for solving Example 1 with𝛼 = 1.5,

N(M) =⌈2Mmin{r𝛾,2−𝛾}∕2_{⌉, 𝜇 = 1 +}𝛼

2,𝜅𝜇=1, and s = 3

DIDS (2.11) FIDS (2.10)

(r,𝜸) M Err_∞ Rate Err2 Rate CPU(s) Err∞ Rate Err2 Rate CPU(s)

(1, 0.8) 28 _6.734e−3 _– _5.974e−3 _– _0.068 _6.734e−3 _– _5.974e−3 _– _0.051

29 _3.639e−3 _0.888 _3.173e−3 _0.913 _0.182 _3.639e−3 _0.888 _3.173e−3 _0.913 _0.103

210 _1.972e−3 _0.884 _1.698e−3 _0.902 _0.619 _1.972e−3 _0.884 _1.698e−3 _0.902 _0.204

211 _1.108e−3 _0.832 _9.450e−4 _0.845 _2.712 _1.108e−3 _0.832 _9.450e−4 _0.845 _0.702

(2, 0.5) 27 _4.363e−3 _– _3.827e−3 _– _0.026 _4.363e−3 _– _3.827e−3 _– _0.028

28 _1.964e−3 _1.152 _1.692e−3 _1.177 _0.047 _1.964e−3 _1.152 _1.692e−3 _1.177 _0.059

29 _9.497e−4 _1.048 _8.124e−4 _1.059 _0.163 _9.497e−4 _1.048 _8.124e−4 _1.059 _0.152

210 _4.542e−4 _1.064 _3.934e−4 _1.046 _0.732 _4.542e−4 _1.064 _3.934e−4 _1.046 _0.473

(3, 0.8) 27 _1.473e−3 _– _1.304e−3 _– _0.028 _1.473e−3 _– _1.304e−3 _– _0.045

28 _5.980e−4 _1.301 _5.251e−4 _1.312 _0.074 _5.980e−4 _1.301 _5.251e−4 _1.312 _0.096

29 _2.459e−4 _1.282 _2.136e−4 _1.297 _0.287 _2.459e−4 _1.282 _2.136e−4 _1.297 _0.276

210 _1.037e−4 _1.246 _8.954e−5 _1.255 _1.056 _1.037e−4 _1.246 _8.954e−5 _1.255 _0.834

Abbreviations: DIDS, direct implicit difference scheme; FIDS, fast implicit difference scheme.

Because it knows that 0< −̃gk+1< −̃gk, limk→∞(−̃gk) =0 and

|(−̃gk)cos(k𝜃)| < (−̃gk)

with the series ∞ ∑

k=1

(−̃gk) being convergent, the series ∑∞k=1(−̃gk)cos(k𝜃) converges to a real-valued function for

∀𝜃 ∈ [− 𝜋, 𝜋], which also implies that p(𝜃) is real valued. According to Proposition 2.1 and its proof, it is not hard to note that limk→∞|̃gk| = 0. Therefore, it follows that∑∞_k=−∞|̃gk| < ∞, which completes the proof.

In fact, Lemma 3.3 ensures the following property that the given Toeplitz matrix GNcan be approximated via a circulant

matrix well.

Lemma 3.4. If p(𝜃), the generating function of GN, is in the Wiener class, then for any𝜖 > 0, there exist N′and M′> 0,

such that for all N> N′_,

GN −s(GN) =UN+VN, (3.14)

where rank(UN)≤ M′and||VN||2< 𝜖.

Now we consider the spectrum of ( ̃N−1)−1̃N−1−Iis clustered around 1.

Theorem 3.1. If𝜂N,M,rsatisfies the assumption (3.9), for any 0< 𝜖 < 1, there exists N′and M′> 0 such that, for all N > N′,

at most 2M′_{eigenvalues of the matrix ̃}_

N−1− ̃N−1have absolute values exceeding𝜖.

Proof. With the help of Lemma 3.4, we note that

̃

N−1− ̃N−1=𝜅 ̃A − 𝜅s( ̃A) =GN−1−s(GN−1) =UN−1+VN−1.

(3.15)

Because both VN −1and UN −1are real symmetric with||VN −1||2< 𝜖 and rank(UN −1)< M′, hence the spectrum of

VN −1 lies in (−𝜖,𝜖). By the celebrated Weyl's theorem, we see that at most 2M′eigenvalues of ̃N−1− ̃N−1have absolute values exceeding𝜖.

At this stage, we can see from Lemma 3.2 that

||( ̃N−1)−1||2= 1 min 1≤k≤N−1 [ 𝜂N,M,r+ ̃Λ]_k_,k < 1 𝜂N,M,r ≤ 1 ̌𝜂, (3.16)

(15)

TABLE 2 The L∞- and L2-norm of errors, temporal convergence orders for solving Example 1 with𝛼 = 0.5,

N(M) =⌈2Mmin{r𝛾,2−𝛾}∕2_{⌉, 𝜇 = 1 +}𝛼

2,𝜅𝜇=1, and s = 3

DIDS (2.11) FIDS (2.10)

(r,𝜸) M Err_∞ Rate Err2 Rate CPU(s) Err∞ Rate Err2 Rate CPU(s)

(1, 0.8) 28 _2.222e−3 _– _1.908e−3 _– _0.058 _2.222e−3 _– _1.908e−3 _– _0.045

29 _1.240e−3 _0.842 _1.060e−3 _0.847 _0.176 _1.240e−3 _0.842 _1.060e−3 _0.847 _0.096

210 _6.942e−4 _0.837 _5.918e−4 _0.841 _0.589 _6.942e−4 _0.837 _5.918e−4 _0.841 _0.196

211 _4.022e−4 _0.787 _3.420e−4 _0.791 _2.654 _4.022e−4 _0.787 _3.420e−4 _0.791 _0.558

(2, 0.5) 27 _1.608e−3 _– _1.318e−3 _– _0.023 _1.608e−3 _– _1.318e−3 _– _0.032

28 _8.214e−4 _0.969 _6.694e−4 _0.977 _0.046 _8.214e−4 _0.969 _6.694e−4 _0.977 _0.061

29 _4.171e−4 _0.978 _3.405e−4 _0.975 _0.181 _4.171e−4 _0.978 _3.405e−4 _0.975 _0.147

210 _2.112e−4 _0.982 _1.718e−4 _0.987 _0.747 _2.112e−4 _0.982 _1.718e−4 _0.987 _0.454

(3, 0.8) 27 _4.389e−4 _– _4.389e−4 _– _0.029 _4.389e−4 _– _4.389e−4 _– _0.041

28 _1.869e−4 _1.232 _1.845e−4 _1.250 _0.072 _1.869e−4 _1.232 _1.845e−4 _1.250 _0.101

29 _8.001e−5 _1.224 _7.874e−5 _1.228 _0.281 _8.001e−5 _1.224 _7.874e−5 _1.228 _0.273

210 _3.492e−5 _1.196 _3.439e−5 _1.195 _1.073 _3.492e−5 _1.196 _3.439e−5 _1.195 _0.826

TABLE 3 The L∞- and L2-norm of errors, spatial convergence orders for solving Example 1 with𝛼 = 1.5,

M(N) =_⌈(N∕2)2∕ min{r𝛾,2−𝛾}_{⌉, 𝜇 = 1 +}𝛼

2,𝜅𝜇=1, and s = 3

DIDS (2.11) FIDS (2.10)

(r,𝜸) N Err_∞ Rate Err2 Rate CPU(s) Err∞ Rate Err2 Rate CPU(s)

(1, 0.8) 23 _3.889e−2 _– _3.802e−2 _– _0.009 _3.889e−2 _– _3.802e−2 _– _0.008

24 _8.668e−3 _2.166 _7.760e−3 _2.293 _0.031 _8.668e−3 _2.166 _7.760e−3 _2.293 _0.030

25 _1.972e−3 _2.136 _1.698e−3 _2.193 _0.587 _1.972e−3 _2.136 _1.698e−3 _2.193 _0.256

26 _4.561e−4 _2.112 _3.846e−4 _2.142 _20.161 _4.561e−4 _2.112 _3.846e−4 _2.142 _1.727

(2, 0.5) 23 _3.865e−2 _– _3.789e−2 _– _0.003 _3.865e−2 _– _3.789e−2 _– _0.005

24 _8.625e−3 _2.164 _7.734e−3 _2.293 _0.010 _8.625e−3 _2.164 _7.734e−3 _2.293 _0.016

25 _1.964e−3 _2.135 _1.692e−3 _2.192 _0.045 _1.964e−3 _2.135 _1.692e−3 _2.192 _0.068

26 _4.542e−4 _2.112 _3.834e−4 _2.142 _0.736 _4.542e−4 _2.112 _3.834e−4 _2.142 _0.457

(3, 0.8) 23 _3.766e−2 _– _3.791e−2 _– _0.002 _3.766e−2 _– _3.791e−2 _– _0.004

24 _8.349e−3 _2.174 _7.710e−3 _2.298 _0.006 _8.349e−3 _2.174 _7.710e−3 _2.298 _0.009

25 _1.889e−3 _2.144 _1.682e−3 _2.197 _0.015 _1.889e−3 _2.144 _1.682e−3 _2.197 _0.032

26 _4.351e−4 _2.119 _3.801e−4 _2.145 _0.108 _4.351e−4 _2.119 _3.801e−4 _2.145 _0.152

where s(𝜅 ̃A) = F∗̃ΛF and the diagonal matrix ̃Λ contains all the eigenvalues of s(𝜅 ̃A). Meanwhile, we employ the fact that ( ̃N−1)−1̃N−1−I = ( ̃N−1)−1UN−1− ( ̃N−1)−1VN−1, (3.17)

then we have the following corollary.

Corollary 3.1. If𝜂N,M,rsatisfies the assumption (3.9), for any 0< 𝜖 < 1, there exists N′and M′> 0 such that, for all N > N′,

at most 2M′_{eigenvalues of the matrix ( ̃}_

N−1)−1̃N−1−I have absolute values exceeding𝜖.

Thus, the spectrum of ( ̃N−1)−1̃N−1is clustered around 1 for N that is large enough. It follows that the convergence rate of the PCG method is superlinear; refer to Chan and Ng50 _{and Ng}51 _{for details. Based on such observations, the} preconditioner(m) _{is fairly predictable to accelerating the convergence of PCG for solving both Equations (2.12) and}

(3.1) at each time level m = 1,2, … , M well, respectively; refer to numerical results in the next section.

Besides, although the theoretical analysis in Section 3.2 is only available for handling the model problem (1.1) with time-varying diffusion coefficients, that is,𝜅(x,t) ≡ 𝜅(t), the preconditioner (m)_{is still efficient to accelerate the}

conver-gence of nonsymmetric Krylov subspace solvers for Equation (2.12) and/or Equation (3.1) corresponding to the problem (1.1). The variable diffusion coefficients and nonsymmetric discretized linear systems make it greatly challenging to the-oretically study the eigenvalue distributions of preconditioned matrices ((m)₎−1_(m)_{, but we provide numerical results}