/
Автор: Lorentzen L. Waadeland H.
Теги: mathematics algebra exact sciences
ISBN: 0-444-89265-6
Год: 1992
Текст
STUDIES IN COMPUTATIONAL MATHEMATICS 3
editors: C. BREZINSKI and L. WUYTACK
C NTINUE
FR Tl NSWITH
PPLIC Tl NS
Lisa LORENTZEN
Haakon WAADELAND
NORTH-HOLLAND
CONTINUED FRACTIONS
WITH APPLICAl IONS
STUDIES IN
COMPUTATIONAL MATHEMATICS 3
Fditors:
C.BREZINSKI
Univerity of I Me
Villeneuved'Ascq, France
L. WUYTACK
University of A n twerp
Wilrijk, Belgium
HH
NORTH-HOLLAND
AMSTERDAM -LONDON -NI?VVYORK -TOKYO
CONTINUED FRACTIONS
WITH APPLICATIONS
Lisa LORENTZEN
Division of Mathematical Sciences
University of Irondheim
Trondheim, Norway
Haakon WA ADELAND
Department of Mathematics and Statistics
University of Irondheim
Trondheim, Norwav
L992
NORTH-HOLLAND
AMSI LRDAM • LONDON • NEW YORK -TOKYO
FI SEVICR SCIliNC'F PUBI ISHERS B.V.
Sara Burgerhartsiraat 25
P.O. Box 211.1000 AH Amsterdam,The Netherlands
ISBN:() 444 W265 6
© 1992 HIscvicr Science Publishers B.V. All rights reserved.
No part of this publication may be reproduced, stored in a retrieval system oi transmitted in anv form or
by any means, electronic, mechanical, photocopying, recording or otherwise, without the prior written
permission of the publisher, lilsevicr Science Publishers B.V.. Copyright & Permissions Department.
P.O. Box 521, 1000 AM Amsterdam. The Netherlands.
Special regulations for readers in the U.S.A. —This publication has been registered with the Copviight
Clearance Center, Inc. (CCC), Salem. Massachusetts. Information can be obtained from the CCC about
conditions under which photocopies of parts of this publication may be made in the U.S.A. All other
copyright questions, including photocopying outside of the U.S. A., should be referred to the publisher.
No responsibility is assumed by the publisher for any injuiv and/or damage to persons or property as a
matter of products iiabilitv negligence or otherwise, or from am use or operation of any methods,
products, instructions or ideas contained in the material herein.
This book is printed on acid-free paper.
Printed in The Netherlands
Preface
The name Shortly before this book was finished we sent out a number
of copies of Chapter I, under the name "A Taste of Continued Fractions".
Now, in the process of working our way through the chapters on a last
minute search for errors, unintended omissions and overlaps, or other
unfortunate occurrences, we feel that this title might have been the right
one even for the whole book. Tn most of the chapters, in particular in
the applications, a lot of work has been put into the process of cutting,
cancelling and "non-writing". In many cases we are just left with a
"taste", or rather a glimpse of the role of the continued fractions within
the topic of the chapter. We hope that we thereby can open some doors,
but in most cases we are definitely not touring the rooms.
The chapters Each chapter starts with some introductory informa-
information, "About this chapter". The purpose is not to tell about the contents
in detail. That has been done elsewhere. What we want is to tell about
the intention of the chapter, and thereby also to adjust the expectations
to the right (moderate) level. Each chapter ends with a reference list,
reflecting essentially literature used in preparing that particular chap-
chapter. As a result, books and papers will in many cases be referred to
more than once in the book. On the other hand, those who look for a
complete, updated bibliography on the field will look in vain. To present
such a bibliography has not been one of the purposes of the book.
The authors The two authors are different in style and approach.
We have not made an effort to hide this, but to a certain extent the
VI
creative process of tearing up each other's drafts and telling him/her to
glue it together in a better way (with additions and omissions) may have
had a certain disguising effect on the differences. This struggling type
of cooperation leaves us with a joint responsibility for the whole book.
The way we then distribute blame and credit between us is an internal
matter.
The treasure chest Anybody who has lived with and loved contin-
continued fractions for a long time will also have lived with and loved the
monographs by Perron, Wall and Jones/Thron. Actually the love for
continued fractions most likely has been initiated by one or more of
these books. This is at least the case for the authors of the present
book, and more so: these three books have played an essential role in
our lives. The present book is in no way an attempt to replace or com-
compete with these books. To the contrary, we hope to urge the reader to
go on to these sources for further information.
For whom? We are aiming at two kinds of readers: On the one hand
people in or near mathematics, who are curious about continued frac-
fractions; on the other hand senior-graduate level students who would like
an introduction (and a little more) to the analytic theory of continued
fractions. Some basic knowledge about functions of a complex variable,
a little linear algebra, elementary differential equations and occasionally
a little dash of measure theory is what is needed of mathematical back-
background. Hopefully the students will appreciate the problems included
and the examples. They may even appreciate that some examples pre-
precede a properly established theory. (Others may dislike it.)
Words of gratitude We both owe a lot to Wolf Thron, for what
we have learned from him, for inspiration and help, and for personal
friendship. He has read most of this book, and his remarks, perhaps most
of all his objections, have been of great help for us. Our gratitude also
extends to Bill Jones, his closest coworker, to Arne Magnus, whose recent
death struck us with sadness, and to all other members of the Colorado
continued fraction community. Here in Trondheim Olav Njastad has
Vll
been a key person in the field, and we have on several occasions had a
rewarding cooperation with him.
Many people, who had received our Chapter I, responded by sending
friendly and encouraging letters, often with valuable suggestions. We
thank them all for their interest and kind help.
The main person in the process of changing the hand-writ ten drafts
to a camera-ready copy was Leiv Arild Andenes Jacobsen. His able
mastering of I?Tj\X, in combination with hard work, often at times when
most people were in bed, has left us with a great debt of gratitude. We
also want to thank Arild Skj0lsvold and Irene Jacobsen for their part
of the typing job. We finally thank Ruth Waadeland, who made all the
drawings, except the I^T^X-made ones in Chapter XI.
The Department of Mathematics and Statistics, AVH, The University
of Trondheim generously covered most of the typing expenses. The rest
was covered by Elsevier Science Publishers.
We are most grateful to Claude Brezinski and Luc Wuytack for urging
us to write this book, and to Elsevier Science Publishers for publishing
it.
Trondheim, December 1991.
Lisa Lorentzen Haakon Waadeland
Contents
Introductory examples 1
1 Definition and basic concepts. Convergence 3
1.1 Prelude to a definition 3
1.2 Formal definition. Convergence. Notation 7
2 Some examples 10
2.1 The very best 10
2.2 A differential equation 13
2.3 An expansion of a function 14
2.4 A log-expansion 17
3 More examples 18
3.1 Ilypergeometric functions 18
3.2 From power series to continued fractions 21
3.3 From continued fractions to power series 22
3.4 One fraction, two series 23
3.5 The length of an elliptic orbit 26
4 Three classical convergence theorems 30
4-1 Sleszyriski-Pringsheim's Theorem 30
4.2 Van Vleck's Theorem 32
4.3 Worpitzky's Theorem 35
5 Convergence once again 37
5.1 Critical remarks on convergence 37
5.2 Modified approximants 38
5.3 Another concept of convergence 41
5.4 Another concept of continued fraction 44
5.5 Computation of approximants 44
Problems 46
Remarks 50
References 52
IX
II More basics 55
1 Tails of continued fractions 56
1.1 Tails 56
1.2 Tail sequences 59
1.3 Some properties of linear fractional transformations . 62
1.4 Speed of convergence. Truncation error bounds .... 63
1.5 More about general convergence 66
2 Transformations of continued fractions 69
2.1 Generating a continued fraction from a sequence ... 69
2.2 Equivalence transformations 72
2.3 The Bauer-Muir transformation 76
2.4 Contractions and extensions 83
Problems 86
References 91
III Convergence criteria 93
1 Two classical results 94
1.1 The Stern-Stolz divergence theorem 94
1.2 Continued fractions with positive elements 96
2 Periodic continued fractions 101
2.1 Introduction 101
2.2 Classification of linear fractional transformations . . . 101
2.3 Convergence of periodic continued fractions 104
2.4 Thiele oscillation 105
2.5 Tail sequences 106
3 Techniques to prove convergence 108
3.1 Convergence sets 108
3.2 Value sets 110
3.3 Value set techniques I. A posteriori truncation error
bounds 114
3.4 Value set techniques II. A priori truncation error bounds 116
3.5 Value set techniques III. The Hillam-Thron theorem . 119
3.6 Value set techniques IV. The Stieltjes-Vitali theorem . 123
3.1 Smaller value sets for truncation error bounds . . . .125
4 Convergence results 126
4.1 Two useful lemmas 126
4.2 Parabola Theorems 130
4.3 S-fractions 138
4.4 Oval theorems 141
XI
5 Limit periodic continued fractions 150
5.1 Definition 150
5.2 Finite limits, loxodromic case 150
5.3 Finite limits, parabolic case 157
5.4 Finite limits, elliptic case 159
5.5 Choice of approximants 160
5.6 Continued fractions K.(an/1) where an —> 00 169
5.1 Analytic continuation 174
Problems 177
Remarks 182
References 184
IV Continued fractions and three-term recurrence rela-
relations 189
1 Three-term recurrence relations 191
1.1 The structure of the solution space 191
1.2 Approximants for periodic continued fractions in closed
form 194
1.3 Linear independence of two solutions 196
1.4 The adjoint recurrence relation 197
1.5 Recurrence relations in a field F 200
2 Convergence of continued fractions 201
2.1 Pincherle's theorem 201
2.2 Auric's theorem 206
3 Tail sequences once more 209
3.1 Connection to recurrence relations 209
3.2 Minimal solutions and value sets 211
3.3 Tails and convergence 212
4 An application to linear recurrence relations 218
4.1 Forward stability of recurrence relations 218
4.2 A method for computing minimal solutions 220
5 Some generalizations of continued fractions 224
5.1 Introduction 224
5.2 G-continued fractions 225
5.3 Generalized (or vector valued) continued fractions . . 228
Problems 230
Remarks 235
References 237
Xll
V Correspondence of continued fractions 241
1 The nonned field (L,||-||) 242
1.1 Introducing the normed field 242
1.2 Correspondence at z — oo 243
1.3 Properties of the normed field (L, || -||) 246
2 Classification of continued fractions 248
2.1 Criteria for correspondence 248
2.2 Terminating continued fractions 251
2.3 Why classifications? 252
2.4 C-fractions 252
2.5 When does f(z) have a regular C-fraction expansion? 256
2.6 Algorithms for producing corresponding continued frac-
fractions 259
3 Pincherle's and Auric's theorems in (L, || ¦ ||) 265
3.1 Interpretation 265
3.2 A link between correspondence and classical conver-
convergence 270
3.3 Tails and correspondence 274
4 Branched continued fractions 274
4.1 A simple example 274
4.2 Approximants 277
4-3 Another example 279
Problems 281
Remarks 284
References 286
VI Hypergeometric functions 291
1 The hypergeometric functions 2^1 292
1.1 Why and how 292
1.2 A special case 296
1.3 Choice of approximants 298
1.4 Other continued fraction expansions 304
2 Confluent hypergeometric functions 311
2.1 Notation 311
2.2 iFi(b;c;z) 312
2.3 2F0(a,b;z) 316
2.4 oi51! (c; z) 317
3 Basic hypergeometric functions 318
3.1 Definition 318
Xlll *
3.2 2<Pi(a,b)c;q;z) 320
4 Continued fractions bo -f K(ari/6n) where an and 6ri are
polynomials inn 322
4.1 Introduction 322
4.2 Some special cases 322
Problems 326
Remarks 328
References 329
VII Moments and orthogonality 331
1 Orthogonality and continued fractions 332
1.1 Three examples 332
1.2 Moment sequences and moment functionals 338
1.3 Favard's theorem and Jacobi fractions 345
2 Gaussian quadrature 348
2.1 A quadrature formula 348
2.2 An example 351
3 Moment problems 353
3.1 The Stieltjes moment problem 353
3.2 Connection to continued fractions 356
Problems 361
Remarks 363
References 365
VIII Pade approximants 367
1 Classical Pade approximants 369
1.1 A creative problem 369
1.2 Pade approximants 374
1.3 Normal tables. Block structure 379
1.4 Connection to continued fraction expansions 382
1.5 A convergence result 385
2 Generalizations and extensions 386
2.1 Two-point Pade table 386
2.2 Pade type approximants 389
2.3 Multivariate Pade approximants 389
Problems 391
Remarks 392
References 393
XIV
IX Some applications in number theory 397
1 Some basics on regular continued fractions 399
1.1 The Euclidean algorithm 399
1.2 Representation of positive numbers by regular contin-
continued fractions 402
1.3 Best approximation 408
2 Some diophantine equations 410
2.1 Linear diophantine equations 410
2.2 Pell's equation 413
3 Factoring integers 418
3.1 Introduction 418
3.2 Fermat factorization 420
3.3 Factor bases 423
3.4 A lemma on continued fractions 427
3.5 The continued fraction factoring algorithm 428
Problems 435
Remarks 437
References 439
X Zero-free regions 441
1 Zero-free regions for certain sequences of polynomials . . . 442
1.1 Introduction 442
1.2 An application of Van Vleck's theorem 448
1.3 An application of the parabola theorem 451
1.4 The Stieltjes case 453
1.5 The case when an ? R 456
1.6 A fundamental recurrence formula 460
1.1 Chain sequences 462
1.8 Two theorems on zero-free regions 464
2 Stable polynomials 468
2.1 Introductory remarks 468
2.2 Polynomials with real coefficients 470
2.3 Polynomials with complex coefficients 472
Problems 474
Remarks 477
References 479
XI Digital filters and continued fractions 481
1 Filters and their representation 482
XV
1.1 Some introductory examples 482
1.2 Digital filters 484
1.3 Stable filters 489
1.4 Graph representation of filters 493
2 The Schur algorithm 501
2.1 An old algorithm 501
2.2 Schur fractions and digital filters 505
3 Model reduction 508
3.1 General remarks 508
3.2 Stable filters with rational transfer function 509
Problems 514
Remarks 518
References 519
XII Applications to some differential equations 521
1 Second order linear equations 523
1.1 Introduction 523
1.2 An "almost" Euler-Cauchy equation 531
1.3 Two further examples 535
2 Riccati equations 540
2.1 General Remarks 540
2.2 An old example 544
2.3 A new example 547
Problems 554
Remarks 556
References 557
Appendix. Some continued fraction expansions 559
1 Introduction 560
2 Mathematical constants 561
3 Elementary functions 563
3.1 Introduction 563
3.2 The exponential function 563
3.3 The general binomial function 564
3.4 The natural logarithm 566
3.5 Trigonometric and hyperbolic functions 568
3.6 Inverse trigonometric and hyperbolic functions .... 569
3.1 Continued fractions with simple values 571
4 Hypergeometric functions 573
XVI
4.1 General expressions 573
4-2 Special examples with 0^1 575
4.3 Special examples with 2^o 576
4.4 Special examples with \F^ 578
4.5 Special examples with 2^1 580
4.6 Some simple integrals 582
4-7 Gamma function expressions by Ramanujan 584
5 Basic hypergeometric functions 593
5.1 General expressions 593
5.2 Two general results by Andrews 594
5.3 ^-expressions by Ramanujan 595
References 598
Subject index 601
Chapter I
Introductory examples
About this chapter
We have often been asked questions, by students as well as by established
mathematicians, about continued fractions: what they are and what
they can be used for. Sometimes the questions have been raised under
circumstances where a quick answer is the only alternative to no answer:
in the discussion after a talk or lecture, by a cup of coffee in a short
break, in an airplane cabin or on a mountain hike. In responding to
these questions we have very often been pleased by the sparks of interest
we have seen, indicating that we had managed to transmit a glimpse of
new and apparently appealing knowledge. In quite a few cases this led
to a further contact and "follow-up activities".
This introductory chapter is to a large extent inspired by the questions
we have received and governed by the answers we have given. There
is of course a great danger: A quick answer is often a wrong answer.
It may (and even ought to) tell the truth and nothing but the truth,
but it definitely does not tell the whole truth. This may lead to false
guesses. This danger is in particular great in cases where observations
and experiments are used to create and support guesses, such as in
Section 2 of the present chapter. But we still wanted to keep this (often
2 Chapter I. Introductory examples
non-accepted, but highly necessary) aspect of mathematics as part of
the introductory chapter. We have tried to reduce the danger partly by
the way things are phrased, partly by indicating briefly how wrong such
guesses can be, and finally by referring to a more careful treatment later
in the book.
We decided to include, already in the introductory chapter, on the one
hand three classical convergence theorems, on the other hand some of the
newer thoughts on convergence and computation of continued fractions.
Definition and basic concepts. Convergence 3
1 Definition and basic concepts. Convergence
1.1 Prelude to a definition
Let {tn} be a sequence of complex numbers. When we talk about the
series
oo
*n=*l+*2 +••- + *« + ¦¦-, A.1.1)
we have in mind the sequence {Tn} of partial sums
n
or recursively defined
= Tn -f
Convergence of the series A.1.1) means convergence of {Tn} to a complex
number T, in which case we write
oo
X>n=T. A.1.2)
n=l
Similarly we are familiar with infinite products
oo
II Pn = P\ • V2 ¦ • -Pn ' • * , A.1.3)
n=l
where all pn are complex numbers ^0. {Pn} is the sequence of partial
products
n
k=l
or recursively defined
Pn+l = Pn ' Pn+l ¦
Convergence of the infinite product A.1.3) means convergence of the
sequence {Pn} to a complex number P ^ 0, in which case we write
oo
J[Pn = P- A-1.4)
n=l
Chapter I. Introductory examples
Let {an} be a sequence of complex numbers ^ 0, and let {/rt} be the
sequence from C = C U {00} given by
/i = «i ? h = — , h =
1 -f ao
and generally
*
Jn —
do
Similarly to what we have for sums and products this also leads to
a concept, having to do with the nonterminating continuation of the
process, in this case the concept of a continued fraction, constructed
from a sequence {an} of complex numbers, all ^ 0:
CO fli
K K/l) = . A.1.5)
l tt
Convergence of A.1.5) means convergence of the sequence {fn} of ap-
proximants. We shall also accept convergence to 00.
Example 1 For the continued fraction
oo
6
1 + 7—
we find
t 6 t 42
Definition and basic concepts. Convergence 5
It is easy to prove, by induction (see Problem 5, with x = 2, y = —3)
that generally
(-3)" -2"
Jn D
From this it follows that the continued fraction converges to 2.
O
Quite similarly we can construct, from any sequence {bn} of complex
numbers, a continued fraction
oo 1
K A/M = , A.1-6)
n=l 1
or from two sequences, {an} and {bn} of complex numbers, where all
an ^ 0, a continued fraction
00 «i
K K/6n) = . A.1.7)
6 °
&3 +
A.1.5) and A.1.6) are obviously special cases of A.1.7). In the partic-
particular case when in A.1.6) all bn are natural numbers we get the regular
continued fraction, well known in number theory, the one coming from
the Euclidian algorithm. We shall look at regular continued fractions in
Example 2 of the present chapter, and more seriously in Chapter IX.
Let us take a look at the common pattern in the three cases: series, prod-
products and continued fractions (and other constructions for that matter).
In all three cases the construction can be described in the following way:
We have a sequence {<j>k} of mappings from C into C. By composition
we construct a new sequence {$n} of mappings
= <t>\ , $n = #n-l O <j)n = (j)i O <jJ O • • • O <j)n . A.1.8)
6 Chapter I. Introductory examples
In all three cases there is a fixed complex number c, by means of which
convergence is defined: as convergence of {<?n(c)} for that particular c.
(There is a difference in the question of whether or not convergence to
0 or oo is counted as convergence or not.)
For series we have
and the partial sums are
$n@) = 4>i o fa o • - • o (j)n(Q) = tx + t2 + •¦• + *„,
i.e. here we have c = 0.
For products we have
fa(w) =
and the partial products are
i.e. here we have c = 1.
For continued fractions A.1.7) we have
k + w
and the approximants are
¦n@) =
a
ti
n
i.e. here we have c = 0.
Remark: If all ak ^ 0 the mappings fa are all non-singular linear
fractional transformations. Hence <^jTl all exist. We shall later make
much use of this property.
Definition and basic concepts. Convergence
1.2 Formal definition. Convergence. Notation
The following definition is due to Henrici and Pfluger, see for instance
[Henr77, p. 474] (for a slightly different version):
Definition A continued fraction is an ordered pair
(({«»}, {*»}). {/»}) , A-2-1)
where {fln}i° and {&n}o° are given sequences of complex numbers, an
0, and where {/„} is the sequence of extended complex numbers, given
by
n = 0,1,2,3,---, A.2.2)
where
So(w) = so(w), Sn(w) = 5n_i (sn(w)), n = 1,2,3, - - - , A.2.3a)
sn(w) = -—-—, n = l,2,3,---. A.2.3b)
See also [JoTh80, p. 17].
The continued fraction algorithm is the function K mapping a pair
({ari}, {bn}) onto the sequence {/n} defined by A.2.2) and A.2.3). Here
the numbers an and bn are called the nth partial numerators and de-
denominators. A common name is element. The number
Sn@) = bo+ ~ A.2.4)
5
8
Chapter I. Introductory examples
is called the nth approximant. Several more convenient ways of writing
the approximants are introduced in the literature on continued fractions.
We shall here use:
A.2.4')
and more generally
c / \ * . ai . a'2
Sn(w) = bo+ —+ —
6, b2
t
bn
Convergence of a continued fraction to an extended complex number /
means convergence of {/«} to /, in which case we write
/ = bo + — , ~r
a
n
n H
or
A.2.5)
A.2.5')
We even use the notation on the right-hand side when we discuss con-
continued fractions more generally, regardless of convergence properties.
In computing numerically the value of a continued fraction the approx-
approximants, in particular those of high order, are essential. The first ones
are:
/o = &o »
Gb{ -f a\
-I- a2
A straightforward computation without cancellation leads to fractions
for the approximants:
A.2.6)
If we define
B~-\
—
1 '
0
>
—
1
A.2.7)
the following is easily proved by induction:
An + vlri
A.2.8)
Definition and basic concepts. Convergence
where the recurrence relation
9
An '
= bn
' An-l '
+ an
Bn-2
A.2.9)
for n = 1,2,3,... holds. The proof is left as an exercise. Observe that
5n, being a composition of non-singular linear fractional transformations
sk(w) =
is itself a non-singular linear fractional transformation. We have in par-
particular
Bn-\
We shall here call An and Bn nth canonical numerator and denominator
(sometimes just numerator and denominator). An important property
of the numbers An and /?„, is the determinant formula
n
AnBn.x - An_KBn = (-
A-2.10)
k=i
The proof is straight forward use of the recurrence relation A.2.9), and
is left as an exercise.
Let it finally be mentioned that we also may go "in the opposite di-
direction", i.e. from given sequences {An} s^d {Bn} to a continued frac-
fraction A.2.5'). Actually, any pair ({;4n},{f?n}) with the initial condition
A.2.7) determines uniquely a continued fraction A.2.5') with
Jn - .. ,
provided that
AnBn-X - An-xBn ? 0, n = 0,1,2,
(Theorem 7 in Chapter II).
10 Chapter I. Introductory examples
2 Some examples
2.1 The very best
Example 2 From the equality
follow the equalities
2 + 2 + (>/2 - 1) 2 + 2 + 2 +(v/2-1)
1 1 1 1
2 + 2 + 2 + 2 + (V5-1)'
and so on, as long as we want to. Since obviously
. 11 1 1
for any length of the row of dots, it seems to be a good idea to take a
look at the approximants of the regular continued fraction
We find
111
+ 2 + 2 + 2+.
•¦i-
1+11 =
2 + 2
1+2 + 2 + 2
2 + 2 + 2 + 2 ~~
1 + 1 I l I I -
^ 2 + 2 + 2 + 2 + 2
1.5
7
5 ~~
99
70
1
= 1.4
= 1.4166... ,
= 1.41379... ,
= 1.4142857... .
These fractions seem to approach v2 pretty quickly. Already the fifth
one, the last one listed, has an error less than .00008. We phrase tlus
Some examples 11
observation in the following way: The fractions seem to be very good
rational approximations to the irrational number y/2.
O
This is a good place for a warning: Identities such as the ones we have
studied here, which led us into the temptation of studying 1 + K{\/2),
in the hope of getting good approximations to y/2, may just as well be
a dead end. (Actually, the normal thing is that it goes wrong.) The
equality
1 V
will for instance lead to
but the continued fraction (still) seems to converge to y/2. See also Sub-
Subsection 3-4- Such identities are special cases of more general recurrence
formulas, which will play a crucial role later in this exposition.
But in the present case everything goes well. Our numerical observations
are matched by mathematical reality: In fact, it can be proved, that the
fractions obtained are not only very good approximations to \/2, but the
very best, in the following sense: If p/q is one of the fractions obtained
in the way described above, and if m/n is a fraction such that
771
yfi--
n
o
>
i.e. m/n is a better approximation to y/2 than p/q, then n > q. Slightly
rephrased: In order to find a better approximation to y/2 than p/q, we
have to increase the denominator, (p,q,m,n are all positive integers.)
We shall not go into the proof of this here, merely present a geometric
illustration (also without proof) of this way of approximating an irra-
irrational number:
Let a be a positive irrational number (in our case y/2). We shall let the
number a be represented by the ray y = ax from the origin and into
the first quadrant of a cartesian coordinate system. Let furthermore
12
Chapter I. Introductory examples
the lattice point (n, m) represent the fraction m/n. Then the following,
illustrated in Figure 1, can be proved: Assume that there is a nail in
every lattice point, and a rubber band fastened in the points A,0) and
@,1). Stretch the rubber band with a pencil following the ray y = ax
(or y = \/2x in our case). Then the corners of the polygon, in the
order they turn up, are the lattice points corresponding to the rational
approximants of the regular continued fraction for a. Observe e.g. on
Figure 1. the points A,1), B,3), E,7), corresponding to the fractions
1/1, 3/2, 7/5. Inside the polygon, i.e. where the ray is, there are no
lattice points, showing the "bestness" of the fractions 1/1,3/2, 7/5 (and
17/12,41/29, • • •) as approximants for y/2.
This way of illustrating approximations was told to one of the authors
by Viggo Brun in lectures at the University of Oslo [Brun50j, but it
goes way back, and different people have been given credit for it. Any
positive irrational number has a unique representation by a continued
fraction of this type. See remark 2 at the end of this chapter.
Some examples 13
2.2 A differential equation
Example 3 From the differential equation
y = V + y"
follow by differentiation the equalities
y' =
yK ' -
and hence, assuming that we do not divide by 0:
y1 = 2Jt7l7"
y"
= 2 +
y(n+\)/y(n+2) '
From this it follows that
2/7
Tl+l
This suggests to look at the continued fraction
2 + 2+---+2+.--
We "know" from Example 2 that this converges to \/2 + l> which suggests
that
y'
or
y
14 Chapter I. Introductory examples
from which it would follow that
This is actually a solution of the given differential equation.
O
There is of course no good reason to use this "method" for the present
differential equation, since there exists a perfectly good, simple method
taught in elementary calculus classes. Furthermore, the continued frac-
fraction method (even after it is properly established) produces only a par-
particular integral, not the general solution. But there are cases, where this
idea leads to non-trivial results, and where the method may represent an
alternative to (or at least a supplement to) existing power series meth-
methods for solving Unear ODEs. See [Khov63], [Steen73], [Waad83] and the
references there. See also Chapter Xll of the present book.
2.3 An expansion of a function
Example 4 From the identity
x
follow, as it did in Example 2, the identities:
x x
\/l + x - 1 = - -
This suggests to look at the approximants of the continued fraction
xx x
2 + 2+-..+ 2+-.-'
i.e. at the rational functions
x xx 2x x x x x2 + Ax
2' 2 + 2 ~ z + 4' 2 + 2 + 2 = Ax + 8 '
and so on. We refer to the warning in Subsection 2.1.
Some examples
15
Continued fraction expansions are less known than power series expan-
expansions. In the present case the Taylor series expansion at 0 is
oo
- 1 =
I, L L n JLo 0|
* = 2* - 8* + 16* - 128* +
It converges for \x\ < 1 and diverges for \x\ > 1. The approximants of
the series are the partial sums
1
-
2
1
So = -X X ,
2 2 8 '
and so on. Observe that the series approximants are polynomials, whereas
the continued fraction approximants are rational functions.
Let us make an experiment: We compute the two types of approximants
for a certain x— value to see what happens. We choose x = .96, in which
case the value of the function is exactly .4. In the table below some power
series approximants (sn) and some continued fraction approximants (/n)
are listed, all correctly rounded in the 4th decimal place:
n
fn
1
.4800
.4800
2
.3648
.3871
3
.4201
.4022
4
.3869
.3996
5
.4092
.4001
6
.3932
.4000
7
.4053
.4000
This of course does not prove anything, but it suggests that in some
cases the continued fraction may be better (converge faster) than the
power series expansion. (It is, however, only fair to say, that such a
comparison, based merely upon the order n of the approximant, does
not always give the correct picture. Essential in the comparison is the
resources needed, usually the time.)
Even more flattering for the continued fraction expansion is the choice
x = 3. In this case it does not make sense to compute power series
approximants, since we know that the power series diverges. In the next
table the first seven continued fraction approximants are listed, correctly
rounded in the 4th decimal place. Keep in mind that the value of the
function is 1.
16
Chapter I. Introductory examples
n
fn
1
1.5000
2
.8571
3
1.0500
4
.9836
5
1.0055
6
.9982
7
1.0006
The table suggests that the continued fraction expansion converges to
the right value for x = 3, i.e. for a value where the power series diverges,
in which case the continued fraction is better than the power series also
in that respect. (The next three approximants are .9998, 1.0001 and
1.0000.)
O
In Chapter 111 it will be proved that the continued fraction in Exam-
Example 4, with real x, converges for all x ? [—l,oo) and diverges for all
x ? (—oo, — l). For complex x it will be proved, that the continued frac-
fraction expansion converges in the whole plane, except on the ray (—oo, — 1)
of the negative real axis, and to the right value. In Figure 2 we have
convergence of the continued fraction expansion in the whole plane, ex-
except on the strongly indicated ray, whereas the power series expansion
converges inside the dotted circle and diverges outside of it.
\
\
p
Figure 2.
Some examples 17
For later use in the present chapter we rewrite the identities at the
beginning of this example in the following form:
flc-1 x/A
x/A x/A x/A
11 ' B.3.1)
1 + 1 H h 1 + (-s/T+aB - l)/2
The approximants are the former ones, divided by 2.
2.4 A log-expansion
Example 5 An example, related to the previous one, but less trivial,
is the expansion
or more precisely
1 /1 , \ z azz a'iZ
logfl -f z) — —
SV ^ ; 1+1 + 1
where log here shall mean the principal value of the natural logarithm,
and where for all k > 1
k k
2Bib
Right now we shall not worry about how one gets this, only use it for
some experiments, to compare it to the power series expansion
z2 z* zA
log(l + z) = z-j + — - — + .-..
Let us take the "worst" example, z = 1. The series then converges to
log 2, but very slowly. The first seven continued fraction approximants
are listed in the table below, correctly rounded in the 5th place. The
value is log 2 = .69314718, correctly rounded in the 8th place.
18
Chapter I. Introductory examples
n
fn
1
1.00000
2
.66667
3
.70000
4
.69231
5
.69333
6
.69312
7
.69315
In order to get the series approximation sn with the same accuracy we
need n > 105.
Let us also here try a z- value where the series does not converge, for
instance z — 3. In the next table the first 7 continued fraction approx-
imants are listed, all correctly rounded in the 4th place. The value is
log 4 = 2 log 2 = 1.38629436, correctly rounded in the 8th place.
n
fn
1
3.0000
2
1.2000
3
1.5000
4
1.3636
5
1.3973
6
1.3837
7
1.3874
This table also suggests convergence to the right value.
_O
We shall later see, that the present continued fraction expansion for
log(l -f z) converges for all complex z, except on the ray (—oo,-1]
of the negative axis, whereas the power series expansion converges for
\z\ < 1 and diverges for \z\ > 1. An illustration would look like Figure
2.
3 More examples
3.1 Hypergeometric functions
Example 6 The hypergeometric functions
ab z a(a+ 1NF+ I) z2
+\
C.1.1)
where a, b, c are complex numbers, and c not 0 or a negative integer,
are of great importance in several applications. Many of the well known
More examples 19
special functions are special cases of C.1.1). Tf we assume that also a
and 6 are different from 0 and the negative integers, the series C.1.1) is
an infinite power series whose radius of convergence is 1. The following
formal identities can be established from C.1.1) by comparing the power
series on both sides term by term:
F(a,6;c;z) = F(a,b + 1; c + 1; z)
a + 1,6 + 1; c + 2; z)
F(a,6 + l;c+l;z) = F(a + 1, 6 + 1; c + 2; z)
zF(a+ l,6+2;c + 3;
Assuming that we avoid zeros in the denominators, this can be rewritten
in the following way:
F(a, b\c\z)
F(a,b+l;c+l;z)
- 1 +
- 1 +
—a(<
c
F(a,b
F(a+1,
-F + 1
(c +
F(a + 1,
(c
+
6
)(
1
,6
«
: +
i;
)(i
+
-6)
1)
c4
1; c
- a
c +
1; c
z
i;
'I
2)
-' +
^)
2;z)
l)z
2;z)
F(a+l,6+2;cH 3;
Observe that the denominator on the right-hand side of the first equality
is equal to the left-hand side of the second one. Furthermore, the de-
denominator on the right-hand side of the second equality coincides with
the left-hand side of the first one, if in the former a is replaced by a + 1,
6 is replaced by 6 + 1 and c by c + 2 in all places. Hence, by repeatedly
increasing the first two parameters by 1 and the third one by 2 we are
lead to a continued fraction in a similar way as we have seen in Subsec-
Subsection 2.2. The continued fraction, already studied by Gauss [Gaussl3],
[Gauss 14], is of the form
1 -f- 1 -J-' • •-{- L -J-
where
,(° + ").(c:^ + "> , . = 0,1,2,-, C.1.2')
(c + 2n)(c -f 2n + 1)
20 Chapter I. Introductory examples
(b -f n)(c — a + n) ,
K A ' 71=1,2,3,--.. C.1.2")
-f 272 - 1)(C + 272)
We shall see later, in Chapter VI, that the sequence of approximants
converges to a meromorphic function in the whole plane, except on the
ray z > 1 of the positive real axis, and that this function is
F(a,b;c;z)
(or its ineromorphic continuation, if we regard the function as primarily
defined by its power series).
Observe that
<zn-> --. C.1.3)
The continued fraction C.1.2) thus is an example of what is called a
limit periodic continued fraction. The tails of the continued fraction
"look more and more like" the continued fraction expansion for
2 > C-L4)
(see B.3.1)). We shall later use this property to improve the procedure
of computing the values of the continued fraction C.1.2). The idea is to
replace the actual tails of C.1.2) by C.1.4) in forming the approximants.
This means, that we, instead of using 5n@) as approximants, use 5n (?/;),
with it? as in C.1.4). Recall that STI(w) is given by the formula in A.2.8).
is called a modified approximant.
Later in this exposition we shall see more of hypergeometric functions
and their expansions. The reasons for including this particular example
here are: 1. To emphasize the connection between three term recurrence
relations (such as the formal identities for the hypergeometric series or
the differential equations in Subsection 2.2.) and continued fractions.
2. To put the example from Subsection 2.4 into a more general context:
The log-function is a special hypergeometric function:
Three-term recurrence relations will be the topic of Chapter IV.
O
More examples 21
3.2 From power series to continued fractions.
We have seen, that continued fractions of the form
CL\Z
1 + 1 H + i H
in some cases are of advantage compared to power series expansions both
as far as speed of convergence and domain of convergence are concerned.
Hence it is of interest, and sometimes useful, to go from a power series
to this particular continued fraction. We shall illustrate the most prim-
primitive way of doing this (in [CuWu87] called the method of successive
substitutions) by using the log-series as an example:
z2 z3 zl
log{l + z) = lu(z) = *-_ + ---
«¦> -
This leads to the identity
/,(*)'
1 + k{z) '
z
where l\(z) is a uniquely determined power series starting with a term
cz. We recognize the start of the expansion from Subsection 2.4, and the
process could have been continued to any length (depending upon how
22 Chapter I. Introductory examples
many terms we start with in the log-series). It is of course not obvious,
that we will get a continued fraction of the form
d\Z (I2Z 0>nz
1 _|_ 1 +...+ 1 +...'
since we possibly may run into an ^B), starting with a term czk with
k > 1. It is known, however, that this will not happen in the present
case. This procedure gives a partial answer to the question mentioned
in Subsection 2.4, how we can get the continued fraction expansion for
log(l 4- 2). But by this procedure, as presented here, we do not get
general formulas for an, let alone anything about convergence. In con-
conclusion we may say, that the present method by far is not the best
practical method, but it indicates a possible bridge from power series to
a continued fraction expansions.
3.3 From continued fractions to power series
Sometimes we want to go in the opposite direction, i.e. from continued
fraction to power series. We shall use our continued fraction expansion
for log(l -f 2) as an example. Here are the first (classical) approximants
with their power series expansions at 0:
First approximant:
- = 2 -f O22 + O23 + 0z
Second approximant:
Z2 23 ZA
1 + 2/2 2_ 4 8
Third approximant:
2 _ z2 2^ 2
. 2/2 ~z~Y + y~9'
+1 + 2/6
More examples 23
Fourth approximant:
_2 ~3 _4
A* & >C
z/2 2 3 4
The underlined terms coincide with terms in the power series for log(l -f
z). Observe that the agreement increases with the order of the approxi-
mants. It can be proved (and it will, in Chapter V) that this continues.
It is called correspondence between the power series and the continued
fraction expansion. A proper definition will be given later.
3.4 One fraction, two series
We shall now look at a very special example, which however will prove
very useful later.
Example 7 The identity
1-Z
used repeatedly leads to the identities
z z z z
z—- , z =
I-z+l-z + z1 I -z + l -z + l-z + z'
and generally
z z
1-Z+1-Z+---+1-Z
On the other hand, the identity
leads to
1 - z + \ - z-\ +1 - z - 1
24 Chapter I. Introductory examples
Inspired by these two identities we look at the continued fraction
z z z
I- z + 1 - z-\ +l-z-\
For simplicity we assume z to be different from all roots of unity, in
which case the approximants fn are:
z z(l + z)
~ \-Z~ 1 - 22
z z
_ z(l - z) z(l - z2)
1 — z 4-1 — z 1 — z -\- z2 1-f- z'3
z z z z(l 4- zA
h -
JJ 1-z + l-z + l-z 1-z1
By induction the following formula is easily established:
fn =
See Problem 5, with x = z, y = — 1.
We shaD make tauo types of observations on the approximants, one on
convergence, one on correspondence. We distinguish two cases:
lim fn = z
n —> oo
/„ = z + (—z)n+1 + higher powers of z
lim /„ = -1
n —> oo
/„ = — 1 + (—z)~n -f higher powers ofz
The way these observations will be phrased within the analytic theory
of continued fractions is as follows:
The continued fraction
-? — — C.4.2)
More examples 25
a) converges in \z\ < 1 to z, and corresponds at 0 to the series
z + Oz2 + O23 + • ¦ • ,
b) converges in \z\ > 1 to — 1, and corresponds at 00 to the series
_O
This example shows that one and the same continued fraction expansion
may converge to two different functions in two different regions and
correspond to two different series at two points (here 0 and 00).
This trivial example has its non-trivial relatives, where one continued
fraction simultaneously represents two different analytic or meromorphic
functions by convergence and correspondence.
We conclude this section with another trivial remark (still in Example
7), which also has its non-trivial analogues. It has to do with replacing
the classical approximants fn = 5n@) by some modified approximant
Sn(w). The interesting w-values here are z and —1:
Case 1: If 5n@) is replaced by Sn(z), all (classical) approximants will
be replaced by z. This implies two things: The convergence to z
in IzI < 1 is accelerated (bull's eye, the value is hit right away),
and the convergence to z is extended also to the region \z\ > 1,
i. e. we have an analytic continuation of the limit function in
\z\ < 1 to the whole plane.
Case 2: If 5rt@) is replaced by 5n(—1), all classical approximants will
be replaced by —1. This implies two things: Acceleration of con-
convergence to —1 in \z\ > 1 (again bull's eye), and the convergence
to —1 is extended also to 0 < \z\ < 1, i.e. we have an analytic
continuation of the limit function in \z\ > 1 to the whole plane,
minus the origin.
26 Chapter I. Introductory examples
The continued fraction C.4.1) is a special case of a T-fraction, named
after W. J. Thron. We shall get back to T-fractions later in the book.
3.5 The length of an elliptic orbit
Example 8 We shall look at the computation of the circumference L
of the ellipse
?- + L. = lJ a>6>0, a>0. C.5.1)
The well known arc length formula leads to an elliptic integral. One way
of finding approximate values for it is to use power series. By using the
arc length formula one easily proves the following, which is a well known
formula, see for instance [Il{itte55]:
t t2 t3 25J4 \
+ + + + j C.5.2)
1 22 2r> 28 214
where
* = f<2—rl - C.5.2')
\a + bj
For the general coefficient an explicit formula exists, and also the con-
convergence properties are known. We shall leave out both here. Observe
that the formula is exact for a circle, a = 6, i.e. t = 0.
One way of finding approximate values for L is to truncate the series at
different places. We shall, however, use a quite difFerent approach:
We transform the series into a continued fraction the way it is shown in
Subsection 3.2. The start of the continued fraction is in this case
i/4 -t/16 -3t/16 -31/16
1 i . ^o.O.oJ
The first approximants are
MO = i,
More examples 27
... t/4 -*/16 -3*/16 64 -
/3@ - 1 + T+~T"+~T^-
t/A -t/16 -31/16 -3J/16 _ 256 - 48* - 21*2
17+ I + I + I ~ 256 -112* + 3i2
These rational functions are what we later will learn to know as Pade
approximants to the series in the //-formula.
The Pade approximants determined above give us a sequence of approx-
approximate formulas for L:
We shall not include any discussion on the accuracy of the formulas,
merely mention some points to indicate it: The formula with n = 2,
simple as it is, has an error less than 3 mm for an ellipse with size and
eccentricity as the orbit of the planet Mercury. The formula with n = 3
has for the same ellipse an error roughly = 1/10 of the wave length of
blue light. In the "flat" case, which is likely to be the "worst" case
(t — 1), the exact value of L is 4a. The approximate formulas give in
this case:
= 3.1416a
= 3.9270a
7ra/2(l) = 3.9794a
7ra/:J(l) = 3.9924a
7ra/,(l) = 3.9964a
The factors are all correctly rounded in the 4th decimal place.
We shall conclude this example by presenting two approximate formulas
obtained in a different way, namely by using a modified approximant
for the continued fraction C.5.3). We have already touched upon the
concept in the Subsections 3.1 and 3.4-
28 Chapter I. Introductory examples
In the continued fraction C.5.3) we pretend that all partial fractions
from the second one are equal to
1 '
i.e. we replace the continued fraction C.5.3) by
4/4 -f/16 -*/16 -*/16
+ T+ 1 + 1 + I +.-."
(This is of course only an experiment, and we have no guarantee that
it will lead to a good approximation.) This continued fraction may be
written
1 + -^- , C.5.4)
1 -+- w
(which, in standard notation, is S\(w)), where w is the value of the
continued fraction
-t/16 -
1 + 1 + 1 +
i.e. (see B.3.1))
Since
= 3 - V4 - t,
we get the formula
L « ?r(a + 6)C - \/4-0 . C.5.5)
This formula was first found by Ramanujan [Rama57]. For an ellipse of
si/e and eccentricity as the orbit of Mercury this formula has an error
less than 2 mm. For the degenerate case ("flat" case) it gives the value
3.9834a.
The next formula also uses a modified approximant C.5.4), but with an-
another w, based upon the observation that the continued fraction C.5.3)
has two equal partial fractions
More examples 29
in a row, and in front of them
-t/16
1 '
If we pretend that all subsequent partial fractions are
-3S/16
1 '
(again merely an experiment) we have in C.5.4)
w
_ 1 /-3*/16 -3^/16 -3*/16 -3*/16 \
~ 3 V 1 + 1 + 1 + 1 +•• J '
which has the value
-3J/4-1).
(See B.3.1).) This suggests the approximate formula
t/4
or, rewritten in a nicer form
L % ir(a + b) A 4 , ) • C.5.6)
This seems to be the best of the approximate formulas mentioned in this
section. For the "flat" case it gives the value 3.9984a.
This formula also is due to Ramanujan, although it is not known how
he established it. It is assumed, though, that the method shown in the
present section was the one he used [AlBe88].
O
30 Chapter I. Introductory examples
4 Three classical convergence theorems
4-1 Sleszyriski-Pringsheim's Theorem
Theorem 1 The continued fraction K.{a,n/bn) converges if for all n
fl. D.1.1)
D.1.2)
D.1.2')
Under the same condition
1/-1 <
holds for all approximants fn} and
for the value of the continued fraction.
Proof : We first prove D.1.2) by induction. For any n > 1 we have
a
n
bn
a
n
On I +
which proves D.1.2) for n = 1. Next, for any n > 2
On/bn
which establishes D.1.2) for n = 2.
Assume that for some /z, 1 < k < n
has the property /A
(*)
< 1. Then
\ak\ + 1 - \fn
(*)
< 1
Three classical convergence theorems
31
Hence, by induction on /z,
- I f (o)
l/nl = |/<
< 1
To prove the convergence of K(an/^«) we observe that the determinant
formula A.2.10) gives
B
n
i-l
Hence the convergence is established as soon as we have proved the
convergence of the series
oo
E
n-\
BnBn-.\
D.1.3)
From the recurrence formulas A.2.9) we have, for n > 1
\Bn\ = |6nUn_, f an^n-2| > |6n||i?n-i| - |an||2?ri_2
and hence
From this it follows that
n
and the general term in D.1.3) thus satisfies
\Bn-\
We therefore find that D.1.3) converges absolutely, and that the nth
partial sum has absolute value less than or equal to
\Bn
Hence the series D.1.3) converges. D.1.2') is now a simple consequence
of D.1.2). ¦
32 Chapter I. Introductory examples
Remark: If D.1.1) holds, then it follows from our proof that also
< 1 for all |u>| < 1 and Sn(w) —> f locally uniformly for \w\ < 1.
Example 9 Let z be a complex number, and assume that all |an| < 1.
Then the continued fraction
oo an
K —
71=1 Z
converges for all \z\ > 2. In the special case when an = 1 for all n we
find for the value f(z) :
From this it follows that
In Chapter III we will discuss periodic continued fractions more gener-
generally. The continued fraction K(l/z) is a special case. So are also the
continued fractions in the Subsections 2.1 and 2.2. Here in Example 9
the branch of the square root is to be chosen such that f(z) —> 0 when
z —> oo, i.e. such that
/ 2 \ 2
+ 4 = z f 1 + — + - - •) = z + - + ¦ • • ,
and hence
(The • • • mean higher powers.of z~x.)
O
4.2 Van Vleck's Theorem
Theorem 2 (Van Vleck's Theorem) Let 0 < ? < 7r/2, and let bn
satisfy
-|+* <arg6n< |-? D.2.1)
Three classical convergence theorems 33
for all n. Then all approximants o/K(l/6n) are finite and in the angular
domain
< - - e . D.2.2)
Furthermore, the sequences {/2m} and {/2m+i} converge to finite values.
If (and only if), in addition
00
? K\ = 00 , D.2.3)
then K(l/6n) converges.
Partial proof: We shall here restrict ourselves to a proof of the first
part of the theorem, i.e. that the approximants all satisfy D.2.2). The
proof is closely related to the first part of the proof of Sleszynski-
Pringsheim's Theorem, i.e. the proof of the statement on the location of
the approximants fn.
A crucial point in the present case is the following observation for the
angular domain Ve described in D.2.1) and D.2.2):
ve D.2.4)
for all 6fc E Ve. This follows immediately from the fact that the sum of
two elements in Vc also is in Ve, and that
weve=> - ev?.
From D.2.4) it follows by induction as in the previous section that D.2.2)
holds for all n. ¦
This argument is a special case of a basic type of argument in conver-
convergence theory for continued fractions. We shall return to this later, and
also to the rest of the proof of Van Vleck's Theorem.
34 Chapter I. Introductory examples
Remark: If D.2.1) holds, then also the approximants Sn(w) of K(l/6n)
are finite and E V? if w E Ve. Moreover {Sn(w)} converges locally uni-
uniformly to the value of K(l/6n) in Ve. The convergence is also uniform
with respect to the actual choice of bn from compact subsets of V?.
Example 10 It follows immediately from Van Vleck's Theorem that a
regular continued fraction always converges. (As defined in Subsection
1.1 and illustrated in Subsection 2.1 a regular continued fraction is a
continued fraction K{l/bn), where all bn are natural numbers. Obviously
then all 6r, are in any Ve with 0 < e < tt/2, and also ? |6n| = J] bn = oo.
O
Example 11 It follows immediately from Van Vleck's Theorem that
any periodic continued fraction K(l/6n) where all bn have positive real
part will converge. We hope that the following two-periodic continued
fraction will serve as an example. The continued fraction to be studied
here is
1 1 1 1 1 1
Since we know that it converges, it is rather easy to find the value / it
converges to. It must satisfy the equation
_ 1
1-2 + /
i.e.
This quadratic equation has the two roots
/ - Q
2&ai 2
Since the real part of / has to be positive, we find
.O
Three classical convergence theorems 35
4-3 Worpitzky's Theorem
Theorem 3 (Worpitzky's Theorem) Let for all n > 1
aTl\ < i . D.3.1)
Then K(an/1) converges. All approximants fn are in the disk
<\, D-3.2)
and the value f is in the disk \w\ < ^.
Proof: Let
a 1 a-} an
1 + 1 +---+ 1 +¦
D.3.3)
be such that |an| < 1/4 for all n. It is easily seen, that the sequence of
approximants for the continued fraction
2a, 4a, Aa, 4^
2 + 2 + 2 +•••+ 2 +••- v '
is exactly the same sequence as the sequence of approximants for D.3.3).
Since \an\ < 1/4 for all n, we have
2>|4on| + l,
and from Sleszynski-Pringsheim's Theorem it follows that the continued
fraction D.3.4), and hence D.3.3) converges. If the continued fraction
D.3.4) is multiplied by two, which means to replace the first partial
fraction by 4di/2, we find from Sleszynski-Pringsheim's Theorem that
all approximants have absolute value < 1, and hence all approximants
of D.3.3) have absolute value < 1/2. From the convergence it then
follows, that the value of the continued fraction is in the disk \w\ < ^.
This concludes the proof of Worpitzky's Theorem. (This is essentially
Sleszynski's proof.) ¦
36 Chapter I. Introductory examples
Remark: Again the convergence of Sn(w) is uniform with respect to
{aTl} and w for \aTl\ < 1/4 and \w\ < 1/2.
We shall now, through an example, indicate how the knowledge of a set
where the values must be, can be used in the computation of continued
fraction values.
Example 12 Let
— — D.3.5)
1+1+1 + 1+-.. ^ }
be a continued fraction where all an have absolute value < 1/4. What
can be said about the value of the continued fraction?
From Worpitzky's Theorem it follows that the value of the tail
a.i a. j an
1 + 1 +¦•¦+ 1 +•¦•
is in the disk \w\ < 1/2. The linear fractional transformation (l.f.t.)
1/8
w —> —¦
1 + w
maps the disk \w\ < 1/2 onto the disk
1
w--
and the l.f.t.
-1/4
w —* —
maps this disk onto the disk
14
(This is established by using standard methods for mapping disks by
linear fractional transformations, or simply by computing, in each case,
the intersections with the real axis together with the knowledge that
the images are disks.) Observe how quickly we reach good values. By
taking —14/65 as an approximate value, the error is < 1/65, regardless
of which continued fraction D.3.5) we have, if it satisfies \an\ < 1/4 for
all n > 3.
O
Convergence once again 37
5 Convergence once again
5.1 Critical remarks on convergence
We return to some of the thoughts from the very first section in the
present chapter. When a series
oo
converges, the nth tail goes to 0 when n goes to oo:
oo
lim V tk = 0. E.1.1)
k=n+\
The nth approximant of a series is obtained by removing the nth tail,
or, phrased differently: by replacing it with its limit (which is 0), i.e.
the nth approximant is
n
Tn =
When a product
oo
converges, the nth tail goes to 1 when n goes to oo:
oo
lim J[ Pk = l. E.1.2)
n —> oo AA
k=n+\
The nth approximant of a product is obtained by removing the nth tail,
or, phrased differently: by replacing it with its limit (which is 1), i.e.
the nth approximant is
n
Pn =
n
The nth approximant of a continued fraction is also obtained by cutting
off the tail, i.e. the nth approximant is
n B/r.
K 7T'
38 Chapter I. Introductory examples
but for continued fractions this does not mean to replace the tail by its
limit. Usually this limit does not exist at a//, and if it exists, it is 0 only
in very special cases. A continued fraction where the limit exists, is the
one in Example 1,
where all the tails, including the continued fraction itself, have the value
2. This raises the questions:
1) In computing the value of a continued fraction other sequences
{Sn(wn)} may be better than {5n@)}. (Look back to Subsections
2.3, 34 and 3.5.)
2) Perhaps the concept of convergence of continued fractions should
not have been tied to the sequence {5rt@)}, but to some {Sn(wn)}.
We shall consider these questions in the rest of the chapter.
5.2 Modified approximants
The word has been used earlier, and in the Subsections 3-4 and 3.5 we
have seen examples indicating that in some cases sequences {Sn(w)} or
even {Sn(wn)} may be better than {5n@)} in the computation of the
value of a continued fraction. Here are two more examples:
Example 13 For the continued fraction
oo 30 + 0.9n /r « ,\
K —- , E.2.1)
the tails look more and more like
oo 30 ,
K T, 5.2.2)
if we let them start further and further out. One can prove, that E.2.2)
converges to the positive root of the quadratic equation
30
it —
1 '
Convergence once again
39
i.e. to 5. (Problem 2, the hint in Problem 6 and a little more.) This
suggests, that in the computation of E.2.1) the sequence {?^E)} may
turn out to be better than {5n@)}. (The convergence of E.2.1) is not
hard to prove directly, but after Chapter III it will be trivial. For the
time being we take convergence for granted.) The following table indi-
indicates strongly that this is in fact true. In the table C stands for classical
approximants, i.e. 5n@), whereas J stands for modified approximant, in
the present case 5nE). See [JaWa84], [ThWa82].
n
1
2
3
4
35
36
37
38
39
40
41
85
86
87
88
89
C
30.9000
0.97139
15.6770
1.85765
5.10127
5.07160
5.09631
5.07571
5.09288
5.07857
5.09049
5.08507
5.08506
5.08507
5.08507
5.08507
J
5.15000
5.03667
5.12176
5.05762
5.08507
5.08506
5.08507
5.08506
5.08507
5.08507
5.08507
5.08507
5.08507
5.08507
5.08507
5.08507
-O
Example 14 For the continued fraction
3 + 1/12 4 + 3/22 3 + 1/32 4 + 3/42 3
the tails "look more and more like"
3 4 3 4
1/5
+¦
E.2.3)
E.2.4)
40
and
Chapter I. Introductory examples
E.2.5)
4 3 4 3
1 + 1 + 1 + 1+..."
We take convergence for granted in all cases, since it will be obvious after
Chapter III. The value of the continued fraction E.2.4) is the positive
root of the quadratic equation
3
u = ,
which is 1. The value of the continued fraction E.2.5) is the positive
root of the quadratic equation
4
u =
u
which is 2.
In the table below {5n@)} is compared to {Sn(wn)}, where w-2k — 1 and
w2k-i = 2.
71
J
2
3
4
5
6
7
8
9
10
11
12
13
23
24
25
26
C-app.
4.00000
0.69565
1.85580
1.00775
1.38927
1.12527
1.25252
1.16637
1.20885
1.18033
1.19451
1.18502
1.18975
1.18739
1.18738
1.18738
1.18738
J-app.
1.33333
1.18519
1.20055
1.18752
1.18941
1.18745
1.18777
1.18740
1.18746
1.18738
1.18740
1.18738
1.18738
1.18738
1.18738
1.18738
1.18738
Convergence once again 41
The table indicates strongly that {Sn(wn)} converges to the same value
as {Sn@)}, and faster.
O
From Example 13 and Example 14 it seems (and it will later be proved)
that with {wn} properly picked {Sn(wn)} converges to the value of the
continued fraction, and faster than {^(O)}. What "properly picked"
means will be discussed later. We easily can make "improper choices":
Take any sequence {/3tl} of extended complex numbers, and choose
This is possible, since all Sn are non-singular. Then we have
n
This shows, that we can make {5n(w;ri)} converge to anything we want,
or diverge, regardless of the convergence behavior of the continued frac-
fraction itself.
Let it finally be mentioned, that in some cases attempts to compute the
value of some continued fraction K(an/&n) lead to a sequence {Sn(wn)}
which converges to something we have reason to believe is the value
of the continued fraction, whereas {5n@)} may be hard to handle, or
even to get hold of. In such cases one needs results about going from
convergence of {Sn(wn)} to convergence of the continued fraction, and
to the same value (which, as we just have seen, is not true in general).
5.3 Another concept of convergence
We shall first look at an example [Jaco86]:
Example 15 For the 3-periodic continued fraction
1 I — - i Zl
l+I-i- i +I+T+T+-.-
42 Chapter I. Introductory examples
it is not hard to prove, by using the recurrence relations A.2.9) and
induction, that for all n > 1
A'ta-2 = 2n, ^3n-l = 2n, A[in = 0,
-2, ?3n = 1.
Hence the approximants An/Bn are
2" 2"
i3n-l = 2n+l 2 ' ¦*Jn ~ '
_ 3 »
from which
Hm /3n_2 = - , lim /3n_, = - , lim fMl = 0
n —> oo 2 « —*¦ °° 2 " —*¦ oo
immediately follow. This shows that the continued fraction diverges.
For the modified approximants Sn(wn) we find from A.2.8)
2n + t«3W.2 ¦ 0 11
when 72 —> cxd, if the sequence {ti?3n_2} is bounded.
2" + tU3n-1 • 2"
1 + W3n-1
2A + ^3n-i) - C«;3n_i + 2J"" * 2
when n —> oo, if the sequence {tu3n_i} is bounded away from — 1.
0 + w3n • 2" u^1
1 + «>3nBn+1 - 2) 2 - wan + A - 2 • ti;3nJ-» " 2
when 7i —* oo, if the sequence {w^n} is bounded away from 0.
Hence we have, in this example, that limn —> oo 5r,@) does not exist,
whereas
for all {wn} bounded away from 0,-1, oo. This example suggests strong-
strongly, more so than earlier considerations, that the definition of convergence
of continued fractions is "wrong", since the continued fraction in this
example "ought to converge".
O
Convergence once again 43
This example (together with other observations) has led to a new def-
definition of convergence [Jaco86]. In the definition we use the chordal
distance d(z,w), which is defined by
if w and z are both finite, whereas
1
d(w, oo) =
This is a metric very much used in the theory of functions of one complex
variable, in particular in cases where the point at infinity is not supposed
to play a special role, different from the role of other points. The name
comes from the fact, that d(w, z) is the length of the chord between the
images of w and z on a sphere by a suitable stereographic projection.
Definition K.(an/bn) is said to converge generally to an f ?C if there
exist two sequences {vn} and {wn} in C such that
liminf d(vn,wn) > 0 E.3.1)
and
Sn(vTl)= Urn Sn(wn) = f. E.3.2)
n —r oo n —*¦ oo
We shall see later, that / is unique. (If not, the definition would not
make sense.) One property follows directly from the definition: If a
continued fraction converges in the ordinary (classical) sense to /, we
have
lim Sn@)= lim Sn(oo) = f.
n —* oo n —r oo
Since 0,0,0,... and oo, oo, oo,... are two sequences satisfying E.3.1), it
follows that ordinary convergence to / implies general convergence to /.
Thus the new concept includes the classical one, and picks up additional
cases, for instance the one in Example 15.
More important is, that it in many cases is easier to apply. One ad-
advantage is, that we do not need to worry about {5n@)}. Once we have
proved E.3.2) for two sequences satisfying E.3.1) we are through (as far
as general convergence is concerned).
44 Chapter I. Introductory examples
5.4 Another concept of continued fraction
The formal definition of a continued fraction was presented in Subsec-
Subsection 1.2, and the sequence {?^@)} played a crucial role in the definition.
Recently there has been an increased use of and emphasis on modifica-
modifications. Out of this has grown the concept of modified continued frac-
fractions, obtained by replacing the sequence {/n}, fn = 5n@), by {gn},
gn = Sn(wn). Of course the notation must then contain {?#„}. This ap-
appeared in print at first in [BaJo86], and the modified continued fraction
was there written
An abbreviated notation is
K(an,bn,wn).
We obviously have, with reference to notation introduced earlier:
«n}, {6n},
K(an,6n,0) = K(an/6n),
K(an,l,0) = K(an/l).
In working with modified continued fractions the classical one is some-
sometimes referred to as the reference continued fraction.
The earlier mentioned problem of going from convergence of Sn(wn) to
convergence of 5n@) can now be expressed as a problem of going from
convergence of modified continued fractions to convergence of ordinary
continued fractions.
5.5 Computation of approxirnants
To compute
a
n
"
, + by +• •.+ bn + wn ~ Bn + Bn.
there are several algorithms. We shall only mention the two obvious
ones:
Convergence once again 45
1. The forward recurrence algorithm consists of computing An and
Dn by the recurrence relation A.2.9).
2. The backward recurrence algorithm starts at the other end by
setting
tn = wn
and then work backwards by setting
+ tk
for k = n, n — 1,..., 1. Then Sn(wn) = to (or &o + to).
The first method has the advantage that if you have found Sn(wn), you
can easily find 5n+i(tyn+i)» whereas you must start again from scratch
in the second method. On the other hand, the backward recurrence al-
algorithm is in general more stable. (Why will become evident in Chapter
IV.) The computations in this book are done by means of the backward
recurrence algorithm.
46 Chapter I. Introductory examples
Problems
A) Use the identity
y/b- 1
2 i . Vs-\
1 ^ 2
to produce a continued fraction by the procedure of Example 2.
Compute the first 7 approximants fn and compare the values to
- l)/2. Prove that
where Fo = 1, FL = 1, F2 = 2, F3 = 3, F, = 5, and generally
Fn+i =Fn + Fn-l forn> 1.
(The sequence {Fn} is the sequence of Fibonacci numbers, and the
ratio (\/o — l)/2 = .61803... is the golden ratio.)
B) Prove the following: For any real a, if the continued fraction
a a a
T+T+T+-..
converges, then it converges to one of the roots of the equation
a
x —
1 + x
Use this to prove that the continued fraction diverges for all a <
-1/4.
C) Assume that we know that the continued fraction
1 1 1
1 + 1-j l-\
converges. Prove that it then converges to (\/5 — l)/2.
D) Find a particular integral of the differential equation
y = 22/' + Zy"
by using the "method" in Subsection 2.2.
Problems 47
E) Let x and y be complex numbers, \x\ ^ |y|, and let fn be the
nth approximant of the continued fraction
—xy
-(« + y) + -(a: + y) + ~{x + y) +• • - *
Find a formula for /„ in terms of x and y.
F) Assume that we know that the continued fraction in Example 4,
xxx
converges to \/l + 2 — 1 for positive 2-values. Use this to find \/5
with an error < 5 • 10" "*J. (Hint: Observe — and prove — that
{f'2n} and {/2n+i} are monotone. Take a; = 1/4.)
G) Compute for a suitable n the first n approximants of the continued
fraction in Problem 6 for x = 1 — 2z. Try to guess in advance
what the sequence will converge to, and compare to the computed
approximants.
(8) Assume that we know that the log(l + z)-expansion of Subsection
2.4 converges to log(l + z) for positive z-values. Use this to find
log 5 with an error < 5 • 10~3. (Hint as in Problem 6 with z = 4.)
(9) Compute for a suitable n the first n approximants of the continued
fraction in Subsection 2.4 for z — i — 1. Try to guess what the
sequence of approximants converges to.
A0) Use Example 6 in Subsection 3.1 to establish the expansion of
log(l + z) of Subsection 2.4-, assuming that the continued fraction
expansion C.1.2) of
F(a, 6; c; z)
is established. (Hint: Take a = 1,6 = 0, c = 1, and replace z by
A1) Let a be real and not a positive integer. With F as in Subsection
3.1, prove that
F(-a,l;l;-z) =
48 Chapter I. Introductory examples
Under the same assumptions as in Problem 10 find a continued
fraction of the form
1 + 1 + 1 +...
for 2A -j- z)n by using Subsection 3.1.
A2) Let a be as in A1). Use the procedure of Subsection 3.2 to trans-
transform the power series expansion of z(\ -f z)a at 0 to a continued
fraction of the form
b\z b2z 63Z
1 + 1 + 1 +...
(Of course only the start, for instance up to and including b\z/l.)
A3) Use the procedure of Subsection 3.2 to transform the power series
expansion at 0 of ez — 1 to a continued fraction of the form
b\z
1 + 1 + 1 H
(Compute 61,62,6;j, b,\.)
A4) Use the procedure of Subsection 3.3 to find the first terms of the
power series expansion at 0 corresponding to the regular C-fraction
z_ -z/2 z/6 -z/6
1+ 1 +T+ 1 +...
A5) Let a be a positive number. For which values of a does the con-
continued fraction 'K^2i(l/n~nL) converge, and for which values does
it diverge? (Hint: Use Van Vleck's Theorem.)
A6) Use Worpitzky's Theorem to prove the following:
a) The value of any continued fraction
1/4 -1/4 a3 a,
~T+ 1 +T+T+...'
with |arl| < 1/4 for all n must lie in the disk
2
w
5
Problems
49
b) The value of any continued fraction
z/4 ct2 a:i
T+T+T+--.'
with \an\ < 1/4 for all n must lie in the disk
1
- 6
A7) For the continued fraction in Example 15 of Subsection 5.3 com-
compute Sn(wn) for the following values of wn. In which cases do we
have convergence?
a) wn = 1,
b) wT> = n,
c) wn = 2("
d) wn = 1/n,
e) wn = 2-"/3.
50 Chapter T. Introductory examples
Remarks
1. For those who want to go deeper into the analytic theory of con-
continued fractions we refer to the three standard monographs in
the field: the classical text-book by Perron [Perr54], Wall's book
[Wall48], with its introduction to some of the new ideas, upon
which the modern theory is built, and finally the most modern
exposition, by Jones and Thron from 1980 [J0TI18O]. In Henrici's
3 volume work on Applied and Computational Complex Analysis
[Henr77] a large portion of Volume 2 is devoted to analytic theory
of continued fraction. Khovanskii's book [Khov63] contains some
interesting applications. References to further books and papers in
the field are found in the texts above. As for the history of contin-
continued fractions we refer to a recent book by Brezinski [Brez91], but
also to the texts mentioned, in particular to the book [JoTh80] by
Jones and Thron, which contains many interesting comments on
the historic development of concepts, methods and applications.
2. To most people (meaning mathematicians) continued fractions are
most closely associated with Number Theory, for instance in con-
connection with diophantine equations of degree 1 or 2. The continued
fractions used there are mostly regular continued fractions. One
very important thing in the Theory of Numbers is the connection
between the Euclidean algorithm and the terminating regular con-
continued fraction expansion of rational numbers. In the present expo-
exposition we shall not say much about continued fractions in number
theory, except for a small chapter on some applications in number
theory (Chapter IX). We also refer to [Perr54].
3. After Example 4 and Example 5 in Subsections 2.3 and 2.4, indi-
indicating superiority of continued fractions over power series expan-
expansions, both as far as domain of convergence and speed of conver-
convergence are concerned, it is only fair to say, that this is not always
the case. Sometimes it is the other way around. One trivial exam-
example, which has its non-trivial analogues, is Example 7 in Subsection
3.4- There the power series z + O.z2 + O.z** + • • • is transformed
into the continued fraction C.4.1). The series converges to z in
the whole plane. The convergence is the "fastest possible", since
all partial sums = z. The continued fraction converges to z in the
Remarks 51
open unit disk \z\ < 1, but more slowly, and for \z\ > 1 it even
converges to "something wrong", namely to —1. (It can be proved,
that it diverges on the circle \z\ — 1, except for z = — 1, where it
converges to — 1.) On the other hand, we shall see in Chapter VI
that the continued fraction in Example 6 always "wins over the
hypergeometric series".
4. For references and comments connected to the three classical the-
theorems in Section 4 we refer essentially to the book by Jones and
Thron [JoTh80]. Let it be mentioned, though, that Worpitzky's
Theorem was proved already in 1865, but remained unknown to
workers in the field until Pringsheim rediscovered it more than 30
years later. It was not until 1905, through Van Vleck, that Wor-
pitzky got credit for it. Part of the reason may be the way it was
published, (in an annual report from the school where Worpitzky
was teaching, [Worp65]), but there may be other more significant
reasons, see [JaTW89]. Theorem 1 usually carries the name of
Pringsheim. However, as pointed out to us by W. J. Thron, J.
Sleszynski is the right one to give credit, since he already proved
the theorem in 1888, see [Sles89].
5. Already Hamel [Hamell8] raised the question about the concept
of convergence for continued fractions.
References
[AlBc88]
[BaJo86]
[Brez91]
[Brun50]
[CuWu87]
[Gaussl3]
[Gaussl4]
G. Almkvist and B. Berndt, Gauss, Landen, Ramanu-
jan, the Arithmetic-Geometric Mean, Ellipses, ir, and the
Ladies Diary. Amer. Math. Monthly A988), 585-608.
C. Baltus and W. B. Jones, A Family of Best Value Re-
Regions for Modified Continued Fractions, "Analytic Theory
of Continued Fractions II", Lecture Notes in Mathematics
1199 (ed. W. J. Thron), Springer-Verlag, Berlin A986),
1-20.
C. Brezinski, "History of Continued Fractions and Pade
Approximants", Springer Series in Computational Mathe-
Mathematics, 12, Springer-Verlag, Berlin A991).
V. Brun, "Forelesninger over Kjedebr0k", Universitetet i
Oslo A950).
A. Cuyt and L. Wuytack, "Nonlinear Methods in Numer-
Numerical Analysis", North-Holland Mathematics Studies, Ams-
Amsterdam A987).
C. F. Gauss, Disquisitiones generates circa seriem infini-
*x* etc,
Commentationes Societatis Regiae Scientiarium Gottin-
gensis Recentiores 2 A813), 1-46; Werke, Vol. 3 Gottingen
A876), 134-138.
C. F. Gauss, Methodus Nova Integralium Valores per Ap-
proximationem Inveniendi, Commentationes Societatis Re-
52
References
53
[Hamell8]
[Henr77]
[Hiitte55]
[Jaco86]
[JaTW89]
[JaWa84]
[JoTh80]
[Khov63]
[Perr54,57]
[Rama57]
giae Scientiarium Gottingensis Recentiores 15 A814), 39-
76; Werke 3 Gottingen A876), 165-196.
G. Hamel, Uber einen limitarperiodischen Kettenbruch,
Archiv der Math, und Phys. 27 A918), 37-43.
P. Henrici, "Applied and Computational Complex Analy-
Analysis", Vol. 2, Wiley, New York A977).
Hiitte, "Des Ingenieurs Taschenbuch", 28. Aufl. 1, Wilhelm
Ernst & Sohn, Berlin A955), Seite 139.
L. Jacobsen, General Convergence for Continued Fractions,
Trans. Amer. Math. Soc. 281 A986), 129-146.
L. Jacobsen, W. J. Tliron and H. Waadeland, Julius Wor-
pitzky, his Contributions to the Analytic Theory of Con-
Continued Fractions and his Times, "Analytic Theory of Con-
Continued Fractions III", (ed. L. Jacobsen), Lecture Notes in
Mathematics 1406, Springer-Verlag, Berlin A989), 25- 47.
L. Jacobsen and H. Waadeland, Modification of Contin-
Continued Fractions, "Pade Approximation and its Applications,
Bad Honnef 1983", (H. Werner and H. J. Biinger eds.) Lec-
Lecture Notes in Mathematics, Springer-Verlag, Berlin 1071
A984), 176-196.
W. B. Jones and W. J. Thron, "Continued Fractions: An-
Analytic Theory and Applications", Encyclopedia of Mathe-
Mathematics and its Applications, 11, Addison-Wesley Publish-
Publishing Company, Reading, Mass. A980). Now distributed by
Cambridge University Press, New York.
A. N. Khovanskii, "The Application of Continued Fractions
and Their Generalizations to Problems in Approximation
Theory", P. Noordhoff, Groningen A963).
0. Perron, "Die Lehre von den Kettenbriichen", Vol. 1, 2
3. Aufl., B. G. Teubner, Stuttgart A954, 1957).
S. Ramanujan, "Notebooks", Vol. 2 Tata Institute of Fun-
Fundamental Research, Bombay A957). Now distributed by
Springer-Verlag. l
54 Chapter I. Introductory examples
[Sles89] J. V. Sleszyriski, Zur Frage von der Konvergenz der Ket-
tenbruche (in Russian), Mat. Sbornik 14 A889), 337-343,
436-438.
[Stecn73] A. Steen, Integration af LineeEre Differentialligninger af
Anden Orden ved Hj&lp af Kjdsdebr0ker, Kobenhavn
A873).
[ThWa82] VV. J. Thron and H. Waadeland, Modifications of Contin-
Continued Fractions, a Survey, "Analytic Theory of Continued
Fractions", Proceedings 1981, (W. B. Jones, W. J. Thron
and H. Waadeland eds.), Lecture Notes in Mathematics
932, Springer-Verlag, Berlin A982), 38-66.
[Waad83] II. Waadeland, Differential Equations and Modifications
of Continued Fractions, some Simple Observations, "Pade
Approximants and Continued Fractions", Proceedings
1982, (H. Waadeland and H. Wallin eds.) Det Kon-
gelige Norske Videnskabers Selskabs Skrifter, Trondheim
1 A983), 136-150.
[Wall48] II. S. Wall, "Analytic Theory of Continued Fractions", Van
Nostrand, New York A948).
[Worp65] J. Worpitzky, Untersuchungen u'ber die Entwickelung
der monodromen und monogenen Funktionen durch Ket-
tenbruche, Friedrichs-Gymnasium und Realschule Jahres-
bericht, Berlin A865), 3-39.
Chapter II
More basics
About this chapter
Continued fractions are defined by means of linear fractional transfor-
transformations. Such transformations have very nice properties. One way to
take advantage of this is to introduce the concept of tail sequences. (The
beautiful mapping properties of linear fractional transformations will be
exploited in Chapter III.)
Continued fractions can also be described by means of linear recurrence
relations. This will be treated at length in Chapter IV, but already now
we observe the following from formulas A.2.6)-A.2.9) in Chapter I: If
{An} and {Bn} are given, satisfying certain conditions, then the ele-
elements {an} and {bn} of the continued fraction can be determined. This
is the background for the useful transformations of continued fractions
to be presented in this chapter, transformations that allow us to change
the coefficients of a continued fraction without altering its approximants
(too much).
55
56 Chapter II. More basics
1 Tails of continued fractions
1.1 Tails
The Nth tail of the continued fraction 60 + K(an/^«) is the continued
fraction
oo an ayv+i Qn+2 Qn+'a f \
n=N+\ bn O/V + l -fOA'+i + 0vV+3+* * *
for AT G Nrj. Here and in the rest of the book No is the notation for the
set of non-negative integers; i.e. No = NU{0}. There are several reasons
to study such tails, and one is described in the following theorem.
Theorem 1 The following three statements are equivalent.
A) 6q + K(an/&n) converges/converges generally.
B) A.1.1) converges/converges generally for an N G Nq.
C) A.1.1) converges/converges generally for all N G No-
Proof : C => B and C => A follow trivially.
A => C: Let &o + 'K(an/bn) converge to /. That is, its approximants
fn = «Sn@) converge to /. Let N G No be chosen arbitrarily, and let
i ' = 5;, ^@) denote the approximants of A.1.1). Then
so that fn = SjS,\fi\r+n). The convergence of /r\ ' as n —> oo follows
therefore from the fact that 5^' is a linear fractional transformation,
and therefore is a bijection of C onto C, since all an ^ 0. (This will be
discussed in Chapter III.)
The proof for general convergence uses the same idea, where we use the
analogue
Siv+n(w) = Stw(S^ }(w)) = 5/v o S^ '(w) A.1.3)
Tails of continued fractions
57
to A.1.2).
C: This can also be proved in a related way.
Example 1 We shall prove that the continued fraction K(an/l) con-
converges if an —> 0.
Since an —> 0 there exists an N ? No such that \an\ < 1/4 for all n >
N. Hence, the JVth tail of K(an/l) converges by Worpitzky's theorem,
Theorem 3 in Chapter T.
Observe that we can no longer conclude that the value / of K(otn/1) is
in the closed disk |/| < 1/2. This is only true for the value /W of its
Nth tail, |/W| < 1/2 (and for /(*) for all k > N). The value / may
even be infinite.
O
We have used An and Bn to denote the canonical numerators and de-
denominators of 60 + K(an/bn), and Sn(ivn) and fn = 5Tl@) to denote
its approximants. For its iVth tail we use An , Bn , Sn (w^vv+n) and
fy ' = Sn @) to denote the corresponding quantities. This notation
will be used throughout the book.
Just as for ATn Bn and Sn we have
n + A {
G7)fTv)
for n = 1,2,3,..., and
o - - • o sN+n(w), A.1-4)
where A^ = 1, A^ = 0, B™ = 0, B
= 1 and
r aw 1
Bw
n
— D/V-t-n
/I
B{N)
An-2
. ^n-2 .
N+n
A.1.5)
A.1.6)
In addition one can also prove:
58 Chapter II. More basics
Lemma 2 With the notation just introduced the following equalities
hold:
W = aN+lB forN>0, n > 0, A.1.7)
=bN+lB^ +aN+2BZ2) forN>0, n > 1 , A.1.8)
N+n
TT / x
11 I ajJ '
Proof : A.1.7) is trivially true for n = 0 and n — \. Hence it holds
for all n, since by A.1.5)
A^ = bN+nA{nN_\ + aN+nA[N_\ for all n > 1
and
?<*?'' = bN+nBtV] + a/v+n^3+'' for all n > 2 .
A.1.8) holds trivially for n = 1 and n = 2. By induction on n we find
that it holds for all n, since by A.1.5)
= bN+l \bN+nB^_2 +ajv+n#n_3 J
forn>3.
Finally, it follows from A.1.6) that A.1.9) holds for k = 0 since
1. For fe = 1 we get by A.1.5) that
B
Tails of continued fractions 59
which proves A.1.9) for this value of k. By the same process we find
that
n-l An-lnn+k
for * = 2,3,4,...
which is the same recurrence relation as A.1.5) for \ B, \
x ' I * JJt=
Hence A.1.9) follows.
1.2 Tail sequences
Assume that &o + K(an/&n) converges. Let /W G C denote the value of
its Nth tail A.1.1) for N = 0^1,2,.... Then we find from A.1.l) that
/ = 60 + /(") is the value of 60 + K(an/bn) and
N = 0,1,2,.... A.2.1)
(A.2.1) is to be interpreted in the obvious way if /(^) or /(N+1) is infi-
infinite.) We say that {/^}w=o ^s ^ne "ri&t toil sequence [or the convergent
continued fraction 6q + J&(an/bn).
More generally, we say that a sequence {tn}?JLu °f elenients from C is a
tail sequence for 60 + K(an/bn) if
*"-' = h ^f =sn(tn) for n= 1,2,3,.... A.2.2)
This means in particular that
t0 = Sl(tx) = a, o «2(t2) = ... = «! o ... o an(tn) - 5<0)(*n) A.2.3)
L-l
for all n, and thus that {5A (^))}^Lo ^s a ^a^ sequence (uniquely
defined) for every *o € C. Observe that if {tn}™-o is a tail sequence for
bo + K(an/&n)> then {*n}^L^ is a tail sequence for its A^th tail A.1.1).
60 Chapter II. More basics
Theorem 3 Let {tn} be a tail sequence for 60 -f K(an/bn). Then
for all n.
A.2.4)
Proof : Inverting A.2.3) gives tn = sn^ o snl_l o • • • o sx '(to) which is
equal to s~l os~[{ O---O5J osq1F0 + to) = 5~1F0 + to)- From A.2.2)
we find that
-»(„,)= _6*+ — = -(&* +t^t) forA;> I. A.2.5)
*• to L {w)j
This proves the last equality in A.2.4).
Remark: Observe that if {tn} and {tn} are two tail sequences for
bo + K(ari/6n) with tk = t^ for one index fc, then tfl = tn for all ?7 by
A.2.4) since all Sk are bijections of C onto C.
We shall see later that the tail sequence {S^^oo)} plays a special role
in our theory. We define
hn = -S~x(oo) for n = 0,1,2,... , A.2.6)
which gives h.Q = oo, h\ =b\, and
h,. = -^~ = 6n 4 T^^^1^ ,r for n = 1,2,.... A.2.7)
(Observe that 5~'(oo) = 5n (oo) since so(oo) = oo> regardless of
the value of 6n.) This sequence {hn} is called the critical tail sequence
of 60 + K(a«/^rt) (although it is strictly speaking {—hn} which is a tail
sequence).
Example 2 In Example 15 in Chapter I we saw that the 3-periodic
continued fraction
an 2 1 -1 2 1 -1 2
K i 1 + 1+T+1+I+ i
Tails of continued fractions 61
converges generally to / = 1/2, but diverges in the classical sense since
5:jn@) = 0 for all n. We shall see that the tail sequence {tn} of K(a«/l)
with ?o = 0 is 3-periodic, why this has any connection to the convergence
behavior of K(ctnA)> and compute this tail sequence. We shall also see
whether K(ctn/l) has any other 3-periodic tail sequences.
Let t0 = 0. By A.2.3) we have tn = si^' («„) = 5~1@) since 60 = 0.
Since 53n@) = 0 for all n, this means that <3n = 0 for all n. Prom A.2.2)
with n replaced by 3n, we find that ?3n_ | has a value independent of n
since {an} is periodic and all bn = 1. Similarly also t^n-z has a value
independent of n, so {tn} is 3-periodic.
We may say that {tn} reveals the "trouble" we have with the convergence
of fn = 5ri@). For every tail sequence we have &o -f to = Sn(tn). So if
{tn} is not a right tail sequence, we must stay away from tn (wn ^ tn)
when we choose approximants Sn(wn) for our continued fraction, at least
from some n on. Otherwise we destroy the convergence to /. Our {tn}
is not the right tail sequence since to ^ / = 1/2. And we did not stay
away from {tn} when we chose our approximants Sri@). For every third
index n we have wn = tn = 0.
With to = 0 the terms of {tn} are
2
?<J = t:in = 0, t\ — *3n+i = Sj (to) = ~ 1 + - = CX),
t'i = *jn+2 = S2 {t\) = —i-\ = — 1 for n = 1,2,3,....
CX)
The right tail sequence {f^} must also be periodic since every third
tail of K(an/1) ls identical to K(an/l) itself. It is given by
l=_2 forn=1>2>3>
These two sequences are the only 3-periodic tail sequences of K(ct«/l),
since {tn} is a 3-periodic tail sequence if and only if tn = tn+^; that is,
62 Chapter II. More basics
if and only if tn is a solution of the equation
tn = S3 (^n+.O = S3 (tn).
Tliis is a quadratic equation which has at most two solutions.
O
1.3 Some properties of linear fractional transformations
Both a continued fraction bu + J?(an/bn) and its tail sequences are closely
tied to linear fractional transformations
t(w)= aW^~ ad-bc^O. A.3.1)
(See for instance A.1.4) and A.2.2).) Such transformations t(w) are
bijective mappings of C onto C with very nice properties:
a) t(w) maps (generalized) circles on the Riemann sphere C onto
(generalized) circles on C.
b) Let C be a (generalized) circle on C. Then t maps points symmet-
symmetric with respect to C onto points symmetric with respect to t(C).
(The reflection property.)
c) The cross ratio is invariant under linear fractional transformations;
that is, if W|, w<2, w^ and w4 are four distinct points in C, then
- t{w2))(t(w3) -
- w3)(w2 - u>.i) {t{wi) - t(w-j))(t(w2) - t(w.\)) '
(If one of the points w^ or its image t(wk) is equal to infinity, then
A.3.2) has the standard meaning.)
Sometimes it is of advantage to use the chordal metric d(w\, w-i) as
defined in Subsection 5.3 of Chapter I. The cross ratio is still invariant
under t if we define it by the chordal metric:
, w2) • d(w3, w,\) d(t(wi)it(w2)) • d(t(w3),t(wi)) .
— A.3.3)
d(t(wi),t(w3))-d(t(w2),t(wl))
for wi,W2,W3,W\ G C all distinct.
Tails of continued fractions 63
Speed of convergence. Truncation error bounds
Convergence properties are often important in applications of continued
fractions. Not only the existence of a limit, but also how fast this limit is
approached by the approximants fn (or more generally Sn(wn)). Hence
it is important to have estimates for this speed of convergence. We
distinguish between a priori truncation error bounds
|/-/n|<An, A.4.1)
where An > 0 is a bound we can find in advance, before we start com-
computing the approximants, and a posteriori truncation error bounds
\f-fn\<Mn\fn -/n-l|, A.4.2)
where the bound Mn\fn — fn-i\ can be determined only after we have
computed (at least) the approximants fn and fn-\. A priori bounds
A.4.1) can be used to determine, in advance, the index n we need in
order to obtain a desired accuracy. This saves work in the sense that
one only needs to compute fn for this particular index. A posteriori
bounds A.4.2) work more like a stopping criterion. One computes ap-
approximants /|, /a, fa,... until the right hand side of A.4.2) is sufficiently
small. Sometimes the a posteriori bounds are more accurate, so we can
stop at a lower value of n than indicated by the a priori bound.
To find a posteriori truncation error bounds, and to compare speed
of convergence of {Sn(wn)} for different sequences {w>n}> we shall use
A.3.2) with t = Sn. (This idea was presented by Thron [Thron89].)
If we choose Wi = 0,^2 = /^ ,w>3 = 00, and w<i = —hn (notation as
in A.1.2) and A.2.6)), and we require that these four points in C are
distinct, then A.3.2) reduces to
0 - fi^ Sn(Q) - SMi^) fn - fn+k
5n@) - 5B(oo) - /„ - /„_, •
Now, our four chosen points Wk are distinct if and only if the four points
f°r k = 1,2,3,4 are distinct; i.e. if and only if
/n,/rl_i and fn+k are distinct and finite. A.4.4)
64 Chapter II. More basics
Hence
f
fn+k ~ fn = 7^ (/„ - /„-.) if A.4.4) holds, A.4.5)
fl J + K
and thus
\fn+k ~ fn\ <Mn\fn -/n-l|
if f?n' and hn can be estimated properly to give a bound Mn. If
fn+k —*¦ / as k —>oo and Mn is independent of k, this gives us a poste-
posteriori truncation error bounds
\f - fn\ < Mn\fn - fn^\. A.4.6)
Example 3 We shall find a posteriori truncation error bounds for a
continued fraction K(cn/l) with all \cu\ < g < 1/4. (Tool: Worpitzky's
theorem in Subsection 4-3 of Chapter [ states that K.(a,n/l) converges
and has approximants |/n| < 1/2 if all \an\ < 1/4.)
All tails of K(cn/1) satisfy Worpitzky's theorem, so \fn '| < 1/2 for all
n and N. However, since /„ ' = -Sy\r+i(/ri_i ) = cn+i/A + /„_
we really have
= 2g
1-1/2
for all N > 0 and n — 1,2,3, The critical tail sequence
., . cn cfl_i c2
can be regarded as 1 4- (approximant of continued fraction K(rfn/1) with
all \dTl\ < g). Hence, \hn\ > 1 — 2g for n = 1,2,3,.... Assume that
/„, fn-\ and fn+k are distinct. (We shall see later that this is really so.)
Then by combining the above we find from A.4.5) that
2g
\fn+k — fn\ < ~ 7~l/« ~" fn-[ \ >
1 - 4g
and thus
Tails of continued fractions
65
This is a useful bound. To find the value of K@.2em/l) with an error
less than 0.05, we compute approximants fn for n = 1,2,3,... until
. 2'.°"!j/» - /-iI = 2|/» - /"-iI < 0-05
1 — I • \J.Z
for some index n, and then we use / ~ /„ for this n.
n
1
2
3
fn
0.1081 + zO.1683
0.1484 + zO.1541
0.1567 +i0.1462
2|/n - fn-l
0.0856
0.0229
Hence we can use / ~ /;j ~ 0.16 + z'0.15. We shall return to the question
of a priori truncation error bounds in Chapter III.
O
fjf1 W2 =
Next we choose W\ — f^ \w2 = wn,wx = 0 and W\ = —hn in A.3.2).
If these are distinct; i.e. if /„+&, Sn(wn) and fn are distinct and finite,
then
r(n) - ^/»)@ + K) _ (fjrt - wn)hn _ fn{.k - Sn{wn)
^ — 0)(wTl + hn) fl (wu + hn) Jn+k — Jn
This formula is useful for comparing modified approximants Sn(wn)
to the approximants /„ = 5n@). If fn+k —> / ^ ex) as fc —> oo, and
\hn/fk{wn + hTl)\ is bounded by some constant M, then we get in
particular that
/ - Sn(wn)
-w
0 if
f-fn
That is, Sn[wn) converges faster to / than /„. This idea was used in
Example 13 and 14 in Chapter I.
To compare various approximants Sn[un),Sn(vn) and S,,(wn) of 6o +
~K(an/bn) one can choose the four points vn, un, wn and — hn in A.3.2)
(if these are distinct) to get
{Vn ~ Un)(wrl + hn) _ Sn[vn) ~ Sn{un) A4 8)
[vn - wn)(un + hn) Sn{vn) - Sn[wTl) '
66 Chapter II. More basics
For later use we record that the chordal metric version of A.4.8) is
-hn) _ h. + \Sn(un)\2 d(Sn(vn),Sn[un))
d(vn, wn)d(un, - hn) V 1 + \Sn(wn)\2 d(Sn[vn), Sn[wn)) '
But when are the approximants /„, fn+k distinct? Assume that /„ =
fn+ki i.e. that 5M@) = 5n(/^n'). Then we have /^ = 0. So, if we can
ascertain that /^ / 0, then fn / fn+k- Since /, ' — an+i/bn+\ /Owe
always have /„ / /n+i- We shall return to this later.
1.5 More about general convergence
In Subsection 5.3 of Chapter I we introduced the concept of general con-
convergence of continued fractions b{) -f J<L(an/bn) due to Jacobsen [Jaco86].
(See E.3.1)-E.3.2) in Chapter I.) We also found that if 60 + K(an/bn)
converges to a value / in the classical sense, then it converges generally
to /, and we saw an example of a continued fraction which converges
generally but not in the classical sense. (Example 15 in Chapter I.) But
what about the following questions?
A. Why do we require a common limit for two sequences of modifying
factors {vn} and {wn} in the definition of general convergence?
B. If bo-\-K(an/bn) converges generally to /, for which sequences {un}
will limSn(un) = /?
C. Is the value / of a generally convergent continued fraction really
unique?
We shall look at some answers.
A. Assume that we have the following information about a continued
fraction 6o -f- K(an/6n): Iim5n(wn) = / for some sequence {wu} from
C. What can we then say about fc0 + K(an/^n)? Jf we do not have any
additional information we can say nothing! In fact, let 6(J -f ~K(aTl/bn)
be an arbitrarily chosen continued fraction, and let {qn} be an arbitrary
Tails of continued fractions 67
sequence of numbers from C, converging to /. Then the choice wn =
ST^(qn) for all n gives the approximants Sn(wn) = qn —> / for 60 +
K(an/6n).
Hence, it would not suffice to require convergence of just one sequence
{5n(u>n)} in the definition. We need more. Common limit for two
sequences is one way of doing this. Another possibility is demonstrated
in Theorem 4 to come.
B. Let 60 + *K[an/bn) converge generally to /. When will Sn(un) —* /?
It is easy to see that if {un} is a tail sequence for 60 + K(ari/&n) with
u0 / /- 60, then Sn(un) = b{) | «o / /• A deeper result is that
Iim5n(ttn) = / if {un} "stays far enough away asymptotically" from
one such tail sequence:
Theorem 4 The continued fraction 6q + H(an/bn) converges generally
to f if and only if limSn(un) = / for every sequence {un} from C such
that
liminf d(un. —hn) > 0 if f ^ 00 ,
--*~ v A.5.1)
Tl * OO
liminf ct(un, —ATl/An-[) > 0 if f = 00 .
n —> 00
Proof : The if-part follows from the definition of general convergence.
To prove the only if-part we let
Iim5n(vn) = limSn(iun) = / A.5.2)
where
liminfd(vn,it;ri) > 0; A.5.3)
i.e. 60 + K(an/^n) converges generally to /. Let {un} satisfy A.5.1), and
assume first that / ^ 00. We then know that from some n on, say n > Ny
we have vn 7^ wn, un ^ —hn, Sn(vn) 7^ 00 and Sn(wn) 7^ 00, where the
two last statements are equivalent to vn ^ —hn and wn ^ —hn. It
suffices to prove that if {njtlfcLi is the subsequence of N where n > N
and un ¦? vni un ^ wn for all n = njt, then
Urn Srik(unk) = f . A.5.4)
k —> ex)
68 Chapter II. More basics
For n = rik we can use A.4.9). The left side of A.4.9) stays bounded as
k —> oo. Since d(Sn{vn), Sn(wn)) in the denominator of the right side
of A.4.9) approaches 0 as k —» oo [n = n^) and Sn(wn) —* f z/L oo, we
therefore need that d(SH(un), Sn(vn)) in the numerator also approaches
0 when n = n^ and A; —» oo. This proves A.5.4).
The case / = oo is not much different. {- hn} is no longer dangerous,
but -An/An-X = 5-!@) is! ¦
C. The uniqueness of / is a simple corollary of Theorem 4:
Corollary 5 Let 6<j -f K(an/6W) converge generally to f and to g. Then
Proof : Let {vn} and {wn} be such that A.5.2) and A.5.3) hold, and
let
Sn(gn) = g A.5.5)
where
\im'mfd(pn,qn) > 0.
Assume first that / ^ oo. For each n define
_
Pn if rf(Pm -hn) > d(qn, -hn),
Qn otherwise .
Then A.5.1) holds, and thus Sn(un) —> f. On the other hand Sn(un) —> g
by A.5.5). Hence f — g.
if / = oo we repeat the argument with hn replaced by An/An-\. *
Thus having answered our three questions A, B and C, a new question
springs to the mind: How can it be that we only have to stay "sufficiently
far away" from one particular tail sequence when all tail sequences {tn}
with t[) ^ / — 6t) are dangerous choices for wnl There can only be one
answer to this:
Transformations of continued fractions 69
Theorem 6 Let 6o+K(a,i/6n) converge generally to f, and let {tn} and
{tn} be tail sequences for &o + K(an/6TI) with to ^ f — 60 andl^ ^ f — bQ.
Then
iimd(tn,tn) = 0. A.5.6)
2 Transformations of continued fractions
2.1 Generating a continued fraction from a sequence
In the previous chapter we saw examples of how a continued fraction
6o + K(an/6n) can be derived to represent a number / or a function f(z).
The hope was that the continued fraction would converge to /, i.e. that
its sequence of approximants would converge to /. Tn fact, our interest
was in the behaviour of the approximant sequence, not in the continued
fraction itself. The continued fraction was just an intermediate step, as
it is for any such limiting process.
Still we prefer to study the continued fractions because we have some-
something to gain by doing so. For instance, the convergence criteria for {/n}
in Chapter I were all based on the elements of the continued fraction.
And in Example 13 and 14 in Chapter I, these elements helped us to
choose favorable approximants.
Indeed, one might ask the question: Given a sequence {/rt}, which con-
continued fraction 60 + K(&«/&«) has this as an approximant sequence, if
such one exists at all?
Theorem 7 The sequences {An}^L_l and {Bn}™___l of complex num-
numbers are the canonical numerators and denominators of some continued
fraction 60 + K(an/&n) if and only if
0 B.1.1)
for alln E N. If B.1.1) holds, then 6o+K(a«/6n) is uniquely determined
70 Chapter II. More basics
by
b0 = Ao, 6, = B{ , a, = Ai - A^BX ,
an = — — , btl = forn>2. B.1.2)
Proof : If b(J+'K(an/bn) is given, then B.1.1) holds by the determinant
formula A.2.10) and the initial conditions A.2.7) in Chapter I. If {An}
and {Bn} are given, satisfying B.1.1), then an and bn are solutions of
the system
bnAn-X -f arli4n_2 = An , .
of linear equations. The determinant of this system is An_j ^ 0. Hence
the solution B.1.2) is unique. ¦
Example 4 We shall find the continued fraction 6u + K(an/6n) which
has An = n2 and Bn — n2 -f 1 for n = 0,1,2,
Using Theorem 7 we get
An = AnBn-i - BnAn-i = 2n - 1
and
AnJ5rl_2 ~ ^n^n-2 = 4fl - 4
which means that
bo = 0, 6, = 2, a, = 1,
- 1 4n - 4
6
Hence, the continued fraction
1 -3/1 -5/3 -7/5 -9/7 -11/9
2+ 4/1 + 8/3 +12/5 + 16/7+ 20/9 +•••
has canonical approximants n2/(n2 + 1), and converges to 1.
O
Transformations of continued fractions 71
If we only have given the sequence {/n}JJLo of approximants, the contin-
continued fraction &o + K(an/&n) is no longer unique. One way to use Theorem
7 is then to choose
Bn = 1, An = fn if fn ? oo ,
*n = 0, An = l if/n =
for n = 0,1,2, To emphasize that {/n} shall be approximants of the
type /„ = 5n@), we shall call them classical approximants in contrast
to the (modified) approximants Sn[wn).
Corollary 8 The sequence {/n}?L0 from C is a sequence of classical
approximants for some continued fraction 60 -f K(an/&n) if and only if
oo and fn ^ /„_] /orn = 1,2,3,... . B.1.5)
Proof : Let 60 + K(an/bn) be given. Then, by B.1.1)
f1 =L ^
(This holds also if Bn or /?n_i are equal to 0, since two consecutive J9n's
can not be equal to 0 by the difference equations B.1.3).)
If {/„} is given, we define {An} and {/?„} by B.1.4), and the result
follows from Theorem 7. ¦
Example 5 We shall find a continued fraction &o + K(an/6n) which has
classical approximants /„ = n2/(n2 + 1) for n = 0,1,2,
With the choice B.1.4) for An and Bn we find from Theorem 7 that
&o -f K(an/&n) has classical approximants /„ if &o = /o = 0,6i = l,aj =
f\ - /o = 1/2 and
-i Bn-l)(n2-4n+5)
^;:^^ .2,3,4,....
n-i ~ fn-2 Bn - 3)(n2 + 1)
O
72 Chapter II. More basics
Example 6 The infinite product ni?() Pk has partial products fn —
\Xk-oPk- Let all pk ^ 1,0, oo. We shall find a continued fraction 6o -f
~K(atl/bn) which has classical approximants {/„}.
With the choice B.1.4) for An and Bn we find by Theorem 7 that 60 =
/o = Po,&i = l,aj = f\ - fo = Pu(pi ~ 1) and
— /n In-1 Pn-\\Pn -j f o Q A
an = — — — lor n = 2,3,4,...,
/n-1 — Jn-2 Pn-l ~ 1
, fn ~ fn-2 PnPn-] ~ 1 f n o a
bn = 7 ~ = — for n = 2, 3,4,
Jn-1 — Jn-2 Pn-\ ~ 1
Hence the continued fraction
, Pu{p\ ~
- 1) - (P2P3 "
has classical approximants fn = YYi=:oPki an^ a^ its canonical denomi-
denominators Bn are equal to 1.
O
2.2 Equivalence transformations
Definition We say that two continued fractions are equivalent if they
have the same sequence of classical approximants.
We write 60 + K(an/bn) w d0 + K(cn/dn) to express that 6{) + K(an/bn)
and c?u + K(cn/rfM) are equivalent. Let the canonical numerators and
denominators be denoted by An and /?„ for 6u + K.(an/bn) and by Cn
and jD71 for do + K(cr,/dn). If we require that all Cn = An and Z)n = BTl1
then it follows from Theorem 7 that the two continued fractions are
identical; that is, cn = an and dr, = bn for all n. So that has no point.
We have required too much. What we can do, and shall do, is to require
that An/Bn = Cn/Dn for all n.
The idea of equivalent continued fractions is due to Seidel [Seid55] who
also proved:
Transformations of continued fractions 73
Theorem 9 60 + K(an/bn) « d0 + ~K(cn/dn) if and only if there exists
a sequence {rn} of complex numbers with r*o = l,rn ^ 0 for all n E N,
such that
du = 60 , cn = rnrn_ian , dn = rnbn for aline N . B.2.1)
Proof : Let AtnBn be the canonical numerators and denominators of
60 + K(an/6n). Then 60 + K(an/6n) w dQ -f K(cn/dn) if and only if
there exist numbers rn ^ 0 such that the canonical numerators Cn and
denominators X)n of d$ -f K(cn/dn) can be written
n
C_, = 1, ?>_i = 0, Cn = An I] »"*, Dn = Bnl[rk B.2.2)
for all n. Since Z)o = ^u = 1 we need 7*0 = 1. From Theorem 7 it follows
then that d0 -f K(cn/dn) is given by B.2.1). ¦
Remarks:
1. The concept of equivalence is tied to the classical approximants.
Tf 60 4- K(an/bn) w d0 + K(cn/dn) by the relations B.2.1), then
S»(wn) = Tn(rnwn) for n = 0,1,2,... , B.2.3)
where Sn(w) are approximants of 6o + K(an/6n), and Tri(iy) are
approximants of d() + K(cn/rfn).
2. Tf {*„} is a tail sequence for 6o + K(an/^?i)» then {tnrn} is a tail
sequence for rfo+K(cn/rfn), where &o+K(an/&n) and rfo
are as in Theorem 9.
Example 7 The continued fractions in Example 4 and 5 in the previous
subsection are equivalent since they have the same sequence of approx-
approximants. To derive the one in Example 4 from the one in Example 5 we
use
n2 + l
7-0 = 1, T-, =2, rn = for n = 2,3,4,
(n- IJ + 1
74 Chapter II. More basics
An even simpler equivalent continued fraction can be obtained from the
one in Example 4 by using ru = 1, r^ = 1, rn = 2n — 3 for n — 2,3,4,
We get
1 ll -5-1 -7-3 -9-5 -11-7
2+ 4 |- 8 + 12 + 16 + 20 +•••
which therefore also has approximants n2/(n2 + 1) —» 1.
O
Example 8 The continued fraction in Example 6 also has a simpler,
equivalent form. The choice Tq = 1, r\ = 1, rn = pn-\ — 1 for n > 2
leads to
1 - P1P2 — 1 — P2P3 — 1 — PzPi — 1
which therefore also has approximants fn = nJJ=u Pk-
O
Example 9 We shall prove that K(^i2/3n) converges.
An equivalence transformation with rn = 1/n for n = 1,2,3,... brings
K(w2/3n) over to the form
Zl!~I 2/1 3/2 4/3 5/4 6/5
K
which converges by the Sleszynski-Pringsheim theorem, Theorem 1 in
Chapter I.
O
The following two equivalence transformations are of particular interest:
Corollary 10
A. 60 + K(an/6n) ^ b0 + K[l/dTl) where
dn = &nlla* forn = 1,2,3,.... B.2.4)
Transformations of continued fractions 75
B. Ifbn^O for alln>\, then bQ -f K(an/bn) « 60 + K(cn/1), where
cl = ^-, cn = . a.n forn = 2,3,4,.... B.2.5)
Remarks:
1. The transformation in A can always be performed. The elements
dn have the structure
d\ = bi • — , d2 = bi— , ds — 03 , CL4 = 0.1
ay a Cttt
2. The transformation in B can only be applied if all bn ^ 0, since
otherwise cn would not be a well denned complex number. Com-
Combined with Worpitzky's theorem in Subsection J^.3 in Chapter I,
it shows for instance that every continued fraction 60 + K(an/^»)
with |ai/&i| < 1/4 and |an/6n6n_i| < 1/4 for all n > 2 converges
to a finite value.
Proof :
A: Use Theorem 9 with
rt
*n = 11 fi~ for all n > 1
• : Use Theorem 9 with rn = l/6n for all n > 1.
Example 10 We shall see that the continued fraction K(C0+@.9)")/l)
in Example 13 of Chapter I converges.
76 Chapter II. More basics
By Corollary 10A it follows that K(C0 ¦+ @.9)n)/l) % K(l/dn) where
d\ — > —
30 + 0.9 31
30 + 0.9
d
30 + @.9J >
30 +
3 ~
30 + @.9J
30 + @.9J _1_
0.9)C0 + @.9)J) > 31
_ C0 + 0.9)C0+@.9):*)
4 C0 + @.9J)C0 + @9)')
In fact, we find d>n > 1 and d2n+i > 1/31 for all n, so ]T<in =
Hence the convergence follows by Van Vleck's theorem, Theorem 2 in
Chapter I. We also get that the value of the continued fraction is finite.
O
2.3 Fhe Bauer-Muir transformation
Definition The Bauer-Muir transform of a continued fraction 6q +
K.(atl/bn) with respect to a sequence {wn} from C is the continued frac-
fraction do + I<L(cn/dn) whose canonical numerators Cn and denominators
Dn are given by
^T1' n"-=H°'+JB B.3.1)
for n = 0,1,2, , where {An} and {Bn} are the canonical numerators
and denominators of b{) -f K(ari/^n)-
This transformation dates back to the 1870's [Bauer72], [Muir77]. What
the Bauer-Muir transformation does, is to give a continued fraction do ¦+
K(cn/dri) whose classical approximants Tn@) are equal to the modified
approximants Sn(wn) of 60 + K(ar»/^n)- With this notation we have:
Theorem 11 The Bauer-Muir transform ofbo~\-'K(an/bn) with respect
to {wn} front C exists if and only if
3 forn = 1,2,3,... . B.3.2)
Transformations of continued fractions 77
// it exists, then it is given by
t . , A] C2 C3
&o + ™o + t— , x , 7~ , B.3.3)
where
cn = an_iqn-i , dn = bn + wn - wn^2qn-i , 9, = An+l/An . B.3.4)
Proof: Let {Cn} and {?>„} be given by B.3.1). Then {Cu} and
{Dn} are canonical numerators and denominators of a continued fraction
do+KtcnKOifandonlyifC-! = A) = l,#-i = 0 and An = CnJDn_i-
DnCn-\ ^ 0 for all n > 1. (See Theorem 7.) The initial conditions for
Cn and Dn are satisfied. Moreover
An = CnDTl-i — DnCn-\
= (An + An_i«;
where the first factor is different from 0 by the determinant formula
A.2.10) in Chapter I and the second factor is equal to An in B.3.2). This
proves the existence part of Theorem 11. The elements of do + K(cTl/dn)
follows now from B.1.2). ¦
We shall see examples of three different applications of the Bauer-Muir
transformation.
Example 11 We shall see later (Theorem 28 in Chapter 111) that the
limit periodic continued fraction K(C0 + @.9)n)/l) has critical tail
sequence hn —*¦ 6 and right tail sequence /(n) —¦> 5 as n—»oo. Using
this here, we shall prove that 5nE) converges faster to the value / of
K(C0 + @.9)")/l) than 5n@), in the sense that
>0 asn->oo, B.3.5)
78 Chapter II. More basics
and find a Bauer-Muir transform of K(C0 + @.9)")/1) with respect to
wu = 5.
We proved in Example 10 that K(C0 + @.9)n)/l) converges to a finite
(ri\
value /. Indeed, by the same type of argument we find that all /^ } ^ oo
and all /'") ^ oo for this continued fraction. Since
f(n) _ fl»+1 ,
4 " 6 fiT
we therefore also have that all fjf1' ^ 0, /M ^ 0. This means that
all approximants /n, fn+k a^e distinct and finite (see remark at the end
of Subsection l-4)t an(^ B.3.5) follows from A.4.7). (This is consistent
with our observations in Example 13 of Chapter I.)
From Theorem 11 we get
An = 30 + @.9)n - 5E + 1) = @.9)",
which means that the Bauer-Muir transform is
da + K(cn/du)
0^ C0 + 0.9H.9 C0 + @.9JH.9
+ 6
= O +
6 + 6-0.9-5 + 6-0.9-5 +•••
0.9 27+@.9J 27 + @.9K 27+ @.9)l
6 + 1.5 + 1.5 + 1.5 +-
Idea: This continued fraction has the same structure as K(C0 + @.9)n)/
1), and for the same reasons we know that the modified approximants
Tn(w) of du + K(cn/dM) converge faster to / than Tn@) if w is the
positive root of the quadratic equation
27
w = i.e. w = 4.5 = 5 • 0.9.
1.5 + w
Hence, let us replace the first tail of dn + K(cn/dn) by its Bauer-Muir
transform with respect to wn = 4.5. This time we get
An = 27 + @.9)"+1 - 4.5A.5 + 4.5) = @.9)B+l,
Transformations of continued fractions
79
so the result is
0.9 @.9J
5 +
B7 + @.9J) • 0.9 B7 + @.9K) • 0.9
6 + 4.5 + 1.5 + 4.5 + 1.5 + 4.5 - 0.9 • 4.5 + 1.5 + 4.5 - 0.9 • 4.5+- - •
0.9 @.9J 24.3 + @.9K 24.3 +@.9)' 24.3 + @.9M
+ 10.5+ 6 + 1.95 + 1.95 + 1.95 +• - •
B.3.7)
Again we can repeat the process, this time with wn — 4.5-0.9 = 4.05 and
so on. Each time we get a continued fraction converging faster than the
previous one. It can be proved that this leads to the continued fraction
5 +
0.9 @.9J @.9)'* @.9I @.9)r>
10.5+10.05 + 9.645+9.2805+ L +•
B.3.8)
where
bn = 6 + 5 • @.9)n for n > 1.
The table below gives the first classical approximants for the given
continued fraction and for the continued fractions B.3.6), B.3.7) and
B.3.8).
n
1
2
3
4
•
•
¦
16
17
18
•
•
38
39
40
•
•
•
86
87
88
K(C0+@.9)n)/l)
30.90000
0.97139
1.56770
1.85765
•
*
*
4.59286
5.53149
4.73830
*
*
•
5.07571
5.09288
5.07857
*
•
5.08506
5.08507
5.08507
B.3.6)
5.15000
5.03667
5.12176
5.05762
¦
•
¦
5.08418
5.08573
5.08457
•
•
•
5.08506
5.08507
5.08507
•
•
•
B.3.7)
5.08571
5.08463
5.08536
5.08486
•
¦
5.08506
5.08507
5.08507
•
•
•
B.3.8)
5.08571
5.08506
5.08507
5.08507
*
•
•
80 Chapter II. More basics
In every column we have stopped when the computed value has reached
the accuracy of the table. (To determine this accuracy, we have used
a theorem for continued fractions with positive elements which will be
proved in Chapter TTI, Theorem 2.)
Let it finally be mentioned, that the continued fraction B.3.8) is even
better than the table shows. With more figures in the approximants of
order 3 and 4 we actually find for the value /:
/i ^ 5.085066164 < / < 5.085066199 « /3 .
O
Example 12 One can prove that the approximants Sn(—6) of K(C0+
@.5)")/l) converge, but not to the value / of the continued fraction.
But if we try to compute Sn(—6) from the continued fraction, we have a
problem. Small inaccuracies in the input or computation will have the
effect that our computed sequence still converges to /. The computation
of 5n(—6) in this way is unstable. (See the table.) How can we find a
more stable method to compute Sn(—6)? This problem is important for
(for instance) analytic continuation by the method to be described in
Chapter III.
We find the Bauer-Muir transform of K(C0 + @.5)")/l) with respect to
wn = —6. Since
An = 30 + @.5)n - (-6)A - 6) = @.5)" ,
this transform d,Q -f- ~K(ctl/dn) is given by
(U> C0 + 0.5) -0.5 C0 + @.5J) • 0.5
+ 5 5FH5 -5-(~6)-0.5 +-¦¦
0J5 15 + @.5J 15 + @.5K
" -5+ -2 + -2 +-..'
and its classical approximants Tn@) (which can be computed stably)
are exactly 5n(—6).
We observe that the correct "table-value", -6.06220, of limSn(-6) is
taken on for Tri@) for all n > 19.
Transformations of continued fractions
81
n
1
2
3
•
•
•
12
13
14
15
¦
¦
•
18
19
20
•
•
161
162
163
Sn(- 6)
-6.09999
-6.03962
-6.07580
•
-6.06215
-6.06218
-6.06229
-6.06208
•
•
¦
-6.06247
-6.06189
-6.06257
•
•
•
5.05858
5.05859
5.05859
Tn@)
-6.03960
-6.07582
-6.05404
•
•
•
-6.06228
-6.06215
-6.06223
-6.06218
•
¦
•
-6.06221
-6.06220
-6.06220
•
•
•
-6.06220
-6.06220
-6.06220
Observe also, that the computed values of Sn(—6) for n = 13 and 14 are
pretty close to the value —6.06220. By increasing n, however, the values
"take off" and in the long run (n > 162) approach the value 5.05859 of
limSn@).
O
Example 13 The continued fraction
K(an(z)/1) where
z + n
n(n — 1)
+ n)/n) ls equivalent to
0 as n —> oo
Hence, K((z + n)/n) converges to some value f(z) for all z G C by the
argument of Example 1. What does this function f(z) look like? What
is the value of K((z + n)/n)?
Let us assume that the modified approximants 5n(l) of K((z + n)/n)
also converge to the same value f(z). (It is possible to prove this, as
82 Chapter II. More basics
we shall see later. It is for instance a consequence of the parabola
theorem, Theorem 20 in Chapter III.) Then its Bauer-Muir transform
d[) -f- "K.(cn/dn) with respect to wn = 1 must also converge to f(z). To
determine d{) -f K(cn/dn) we find that
\n — z -f- n — X(n + 1) = z — 1 for all n.
Hence
cn z-\ z+X z + 2 z + 3
which looks similar to K((^ + n)/n). Indeed, the first tail g^x\z) of
do + JC(cn/dn) is such that
z z z -f 1 z + 2 z—1 + n
K
n
Since c?0 + K(cM/dn) also converges to /(^), tills means that
which is a functional equation for /(z). If we let z = X in this equation,
we find that
i
7(o)+T -J
since /@) must be a positive number. Hence, we can at least obtain the
value of f(k) for all k G N. For instance
1+14
21 ?36
?:
—+ 4 IO + 5
13 73
_O
Transformations of continued fractions 83
2.4 Contractions and extensions
We shall call do + K(cn/rfn) a contraction of 6q + ]&.{an/bn) if its classical
approximants {gn} form a subsequence of the classical approximants
{/„} of 60 + K(an/bn). We call b0 + K(an/6n) an extension of d0 +
K(cn/dn) in this case. Also this idea is due to Seidel [Seid55] although
Lagrange had some special cases already in 1774-76 [Lagr74], [Lagr76].
We call in particular do + K(cn/d,) a canonical contraction of 60 -f-
K(ajbn) if
Ck = Ank1 Dk = Bnk forfc = 0,1,2,..., B.4.1)
where Cn,DniAn and #n are canonical numerators and denominators
of du -f K(cn/dri) and 6q + K(an/&n) respectively. To derive a general
expression for a canonical contraction we can use Theorem 7 combined
with formula A.1.9) in Lemma 2 (with TV = 0). Rather than considering
the general case we shall restrict ourselves to some important special
cases.
Theorem 12 The canonical contraction of bo -f J<i{an/bn) with
Ck = A2k, Dk = B2h for k = 0,1,2,...
exists if and only if b2k 7^ 0 for k = 1,2,3,..., and is then given by
, b2ai a>2a;ibi/b2 04A56G/ki /2 .
b2b\ -f- a2—ax + b^b.{ + a^b/b + 66 + b/b
^_0 is a ^az7 sequence for bo + K(ari/^n) ti/ttfe a// ?„ ^ oo;
» —^i^2» —^3^4, • • • is a tail sequence for B-4-2).
Proof : From Theorem 7 we find that the canonical contraction has
elements
do = Co = Au = b0 ,
d\ = D{ = B2 = 626i + a2 ,
cj = Ci - C0D1 = A2 - A0B2 = 62aj ,
84 Chapter II. More basics
cn = -Arl/An_!, where
An = CnDn-\ — DnCn-\ = A2nB2n-2 ~ #2n^2/i-2
2n-L 2n-l
= ~D[in-^ n (-«>) - -**» n (-
by formula A.1.9), and finally dn = Pn/An_i, where
Pn = CnDn-2 — DnCn-2 = A2n-#2n-4 — -^2
2n-3
= -42-:i) n (-«>
J=l
This proves B.4.2). Let {tn} be a tail sequence for &o + (/)
with all tn ^ cxd. Then an = tn^\{bn + tn) for all n. To see that
^o, — t\t2i —*:i^4» ••• is a tail sequence for d0 + K(cn/dn) given by B.4.2)
it suffices to prove that
C\ = tQ(di — t}t2), Cn = —t2n-
for n — 2,3, This follows by straight forward computation using the
substitution an = ?n_iFn + tn). m
A contraction of this kind, where the approximants are the even-num-
even-numbered approximants of 6o + K(an/&n) ls often called the even part of
fco + K(an/^n). By an equivalence transformation, B.4.2) can be written
in the form
-f
a6a76864
- 64(a6 + 6566) -f- a566 - 66(a« + 676s) + 076s
which is more widely used (but which is no longer canonical). If {tn};
tn ^ 00 is a tail sequence for 6U + K(ari/6n), then t0, —tit2, —62^3^1,
—6.jM6? —tehl*,... is a tail sequence for B.4.3). This follows from
Remark 2 to Theorem 9.
Transformations of continued fractions 85
Theorem 13 The canonical contraction of 60 -f 1<L(arl/bn) with Co =
A\/Bi, Dq = 1 and
for k = 1,2,3,...
exists if and only if b2k+i ^ 0 /or k = 0,1,2,..., and is then given by
6i(a3 + b2b
070369/6
6567 + 0067/65 —
B.4.4)
//{/„} 25 a tail sequence for 6q + K(ari/6n) wiJ/i a// in 7^ oo, then
—t{)ti/b\, —61^2*37 " ^4*5; —^6^7? • • • *s a ^a^ sequence for B.4-4)-
The proof follows the same lines as the proof of Theorem 12 and is
omitted. Contractions such as B.4.4) which have the odd-numbered
approximants of 60 + J?.(a>n/bn) as classical approximants, are called odd
parts of 60 + K.(an/bn). An equivalence transformation changes B.4.4)
to the form
61 6l(a3 + 6263) + 0263
03016561 05006763
(z.4.5)
- 6:j(a5 + 6165) 4- 0,465-65@7 4- 6G67) 4- a667--
By Remark 2 to Theorem 9, the tail sequence —tyti/bx, — 61 ?2^) — *'i*5> • ¦ •
of B.4.4) transforms into the tail sequence — ?0*1/61, —&i?'2*3» —63^,^5,...
of B.4.5).
86 Chapter II. More basics
Problems
A) Prove that K(an/l) converges if an —> — 0.2.
B) Prove that K(an/bn) converges if an —> 2 and all |6n| > 4.
C) Let {tn} be a tail sequence for K(an/&r») with all tn ^ oo, and
let An and Bn denote its canonical numerators and denominators.
Show that then
n
B^tn = JJ F^ + tk) , (l)
n
and
where fn — An/Bn and hn = Bn/Brl-i.
D) Given the periodic continued fraction
_2-42-42-4 \
" 4+ i +4+ i +4+ 1 4---./ "
(a) Find the periodic tail sequences of K(an/6n).
(b) Find the first ten terms of its critical tail sequence {hn}.
(c) Show that
~ ^ h>2n 5: 1 and h-in+x > 6 for all n > 2.
(d) Use the results from a) and c) and the formula C) in Problem
3 to prove that K(an/&n) converges to 1.
(e) Use formula A.4.5) to find a posteriori truncation error bounds
for K(an/bn).
Problems 87
(f) Use Theorem 6 to determine the asymptotic behavior of {hn}.
(g) Compute the 10 first approximates of the types 5n@), Sn[tn)
and Sn(tn) for the continued fraction
2 + 0.5 4 + @-5)'2 2 + @.5):* 4 + @.5L 2 + @.5M
4- 1 + 4 - 1 + 4
where {tn} and {tn} are the periodic tail sequences of
K(on/6n) from a). Compare these sequences of approxi-
mants.
E) We return to the 3-periodic continued fraction
ori_211211.2
KT ~ I+1T+1TT+1
from Example 15 in Chapter I and Example 2 in this chapter.
(a) Find the first 5 terms of its critical tail sequence.
(b) Determine the critical tail sequence of its first tail
1 1. 2 1 1 2
T—T+T+T—T+Th—
Compare this to the results in Example 2.
(c) Determine the asymptotic behavior of the critical tail se-
sequence in (a).
(d) Explain the convergence result
lim 5tl(turi) = - for all {wn} bounded away from 0, — 1, oo
for K(on/1) by means of Theorem 4. (This result was proved
by another method in Example 15 in Chapter I.)
F) (a) Use the method of Example 3 to compute the value of
K((-0.2+ @.4)")/l) with an absolute error less than 0.05.
(b) Which approximants Sn(wn) would you choose to compute
G) (a) Show that if K.(a/b) is a 1-periodic continued fraction which
converges generally to /, then it converges to / also in the
classical sense.
88 Chapter II. More basics
(b) Show that if K(an/1) is a 2-periodic continued fraction which
converges generally to /, then it converges to / also in the
classical sense.
(c) Give an example (other than the one in Problem E)) of a
periodic continued fraction which converges in the general
sense but not in the classical sense.
(8) Prove that the continued fraction
Po+ ~T ., ,2 ., ,3 where all pk f. 0
1 -1 + pi — 1 + p$
has canonical approximants An/Bn with
n = 0,1,2,
(9) Let N be a fixed natural number. Show that the canonical con-
contraction of fc0 + K(««/&«) with
Cn = An,Dn = Bn forn = 0,1,2,...,N - 1
and
Cn = An+U Dn = Bn+] for n = N, N + 1, ^V + 2,...
exists if and only if 6^+1 ^ 0, and show that then it is given by
bo + P-
-\ \- 6yv_
4-
Show further that if {tn}'?L() is a tail sequence for 6q +
with tjv ^ ex) and ?/v+i 7^ °°} then
<o, ?1,..., ?;\_i, — ivv^+ii ^Ar+2> ^N+3? • • • is a tail sequence for this
contraction.
A0) Given 6q + K(a«/6n) with critical tail sequence {/&„} such that all
/in ^ 0. Prove that its equivalent continued fraction
°
has all canonical denominators equal to 1 for n > 0.
Problems 89
A1) Let 60 + K(an/&n) have classical approximants 5n@) = /n, let
N G N, N > 2 and let g 6 C be chosen such that
AN - BNg
Prove that
a/v-i fl/v /? ai\+i/p aN+z
is an extension of 60 + K(an/^n) with classical approximants
/n for n = 0,1,..., N — 1,
<7 for n — N ,
(This idea can be found in [Perr57, p. 15] and [J0TI18O, p. 43].)
A2) The Khovanskii transform of ~K(an/l) is given by
2a2 - 1 - 1 + 2a3 + 2a.j - 1 - 1 + 2a5
O-ln a'2n+\
_. . ._ 1 — 1 -j-
See also [Khov63, p. 22]. Prove that if both K(an/1) and its
Khovanskii transform converge (in the classical sense), then they
converge to the same value.
A3) Prove that the following continued fractions converge.
(a)
(b)
(c) i
A4) (a) Find the Bauer-Muir transform of
2 . . 22 22 42 42 62 62
z - 1 + —
1 +Z2 - 1+ 1 +22 - 1+ 1 +Z2 - 1 +•
with respect to
- — k if n is odd,
n(z + 1) + ^C + 2z - z2) if n is even.
90 Chapter II. More basics
(b) Assume that the continued fraction in (a) and its Bauer-Muir
transform converge to the same value f(z) for z > 1. Find a
functional equation for f(z).
References
[Bauer72] G. Bauer, Von einem Kettenbruch von Euler und einem
Theorem von Wallis, Abh. der Kgl. Bayr. A lead, der Wiss.,
Miinchen, Zweite Klasse, 11 A872), 99-116.
[Jaco86] L. Jacobsen, General Convergence of Continued Fractions,
Trans. Amer. Math. Soc. 294, no. 2 A986), 477-485.
[JoTh80] W. B. Jones and W. J. Thron, "Continued Fractions: An-
Analytic Theory and Applications", Encyclopedia of Mathe-
Mathematics and its Applications, 11, Addison-Wesley Publish-
Publishing Company, Reading, Mass. A980). Now distributed by
Cambridge University Press, New York.
[Khov63] A. N. Khovanskii, "The Application of Continued Fractions
and their Generalizations to Problems in Approximation
Theory", P. NoordlioffN. V., Groningen, The Netherlands
A963).
[Lagr74] J. L. Lagrange, Additions aux Elements d'Algebre d'Euler,
Lyon A774).
[Lagr76] J. L. Lagrange, Sur I'usage des fractions continues dans le
calcul integral, Nouveaux Mem. Acad. Sci. Berlin 7 A776),
236-264; Oeuvres, 4 (J. A. Serret, ed.), Gauthier Villars,
Paris A869), 301-322.
[Muir77] T. Muir, A Theorem in Continuants, Phil. Mag., E) 3
A877), 137-138.
91
92 Chapter II. More basics
[Perr57] O. Perron, "Die Lchre von den Kettenbrikhen" Band 2, B.
G. Teubner, Stuttgart A957).
[Seid55] L. Seidel, Bemerkungen uber den Zusammenhang zwischen
dem Bildungsgesetze eines Kettenbruches und der Art des
Fortgangs seiner Ndherungsbruche, Abh. der Kgl. Bayr.
Akad. der Wiss., Miinchen, Zweite Klasse, 7:3 A855), 559.
[Thron89] W. J. Thron, Continued Fraction Identities Derived from
the Invariance of the Crossratio under Linear Fractional
Transformations, "Analytic Theory of Continued Fractions
III", Proceedings, Redstone 1988, (L. Jacobsen ed.), Lec-
Lecture Notes in Mathematics 1406, Springer-Verlag, Berlin
A989), 124-134.
Chapter III
Convergence criteria
About this chapter
Applications of continued fractions are often tied to their possible con-
convergence. It is therefore important to have convergence criteria which
are easy to check and which cover large classes of continued fractions.
Rather than discussing a large variety of such criteria we shall empha-
emphasize how one can derive them. This means in particular that only the
best known and/or the widest applicable convergence theorems will be
presented here. For a more complete list we refer to [JoTh80].
The methods we use are based upon some very nice mapping properties
of linear fractional transformations. Some of these methods will also
lead to truncation error estimates. In this respect we have taken the
attitude that relatively simple and easy to use bounds are often to be
preferred to more complicated but slightly tighter ones.
93
94 Chapter III. Convergence criteria
1 Two classical results
1.1 The Stern-Stolz divergence theorem
In Chapter I we presented three classical convergence theorems. We
shall now see two more. The first one is in fact a divergence theorem.
It dates back at least to the 1860's [Stern60], [Stolz86]. We state it in a
slightly more general form:
Theorem 1 (The Stern-Stolz Theorem) The continued fraction
bo + K(l/6n) diverges generally if J2 \bn\ < oo. In fact
lim A2n+P = PP / oo, lim B2n+P = QP ^ oo C1-1-1)
n —r oo n —r oo
for p = 0,1, where
= i. A-1.2)
Remarks:
1. We say that 6() + K(aTl/bn) diverges (generally) if it fails to con-
verge (generally) in C. Since general convergence is a slightly
wider concept than classical convergence, it follows that general
divergence is a slightly stronger property than divergence in the
classical sense.
2. From A.1.1) it follows that
lim S2nM= n[)llW
—>oo lnK ' Qo +
n—>oo
A.1.3)
That is, the even and odd parts of K(l/6n) converge in the classical
sense to Pu/Qo and P\IQ\. However, by A.1.2) these limits are
distinct, so the continued fraction itself diverges. It even diverges
generally.
Two classical results 95
Here is the place for a little reflection. The even part of K(l/bTl)
converges and thus converges generally. Still it follows from A.1.3)
that the limit of S2n(w) is totally dependent on the choice of w.
How can this be? Does not this violate Theorem 4 in Chapter II?
Please note that {5^@)} is the sequence of classical approximants
{Tn@)} for the even part of K(l/67l), whereas S2n(w) / Tn(w) for
w / 0. In fact, if we consider the canonical even part of K(l/6n)
as described in Theorem 12 in Chapter II, then S2n(w) = TTl(wn)
if and only if
A2n + A2n-\W A2n + A2n
B2n + B2n-jw ~ B2n + B2n-2wn '
that is, if and only if
wn = - for all n.
Hence, the classical convergence of the even part of K{l/bn) im-
implies general convergence of the even part, but not convergence of
S2n(w) to a value independent of w.
3. An equivalence transformation does not change the classical ap-
approximants of a continued fraction. Hence, from Corollary 10A in
Chapter II it follows that K(an/&n) diverges in the classical sense
if
OO n
Ebn TI ak
n= 1 &:= 1
The series in A.1.4) is called the Stern-Stolz series of K(an/bn). It
is invariant under equivalence transformations of K(an/6n). The
general divergence of ~K(an/bn) also follows easily.
<oo. A-1.4)
Proof of Theorem 1: It suffices to prove A.1.1)-A.1.2). {An} and
{Bn} are solutions of the recurrence relation
Xn = 6nXn_i + Xn-2 for n = 1,2,3,.... A.1.5)
By induction it follows that any such solution satisfies
-tU \XQ\} • (\bx\ + 1)(|62| + 1) ¦ - .(|6n| + 1).
96 Chapter HI. Convergence criteria
Hence {ATl} and {BTl} are bounded under our conditions. This means
that Yl^nAn-\ and Y^bnBn-\ converge absolutely. Since Xn — Xn-.2 =
6nXn_i by A.1.5), we get for instance
« n
jrt2n — / j \J^lm ~~ ^12»« —2/ — / v
m=l rn=l
and similar expressions for A2n+i-> ^2n a^d B2n+i- This proves A.1.1).
A.1.2) follows then since by the determinant formula (see formula A.2.10)
in Chapter T)
A2n+\B2n - A2nB2n+i = 1 for all n.
1.2 Continued fractions with positive elements
Let K(an/6n) have all an > 0 and bn > 0. Then
5,@) = fL>0,
0<52@) = -^.<Jl- =
since (a2/b>) > 0. Furthermore
53@) = ^-a— > ^^ = 52@)
since
Moreover
*3
and so on. We get:
Two classical results 97
Theorem 2 Let all the elements an and bn of K(arl/6n) be positive.
Then
52@) < 5,@) < 5G@) < ¦ • • < 55@) < S:,@) < 5^0). A.2.1)
Remarks:
1. We find from A.2.1) that {52«@)} is a bounded, monotonely in-
increasing sequence. This implies that {5^@)} converges to a finite
value L[). Similarly {S2n+i@)} decreases monotonely to a finite
value L\, and Lq < L\. Hence, both the even and odd parts of
~K(an/bn) converge to finite values.
2. If we know that K{an/bn) itself converges, then A.2.1) can be used
to estimate its value /. By setting
/-/„' = \(S2n+, @) + 52rl@)) A.2.2)
(the average value of the two approximants), we know that the
error is bounded by
- fn\ < \(S2n+l{0) - 52n@)). A.2.3)
Example 1 In Example 5 in Chapter I we used a continued fraction to
estimate the value of log 2:
The first seven approximants /„ = 5n@) were given in a table. The
oscillatory character of {5n@)} is consistent with A.2.1). In particular
we get
log2 « i(/7 + /6) ± \{f7 - /6) « 0.693135 ± 0.000015
which agrees with the correct value of log 2.
O
98 Chapter III. Convergence criteria
Knowing Theorem 2 makes it easy to prove the second classical result
due to Seidel [Seid46] and Stern [Ster48]:
Theorem 3 (The Seidel-Stern Theorem) Let all the elements bn of
n) be positive. Then K(l/6n) converges if and only ifYl^n = °°-
Remarks:
1. From Remark 1 to Theorem 2, we know that if K(l/&u) converges,
then it converges to a finite value, and if it diverges, then its even
and odd parts still converge to finite values.
2. An equivalent formulation of Theorem 3 is that K(an/^n) with all
an > 0,6n > 0 converges if and only if its Stern-Stolz series A.1.4)
diverges to oo.
Proof: If ?6n < oo, then Jji(l/bn) diverges by Theorem 1. Let
^2bn = oo. To prove that then ~K(l/bn) converges, it suffices to prove
that
@) - S2,,@) = ?=±i - ^ = \ - 0. A.2.4)
Since
Bn = bnBn-\ + Bn-2 f°r w = 1,2,... and D-\ = 0 , Bq = 1,
it follows that all B2n > B2n-2 > — • > B{) = I and B2n+i > B2n-\ >
- - • > B\ — b\, so that
B2n > b2nbi + B2n-2 > •' > (&2n + hn->
and
B2n+l > b-in+l ' 1 + B2n-\ > • • • > b2n+l +
The divergence of J2 bn now proves A.2.4).
Two classical results 99
Example 2 In Example 2 in Chapter I we used the continued fraction
1 - - -
2 + 2 + 2 + --.
to compute approximations to y/2. This can be done since the continued
fraction converges by Theorem 3, and its value must be the positive
solution of the equation / = 1+1/A+/); i.e. / = \/2. The approximants
fn — 5n@) were computed for n = 1,2,..., 5. They show the oscillation
property described in A.2.1). In particular we get f.\ = 41/29 < y/2 <
f5 = 99/70, i.e.
± 777^ ~ 1.41404 ± 0.00025
4060
7^77 ± 777^
4060 4060
which agrees with the value y/2 = 1.41421356
O
If we use modified approximants Sn(wn), this useful oscillation property
sometimes gets lost. When is it preserved?
Example 3 In Example 14 in Chapter I we used the classical approx-
approximants 5n@) and the modified approximants Sn(wn) where w-2k = 1,
w2k+\ = 2, for the continued fraction
3 + 1/12 4 + 3/22 3 + 1/32 4 + 3/42 3 + 1/52
1 + 1 + 1 + 1 + 1 +•••"
The table for fn = 5n@) displays the oscillation property A.2.1). Also
the table for Sn(wn) shows the same property, but only up to and in-
including n = 5. In this case we have
3 + 1/12 ^ g/ 3 + 1/12
1 + 2 -""i""/- 4 + 3/22
because
= 2 4 + 3/22 a2
1 + 1 1 + w2 '
100 Chapter HI. Convergence criteria
and
= =T^ >***> = 7^4+ »/*
2
o -r l / o
because
4 + 3/22 a2
- 2
1 + 2
and so on. But S\(wa) > 5
O
If we follow the same line of argument as in Example 3 we find:
Theorem 4 Let all the elements an and bn of~K(an/bn) be positive and
let all wn > 0. Then
S2{w2) < 5.| (w.|) < S6(wG) <•¦< Sn(ws) < 53(^j) < ?,(«;,) A.2.5)
if
wn < ¦ and wn < ^— A.2.6)
t>n+2 + W^
/or a// n G N.
Remarks:
1. A.2.6) holds trivially if all wn = 0.
2. If A.2.6) holds with the opposite inequality signs, then A.2.5)
holds with the opposite inequality signs.
3. In Example 3 the second inequality in A.2.6) fails to hold for r? = 4,
since wl{ = 1 and the righthand side = C + l/25)/C + 1/24).
Periodic continued fractions 101
2 Periodic continued fractions
2.1 Introduction
A continued fraction 6o + K(an/&n) is called periodic with period length
k 6 N, or k-periodic for short, if the sequences {an}S?=i and {Aj}^! of
its elements are all k-periodic; i.e. if
a*p
B.1.1)
for some N G N(). We say that the period begins at n = N -f 1. We
have already seen several examples of periodic continued fractions.
The approximants of 60 + K(an/&n) w^n property B.1.1) can be written
SN+kn+P{w) = SNo T? o Tp(w) B.1.2)
where Syv is as used earlier, and
*
7i = 1,2,..., k. B.1.3)
a
The convergence behavior of 6o + K(an/6n) depends therefore on how
the linear fractional transformation Tf. behaves under iterations.
2.2 Classification of linear fractional transformations
Linear fractional transformations
t(w) =
C* \JU | \JL
are often classified according to how iterations behave asymptotically;
i.e. to what happens to
tn(w) = toto---ot(w) n times B.2.2)
as n —* oo. If tn(w) —> x, then clearly x must be a fixed point of t; i.e.
t(x) = x. Unless t is the identity function t(w) = w (i.e. a = d ^ 0,6 =
102 Chapter III. Convergence criteria
c = 0), it can have at most two distinct fixed points, since they have to
be solutions of the equation
ex2 + (d - a)x - b = 0 . B.2.3)
We allow x = oo as a fixed point. From B.2.1) we see that x = oo is a
fixed point for t if and only if c = 0.
Basis for the classification is the following result:
Theorem 5 Let t be a linear fractional transformation B.2.1) with at
most two fixed "points x and y.
A. If x — y (only one fixed point) then
lim tn(w) = x for allw EC. B.2.4)
n —* oo
B. Ifx^y and
\cx + d\ = \cy + d\
0,
[2.2.0)
a\ = \d\
then t"(w) diverges (by oscillation) for all w
C. Ifx^y and
0,
[2.2.b)
a\ / |rf| ifc = 0 ,
then
lim tn(w) = x for allw ^y. B.2.7)
n —* oo
(If c = 0 then x = oo if \d\ < \a\ and y = oo if \d\ > \a\.)
For the proof of Theorem 5 we refer to text-books on complex analysis.
These three different types of linear fractional transformations are given
Periodic continued fractions 103
special names. We say that t is parabolic if it has only one fixed point
as in Theorem 5A, elliptic if it is as in Theorem 5B and loxodromic if it
is as in Theorem 5C. It is also common to say that t is hyperbolic if it is
loxodromic with
{ex + d)/(cy + d) > 0 ^,
B.2.8)
a/d>0 ifc = O.
(In some books one has chosen to say that t is not loxodromic if it
is hyperbolic.) Note that these three possibilities: parabolic, elliptic
and loxodromic, are the only ones we have in addition to the identity
transformation. Note also that t is parabolic (and not elliptic) if cx + d =
cy -\- d and c^0orifa = d^0, c = 0 and
We also say that the fixed point x of t is attractive if tn(w) —> x for all
w different from a point y, and repulsive if tn(w) —> p ^ x for all w ^ x.
Properties
1. Let i be a fixed point of t. Then x is also a fixed point of t ].
That is, t and t~l have the same fixed points. However, if x is
an attractive (repulsive) fixed point of ?, then x is a repulsive
(attractive) fixed point of t~l. In particular this means that the
classification (parabolic/elliptic/loxodromic) off is invariant under
inversion.
2. We say that a linear fractional transformation tc is conjugate (or
similar) to t if there exists a linear fractional transformation p such
that
tc -pot op . B.2.9)
Since then ?" = p o tn o p, it follows that our classification is
invariant under conjugation.
If x is a fixed point for 2, then p(x) is a fixed point for tc in B.2.9).
And if x is attractive (repulsive) for t, then p(x) is attractive (re-
(repulsive) for tc.
104 Chapter III. Convergence criteria
2.3 Convergence of "periodic continued fractions
By combining B.1.2) and Theorem 5, we find our main result in this
section. We refer to B.1.1)-B.1.3) for notation.
Theorem 6 Let 60 -f K(flri/^n) ^e a k-periodic continued fraction satis-
satisfying B.1.1).
A. IfTf- is parabolic, then 60 + K(an/bn) converges to Sn(x), where
x is the fixed point o
B. If Tk is elliptic or the identity transformation, then fc0 + K(an/^n
diverges generally.
C. If Tk is loxodromic, then 60 -f K(an/67J) converges generally to
Sjsj(x), where x is the attractive fixed point ofT^. It also converges
to Sjv(x) in the classical sense if
TP=l,2,...,k, B.3.1)
where y is the repulsive fixed point of Tk.
Example 4 Tn Example 2 we proved that the 1-periodic continued frac-
fraction
1 I I
2 + 2 + 2+-.¦
converges to / = \fi. This agrees with Theorem 6 since T\{w) = 1/B +
w) is loxodromic with attractive fixed point x = y/2 — 1.
O
Example 5 In Chapter I, Subsection 2.3, we promised to return to the
continued fraction
x x x x
2+2 + 2 + 2H
and prove that it converges to / = y/l + x — 1 for all x in the cut plane
C \ X, where the cut L is the real ray (—oo, —1). This follows now
Periodic continued fractions 105
easily from Theorem 6 since Ti(w) = x/B + w) is parabolic for x = — 1,
elliptic for x ? L and loxodromic otherwise, and since / is an attractive
fixed point for rl\ when T\ is parabolic or loxodromic when we choose
the principal branch for the square root; i.e. 3ft f yl-j-ajj > 0.
O
2.4 Thiele oscillation
Let us return to condition B.3.1) in Theorem 6C. Tf Tk is loxodromic
and Tp@) = y for some p ? {1,2, ...,&}, then 60 + K(an/6n) diverges
in the classical sense although it converges generally. This follows easily
since then
for all n G N , B.4.1)
whereas SW+Jfcn+m@) —> Sj\f(x) as n —> 00 for all m such that Tm@) ^ 2/.
This phenomenon is called Thiele oscillation, due to Thiele [Thie79] who
was the first one to point out that this thing could happen.
In Example 2 in Chapter II this phenomenon was connected to properties
of tail sequences for 60 + K(an/6n). Let us do so here too. Since 7^ is
loxodromic, it has two distinct fixed points x = x^ and y =
Moreover, since
rl
- TrloTkoTj{w), B.4.2)
and thus is a conjugate of 7),., we know by Property 2 in Subsection
2.2 that also T^ is loxodromic. Let xW and yW be the attractive and
^ y
repulsive fixed points of TJfK Then the right tail sequence {/(")} of
fco + K(an/^n) is periodic, looking like
aj(°) aj(O aj^) x^ - x^ x^ ajC*) -(o) T2 4 3^1
from some n on. The second periodic tail sequence {tn} ofb() + 'K(aTl/bn)
looks like
@ (*-i) (*)(«) (l) v(*-0 «
106 Chapter III. Convergence criteria
from some n on. If Tp@) = y = y(°\ then
Tl»\0) = T~' o Tk o Tp@) = T"' (») = 0;
i.e. j/(p) = 0. Hence we have the following alternative characterization of
Thiele oscillation: 2\ loxodromic and y(p) = 0 for some p G {0,1,..., fc —
Or, since j/^ = 0 if and only if y&+l) = oo: 6o + K(an/&n) diverges
by Thiele oscillation if and only ifTf. is loxodromic and y(f)+l) = oo for
some p ? {0,1,..., A: — 1}.
It is worth noticing that Thiele oscillation can never occur if k = 1 or if
A; = 2 and 6; = 6? = 1. (See Problem 7 in Chapter II.)
Example 6 The periodic continued fraction
2 112 112
T-fi—T+T+i —T
in Example 15 in Chapter I and Example 2 in Chapter II, diverges by
Thiele oscillation. Its periodic tail sequences are
! 3 -HI 3 -2 1
2" 3'2' ' 3'2'""
and
0, oo, —1,0, oo, —1,0, oo, —1,0,....
The first one is the right tail sequence. The second one reveals the Thiele
oscillation.
O
2.5 Tail sequences
For simplicity we let H(an/bn) be a A;-periodic continued fraction where
the period begins at n = 1. That is, B.1.1) holds with N = 0. Then
K(an/&n) has the periodic tail sequences B.4.3) and B.4.4). Combining
tills knowledge with Theorem 6 in this chapter, we immediately find:
Periodic continued fractions
107
Theorem 7 Let "K(an/bn) be as described above. Let {tn} C C be a
tail sequence for ~K(an/bn).
A.
(= Sk) is parabolic with fixed point x, then
lim tkn+p = T~](x) =
n —f oo
-l. B.5.1)
B. If Tk is loxodromic with attractive fixed point x and repulsive fixed
point y, and tu ^ x, then
forp = 0,1,..., k - 1. B.5.2)
lim
n —> oo
Example 7 The 4-periodic continued fraction
1.13211321
T+T-T-i-fT+T-T-T+T+
converges generally to the attractive fixed point x
transformation
2 4 +
B.5.3)
¦
1 of the loxodromic
^/x 113
Ti(w) = - -
¦|V ; 1 11
The repulsive fixed point of Tj is y = —4. Hence
5i
Therefore, every tail sequence {tn} with fu ^1 satisfies
n —> oo
ifp=l,
9
5
2
I 3
if p = 3.
Tn particular this is the case for tn = —hn where {hn} is the critical tail
sequence for B.5.3).
Note also that since all y(p) ^ oo, we have no Thiele oscillation, so
B.5.3) converges to x = 1 also in the classical sense.
O
108 Chapter III. Convergence criteria
3 Techniques to prove convergence
3.1 Convergence sets
Convergence criteria for continued fractions are often given in terms of
convergence sets O C C X C: If (an,bn) G fi for all n, then J<L(an/bn)
converges. Examples of such sets are the Sleszynski-Pringsheim set
H = {(a, b) G C x C; |6| > \a\ + 1} C.1.1)
(Theorem 1 in Chapter I) and the Worpitzky disk
il = E X {1} where E = {a G C; \a\ < 1/4} , C.1.2)
(Theorem 3 in Chapter I). Observe that here {1} means the one-point
set consisting of only the element 1. This use of symbols is not consistent
with the use of {•} for sequences, but will be used in a few places where
the context prevents confusion. A conditional convergence set fi is a set
fiCCxC such that: If (an,6n) 6 Q for all n, then "K(an/bn) converges
if and only if its Stern-Stolz series diverges to oo, i.e.
?
n
k=\
k
= oo. C.1.3)
The Van Vleck sector
H = {1} X G where G( = V( = Ib G C; |arg6| < | - e| C.1.4)
for an 6 > 0 is an example of a conditional convergence set, (Theorem 2
in Chapter I). Another example is Q = {(a, 6) GCxC;a>0,6> 0},
(Theorem 3 and the subsequent Remark 2 in this chapter).
A uniform convergence set U is a convergence set to which there corre-
corresponds a sequence {Aa} of positive numbers converging to 0 such that
|?n+m@) - Sn@)| < AM for all m, n G N C.1.5)
for every continued fraction K(an/6n) from H; i.e. for every continued
fraction K(an/6n) with all (anibn) G H.
Techniques to prove convergence 109
All these types of convergence sets refer to classical convergence. For
general convergence we shall use the terms general convergence sets,
conditional general convergence sets, and uniform general convergence
sets with respect to some set W C C, where
\Sn+m(wn+m) - Sn(wn)\ < Xn for all m, n G N and wk e W.
C.1.6)
For continued fractions K(an/1), a convergence set Q can always be
described as Cl — E x{l}. For short we say that E C C is a convergence
set for continued fractions K(an/1) ifn = 2?x{l}isa convergence set.
Similarly, we say that G C C is a convergence set for continued fractions
K(l/6n) if H = {1} X (? is a convergence set. For instance, the Van
Vleck sector Gc in C.1.4) is a conditional convergence set for continued
fractions
Sometimes we need the more general notion of a sequence {Qn}^_, C
C X C of convergence sets: Tf (anjbn) ? Qn for all n, then K.(an/bn)
converges.
Example 8 The 3-periodic continued fraction
2 112 11 _ an
T_t_I_T_t_T+T_7+..." K T
from Example 6 converges generally, but diverges in the classical sense.
Hence {Hri} where Qn = {an} X {1} is a 3-periodic sequence of general
convergence sets, but not a sequence of convergence sets.
O
A 2-periodic sequence {Qn} of convergence sets is determined by the
pair (Q 1,0,2)- We say that (fii,^) 1S a Paip °f twin convergence sets,
or, for short, that Q1? Q2 a*e twin convergence sets.
Without loss of generality we shall restrict ourselves to continued frac-
fractions 60 + K(an/6n) with 60 = 0 in this section.
110 Chapter III. Convergence criteria
3.2 Value sets
To determine whether a continued fraction K(an/6n) converges or not,
we have the following tool:
Definition. We say that {K,}^L0 is a sequence of value sets for
K(an/bn) if all Vn CC;Kn/0 and
sn{Vn) = . a"r C Vn-! forn = 1,2,3,... . C.2.1)
bn -f Vn
(In the literature {Vn} is often referred to as pre value sets.) The im-
importance of value sets lies mainly in the fact that they contain values of
approximants of JC(cLn/bn):
Theorem 8 Let {Vn} be a sequence of value sets for H(an/bn). Then
Kk '(wn+k) = ? Vn C.2.2)
for all wn+k 6 Vn+ki for w = 0,1,2,... and k = 1,2,3,
In particular S^w^) = S^. (w^) G Vq if Wk G V/fc. Theorem 8 is a simple
consequence of C.2.1) and the fact that S^ = sn+i o sn+2 ° • • ¦ ° Sn+k-
If 0 G Vn for all n, then S^\o) G Vn for all k by Theorem 8. That
is, Vn contains all the classical approximants fk of J^(an/bn). In this
case we say that {Vn} is a sequence of classical value sets for K(an/&n)-
The reason for this is that historically the emphasis has mostly been on
classical approximants, and thus, when one referred to value sets or value
regions, one always meant sets containing the classical approximants.
See for instance [JoTh80, p. 64]. (For information on the historical
development of this concept we refer to the section of remarks at the
end of this chapter.)
Following the classical ideas, we say that V is a value set for K(an/6n)
if {Vn}, where all Vn = V, is a sequence of values sets for K(an/^n)- If
Techniques to prove convergence 111
{Vn} is 2-periodic such that V-m — Vo and V*2n+i = V\ for all n, we say
that T'o, Vi are twin value sets for J<L(an/bn).
Assume that K(an/6n) converges generally to some value / G C. Will
then / G Vo? Here and in the rest of the book A denotes the closure
of the set A in C. We shall try to avoid confusion with the complex
conjugate z of a complex number z.) The answer is of course Yes if
Sn(un) —> / f°r some sequence {wn} from {Fn}; i.e. wn € Vn for all n.
According to Theorem 4 in Chapter II it suffices that we can find two
sequences {pn} and {qn} from {Vn} such that
hm'mf d(prnqn) > 0,
where d(u, v) denotes the chordal metric on the Riemann sphere. In
that case we can always construct a sequence {un} from {Vn} such that
Sn(un) ~* /• We just use the same technique as in the proof of Corollary
5 in Chapter II. This proves the first two statements in the following
theorem. We shall return to the proof of Theorem 9C after having seen
some examples.
Theorem 9 Let {Vn} be a sequence of value sets for the generally con-
convergent continued fraction K(an/^n) such that Vq ^ C and
liminf diam^V,,) > 0, where
n->°° C.2.3)
61amd(Vn) = sup{d(u,v); u,v G Vn} .
Then:
A. The value f of K(an/6n) is contained in Vy.
B. The value /(n) of the nth tail ofj?(an/bn) is contained in Vn.
C. Hmn _> oc Sn(wn) = f for every sequence {wn} such that wn G Vn
and
liminf distd(wn,dVn) > 0, where
"-*00 C.2.4)
dist d(wn, dVn) = inf{d(wn, v); v G dVn} .
112 Chapter III. Convergence criteria
Here dVn denotes the boundary of the set Vn, (and Vn denotes the closure
of Vn in C).
Example 9 The unit disk
V = U = {weC; \w\ < 1} C.2.5)
is a (classical) value set for the Sleszynski-Pringsheim convergence set
C.1.1). This can be seen from the proof of Theorem 1 in Chapter T (or
by direct verification).
We know that every continued fraction from the Sleszynski-Pringsheim
set C.1.1) converges to some value / ? V since 0 ? V and thus 5"n@) ? V
for all 72. So in this example Theorem 9A, B does not bring anything
new. From Theorem 9C we can conclude that not only the classical
approximants of such a continued fraction from C.1.1) converge to the
value /, but
Bm Sn(w) = f for all w ? V . C.2.6)
From Stieltjes-Vitali's theorem, to be presented later (Subsection 3.6),
we can conclude that the convergence in C.2.6) is uniform on compact
subsets of V. This was mentioned in a remark to the Sleszynski/Prings-
heim theorem in Chapter I.
The Worpitzky disk C.1.2) has the value set
V = {w ? C; H < 1/2}. C.2.7)
Hence, every continued fraction K(an/1) from the Worpitzky disk has
approximants Sn(w) converging locally uniformly to a constant function
in this V. This was also mentioned in Chapter 1.
O
Example 10 A value set for the Van Vleck sector C.1.4) is
V(, = Gc = lw e C; |arg™| < | - e\ . C.2.8)
(See the proof of Theorem 2 in Chapter I.) Since fc/1+i ? G( =>> l/6M+i ?
V<, we find from Theorem 8 that S^\l/bn+k+i) = 5^,@) = /?? ? V(
Techniques to prove convergence 113
for all n > 0 and k > 0. Hence every convergent continued fraction
K(l/6n) from Gc converges to a value in Vcy and A and B in Theorem 9
are obvious. From Theorem 9C we find that if ~K(l/bn) is a convergent
continued fraction from GCi then C.2.6) holds with V = V(.
O
Other examples where Part A and B are no longer obvious will come
later.
Proof of Theorem 9C: Let t0 E C \ Vu. Then t0 ^ /, and the
tail sequence {tn}; tn — S~l(to) is not a right tail sequence. Since
Sn{Vn) C V() it follows that Vn C S'^Vo) and thus that
S (C \ Vo) C C \ Vn . C.2.9)
Hence tn G C \ Vn for all n. All tail sequences {tn} which are not the
right tail sequence of K(an/&n)? have the same asymptotic behavior in
the sense that d(tn, in) —> 0. (See Theorem 6 in Chapter II). Hence {wn}
stays "sufficiently far away" from all such "dangerous" tail sequences in
the sense of Theorem 4 in Chapter II, and thus Sn(wn) —» /. ¦
Value sets are in no way unique. This is illustrated by the following two
examples:
Example 11 Let {?n}^o ^e a *a^ sequence for K.(an/bn). Then
{V),}J^_0, where Vn is the one-point set containing ?„, is a sequence of
value sets for K.(an/bn).
O
Example 12 We shall find some value sets for the 3-periodic continued
fraction in Example 8. Combining results in Example 2 in Chapter II
with results in Example 11 above, we find that
}, forn = 0,1,2,...
114
Chapter III. Convergence criteria
is a sequence of value sets. Similarly
W3n = {0}, W3n+i={oo},
is another such sequence. Still another one is given by
for n = 0,1,2,...
UAn =
C;
w- -
<
-j
U3n+i = {we C; |ti; - 3| < 1}
; + -
O
5.5 Va/we se? techniques I. A posteriori truncation error bounds
In Formula A.4.5) of Chapter II we proved that if /n, /n_i and /„+& are
distinct and finite, then
fn+k - fn =
7^
+ fk
" ~ /n-l)
C.3-1)
for 7i, ^ G N. But when are /n?/n-i and /n+jt distinct and finite? The
following theorem gives a very simple criterion:
Theorem 10 Let {Vn} be a sequence of classical value sets for J^(an/bn)
such that oo ^ Vn for all n. Then all the approximants fn ofK.(an/bn)
are distinct and finite.
Proof : Since /„ € V^ an(l °° i K)» it follows that /„ ^ ex) for all n.
That /„ ^ fn+k is a consequence of the determinant formula: Assume
that fn = fn+k for some k > 1; i.e. that Srl@) = 5n(/^n)). Then 0 =
f1' (which rules out the case k — 1), and thus fjen_l ' = Sn+iifk ) =
fjf (which rules out the case k — 1), and thus fje_l = Sn+iifk ) =
-bn+\ -\-an+l I fl = oo which is impossible since fj^_l ' G Vri+i whereas
Techniques to prove convergence 115
To derive useful a posteriori truncation error bounds from C.3.1) we
want to estimate the factor /^ /(hn + fj. )• We shall show how this
can be done by means of value sets in the special example where the
continued fraction has the form K(an/1) and has a bounded classical
value set V. Then /^n) G V for all n and fc, and by A.2.7) in Chapter II
1 + 1 H hi
forallneN. C.3.2)
Example 13 Let E = {w G C; \w\ < g(l-g)} and V = {w e C; \w\ <
g} for a positive number g < 1/2. Then V is a classical value set for E
since 0 G V and
a
1 + w
For g = 1/2 this is exactly the Worpitzky situation.
Let g < 1/2. Since hn e 1 + V hy C.3.2) and /f/l) G V\ we then have
that |/fn)| < ^ and \hrl + /^n)| > 1 - 2g, and thus by C.3.1)
|/-/n|<T-Vl/n-/n-l| fom = 1,2,3,... .
1 2y
In Example 3 in Chapter II we also considered continued fractions
K(cn/1) with \cn\ < M < 1/4. (The notation differed slightly.) We
found that
2M
Our new truncation error bound is slightly better since M = g(\ — g)
implies that g = A - \/l - 4M)/2 so that
1 - Vl - 4M 4M 2M
- AM 2vT -4MA + Vl -4M) 1 - 4M
For M = 0.2 we have for instance
—— = 0.382 ,
1-4M
Numerically this improvement is not much worth, of course. As is often
the case, we can find reasonable truncation error bounds by using rough
116 Chapter III. Convergence criteria
estimates, and we gain very little by careful refinements (unless we can
pull in some new factors going to zero. We shall return to this point in
Section 5).
O
The crucial points are really that /M = lim* —> oo fl ^ ^ an<^ tnat
hn ? 1 + V. If we turn to the more general situation where {Vri}^.0 is
a sequence of value sets for K(an/^n)> we still have that /(") ? Vn if the
Vris are "large enough" in the sense of Theorem 9. But we loose control
over
hn = bn + 7 - —. C.3.3)
On-l +0n-2-\ h Oi
We find, though, that if (Vo, Vi) is a pair of classical twin value sets for
K(an/1), then h2n ? 1 + Vi and /i2n+i ? 1 + Vo for n > 1.
The same idea also works if V is a classical value set (or Vy, V\ are
classical twin value sets) for a continued fraction of the form K(l/6n).
Then sn(V) = l/Fn + V) C V and (l/6n) ? V for all n and therefore
hn € bn-\-V C 1 fV. Hence, if V is bounded, say |w| < M for all w ? V,
and
infflz + 2/|; a; ? V, y ? 1/F} = rf > 0 , C.3.4)
then
M
|/n+m - /n| < "T^" ~ /n-1 I • C.3.5)
3.4 Value set techniques II. A priori truncation error bounds
Let {VnJJJLo be a sequence of value sets for K.(an/bn). Then it follows
from the definition C.2.1) that if wTl ? Vn then
Sn(wn) € Sn(Vn) = 5n_,(«B(Vrn)) C 5n_,(Vn_l), C.4.1)
so that Kn = Sn(Vn) forms a sequence of nested closed sets, V0D K\ D
K-2 3 '" •• This sequence will therefore converge to a non-empty set K.
Assume first that K contains only one point, K = {/}. (The limit point
case.) Then / ? Kn for all n, and
|/-5n(ti;n)|<diain(jRrn)-0 \twneVn, C.4.2)
Techniques to prove convergence 117
so that Sn(wn) —» /. If now liminf diam(/(V/i) > 0, where the diameter
is measured by the chordal metric as in C.2.3), then J<i(an/bn) converges
generally to /. Bounds for diam(/fn) can be used as a priori bounds for
the truncation error |/ — Sn(wn)\.
To keep the computation simple one often chooses Vn to be circular disks
on the Riemann sphere C. The following lemma may then be of help:
Lemma 11 Let D be a circular disk with center at c and radius r, and
let -b?D. Then
is a circular disk with center ca and radius ra given by
(b + c)a r\a\
|6 +
The proof is a simple exercise in mapping theory for linear fractional
transformations and is left out here.
Example 14 Let E and V be as in Example 13 for a fixed g, 0 <
g < 1/2. Then Kn = Sn(V) = s{ o 5, o ••• o sn(V) C V when all
Sk(w) — a/t/(l "I w) ai*d CLk € ^- By Lemma 11, Dnn = sn(V) is a
circular disk with center and radius given by
Furthermore Dnn_i = sn^\(Dnn) is a circular disk with center cn^n-
and radius
- g)
'n,n—
9
- 7-2
and so on. Observe that since Dn^ C V it follows that \critk\ + ^«,fc ^ ^>
and thus that |1 ¦+- cn^\ > 1 — |cri,A;| > 1 — <7 + ^rt,A:j which again means
that
cB,*|2 - r'lk > A - g + r-n,,J - <A: = A - gf + 2A - y)rniifc.
118 Chapter III. Convergence criteria
Therefore
- g) rTlykg{l - g)
2(rniib)max '
,K fmax
where (rn?fc)mtIvC is a positive number such that rntk < (^n,A:)/»ttx- This
means that
1 ff 1 1/2 1
r
2/6 10 '
V2 = JL
< < =
~ 10 l-y +2/10 "* 101/2 + 2/10 14'
and so on, and thus the radius Rn of Kn = 5n(V") is bounded by
Rn = ^n,i < 2 for n = 1,2,3,
Since therefore Rn —> 0, we have proved that E is a convergence set
(which we already knew from Worpitzky's theorem), and we have proved
that
1/ - 5n(«;)| < diam (Kn) = 2Rn < —-— for w ? V . C.4.4)
2n + 1
Tf g < 1/2 we can do even better. Then
r .
0 - ffJ + 2A-ff)rB*l-/B>
where g/(l — g) < 1. By the same argument we get R\ < g2/(l + g) and
- Sn(w)\ < 2Rn < -^- (yzt) ^oiweV. C.4.5)
If g < 1/2, the bound C.4.5) approaches 0 faster than C.4.4) and is
thus better.
O
Techniques to prove convergence 119
Example 15 We want to compute the value of
0.2em 0.2el 0.2e2j 0.2e:jl
K
1 1 + 1 + 1 +¦
with an error less than 0.05, as we did in Example 13, but this time we
want to determine the index n for 5n@) in advance by means of the a
priori truncation error bound C.4.5). We use g(l — g) = 0.2 so that
g - A - \/0^2)/2 < 0.28, and we require
29 ( 9 \ < 0.1225-0.39"-1 < 0.05
V f 9 \
1 + 9 \l-g)
which holds already for n — 2.
-O
3.5 Value set techniques III. The Hillam-Thron theorem
If the sequence of nested sets {Kn} in Subsection 3.4 converges to a
larger set K instead of a one-point-set {/}, the question about con-
convergence of {Sn(wn)} is a little more tricky. But the very fact that
limdiam(jfiTn) = d > 0 can have implications which lead to convergence
of the continued fraction. This is demonstrated in the following theorem
due to Hillam and Thron, [HiTh65]:
Theorem 12 (The Hillam-Thron theorem) Let V C C be an open
circular disk with 0 G V. If V is a value set for the continued fraction
K.(an/bn), then J!i(an/bn) converges.
Notice that both the Sleszynski-Pringsheim theorem and the Worpitzky
theorem are simple consequences of the Hillam-Thron theorem.
To prove Theorem 12 we shall use a value set technique, but not di-
directly on V. We shall rather prove the following lemma which has wider
applications:
120 Chapter 111. Convergence criteria
Lemma 13 Let U = {z E C; \z\ < 1} be the unit disk, and let 0 < k <
1. Le< {?„} 6e a sequence of linear fractional transformations such that
tn(U) C 17 and |*n(oo)| < k for alln, C.5.1)
and let Tx = i,,Tn = T^o^ = *! o*2 o---oin for all n. Then {Tn(w)}
converges locally uniformly in U to a constant function T(w) = c E U.
Also this can be found (in a slightly weaker form) in [HiTh65]. In this
setting U plays the role of the value set and {Tn} the role of the continued
fraction. We still have
Kn = Tn(U) C Tn_,(G) = Kn-x C - • • C & C.5.2)
where #„ now are circular disks, and we distinguish between the limit
point case where diam(iifn) —> 0 and what we can call the limit circle
case where diarn(jRTn) —* d > 0.
Proof of Lemma 13: Let Cn and Rn be center and radius of Kn for
each n. If Rn —> 0 then the lemma is trivial, so assume that Rn —» R > 0.
The nestedness C.5.2) then ensures that Cn —> C, the center of K. The
most general linear fractional transformation which maps U onto U is
given by
w — o.
S(w) — e1^ — where lu G R, a ? U.
1 — aw
(Here a denotes the complex conjugate of a.) Hence, we can write
7»l Qi
TJw) = Cn + Rne'"n z-^ for n = 1,2,3,..., C.5.3)
1 — anw
where all o;n E R and qm € 27. The plan now is to develop two in-
inequalities which together will prove our lemma. The first one is a result
of the condition tu(oo) E U, which leads to Tn(oo) = Tn-\[tn(oo)) E
Tn-i(U) = Kn-i\ that is, Tn(oo) = Cn-\ + Rn-\zn for some zn E <7.
Since therefore
- Cn = ^ = Crj_i + i2ri_|Zn - Cn ,
Techniques to prove convergence 121
we have
Rn
and thus
= \(Cn-\ — Cn) + Rn-\Zn\ < (Rn-l — Rn) + Rn-l , C.5.4)
2|aH| 1 - \an\
= 1
#n_t 1 +|an| l + |an
which gives us the inequality
n
1 — \ctj\
where 6j = —^ < 1. C.5.5)
The second inequality depends on the stronger condition that if kn =
tn(oo), then |A:n| < k for all n. We get
Tn+1(oo)-Tn(oo) = Tn
an(l - Q
so that
C.5.6)
- 2,,(oo)| < J^-YZy < —k(l - \an\
by use of C.5.4), and thus the inequality
« d m — I
|Tn+m(oo) - Tn(oo)| < ^ E A - ^
C.5.7)
Since J?n —» R > 0, it follows by C.5.5) that Yl $j < °°- Hence, by
C.5.7) {Tn(cx))} is a Cauchy sequence and thus converges to a value
c e K. Since by C.5.3)
\Tn(w) - Tn(oo)| = fln
w-an L Rn 1 - la
+ —
- anw an
2
\a
n
r=rr> C-5.8)
where \an\ —> 1 (since 6n —> 0), this actually proves that Tn(w) —> c
locally uniformly in C \ dU in the limit circle case. (dU is the boundary
of U\ i.e. the unit circle.) ¦
122 Chapter HI. Convergence criteria
Proof of Theorem 12: Let 7 and p be the center and the radius of
dV, and let t(w) = pw + 7 so that t(U) — V. Then tn = t o sn o t
maps U into G and
tn(oo) = r1 o «n(oo) = rJ(o) = --
for all n, where |7/p| = k < 1 since 0 E V. Hence Tn(u>) converges
locally uniformly in U to a constant function T(w) = c ? U. Now
Tn = ti o ?2 o • • • o tn — t~* o S\ o t o t o 52 o t o • • • o t~l o sn o t
= t~* o s\ o s-2 o > > - o sn o t = t" o 5n o t. C.5.9)
Hence Sn(w) converges for all w ? t(U) = V to the constant function
= / where r'(/) = q i.e. / = t(c) eV. m
We can also prove a slightly more general result by means of Lemma 13:
Theorem 14 Let {Kj^Lu be a sequence of value sets for K{aTi/bn)
consisting of open circular disks with centers 7n and radii pn such that
\jn/Pn\ < k for all n for some k < 1. Then "K(an/bn) converges to a
value / G Vu-
Remark: Since 0 € Fn for all n, it follows that {Vn} are indeed clas-
classical value sets for K.(an/bn).
Proof : Let rn(w) = jn -f pnw so that rn(U) = Vn for all n. Then
tn = r~_|, O5n orn maps U into {/ and tn(oo) = ^,@) = -fn-i/Pn-i-
The result follows by the same line of argument as Theorem 12 since
Tn = t\ O ?2 ° " " * ° tri
= Tq* OS] O Ti OT,"' O 5-2 O T2 O • • • O T~\ O Su O Tn C.5.10)
= T~loSnOTn.
Theorem 14 and the Hillam-Thron theorem illustrate an important point
in the value set techniques to derive convergence theorems. One picks
Techniques to prove convergence 123
"nice" value sets {Vn}. And then K(an/bn) converges if all sn have
the "right mapping properties" sn(Vn) C Vn-\. Collections of such
continued fractions K(an/&«) can then be described by convergence sets
{nn}.
We shall return to this in Section 4.
3.6 Value set techniques IV. The Stieltjes-Vitali theorem
Theorem 15 (Stieltjes-Vitali's theorem) Let {fn} be a sequence of
holomorphic functions in a region D, such that
(i) there exist two points a, 6 ? C such that fn(z) ^ a,/n(z) ^ b for
all n and all z ? D, and
(ii) {fn{z)} converges to finite values for every z in an infinite set
ACD which has at least one point of accumulation in D.
Then {fn{z)} converges locally uniformly in D to a holomorphic func-
function.
We recall that the word "region" here (as it is generally in this book)
is used in the strict sense: open, connected set. This theorem is a
consequence of MontePs theorem for normal families. For the proof
we refer to text books on functions of a complex variable, for instance
[Hille62, p. 248-251].
The idea of application of this theorem is best explained if V is a classical
value set for an element region fi; i.e. Q C C x C is an open, connected
set and a/(b + V) C V for all (a, 6) ? Q. Assume that we know that
every continued fraction K(a«/6«) from a subset Qq C Q converges. We
can then introduce an auxiliary variable z such that
(an(z),bn(z)) ? n0 if-z? A,
(an(z),bn(z)) € n if z€D, C.6.1)
124 Chapter III. Convergence criteria
to get fn{z) € V for z ? Q, where fn is assumed to be holomorphic. So
if there exist two points a, b ? C which are not contained in V, then
limn —> oo fn(z) exists for all z ? D.
We can also follow the same idea if V is a non-classical value set for H.
Then we use modified approximants f*(z) = Sn(w,z) for some w 6 V
and consider general convergence.
In Chapter I we presented Van Vleck's convergence theorem in Theorem
2. We promised to return to the proof. We shall prove this result now
by means of the Stieltjes-Vitali theorem, by extending the convergence
result in Theorem 3 for continued fractions with positive elements. The
proof is taken from [JoTh80, p. 89].
Proof of Van Vleck's theorem: We want to prove that Gc given by
C.1.4) is a conditional convergence set. We know that V^ = Gc is a value
set for Gt. (See C.2.8).)
Let K(l/6n) be an arbitrarily chosen continued fraction from G(. If
\bn\ < oo, then K(l/fcn) diverges, so assume that ^ |fcn| = oo. Let
Pn = arg(fc,t) and dn(z) = \bn\ei0nZ for all n. C.6.2)
Then dn(z) <E G(/2 if |arg(|6n|c?/3»z)| < tt/2 - c/2; i.e. if \/3
it 12 — e/2 where |/3n| < tt/2 — e. Hence
dn{z)€Gf/2 UzeD = {zeC;\M(z)\ < ^f^} • C.6.3)
Moreover
dn(z) > 0 if z ? A = {z ? C; &{z) = 0} . C.6.4)
Since
Y, dn{>) = E \K\e-p"^ > e-^W ? |6n| = oo ; * 6 A ,
it follows from Seidel-Stenrs theorem, Theorem 3, that ~K(l/dn(z)) con-
converges for z 6 A; i.e. that the classical approximants fn{z) of K.(l/dn(z))
converge for z ? A. By Stieltjes-Vitali's theorem it follows therefore that
K(l/^n(^)) converges for all z (E D. In particular it converges for z — 1.
Hence ~K(l/bn) converges since 6n = dn(l). ¦
Techniques to prove convergence 125
The idea in this subsection can be extended to sequences {Vn} of value
sets for sequences {Hn} of element regions.
3.1 Smaller value sets for truncation error bounds
If we want to use value sets to estimate truncation error bounds, as we
did in Subsection 3.3 and 3.4, we really want these sets to be "as small
as possible" to obtain best possible bounds. In some cases we can then
use:
Lemma 16 Let {Un} and {Wn} be two sequences of value sets for
K(an/6n). If Vn — Un fl Wn / 0 for all n, then {Vn} is also a sequence
of value sets for K.(an/bn).
Proof: If an/(bn + Un) C ?/"„_! and an/(bn + Wn) C Wn-U then
an/(bn + Unf\ Wn) C Un-i n Wn-i- ¦
Lemma 17 Let W^W\ be twin value sets for K(an/6n). Then:
A. If all bn = 1 and
Vb = Wb\(-l-Wi)^0, Vl = W, \(-l-Wb)^0, C.7.1)
then Vq,V\ are also twin value sets for K(an/1).
B. If all an = 1 and
Vo = Wo\ (-1/Wx) + 0 , Vi = Wx \ (-1/Wo) / 0, C.7.2)
Vqi V\ are also twin value sets for K(l/6n)-
Proof: A: In view of Lemma 16 it suffices to prove that Uq = C \
(-1 -Wl)&ndUi=C\ (-1 - Wo) are value sets for K(an/l); i.e. that
«2n(C/b) = . ,2r!r C C/", and s2n+i{U\) = 2"+l C C/"o,
126 Chapter III. Convergence criteria
or, since sn is bijective, that
*2n{C\U0)DC\Ul and j2n+l(C \ Ux) D C \ Uo ,
i.e.
5-1-W^o and
i.e.
-i - iy, d ^(-i - w0) = -
and
But this follows directly from the fact that Wq, Wi are twin value sets
for K(aTl/l).
B: This part can be proved in a similar way, observing that if s(w) =
1/F + w), then s~l(w) = -b + 1/w. m
4 Convergence results
4-1 Two useful lemmas
Value set techniques give results on general convergence for continued
fractions. If the value sets do not contain the classical approximants
from some n on, it may be difficult or even impossible to prove classical
convergence. This is not so if K.(an/bn) has the form K(an/1) or
and {Vn} is either 1-periodic or 2-periodic and bounded.
Lemma 18 Let (Vb,Vl) be a pair of bounded twin value sets for the
generally convergent continued fraction K(an/6n), and let Vq (or V\)
contain at least two points. Then K(an/bn) converges in the classical
sense if either all an = 1 or all bn ~ 1.
Convergence results 127
Proof : Let / be the value ofK(an/6n), and observe that both Vq and
V\ must contain at least two elements since si(Vi) C Vq and s>(Vo) C V{.
It follows therefore from Theorem 9A that / G Vq and thus / / 00. For
a given i0 G C we recall from Theorem 3 in Chapter II that the tail
sequence {tn}™=0 for K(an/6n) is given by
' I4'1'1)
o) -
Assume first that all 6n = 1 and choose to ? — 1 — Vi,to / /. Then,
by D.1.1), tn 6 —1 — V(n+i)moj2 f°r all n, and is thus bounded. From
properties of tail sequences of generally convergent continued fractions
(Theorem 6 in Chapter II) it follows therefore that limsup|/in| < oo.
This in turn means that liminf rf(oo, — hn) > 0 so that Iim5n(oo) = /
by Theorem 4 in Chapter II. This proves the classical convergence since
5n(oo) = Sn_i@).
Assume next that all an = 1, and choose to 6 — 1/V\,to ^ /. Then
tn e -bn - V(n)morf2 ^ -l/V(n+])mod2 by D.1.1). Hence liminf \tn\ > 0.
By the same argument as above we now get that liminf \hTl\ > 0, so that
liminf <2@, — hn) > 0 and thus lim5n@) = /. ¦
This first lemma gave a method to conclude classical convergence from
general convergence. The next lemma shows a method to conclude clas-
classical convergence from the convergence of the even and the odd part of
a continued fraction. It is due to Lane and Wall, [LaWa49].
Lemma 19 Let ~K(an/bn) have finite approximants fn = 5n@) satisfy-
satisfying
oo oo
Yl \f-2n+2 ~ f2n\ < OO and ]T |/2n+l ~ /2n-l | < OO . D.1.2)
n=l n=l
Then K(an/^n) converges if and only if its Stem-Stolz series
oo
E
n=l
n
bn
Lk
k=l
D.1.3)
diverges to oo.
128 ChapteT III. Convergence criteria
Remarks:
1. The condition D.1.2) implies that XX/2n+2 — /2n) and
/2n-i) converge to finite values; i.e. that fm —* Lq ^ oo and
/2n+i —» L\ / oo. When D.1.2) holds, we say that the even and
odd parts of K(art/&n) converge absolutely.
2. From the proof of the Sleszynski-Pringsheim theorem in Chapter
I, we find that
oo
A a OO ,-m
OO
?n|J
that is, if the unit disk is a value region for K(an/6n), then
converges absolutely.
Proof of Lemma 19: Since the approximants fn and the Stern-Stolz
series D.1.3) are invariant under equivalence transformations, we may
assume that K(a«/&n) has the form K.(l/bn). We use the standard
notation /„ = An/Bn and hn = —S~[(oo) = Bn/Bn-i. Since /n ^ oo,
it follows that Bn / 0 and thus hn ^ oo,0. We shall see later (Formula
C.3.4) in Chapter IV combined with Formula A.1.7) in Chapter II) that
then An can be written
ft
(;-*;) ft h
A:=l \j=2 j=k+\
for all n > 1, where the empty sum is 0 and the empty product is 1. Since
#! = 6, we have Bn = Bx YYj^Bj/Bj-i) = 6, n"=2 hj> ^d therefore
itP where P, =
and
6; —
V
1A
l/ftj-
Convergence results
129
By Theorem 1, the Stern-Stolz theorem, we know that K(l/6n) diverges
if 2 l^n| < oo. So, assume that J^ l^nl = oo. We want to prove that
{fin} and {/2n+i} have a common limit Lo = L\\ i.e. that
fn+l — fn — —T
0.
D.1.7)
By D.1.2) and D.1.5) we find, using D.1.6), that
oo
OO
El,
|/2n+2
n=l
OO
OO
n=\
< OO,
and similarly
OO OO
| hn+1 — fin- 1 | =
n=\
n=\
2n+l°2n+l
< OO
Now, if J^l^nl = oo and Xll^n^rJ < oo, then Pn —> 0 which proves
D.1.7). Hence we only need to prove that Yl |^n| = oo. We have
fc-1
so that
h'2n-2 =
D.1.8)
and similarly
b>
w| < °°» then
D.1.9)
< oo by D.1.8)-D.1.9), a contradiction.
Hence
130 Chapter III. Convergence criteria
Example 16 In Remark 2 to the Seidel-Stern theorem, Theorem 3, we
noted that "K(an/bn) converges if and only if its Stern-Stolz series D.1.3)
diverges to oo if all an > 0 and 6n > 0. How does this relate to Lemma
19?
We shall first prove that the even and odd parts of K(an/6n) converge
absolutely when all an > 0 and 6n > 0. From Theorem 2 we know that
{/2n} is a bounded, monotonely increasing sequence. Hence f2n —> Lq ^
oo and
oo oo
/ , |/2»+2 — I2n\ = /^{Jln+l ~ Jin) = ^0-
n=l n=l
Similarly, {/2n+i} is bounded and monotonely decreasing, so /2M+1 —>
L\ / 00 and
00 00
2^ l/2n+l - fln-\ I = - 2^(/2n+l ~ /2n-l) = ~L\ + /l ¦
Hence, by Lemma 19, I?(an/bn) converges if and only if D.1.3) diverges
to 00, i.e. Q = {(a, 6) € C x C; a > 0 and 6 > 0} is a conditional
convergence set.
O
4.2 Parabola Theorems
The value set techniques described in this chapter are all based on map-
mapping properties of the linear fractional transformations sn and 5n. The
convergence set or the element set is the set of all (an, 6n) which give s t
the wanted mapping properties. Since linear fractional transformations
map circles and lines into circles and lines, it is nice to work with value
sets that are circular disks, the exterior of circular disks or halfplanes.
In this subsection we shall work with value sets that are halfplanes.
The parabola theorem is a convergence theorem for continued fractions
K(an/l):
Theorem 20 (The parabola theorem) Let a be a fixed, real num-
number, -tt/2 < a < 7r/2. Then:
Convergence results 131
A. The parabolic region
Pa = ia? C; \a\ - »(ac"?2a) < \. cos2 a\
; 0 < r < 2 "^ ° \ D.2.1)
1 - cos 0 J
= jc2 <E C; |S(ce-'a)| < ^ cos a|
25 a conditional convergence set for continued fractions K(an/1).
B. The even and odd parts of continued fractions K(a«/1) from Pa
converge to finite values.
C. The half plane
Va = \w e C; $l(we-ia) > -^cosaj D.2.2)
w a value set for Pa.
D. If an 6 Pa for alln and Yl(n\an\)~l — °°; then {Sn(w)} converges
uniformly in Va to a value f. In fact,
forallweVa. D.2.3)
< „
I cos a
E. The set Ea^i = {a ? Pn; \a\ < M} is a uniform convergence set
for continued fractions K(an/l).
Remarks:
1. The boundary of Pn is a parabola with axis along the ray arg z —
2a, focus at the origin and vertex at — (l/4)el2a cos2 a. It intersects
the real axis at z = —1/4. (See Figure 1.)
132
Chapter HI. Convergence criteria
Figure 1.
2. Since oo ^ Fa and 0 G Va, it follows that all approximants Sn@) =
An/Bn of a continued fraction from Pa are finite. In particular all
3. It follows from Part 13 that if K(an/1) from Pa converges, then it
converges to a finite value.
4. In Part C we actually prove the stronger result that a/(l + Va) C
Va if and only if a ? Pa-
5. The truncation error bound D.2.3) is also valid in Part E.
6. The parabola theorem generalizes the Worpitzky theorem, since
the Worpitzky disk E = {a ? C; \a\ < 1/4} C PQ.
Convergence results
133
Sketch of proof: We shall first prove part C and B, and then use
these to prove part A. Finally we look at part D and E.
C: The mapping s(w) = a/(l + w) maps Va onto the closed circular disk
\
Figure 2.
with center at 7a = (ae~ia)/ cos a and radius pa = |a|/cosa. This disk
is contained in Va if and only if 7a G Va and 7a has a distance da to dV&
such that da> pa. Now
ii ~*~ ~— f*c\^ f\ —I— mi I ^v & I
2
where da > 0 if and only if 7a G Va. (See Figure 2.) So da > pa if and
only if
,—tot
- cos a + tfi I e
2 V cos a
cos a
i.e. if and only if a (E Pa.
B: We define the linear fractional transformation
—1 + ela cos a — w
l(w) = TTT^ •
D.2.4)
134 Chapter HI. Convergence criteria
Then t maps the closed unit disk U onto V^ with t(oo) = —1 and
t(—1) = oo. Let K(an/1) be from Pa and let
>-1
— t ° S2n-l ° S2n °t(w) for 71 = 1, 2, 3, . . . . D.2.5)
Then, tn(U) = t~x o «2n_, o s2n(FQ) C r1 o s2n_,(Fa) C r
J7, and tn(oo) = t~x o s2n_, o s2n(-l) = *"' ° s2n-i{oo) =
— 1 + e"* cos a = k ? G. It follows therefore by Lemma 13 that Tn(w) =
t\ o?2 o • • -o?n(w) converges locally uniformly in U to a constant function
T(w) = ceU. Since
Tn = tj o t2 o • • • o tn
— (t~x o s\ o 52 o t) o (t-1 o 53 o s4 o t) o • • • D.2.6)
we therefore have that 52n(if) —> t(c) = Xq for all w ? ?
and in particular 52n@) —* Lq. To see that Xo 7^ 00, we observe that
S2n@) G ^(V^)^ ^2n C s,(Fa) for all_ n, so that L0
Further, 00 ^ 5|(^) since $7'(oo) = -1 ^ Fa.
Similarly one can also prove that 52n+i@) —> L\ ^ 00.
A: It follows by Theorem 1 that if K(an/1) converges, then its Stern-
Stolz series diverges. We need to prove that if K(an/1) diverges, then
the Stern-Stolz series converges. As before we let Kn = Sn{Vn) so that
Kn+i Q Kn Q • • • ^ si(Fa), where S\[V(k) is bounded. If diam(iTn) —> 0,
then K{an/1) converges, so assume that diam(.K') > 0 where K —
lim Kn. Then the radius Rn of Tn(U) = f'o S-2n o t(U) must converge
to an R > 0; i.e. we have the limit circle case for {Tn} in D.2.6). We
shall prove that
00
? |52n+2@) - 52n@)| < 00. D.2.7)
Since {Tn} satisfies the conditions of Lemma 13, it follows by C.5.6)
and C.5.8) that ? |Tn+,(oo) - Tn(oo)| < 00, and thus by C.5.8) that
?|!Tn+iM - Tn(w)\ < 00 for \w\ ? 1. In particular "?\Tn+1(w) -
Tn(w)\ < 00 for w — — 1 -f el"cosa. D.2.7) follows therefore, since
5,4@) = t o Tn(-1 + e.in cos a) ? «i(Fa) which is bounded.
Convergence results 135
In a similar way we prove the second condition in D.1.2), and the result
follows from Lemma 19.
D: It is possible to prove by elementary methods that the radius Rn of
the circular disk Kn = Sn(Va) satisfies the inequality
Rn< n , |ai|/cOS^ forn = 1,2,3,.... D.2.8)
([Thron58].) This proves Part D.
E: This follows directly from D. ¦
The parabola theorem is in several ways the queen among the conver-
convergence theorems for continued fractions K.(an/1). It is best in several
respects. In particular one can not enlarge the conditional convergence
set Pa, not even by adding just one point, without destroying that prop-
property; i.e. if a $¦ PQ, then {a} U Pa is not a conditional convergence set,
[Lore].
Example 17 The continued fraction
sc xxx sc/2 sc/4 sc/4 sc/4
was discussed in Chapter I, Example 4. Let
^arg(z) if |arg(z)| <tt,
a = <
0 if I arg(sc)| = 7r.
Then sc/4 ? Pa if | arg(sc)| < ?r, and thus K(sc/2) converges. If sc < 0,
then sc/4 ? Po if and only if |sc| < 1. Hence K(sc/2) converges in the cut
plane D = {sc ? C; | arg(l + sc)| < tt} U { — 1}, just as proved in Example
5 in Subsection 2.3, and illustrated in Figure 2 in Chapter I.
From D.2.3) we further get the a priori truncation error bound
X\ / tt / COS
cos a / +\ \ (j — l)|sc
3=2
136
Chapter III. Convergence criteria
which shows that K(sc/2) converges locally uniformly in D. We can not
expect that this bound is the best possible, since we have not taken into
account the special periodic character of K(*c/2).
O
Example 18 In Example 5 of Chapter I we presented the continued
fraction expansion
Let
anz
1
, where
71
2Bn-
a
z
iI
= <
2/2
» O>2r
0
2/6 22/6
1 + 1
n
1+1 ~ 2Brc
SM ^ 1
if|
22/10
+ 1 +
+ 1) f°r
arg(z)| <
argB)| =
32/10
1
71= 1
^¦»
+ ¦••
, 2,3,
Then anz G Pn if |arg(,z)| < 7r. If |arg(,z)| = 7r, then anz G Po if and
only if \anz\ < 1/4. Hence, anz G Pn for all n > 2 (with our choice
of a) if z is in the cut plane D\ = {z ? C; | arg(>z -f 1/2I < tt}, and
thus ~K(anz/l) converges for all z G i^i • We can even conclude that this
convergence is uniform on compact subsets of Di, since for every z ? D\
there exist a neighborhood Bz and a permissible a such that Bz C Pai
and the result follows therefore from Theorem 20E. (Keep in mind that
every compact subset of D \ can be covered by a finite number of such
neighborhoods Bz.)
O
We can improve the result in Example 18 if we let Va vary with 71 as in
the following theorem by Jones and Thron, [Thron58], [JoTh68]:
Theorem 21 (The parabola sequence theorem) Let a be a fixed
number, —ir/2 < a < ir/2, 0 < go < 1 and 0 < gn < 1 for n — 1,2,3,
Then:
Convergence results 137
A. The parabolic regions
P«,n = {a G C; \a\ - ^R(ae't2n) < 2<7n_1(l - <?n)cos2a} D.2.9)
for n = 1,2,3,... form a sequence of conditional convergence re-
regions for continued fractions
B. The even and odd parts of continued fractions K(an/1) from {Pa,n}
converge to finite values.
C. The half planes
V^n = {weC; ft(we-ifx) > -gn cos a} D.2.10)
for n = 0,1,2,... form a sequence of value sets for {Pa,n}-
D. If an G Pa,n for aM nf then
If I
\f - Sn
— 71
- /
for all w G V^>n and n G N, where
m=0k=\
E. If
OO m
En — {a G P(x,ni \a\ < M} is a uniform sequence of conver-
convergence sets for continued fractions K(an/1).
(Part A, B and C follow from [JoTh68, Theorem 5.1] with tpn = a and
Pn = 9n cos a for all n. Part D and E follow from [Thron58].) The proof
follows the same pattern as the proof of Theorem 20. Notice that also
here all canonical denominators Bn ^ 0 for continued fractions ~K{an/l)
from {jPa,n} since oo
138 Chapter III. Convergence criteria
Example 19 Let K.(anz/1) and a be as in Examle 18. If z is real and
negative, we now have that anz G Po,« if and only if \anz\ < flrM_i A —</«).
Let us choose
g2n-\ = n/Bn - 1), g2n = 1/2 for n = 1,2,3,... .
Then an = yn_i(l — <7n) for all n > 2. Hence K(an2/1) converges
locally uniformly in the cut plane D = {z G C; | arg(,z + 1)| < 7r} to a
holomorphic function.
O
4.3 S-fractions
S-fractions, or Sticltjes fractions, are continued fractions of the form
K.(auz/1), where all on > 0 and z is a complex variable. Example 17
and 18 were studies of some special S-fractions. The parabola theorems
are well suited to prove convergence of such continued fractions. But we
also have:
Theorem 22 The S-fraction ~K(anz/l) where all an > 0, has the fol-
following properties.
A. Its even and odd parts converge locally uniformly in D = {z ?
C; |arg(,z)| < tt} to holomorphic functions.
B. It converges to a holomorphic function in D if and only if the
Stern-Stolz series
OO n
e n Ki(-o"~k+l D-3.1)
n=l A.— 1
o/K(an/l) diverges to 00.
C. It diverges for all z € D if the series D-3.1) converges.
This result is due to Stieltjes [Stie94], but it is also a corollary of The-
Theorem 21. In some cases we obtain better results by using the parabola
theorems, such as in Example 17, 18 and 19.
Convergence results 139
A very nice consequence of Theorem 22 is that an S-fraction converges
to a holomorphic function in D if and only if it converges at a single
point z G D.
Henrici and Pfluger [HePf66] have proved the following a posteriori trun-
truncation error bounds for S-fractions:
Theorem 23 (The Henrici-Pfluger truncation error bounds)
Let —7r < 6 < 7r. Then the sector
V = {weC;0< sgn@) • arg(w) < |0|} D.3.2)
is a value set containing the classical approximants of the S-fraction
~K(anz/l), where all an > 0 and arg(,z) = 0. If this S-fraction converges
to f(z), then
where fn{z) denotes the nth classical approximant of~K(anz/l).
Proof : anz/(l + V) C V, since for w G V
arg
which lies between 6 and 0. Since anz G V, it follows therefore by
Theorem 8 that fn(z) G V for all n > 1.
Let J4Tri = 5n(F) where Sn — S\ o s2 o • • • o sn and 5^(w) = Ofez/A +
Then JRTn is bounded by the circular arcs 5n(R+) and Su(Ro), where
Rq is the ray arg(w) = 6. The truncation error bound D.3.3) is derived
by a careful study of these convex, lens-shaped, closed regions Kn. For
details we refer to [IIePf66]. ¦
The value set V in D.3.2) can also be used to derive a priori truncation
error bounds for S-fractions J?(anz/l), [GrWa83]:
140 Chapter HI. Convergence criteria
Theorem 24 (The Gragg-Warner bounds) Let K(anz/l) be an S-
fraction with z = rel2cc for \a\ < 7r/2 and all an > 0. Then
A
\fn+m(z) ~ fn(z)\ < 2—— || ' D.3.4)
cos a ?^2 y/\ + 40*77 cos2 a + 1
/or n > 2 and m > 1.
Example 20 Let us once again turn to the continued fraction expansion
anz _ z z/2 z/6 2z/6 2z/lO 3z/lO
K~T~ I+T"+T"+"T"+ i + i +...
of log(l + z). (See Example 18 and 19.) This is an S-fraction, and it
converges, not only for | arg(,z)| < ?r, but for | arg(,z + l)| < 7r by Example
19.
We want to compute log(l + z) for z > 0. Then, by D.3.3),
- fn(z)\ < \fn(z) ~ fn-l(z)\ foTZ>0
which is consistent with results from Subsection 1.2. From D.3.4) we
get the a priori bounds
which for z — 1 is
2v/lT2-l y/l + 2/3-1^/1
+ 2 + 1 yi + 2/3 + 1 v/1 + 4/3 + 1
+ 4/5-1 VI + 4an - 1
+ 4/5 + lVl + 4an + 1
Hence |/A) - /3A)| < 0.0681, |/A) - /,A)| < 0.0142, |/A) - /5A)| <
0.0021 and so on. Compare these estimates to the table of the first
approximants /«A) of J?(anz/l) in Chapter I, Example 5.
O
Convergence results 141
4.4 Oval theorems
As pointed out in Subsection 3.7, we want as "small" value sets as
possible for a given continued fraction to derive as good truncation error
bounds as possible. For this reason, although we are happy with the
parabola theorem, we also want other value set results for continued
fractions K(an/1).
In the first theorem in this subsection, we let
? C; \w-C\ < R}, CeC, R>0 D.4.1)
be the value set, and we consider continued fractions K(an/1) where
an/(l + V) CV for all n. Since V is bounded, we can not allow — 1 G V,
so a necessary condition is |1 + C\ > R.
Theorem 25 (The oval theorem) Let C G C with $l(C) > -1/2
and 0 < J2 < |1 + C| be given. Then
E = {ae C; |a(l + C) - C(\l + C\2 - R2)\ + J2|a| < R(\l + C\2 - R2)}
D.4.2)
is a convergence set for continued fractions K(an/1), and
V = {w e C; \w - C\ < R}
is a value set for E. Moreover
- Sn(*>)\ < 2fl,,7^, \M» "l D.4.3)
M =
: w G V
> D.4.4)
J
for every continued fraction K(«n/1) from E andw ? V, where f is the
value o/K(fitn/l).
142
Chapter III. Convergence criteria
Remarks:
1. We shall see from the proof of Theorem 25 that the condition
3?(C) > —1/2 is necessary for the existence of an o 6 C, a ^ 0,
such that a/(l + V) C V.
2. We shall also see that o/(l + F)CF if and only if a G E.
3. The oval theorem also gives some new convergence criteria in the
sense that not every E is contained in some parabolic region Pa
from the parabola theorem. (See [JaTh86].)
4. If M < 1, then D.4.3) implies that E is a uniform general con-
convergence set with respect to V. M is always less than 1 if R <
-plane
Figure 3.
5. The boundary dE of E is called a Cartesian oval. If C = 0 it is
the circle which bounds
E = {a e C; \a\ < R(l - R)} , 0 < H < 1,
Convergence results 143
a convergence set known from Example 13 in Subsection 3.3. if
C/ 0 we define
4 = C(l + C) (l - ^) . D.4.5)
Then ?7 can be written
|<1ff}. D-4.6)
From this we see that the Cartesian oval dE is symmetric about
its axis through 0 and d, and that it is some kind of a "weighted
ellipse" with foci at a = d and a = 0 and vertices at
D47)
and
One can also prove that E contains the circular disk with center
at d and radius |v2 — d\ and that E is contained in the closed
circular disk with center at d and radius \v^ — d\. (See Figure 3
and [JaTh86].)
6. The oval theorem generalizes the Hillam-Thron theorem, Theo-
Theorem 12, for continued fractions K(an/1), since 0 ? V is no longer
required.
We shall not prove the fact that E is a convergence set in general. For
that we refer to [LoRu]. We shall prove the rest of Theorem 25, though,
in addition to the points mentioned in Remark 1 and 2.
Proof of Theorem 25: By Lemma 11 it follows that a/(I + V) is a
circular disk with center and radius given by
" ~
r
- B? ' "~ \l + C\2 - B? "
144
Chapter III. Convergence criteria
This disk is contained in V if and only if \C — ca\ + ra < R\ i.e. if and
only if a ? E.
Next we shall prove that E ^ 0 if and only if $t(C) > -1/2. From
Remark 5 it follows that E — 0 if and only if C ^ 0 and R and C are
chosen such that
If - H + ttv^ICI < i?r D-4-10)
is impossible for every F C. The left side of D.4.10) attains it minimum
for ( = 1- So D.4.10) is impossible if and only if R/\l + C\ > R/\C\; i.e.
< \C\ i.e. »(C) < -1/2.
Finally, to prove D.4.3), we observe that the circumference of #„ =
Sn{V) is given by
27rRn= f \S'n{w)\dw, D.4.11)
where S'n(w) denotes the derivative of Sn and Rn is the radius of Kn.
Since Sn = S\ o S2 o • - ¦ o sn we get by the chain rule that
s'n(w) = «ik
D.4.12)
where K;njU =
G ^ for fe < n and
——- = rbh=Lm D.4.13)
A + Wn,kY 1 + Wn,k
Combining D.4.12) D.4.13) we get D.4.3) since
n
= n
n-\
n
¦M
n-\
so that by D.4.11)
Convergence results 145
Example 21 How do the truncation error bounds D.4.3) turn out for
the S-fraction expansion of log(l + z) that we studied in the previous
examples? Again we choose z — \. Then ~K(anz/l) is a continued
fraction with positive elements such that 1/6 < anz < 1.
Of course we can not expect the bounds D.4.3) to be as good as the
Gragg-Warner bounds in Example 20 or the practical bounds we get
from Theorem 2 for continued fractions with positive elements. Those
types of upper bounds are "designed" for the particular S-fraction (or
continued fraction with positive elements) in question. The bounds
D.4.3) hold for all continued fractions from E.
On the other hand it is useful to get an idea of how well D.4.3) is
doing compared to these more specialized bounds. Since 1/6 < an < 1,
we want to choose C and R such that the element set E is symmetric
about the positive real axis with vertices V\ = 1/6 and v^ — 1. From
D.4.7)- D.4.8) we find, if 0 < R < C,
v, = A + C + R){C -R) = l, v2 = {l + C- R){C + R) =
b
i.e. R = 5/12 and C = (\/l45 - 6)/12, and
C + R x/145-1
M = = -7= « 0.479.
1 + C + fl ^145+11
Hence
1/A) - /»A)| < \ f^Srr) @.479)"-' « 0.706- @.479)"-'.
o?\vl45f 1/
That is, |/A) - /:l(l)| < 0.16, |/A) - /,A)| < 0.078, |/A) - /5A)| <
0.037, and so on.
O
Again we can do better if we allow V (and thus E) to vary with n:
Theorem 26 (The oval sequence theorem) Let Cn G C and 0 <
Rn < |1 + Cn\ be given for n = 0,1,2,... such that
\Cn-v\Rn < \l + Cn\Rn-l forn = 1,2,3,... . D.4.14)
146
Then
Chapter III. Convergence criteria
Vn = {w e C; \w-Cn\ < Rn} forn = 0,1,2,.
D.4.15)
is a sequence of value sets for
En = {a?C',\{)
< Rn-X(\l + Cn\2-R2n)} for n = 1,2,3,
If K.(an/1) is a continued fraction from {En} and all
n)\ < 2R0
D.4.16)
G Vkt then
Mk D.4.17)
where
= max
w
}¦
D.4.18)
The proof follows the same lines as the proof of Theorem 25 and will be
left out.
Example 22 How can we find better truncation error estimates for the
S-fraction expansion K(anz/l) of log(l -f z) by using D.4.17)? Again
we let z = 1. To keep things simple, we want to choose all Cn = C and
let Rn vary. Since
= 1, a2k —
1
k
1
4
and an —> 1/4, it seems reasonable to choose C such that C(l + C) = 1/4;
i.e. C = (y/2 — l)/2, and to choose Rn such that, except for ai, the
elements a^A- and «2A:+i are alternatingly close to the right vertex
Cn\ - Rn)ei2<Xn ,
where 2arl = arg(Cn_i(l + Cr,)) = 0, and the left vertex
D.4.19)
= <
if «„_
D.4.20)
Convergence results 147
A simple (but not the best) choice is for instance
Ro = 1.06 and Rn = for n = 1,2, 3,... .
4ri — 0.2
We then get
Mk - ~ — ~ ^= for A = 1,2,3,... ,
l + C + #/t (v5+l)Jfe + 0.4 ' ' ' '
and thus
0.5
That is, \f-S:i(w:i)\ < 0.20, \f - S4{wA)\ < 0.044, \f - S5(w5)\ < 0.009,
and so on.
O
Remarks:
1. If {Cn} and {Rn} are chosen such that liminfn —> ^ Rn/(l-\-\Cn\) >
0 and the right side of D.4.17) diverges to 0 as n —> oo, then
D.4.17) shows that {En} is a uniform sequence of general conver-
convergence sets with respect to
2. If all Cn = C and Rn = R, then Theorem 26 reduces to parts of
the oval theorem. The condition D.4.14) is then equivalent to the
condition 9ft(C) > —1/2 in the oval theorem.
3. If all C2n — Co,C2n+i = C\,R2n = #u>^2n+i = -fti> then (Vo, V
is a pair of twin value sets for the twin element sets (Eq,Ei).
Condition D.4.14) now implies that the centers Co, C\ must satisfy
the inequality
A = |1 + C0||l + Ci | - \C0Ci | > 0. D.4.21)
It is no longer sufficient that 0 < Rn < |1 -f Cn\. D.4.14) imposes
a second condition on the radii. The choice
Rx = |i + C| + |Cbi D'4-22)
where 0 < fi < A is one possibility.
148 Chapter HI. Convergence criteria
4. For period lengths k > 2, the corresponding expressions for A and
Rn are
A=fl\l + Cj\-f[\Cj\>0, D.4.21)
and
a A--2 m fc-2
Ji where AA_,= ? IIl^^l II \Cj\
71 m=-l j=U J=m+1
D.4.22)
and the other Ans are determined from A/t-i by cych'c shifts. For
instance, for k — 3 the three An are given by
Ao =
A, = |C2Co| + |1 -f C2||C0| + |14- C2III + Co|,
A2 =
Example 23 We want to find a priori truncation error bounds for the
continued fraction K(an/1) given by
3+ l/l2 4 4- 3/22 3 + 1/32 4 + 3/42 3 + 1/52
1 + 1 + 1 + 1 + 1 +¦¦•
in Example 14 of Chapter I. To keep things simple we choose Cm — 1 and
C2n+1 = 2 for all 7i, since then C271-1 A + C2n) = 4 and C2n(l + <?2n+1) =
3. Since we want an to the left of the vertex w2in given by
+ Cn - Rn) ,
(see D.4.19)) it follows that
R'2n = -——, Rin+i = ~——- for n = 0,1,2 .
In + 1 In + 1
is a possible choice. We get
C, -1 i?2t- ! 4t-l
= 1,2,3,
Convergence results 149
and
Co + R2k
which lead to the truncation error bounds
1
|/-S2n(ti/2n)| < 2
1
\f - S2n+l(w2n+l)\ < 2
2 - « 7 *=
2ti+1
1 2n
4 A:=l
2n+ 1
for wn G Vn- For instance
- S6(u/6)| < 0.24 , |/ - 57(«;7)| < 0.062 ,
which is not so very impressive compared to the table in Example 14 in
Chapter 1, of course. Still this type of bound is useful.
O
The oval theorem and the oval sequence theorem are results on continued
fractions of the form ~K(an/l). The only reason for this restriction was
to keep expressions like D.4.16) and D.4.17) as simple as possible. For
continued fractions ~K(an/bn) the oval sequence theorem takes the form
Theorem 27 Let Cn G C and Rn > 0 be given for n = 0,1,2,
Then {Vn} given by (J^.J^.15) is a sequence of value sets for
+ Rn\a\ < Rn-i{\b + Cn\2 - R2n)} forn = 1,2,3,....
D.4.23)
a«/&n) is a continued fraction from {fln} and all w^ € Vk, then
\Sn+m(wn+m) - Sn(wn)\ < 2^0M^V ft
where
Mk — max
( r~^— i w G Vk\ . D.4.25)
150 Chapter III. Convergence criteria
5 Limit periodic continued fractions
5.1 Definition
A continued fraction K.(an/bn) is said to be limit periodic with period
length fe, or limit fc-periodic for short, if its sequences {an} and {bn} of
elements are limit fe-periodic, i.e. the limits
bkn+p = K for p - 1,2,..., k E.1.1)
n —> oo ^ H n —*¦ oo
exist in C. One only has to cast a glimpse at the appendix of this book
to see that most of the continued fraction expansions of special functions
that are in use are indeed limit periodic.
Limit periodic continued fractions have been widely studied. Not only
for their importance, but also for their nice properties. They resemble
periodic continued fractions, both by appearance and behavior in several
aspects. This connection to the well understood periodic continued frac-
fraction is very useful. We shall use it to prove properties of limit periodic
continued fractions and to choose approximants for them.
5.2 Finite limits, loxodromic case
We assume in this subsection that all the limits in E.1.1) are finite.
Consider first the case where also all a* ^ 0. Since convergence of a
continued fraction is a property which always depends on terms far out
in the continued fraction, and since a limit periodic continued fraction
looks more and more like the corresponding periodic continued fraction
a* a? aT, af. a\ d!X
6* 6T -|" b% -f** • *-f~ b*L-\-b1^ -\- b?y-\-• • •
it is to be expected that the convergence/divergence of ~K(au/brl) is
closely tied to the convergence/divergence of E.2.1). And, as we saw
in Subsection 2.3, Theorem 6, this convergence is determined by the
Limit periodic continued fractions 151
classification of the linear fractional transformation
a?,
E.2.2)
and by the fixed points x^ and y^ of
p+2 +• • •+ Dp+fc + w
E-2.3)
If a* = 0 for some p, then E.2.1) is no longer a continued fraction by
our own definition. But if T^ is well defined, then K.(an/bn) may still be
a convergent continued fraction.
Let us first study limit 1-periodic continued fractions K.(an/bn) of loxo-
loxodromic type: that is, an —> a\ ? C, 6r, ->fcjGC and T\(w) = aJJt/F'J-f w)
is loxodromic if a\ ^ 0, and b\ ^ 0 if a\ — 0. In the latter case Ti is
a singular transformation, Tx{w) = 0 for w / —6*. We say that x = 0
is the attractive fixed point of (the singular transformation) T\ in this
case, and that y = —b*{ ^ x is the repulsive fixed point of T\.
Theorem 28 Let ~K(an/bn) be limit 1-periodic of loxodromic type, where
T\ has attractive fixed point x and repulsive fixed point y. Then:
A. ~K(an/bn) converges to a value f ? C.
B.
lim tn = ( X *'-? ~ ' E.2.4)
12/ lJzoT J
n —> oo
for every tail sequence tn = Sn l(to) of JS.{cLn/bn).
C. Let all bn = 1 and dn = sup{|am — a*|; m > n} for n = 1,2,3,
Ifd2 < A2/4 where A = |1 + x\ - \x\, then
n-\
n
- 4d
n+, J
152
Chapter III. Convergence criteria
for \w — x\ < Rtlf where
Ux
- U2)
x\ + \x
E.2.7)
= max
E.2.8)
Proof : A: We want to use Theorem 27 with all Vn — V and all
Qu — J7. As center for V we choose the attractive fixed point x. (This
is possible since x ^ oo under our conditions.) To make sure that
(aro&ri) ? ^} at least from some n on, we require that (a[,b\) be con-
contained in the interior of $7; i.e.
- x{\b\ + z|'2 - #2)| + R\a{\ < R(\b1 + x\2 - R2),
where R is the radius of V. Since a* = icF* + x) this can be written
x
- R.
Hence, the choice R = A-fi where A = |6, + x\ - \x\ and A/2 < fi < A
is fine. By Theorem 27 we then find that
2R
for 71?^ € V for all j, where N € N(j is chosen so large that (anybn) E
for all 7i > N and
<
+ x\-R
x\
<
A/2
as k —> oo. This proves that {.S^ (w^N+n)} is a Cauchy sequence and
thus converges to a value /W E V. This value is independent of the
Limit periodic continued fractions 153
actual choice of w^+n ? V. Hence the Nth tail of K{an/bn) converges
generally to /(;V). The classical convergence of this tail follows then
by Theorem 9. Finally Theorem 1 in Chapter TI gives the classical
convergence of the continued fraction K.(aTl/bn) itself.
B: Let first t0 = f. Then tn is the value of rath tail of K{an/bn) for
every n ? No- Therefore tn ? V for all n > N by the argument in the
proof of part A. Since fi can be chosen arbitrarily close to A (such that
R is arbitrarily close to 0), this proves that tn —> x.
Next we consider the case to ^ f. We shall use that
(see Theorem 3 in Chapter II). Let fi, A/2 < fi < A, be arbitrarily
chosen, and let N — N(fi) be an integer such that (an, 6n_i) ? fi for all
n> N. This is possible since also (an, 6n_i) —> (a*, 6f) in the interior of
fi. Further, let tyy be chosen such that — F;v + tjv) ? V and tyv ()
Since by E.2.10)
- (bN+n + tN+n) = , E.2.11)
we then have that — (bn + tn) ? V for all n> N. Hence {tn} has all its
limit points ? —{b\-\-V). By Theorem 6 in Chapter II we know that
all tail sequences of a generally convergent continued fraction have the
same asymptotic behavior, except the right tail sequence {/^'}. Hence
every tail sequence {trt} with t0 ^ f has all its limit points in —(b\ + V).
Since \i can be chosen arbitrarily close to A so that the radius R of V
is arbitrarily close to 0, we find that limn —> oo tn = — F* + x) if to ^ /.
Now x and y are the two solutions of the quadratic equation T\(w) = w.
Hence x -f- y = —b*{, and so tn —> — FJ + x) = y.
C: As in Part A we let Vn be circular disks with centers at Cn = sc, but
this time we shall vary the radii Rn. Moreover, since we want to derive
truncation error bounds for K(an/1)» we want (an, 1) ? fi for all n > 1;
that is
On(l + SB) - SB(|1 4- X\2 - R2n)\ + Rn\On\ < Rn-l{\l + *? ~ K) E.2.12)
154 Chapter III. Convergence criteria
for n — 1,2,3, Since an — a\ -f- en where |en | < dn and a^ = x(l +
the choice for i?o and i?n as given in the theorem works fine. Inserted
into D.4.24) this gives E.2.5). ¦
An alternative way to prove Theorem 28A is as follows: Since b*{ ^ 0 un-
under our conditions, we can transform some tail of K.(an/bn) to a contin-
continued fraction of the form K.(an/l) where cn —> a\/b]2. (For such equiva-
equivalence transformations, see Corollary 10B in Chapter II.) The convergence
then follows from the parabola theorem, Theorem 20. We chose to use
Theorem 27 since then the proof can be generalized to limit fc-periodic
continued fractions with period lengths k > 1.
Theorem 29 Let K(an/6n) be a limit k-periodic continued fraction of
loxodromic type with finite limits E.1.1). Let x^ and y(p) denote the
attractive and repulsive fixed points of TJf' for p = 0,1,..., k. If all
oo, then:
A. K(ttn/6n) converges generally to a value / G C.
B. K(an/^n) converges in the classical sense if all y^v' ^ oo.
C
n
JEW iftU = f
forp — 1,..., k .
y{p) if to ¦? f
E.2.13)
Remark: K(an/^n) is limit A-periodic of loxodromic type if either T^ is
a loxodromic linear fractional transformation or a singular, well defined
transformation Tf,(w) = c for all w ^ Wq for elwq^c. We then say that
x = c is the attractive fixed point of Tfc and that y = w^ is the repulsive
fixed point of
We shall not prove Theorem 29 in detail, but the idea is to use Theorem
Limit periodic continued fractions 155
27 with Ckn+p — Cp = a^ as centers of Vkn+p* Hence we need that
k-\
3=0
k-l
*u)\ - n
3=0
Fortunately E.2.14) holds in our case:
Lemma 30 Let x^ ^ oo be a fixed point ofTg' (as given in E.2.3))
for p = 0,1,..., k, chosen such that x(p~l) = flp/F* + x^) for p —
19 h
JL * ^»* • ¦ • • A/ •
A. If x^ is the attractive fixed point of T^ and y(p) ^ x^ is the
repulsive fixed point, then E.2.14) holds.
B. If Tf~ is parabolic, then
k-\ k-\
j=o j=o
C. IfTk is elliptic, then A = 0 fA as given in E.2.14)) but E.2.15)
does not hold.
Proof : A: Assume first that Tk is non-singular (i.e. all a* ^ 0). Then
ifc is a loxodromic linear fractional transformation, which can be written
According to Theorem 5C we have
and if J3J_j = 0 then a(°) = oo if |BJ| < |^_J and y = oo if
_,|. We have assumed that x^0' ^ oo.
156 Chapter III. Convergence criteria
Let first i??_, "=/=¦ 0. Then z(°) and y are finite solutions of the quadratic
equation Tk(w) = w. Hence x^ + y = (AjT^j — ???)/???_, which means
that
«l + m-,V = iift-i - **-i*@) • E-2.17)
In Problem 3 in Chapter II you were asked to prove that if {tn} is a tail
sequence for a continued fraction K(an/bn) and all tn ^ oo, then
n n
Bn + Bn^tn = Y[{bk + tk) , ^n ~ ^n^O = IK"^) ¦ E.2.18)
k=Q
The periodic sequence sc(°),scA)J...Ja:(*:-l)Ja!(*) = x^\x^ is a tail se-
sequence with all x(p> ^ oo for the periodic continued fraction K(g?/6?
given by E.2.1). Hence, by E.2.17),
fc-i
and
fc-i
p=0
Hence E.2.16) is equivalent to E.2.14) in this case.
Now let J9j_! = 0. Then \B'k\ > |AJ_J and «(°) = AJ/(BJ - AJ_,). By
E.2.18) with *„ = a;(n) we thus have
j=0 j=U
and E.2.14) follows since |BJ| > |A?_J.
Finally assume that a* = 0. Then a^) = 0. Hence E.2.14) follows
since 6; + a;(n) ^ 0 for all n. (If 6* + z(n) = 0 for an n € {1,..., A;},
then ?(""*') = a*i/(fc*t 4- ai^) = oo, since T& is weU denned and thus, in
particular, a* ^ 0.)
The results B and C follow by the same kind of arguments. ¦
To prove Theorem 29C we can use that the classification of linear frac-
fractional transformations 2\ are invariant under inversion. (See Property
Limit periodic continued fractions 157
1 in Subsection 2.2.) Theorem 29B follows then easily from part A and
C, using Theorem 4 in Chapter II.
In applications it is often functions of a complex variable z which are ex-
expanded in limit periodic continued fractions K.(an(z)/bn(z)) with poly-
polynomial elements an(z) and bn(z). Theorem 29 gives a domain D where
H(an(z)/bn(z)) converges, but we can only conclude pointwise conver-
convergence. However, Theorem 29 was based on Theorem 27, where the
bound D.4.24) can be used to prove uniform convergence.
Theorem 31 Let K{an(z)/bn(z)) with polynomial elements an(z) and
bn(z) be limit k-periodic. Let D C C be an open set where K.(an(z)/bn(z))
satisfies the hypotheses of Theorem 29 for all z ? D. Let F(z) be
the value of K{an(z)/bn(z)) in D. Then the (general) convergence of
K{an(z)Ibn(z)) in D is uniform on compact subsets C C D where
oo ^ F(C).
The proof is based on the fact that a compact set C which is contained
in a union (J«5i ®n °f open sets is also contained in a union of a finite
number of these sets Dn.
5.3 Finite limits, parabolic case
Also in this section we consider limit ^-periodic continued fractions
K{an/bn) where the limits E.1.1) are finite, but now we assume that
either T*. given by E.2.2) is a (non-singular) linear fractional transforma-
transformation of parabolic type, or T^ is singular (a* = 0 for some p ? {1,..., k})
and well defined for all w ^ c for a c ? C, with T(w) = c for all w ^ c.
We say that K(an/6n) is limit fc-periodic of parabolic type.
The situation is then substantially different from the loxodromic case.
Periodic continued fractions of parabolic type converge generally (see
Theorem 6), but they are in a way on the border line between the pe-
periodic continued fractions of loxodromic type which converge generally
and the ones of elliptic type which diverge generally. This is reflected
158 Chapter III. Convergence criteria
by the limit fc-periodic continued fractions of parabolic type. They may
converge or diverge generally depending on how the elements (an,6n)
approach their limit points E.1.1).
Example 24 The continued fraction K(an/1) is limit 1-periodic of para-
parabolic type if and only if a\ = limn —> qq an = —1/4. Let first an —
— 1/4+ €n where all en > 0, en —> 0. Then an G ^b in the parabola
theorem, and K(an/1) converges.
If an = ( — 1/4) — l/Drc2 — 1) for all rc, then one can prove that K(an/1)
diverges generally.
On the other hand, if
foraUri'
°n = 44Dng-l)
then an G Po,n in the parabola sequence theorem with gn — (n+l)/Bn-f-
1), and thus K(an/1) converges. In fact, /^n^ = —gn = — (n -\- l)/Bn +
1) is the right tail sequence of K(an/1) in this case, so the continued
fraction converges to — g^ = — 1.
O
The parabola theorems, the oval theorems etc. are useful tools for deter-
determining whether a given limit ^-periodic continued fraction of parabolic
type converges. From these we can find "safe directions" in which
(a>n,bn) may approach the limit points without disturbing the conver-
convergence (such as an > —1/4, an —*• — 1/4 in Example 24). We can also
find conditions for "safe speed in unsafe directions" (such as in E.3.1)).
The next result belongs to this latter category. It describes the border-
borderline between the convergent and divergent continued fractions K(a«/1)
where an —> — 1/4 monotonely from the "unsafe direction" an < —1/4.
It is due to Jacobsen and Masson, [JaMa90]. We use the notation
log0rc =n, logmn = logflog^.! n) form = 1,2,... E.3.2)
for the natural logarithms, and let Lk(n) denote the product
n = rc(l°grc)(k)g2 n) • • • (logfc n). E.3.3)
rn=0
Limit periodic continued fractions 159
Theorem 32 Let p ? No be a fixed number. The limit J-periodic con-
continued fraction K(an/1) converges if there exists an N ? N such that
p
-2
-i)|<^Dr»r E-3.4)
/nom some n on. All tail sequences of K.(an/l) then converge to x =
— 1/2. The limit J-periodic continued fraction K(an/1) diverges gener-
generally if
an = - \ - E DI») - d /(Up(n)J E.3.5)
from some n on with d > 1.
Sketch of proof: K(an/1) converges if a tail of K(an/1) converges.
A tail of K(an/1) converges if a/v+n ? Po,n from some n on, where Po,n
are the parabolic element regions in the parabola sequence theorem.
a>N+n ? Po,n from some n on if
+ + E-3-6)
2 4n 4nlogn 4Lp(n)
from some n on. ¦
Observe that the case E.3.1) is covered by E.3.4) with p = 0, and the
case an = ( — 1/4) — l/Dn2 — 1) is covered by E.3.5) with p = 0 and
d — 4. Similar results can be obtained for limit ^-periodic continued
fractions of parabolic type with period lengths A: > 1.
5.4 Finite limits, elliptic case
If I* in E.2.2) is an elliptic transformation, we say that K.(an/bn) in
E.1.1) is limit fc-periodic continued fraction of elliptic type. Such con-
continued fractions may also converge or diverge generally depending on
how (anibn) approach their limit points. In [Gill73] it is proved that
K(art/l)j where limn _* oo an = a\ < —1/4, may converge or diverge. It
always diverges if an —> a\ fast enough; i.e. if \an — a\\ < Crn for some
positive r < 1.
160 Chapter HI. Convergence criteria
5.5 Choice of approximants
Let K(an/bn) be limit fc-periodic of loxodromic type. Since
lim f(kn+P) — v(p) fnr n — 0 1 h — 1 /c; ^ 1 "\
n —r oo
by Theorem 29C, it seems reasonable to use the approximants
Sfcn+P(a;(p)). Actually, we find that if / ^ oo and all x^ ^ 0, oo, then
lim , V ,^ = lim " ^ w, n =0 E.5.2)
since hkn+P —* — y^ asn-^oo and y^ / x^p\ Here we have used for-
formula A.4.7) in Chapter II and Theorem 29C.) We say that Skn+p(xW)
converges faster to / than Sfcn+p@), or that the modification Wkn+p =
xW accelerates the convergence of K(an/6n).
In E.5.2) we used our information E.1.1) to choose approximants for
J?(an/bn). What if we also know that the limits
n —> oo
lim 7+p+1 = rp forp= l,2,...,Jfc E.5.3)
exist, where
+ wn), E.5.4)
what then? Can one use this information to find even better approxi-
approximants Sn(wh.') for K(an/^n)? The answer is yes, under mild conditions.
Let us demonstrate how this works for the case where the period length
is k = 1. We get:
Theorem 33 Let K(an/^n) be a limit 1-periodic continued fraction of
loxodromic type, with finite limits atl —> a| ^ 0 and bn —-»¦ tf cmdf wif/i
finite value f. Further let {wn} be a sequence from C such that en —
wn — /G^ ^0 as n —*¦ oOj and let 6n be given by E.5.4)- Then:
A.
— r s ? inn tri_i_]/tn — / . io.o.oi
n —>¦ oo --¦¦-- n —> oo
Limit periodic continued fractions 161
B. If limn —> oo 6n+\ /6n = r, then limn —> oo ?n/en_i = b\ + x + rz.
C. // limn _> oo 6n+i /?„ = r / 0, oo, Z^en
= 0,
km
,,—>00 / - Sn(wTl)
where Wn ' = wn -\- 6n+\ /(b\ + x -\- rx).
Proof : A: We know from Theorem 28B that /M —»¦ x ^ 00. Since
a\ ^ 0 we also have x / 0. Assume first that en+i /en —> r. Then
cn-, + €n} , E.5.6)
where the expression in the parentheses { } converges to b*x -\-x+xr =: d
as ?2—> oo. This expression d is ^ 0 since It]* -f x\ > \x\ by Lemma 30
and \r\ < 1. Hence lim^n+i/^n = Iim6n/€n_i = r.
Assume next that 6n+\/Sn —> r. From E.5.6) we obtain the equality
Sn+l _ Cn f6n+l+/(w+l)+/(w)Cn+l/€n+gn+ll ,- -
«n "€„_! \ bn + /(») + /(»-!)?„/€„_, +?„ /' ^'^
that is
tn
which means that {/(rt^€n+i/€n} is a tail sequence for a continued frac-
fraction K(cn/dn) where
'n
162 Chapter III. Convergence criteria
and
J L i "Il-fl .TIM—II L* i
«n = o«+i + ^n+i r—r ; -* 6X + jb - jcr =:
Or
¦Tl
Since T^(w) = c*/(rfj + iu) is loxodromic or singular with attractive
fixed point ra? and repulsive fixed point —(b\ + jb), where |6^ -f e| > \rx\,
it follows from Theorem 28 that /^n~l^€n/en_| —> ra;. This proves that
?n/cn_i —> r since x ^ 0, oo.
B: This is a direct result of E.5.6) and the subsequent remark.
C: We have as in E.5.2) that
W™ hn+wn
f-Sn{wn) n
where (hn + wn) —+ (—y + x) ^ 0, oo and where
0
by Part B.
In a way one might say that Theorem 33 describes a device to improve
approximants in the sense that the new approximants converge faster
to the value / of the continued fraction K.(an/bu) than the old ones.
As a starting point we need to have approximants {Sn(wn)} working
better than {Sn@)} in the sense of E.5.2).The process works under
mild conditions if the asymptotic side condition E.5.3) holds.
Example 25 The Stieltjes fraction
22z 2?z 42z b2z
JT^ 5j JT^ 9-11
converges to a holomorphic function f(z) for | arg(z + 1)| < ?r since
an —> 1/4. For such values of z this S-fraction is limit 1-periodic of
loxodromic type. We have d\(z) — z/4 and b\ = 1, and thus x = x(z) =
T
+7- l)/2 and y = y(z) = (-y/l + z ~ l)/2, where Jft(vTT^) > 0.
Hence we can choose the approximants
Sn(v>{n})) where w?\z) = x{z) = {yfiT~z - l)/2.
Limit periodic continued fractions 163
These approximants converge faster to f(z) than Sn@) in the sense of
E.5.2). With this choice we find that
= anZ _ ? =
Hence 6^{/6n —> 1, so by Theorem 33C the approximants Sn(wn (z))
where
@/ \
converge even faster to f(z). Suppressing the variable z we have
and thus
(lix
Bn+l)Bn+3)A + +Bn+3)Bn+5)
Bn + l)Bn + 3)Bn + 5) \ Bn + 3)A + 2e)
so also %^i/ft ~+ 1. Hence Sn(wn ) where
converge even faster again. We can continue the process. In this example
it is possible to prove that S^\ /6n —> 1 at every step m ? No, and
therefore
at every step m. In Table 1 we show the first approximants Sfl@), Sn(x),
Sn(wn ) and Sn(w}? ) for K{arl/l) (i.e. z = 1).
The table stops when we have reached the correct value with 8 digits, and
this value is repeated for all larger indices. Of course, in this example
we would always choose the approximants 5n@) or Sn(x) since they
164
Chapter HI. Convergence criteria
n
1
2
3
4
5
6
7
8
9
Sn@)
1.267
1.2121
1.2213
1.21970
1.219971
1.219924
1.2199320
1.2199306
1.2199308
•
•
•
Sn(x)
1.221
1.21984
1.219941
1.2199296
1.2199309
1.2199308
•
•
1.219993
1.2199258
1.21993125
1.2199307
1.2199308
•
*
•
1.2199356
1.2199305
1.2199308
•
*
•
Table 1: Example 25.
converge so fast anyway. But for values of z close to the ray | arg(z+l)| =
7r where the continued fraction diverges, or with \z\ large, there is much
to be gained by this method.
O
Example 26 Let us once more consider the S-fraction
K
anz
2*/6 2z/10
1+ 1 + 1 +¦
which converges to log(l + z) for | arg(l + z)\ < n. Also here a\(z) — z/4
and 6j = 1, so the approximants Sn(x) converge faster than 5n@) to
the value f(z) = log(l + z) in the sense of E.5.2). With this choice we
find for n > 2
= anz -
= anz -
x
)
= anz - z/4 =
1) if n is even,
if n is odd.
This means that limn
/?„ ' = —1, so by Theorem 33C the
Limit periodic continued fractions
165
n
1
2
3
4
5
6
7
8
9
10
Sn@)
0.667
0.700
0.6923
0.69333
0.693122
0.693152
0.6931464
0.69314733
0.693147158
0.693147185
Sn(x)
0.707
0.6948
0.6933b
0.693177
0.693152
0.69314783
0.69314728
0.693147196
0.693147183
0.693147181
0.6921
0.69309
0.693137
0.6931464
0.69314703
0.693147168
0.693147178
0.693147180
0.693147181
0.693147181
Table 2: Example 26.
approximants Sn(wn ) where
x — z/4(n -f 1)
x 4- z/4n
converge even faster to log(l + z). Now we find
if n is even,
if n is odd,
z2/\6n2
if n is odd.
Hence
n —* oo c@
=
ft1
- 8a?
if n is even,
if n is odd.
166 Chapter HI. Convergence criteria
To continue the process we therefore need results similar to Theorem 33
for limit 2-periodic situations. We shall return to this later. In Table 2
we have compared the values of 5n@), Sn(x) and Sn(wn ) for z = 1.
O
For the limit fc-periodic situations one can prove a similar result.
Theorem 34 Let K(«n/^«) be a limit k-periodic continued fraction of
loxodromic type with finite limits E.1.1), finite value f and finite and
nonzero attractive fixed points x^ for TJf' . Further let {wn} be a se-
sequence from C such that en — (wn — f^) —¦ 0 as n —¦ oo, and let 6nbe
given by E.5.4)- Then:
A. If for an m G {1,.. •,
lim €kn+p+l =3 c C /orp = m,m-l, E.5.8)
and sp ^ ~(bp+l + x(p+l))/x(p) for at least one of the indices p =
7?2, m — 1, then
..
B. If
11-rvi *^ — -f ?~ m * ¦f-j-i<y> «yx —. Illy m 1 I K h I II I
™-*°° Okn+p
then
f'~ ' ~' ' } = 0,1,2,..., k - 1.
E.5.11)
C. If E.5.10) holds, then
km ——-^—r = 0, E.5.12)
-*oo f - Sn(wn)
Limit periodic continued fractions
167
where
A) _
E.5.13)
and sp is given by the equations E.5.9) for m — 1,2,..., k.
To simplify the notation we have used that b*n+k = 6*4, aj(m+*) =
x^m\ tm+k = tm and sm+k = srn . The proof of this result follows
the same lines as the proof of Theorem 33. (It can be found for the
special cases all au = 1 or all bn = 1 in [JaWa88] and [JaWa90].)
Example 26 continued. If we regard 'K(anz/l) as a limit 2-periodic
continued fraction, then our observation
lim
n —> oo
ffi
(I)
= to = t> —
2n
z-Sx'
n —> oo
'2n+2 _ f _
(i \ — *1 —
z-8x
2n+l
n
1
2
3
4
5
6
7
Sn(™i2))
0.693170
0.693159
0.69314740
0.69314728
0.693147183
0.693147182
0.693147181
Table 3: Example 26.
gives by E.5.9)
z - 8e -
Si = — =
- 8ar - 8a?2
Sx2
168 Chapter HI. Convergence criteria
and thus
lim
when
- 82 - 8ar2
( _ g _
The first approximants Sn(wn ') are given in Table 3. The value is
repeated for all n > 7.
O
Another question is: How can we improve the convergence of limit pe-
periodic continued fractions of parabolic type? These continued fractions
often converge very slowly, if they converge at all, so the question is very
relevant. Let us look at an example:
Example 27 The continued fraction K(on/1) where
an = -- + — for n = 1, 2, 3,...
4 on
is limit 1-periodic of parabolic type. It converges to a finite value / by
virtue of the parabola theorem, Theorem 20, since all an € Pw This
value is in fact / = —0.172160, correctly rounded to 6 decimal places.
Two other properties can also be derived from this fact that all an are
real and G Po- The first one is that /^ is real and G Vq for all n and k\
i.e. /|. > — 2 f°r a^ n
Sn@) = ^tty- < 0 for all n ,
1 4- P '
1 + Jn-\
and
5n@) > 5n+i@) for all n.
Limit periodic continued fractions 169
Hence {5n@)} is a decreasing sequence of negative numbers converging
to /. The other property is that the right tail sequence of K(an/l)>
{/(")}, converges to — ?. (Clearly, all /(n) are negative and > —1/2 by
the arguments above. If /* is a limit point of {/(")}, then so is /*,
where /* = (-l/4)/(l + /*). Hence, the set of limit points for {/M}
must contain the tail sequence {t"t} of the periodic continued fraction
K(( — l/4)/l) which begins with JJ = /*. The only such tail sequence
which is contained in Vq is the right tail sequence, where all <* = —1/2.)
This suggests that Sn( —1/2) is a good choice for the approximants of
K(an/1). Since /(n) > —1/2 for all n, this in turn means that
/ = $„(/(">) > 5B(-i/2)
for all n, so Sn(Q) > f > 5n( —1/2), which is a useful truncation error
bound. On the other hand, since also hn —*¦ 1/2 (by a similar argu-
argument), we do not have high expectations to the speed of convergence of
Sn(—1/2) as compared to 5n@). (See the formula in E.5.2).)
We shall estimate /H a little better than just using /(n) % -1/2. The
idea was suggested by Gill in another context, [Gil 180]. We have
/(») =
an+2 an+A
1 + 1 + 1
1 + 1 -f 1
1 1
2 + ^T1
Therefore Sn(wn) ought to converge faster to / than 5rt@) and 5n( —1/2).
Table 4 gives the first approximants of the types ^(O), Sn( —1/2) and
K(an/1).
O
5.6 Continued fractions K(art/J-) where an —> oo
Let K(an/1) have elements an —> oo. Then K(a«/1) may converge or
diverge depending on how {an} approaches infinity. For instance, if
170
Chapter III. Convergence criteria
n
1
2
3
4
•
*
17
18
19
20
5n@)
-0.125
-0.154
-0.164
-0.168
¦
•
•
-0.172147
-0.172150
-0.172153
-0.172155
5n(-l/2)
-0.25
-0.20
-0.184
-0.1780
¦
•
•
-0.172176
-0.172171
-0.172169
-0.172166
-0.167
-0.1704
-0.1714
-0.1718
•
•
•
-0.172160
-0.172160
-0.172160
-0.172160
Table 4: Example 27.
all an > 0, then K(a«/l) converges if and only if its Stern-Stolz series
diverges. (See Theorem 3 and the subsequent Remark 2.)
Evidently the techniques of Subsections 5.2- 5.5 can not be applied here.
So, how can we find good approximants and truncation error bounds?
We shall illustrate some ideas in the following examples.
Example 28 Does the continued fraction K(an/1)> where an = inn,
converge or diverge? The even part of ~K(an/i) is
- 1 + a.-? + a.x - 1 + a5 + aG -
4-5«
1-2-1-3z
i 2-3z
+ 5f-6-1-7z+ 8-
4-5i 6 • li
-1 -1 + D - 3z) - 1 - F - 5z) -1 + (8 - 7i) '
(See Theorem 12 and formula B.4.3) in Chapter IT.) This continued
fraction Kv is equivalent to a continued fraction of the form K(cn/1),
where
-Bn - 2)Bn -
(-l)nBn - 2 - Bn -
n - Bn -
Limit periodic continued fractions 171
for n > 3. (See Corollary 10B in Chapter II.) Since
—4z i 1
- if = 2'
we find that K(cn/l) is limit 1-periodic of elliptic type. One can prove
that K(cn/l) diverges by methods presented in Chapter IV.) Hence also
K(an/1) diverges.
O
Example 29 Let a-2n-i = ^rc2 and a-zn — n for all n G N. Will then
K(an/1) converge or diverge? The even part of K(an/1) is
_ il* 1 • z2'2 2-z32 3 ¦ i42 Cn
e~ 1
+ 1-1 +z22 + 2-l + z32 + 3-l+z42-f 4
where
—i(n — l)n2
G2 -f «(w - l)J)(n + 1 + in1)
Hence cn —¦ 0 as n —> 0 and thus K(cn/1) converges. By the same type
of argument we find that the odd part of K(a7J/l) converges. But we
can not say whether K.(an/1) itself converges or not, unless we can prove
that (fn+i — fn) —* 0 or not.
O
Example 30 Let an = n for all n G N. Then we know by Theorem 3
that K(fln/1) converges. But the convergence of {5n@)} is slow. Can
we find better approximants?
The even part of K(ra/1) is
K =!
2'3 4'5 ~k
1 + 2-1 + 3 + 4-1 + 5 + 6-.
where C\ = 1/3, ci — —1/4 and
-Bn- 2)Bn-
A + 2n - 3 + 2n - 2)A + 2n - 1 + 2n)
72-1/2 11 1
= — H > — - as 72 —> oo .
472 4 071 4
172 Chapter III. Convergence criteria
We recognize the continued fraction from Example 27. The second tail
of that one is identical to the second tail of K(cn/1). Hence we can
use the same modification wn = —A/2) + l/B>/2G2 + 1)) to compute
approximants STl(wn) of K(cn/l).
O
Example 31 Another approach for finding good approximants for
K(n/1) is the following: Since art = n is not so very different from
an+i, we guess that the value of the nth tail
\ an+i an+2 an+3 n + 1 n + 2 n + 3
1 + 1 + 1 +•-- 1 + 1 + 1 +-
of K(w/1) is not so very different from
wn =
an+l an+i an+\ y/1 + 4a/l+1 - 1 y/4n + 5-1
1 + 1 + 1 +..- 2
Hence we want to try the approximants
12 n 12 n
Sn(wn) = - -
1 +1 +...+1 + Wn 1 + lH h(l + v/4n + 5)/2 *
Table 5 compares the approximants 5n@) and Sn(wn). The last column
contains the approximants suggested in Example 30.
We can in fact prove that Sn(wn) converges faster than 5n@) to the
same value /; i.e. that
,. / - Sn(wn)
lim — x / = 0.
n-^oo f-Sn@)
To do this we can use the oval sequence theorem, Theorem 26, with
centers Cn = wn — (\/4ti + 5 — l)/2 and radii Rn = R = 1/2. This
choice satisfies the conditions of Theorem 26. Moreover a G En as given
by D.4.16) if and only if
Limit periodic continued fractions
173
n
1
2
3
4
5
6
7
8
9
10
Sn{0)
1.0
0.33
0.67
0.44
0.583
0.487
0.553
0.506
0.540
0.515
Sn{yjn)
0.5
0.535
0.5205
0.5275
0.5238
0.52592
0.5247
0.52544
0.5249
0.52527
Ex.30
n=2: 0.5168
n=3: 0.5218
n=4: 0.5236
n=5: 0.5244
Table 5: Example 31.
Hence an = n ? En if and only if
4n(l
n + 5) - (\/472
5 + 4n
+4n < 2\/4ra + 5 4-472
that is if
-4n(l 4- \/4n 4- 5) 4-
- l)B\/472 + 5 4- 4n 4- 5) + 472
< 2\/472 4- 5 + 4n 4- 5.
Straight forward computation shows that this holds for all n. Hence
Vn = {w\ \w — u;n| < 1/2} is a sequence of value regions for K(ti/1).
Since Vn is contained in the right half plane $l(w) > —1/2, at least
from some 72 on, it follows from the parabola theorem with a — 0 that
/(") € Vn for all n; that is, |/(n) - wn\ < 1/2. Hence, by formula A.4.7)
in Chapter II
- Sn(wn)
f-sn(o)
W ~ wn)
(K 4- wTl)fH
1/2
Wn ~ 1/2
0
since hn > 0 and wn > 0 and thus \hn/(hn 4- wn)\ < 1.
We can also derive truncation error estimates. From Theorem 26 it
174 Chapter 111. Convergence criteria
follows that
\f - Sn{wn)\ < 211— || — —
1 + wn - R ^ 1-f wk -f R
(A slightly smaller R would have given better bounds.)
O
5.1 Analytic continuation
Let us illustrate the idea by a trivial example.
Example 32 The periodic C-fraction
az az az az
a?C\{0} E.7.1)
converges in the cut plane D = {z G C; |arg(l -f ±az)\ < w} to the
holomorphic function
w
(z) = (y/l i Aaz - l)/2 where ^(v/l +4az) > 0 . E.7.2)
(See Theorem 28.) This function w(z) can be extended analytically
to a 2-sheeted Riemann surface D" with branch points of order 1 at
z = —I/4a and at z — oo. Let W(z) denote this function,
W(*) = @ +4az)l/2-l)/2 foTzeD". E.7.3)
Then K{anz/l) converges to W(z) for z G D C D*. The classical
approximants of J?(anz/l) are rational functions of z, and thus have
no branch points. Hence there is no way that these approximants can
converge to W(z) for z G D~ \ D. For the modified approximants
az az az
liowever, the picture is totally different. For these we have Sn(W(z)) =
W(z) for all 7i and all z G D*. That is, they "converge" to W(z) for all
Limit periodic continued fractions 175
z ? D*. So this choice of approximants lead to analytic continuation of
the value w(z) of the continued fraction ~K(az/l).
O
As already mentioned, this example was trivial. But what if we try to
use modified approximants
c nxn w aiZ a2Z a3Z anZ r^7A\
S«(W(z)) = —+—+—+ + I-^y E.7.4)
for a continued fraction K.(anz/1) where an —> a? Can we also then ob-
obtain convergence of Sn(W(z)) to a holomorphic or meromorphic function
in a larger domain C D* than D where J^(anz/l) is known to converge?
The answer is yes under proper conditions:
Theorem 35 Let a ? C \ {0}, c>0,0<r<l
and
where W(z) is as given in E.7.3). IfK.(anz/l) satisfies
\an~a\<Crn for n = 2,3,4,..., E.7.6)
then {Sn(W{z))} converges to a meromorphic function F(z) in D*. The
convergence is uniform on compact subsets of D* where F(z) ^ oo.
Remarks:
1. D* is a domain in Dm. D* C Dl if r > t. D\ = D and jDq is equal
to D" where the point 0 is removed from the sheet D* \ D.
2. D; is all of D* except for a bounded hole H = D* \ D;. This hole
is contained in the sheet Dx \ D, and it is symmetric about the
axis arg(—az) = 2tt. It is bounded away from the branch points
z = — l/4a and z = oo.
3. The observations in the previous remark imply that the limit func-
function F(z) also has branch points of order 1 at z = — l/4a and
z = oo.
176
Chapter III. Convergence criteria
4. The computation of the approximants Sn(W(z)) is unstable for
z G D* \ D. Small inaccuracies lead to approximants which con-
converge to /(?) ^ F(z)i where z is the projection of z onto D.
This unstability can be avoided by using the Bauer-Muir trans-
transformation as described in Chapter II. The new continued fraction
thus obtained (the Bauer-Muir transform of K(anz/l) with re-
respect to W(,z)) can often be accelerated by methods described in
the present section, or be extended analyticly even further.
For the proof of Theorem 35 we refer to [Lore]. We shall rather show an
example.
Example 33 The limit 1-periodic S-fraction K(anz/l) where
an = 0.25 -|- @.3)n for n = 1,2,3,...
converges to a meromorphic function in the cut plane D = {z G C; | arg(l
+z)| < tt}. According to Theorem 35, however, its approximants
S«{W{z)) =
CL\Z CL2Z
anz
W(z) =
for z G D* converges to a meromorphic function for z G D*K. Here D* is
the 2-sheeted Riemann surface with branch points of order 1 at z = —1
and at z — oo, and Dq 3 is the subset of D* where
W(z)
W{z)
_1_
(U
10
3
Observe that z G D^ if 5RA + zI'2 > -7/13 whereas z G D if and only
if SRA + zfl2 > 0. Moreover z G D?K if |1 + z\ll2 > 13/7. Hence Z>*>3
contains D and neighborhoods of the branchpoints z = — 1 and z = oo.
.0
Problems 177
Problems
A) Determine whether K{an/bn) converges or diverges and whether
its even and odd parts converge or diverge in each of the following
cases.
(a) All an = 1 and bn = z/n2 for z ? C.
(b) All an = z/n2 and bn = 1 for z ? C \ {0}.
(c) All an = zn2 and bn = 1 for z ? C \ {0}.
B) Given the continued fraction
a a a a
+ K26= + 26 + 26 + 26+---
(a) Prove that if a > 0 and 6 > 0 then 6 + K(a/26)converges to
y/a + 62. Use this to find a rational approximation to vl3
with an error less than 10~4.
(b) For which values of (a, 6) ? C X C does 6 + K(a/26) con-
converge/diverge? Find its value if it converges.
C) In Example 26 we suggested approximants Sn(wn ) for the con-
continued fraction
a!L_l 1/2 1/B-3) 2/B-3) 2/B-5) 3/B-5)
KT~T+1+ 1 + 1 + 1 + 1 +•¦-
n/BBn-l)) n/{2Bn + l))
+ i + i +•••
for log 2 (natural logarithm).
Does {Sniwn )} or {Sn(iVri )} have an oscillating character which
can be used to obtain upper bounds for the truncation error | log 2—
D) We want to improve the speed of convergence of the continued
fraction
an _ 6 + @.9) 6 + @.9J 6 + @.9K 6 + @.9)'
K 1 " 1 + 1 + 1 + 1 +•••*
(a) Prove that K(a«/1) converges to a finite value /.
178 Chapter III. Convergence criteria
(b) Suggest a value wn = w for all n such that
,. / - Sn(w)
lim —; „ . . = 0.
n -> oo / - 57I@)
Does the sequence {Sn(w)} have an oscillating character which
can be used to obtain upper bounds for the truncation error
(c) Suggest values for wn such that
lim = o
„ -> oo / _ 5n(ii/)
where w is the value from a). Does {Sn(wn)} have such an
oscillating character?
E) For which values of z does the 4-periodic continued fraction
n_
KT ~ T+ITT+T+TT
T T+I-T-T+T+T-T-T+-..
(a) converge /diverge generally?
(b) oscillate by Thiele oscillation?
(c) converge in the classical sense?
What is the value of this continued fraction when it converges
generally?
F) Let z be chosen such that the continued fraction 7<L(an/l) in Prob-
Problem 5 converges generally. Determine the asymptotic behavior of
the tail sequences of K(a«/1) in the sense of Theorem 7.
G) For which bn > 0 is V = {w ? C; |u; — 1| < 1} a value set for
K{l/bn)l For which an > 0 is V a value set for K(an/1)?
(8) Let V be the half plane V = {w G C; $l(w) > p} where p > 0 or
p < —1. Do there exist continued fractions K(an/1) or K(l/bn)
(with complex elements arn bn) such that V is a value set for this
continued fraction?
(9) Let K(l/6n) be the continued fraction where bn = 4 + @.9)n for
all n.
Problems 179
(a) Find a connected value set V for K(l/&n)- (Try to make V
small.)
(b) Does K(l/&n) converge to a finite value /?
(c) Are the classical approximants fn of K(l/6n) all distinct; i.e.
771?
(d) Use the value set V found in (a) to derive upper bounds for
the truncation error |/ — 5n(it;)| for suitably chosen it; G C.
A0) Let p, q,r ? No, and
v
:, a2n{z) = Yiki2O1*
b2n-i{z) =
be polynomials in n for n = 1, 2,3,..., where all q^, 7a.., C^ and
6k are entire functions of z and ap(z)~fp(z)/3q(z)8r(z) p 0. Further
let D = {z ? C; /3(](zNr(z) ? 0 and all an(z) ? 0}.
Prove that K(an(^)/bn(z)) converges in D if D is open and con-
connected and
(a) q -f t > p and D C D
(b) q + r = p and
(c) q + r = p - 1, ocv{z) = *)p{z) and
D C {z ? D; /3yB;)^rB;)/QpB;) ^ [-00, 0]}
(d) 9 + r = p - 2, apB;) = 7PB;) and
qp(z)
180 Chapter HI. Convergence criteria
(Hint: To prove (b)-(d) one can first prove that the even and
odd parts of K.(an(z)/bn(z)) converge to meromorphic functions
or functions identically oo in D. Then use the parabola theorem
to prove that these functions are identical on some (large enough)
subset of D. Finally one can use Stieltjes-Vitali's theorem to prove
that they are identical in D.)
A1) Use the parabola theorem to derive a priori truncation error bounds
for
, -x _ r, ^ _ / 7( 3) 22/B-3) 2tVB • 5)
+1) - k — - T+^r+—— + i + i
where a2n = n/BBn— 1)) and c^n+i = n/BB7i+1)) for all n > 1.
Compare these with the Gragg-Warner truncation error bounds in
Theorem 24.
A2) Let K{an/l) have all elements an G E = {w G C; \w-3-i\ < 0.4}.
Find a C G C and an jR < |1 + C\ such that E is a subset of the
cartesian oval in Theorem 25. Use this to prove that K(an/l)
converges to a finite value and to find truncation error bounds for
suitably chosen approximants of K(an/1). (Hint: See Remark 4
to Theorem 25.)
A3) Use Theorem 28 to estimate the speed of convergence of Sn(w) for
the continued fraction K.(ani/1) in Problem 11 when
w = (Vl + i - l)/2, $(y/TTi) > 0.
A4) Let an be as in Problem 11. Does K(—an/l) converge?
A5) Let K(on/1) be given by
an = x(l -f sc) -f rn where 0 < \x\ < |1 + x\ and 0 < \r\ < 1.
Choose approximants 5n(«ii ) for K.(an/1) according to the
scheme in Subsection 5.5 such that
f _ c / (m)\
lim n\ n _ q ^ m = 1 2,3,... .
n —» oo / c f '•"-"'
Compare with the Bauer-Muir transformation in Example 11 of
Chapter II.
Problems 181
A6) Let K(an/l) have real elements an such that (—l)nan > 0 and
|«2n-l| < 1 + a2n, \d2n+l \ < 1 + «2n for all 71.
Prove that {S\n+p@)}^-i converges for p = 1,2,3 and 4.
A7) Suggest expressions for wn such that the approximants Sn(wn) of
(hopefully) converge faster to the value of ]&.(an/l) than 5n@).
Compute the first 6 approximants of 5n@) and Sn(wn) and use
the oscillating character to determine an error bound for Sq@) and
182 Chapter III. Convergence criteria
Remarks
1. The idea of using value regions for continued fractions to derive
convergence criteria was presented by Paydon and Wall in 1942,
[PaWa42]. They had V = {w ? C; |w - 1| < 1} and tIt(w) =
1/A -f an+|it;), and studied continued fractions
1 d] a-2
1+ 1 + 1 + 1 | ...
with the property that tn(V) C V for all n.
This fruitful idea was further exploited by Wall and Wetzel,
[WaWe44] in two papers on positive definite continued fractions.
W. J. Thron realized the potential of this idea, and in a long series
of important papers, some of which in collaboration with others, he
refined it and used it to derive several useful convergence criteria.
We can mention his work on the parabola theorem [Thron58] and
twin convergence regions [Thron59]. A nice survey is given in
[Ihron74]. See also his book with W. B. Jones [JoTh80] for further
references.
In this classical work, value regions or value sets always contained
the classical approximants 5n@) of the continued fractions. It was
not until 1982 that one realized that value sets for other approxi-
approximants Sn(wn) could be used in the same way, [Jaco82], [Jaco83],
[Jaco86].
2. The demand for truncation error estimates became more promi-
prominent in the 1960s with the growing use of computers. W. J. Thron
[Thron58] realized that value sets could also be used for the pur-
purpose of deriving such estimates. This started a series of useful
publications in this area, such as [HePf66] and [JoTh76]. For fur-
further references we refer to [JoTh76, p. 298].
3. The first parabola theorem was published by Scott and Wall in
1940, [ScWa40]. It was proved by exploiting what they called
the fundamental inequalities. The result was generalized almost
immediately [PaWa42] and [LeTh42]. The most general parabola
theorem is due to Jones and Thron, [J0TI168].
Remarks 183
4. Limit periodic continued fractions, included those where an —> oo
or bn —> oo, have been extensively studied, the reason being that
so many useful continued fraction expansions have this form. One
important issue here is the question of how to choose approximants
{Sn(wn)} in order that Sn(wn) shall converge as fast as possible to
the value of the continued fraction. The first known result in this
direction dates back to Sylvester in 1869 [Sylv69]. More recently
Wynn [Wynn59], Gill [Gill75], Masson [Mass83], [Mass85], Thron
and Waadeland [ThWa80] and Jacobsen, Jones and Waadeland
[JaJW87] contributed to this area.
The idea of using asymptotic side conditions to improve the speed
of convergence even further, was published by Jacobsen and Waade-
Waadeland [JaWa88], [JaWa90] and improved by Levrie [Levr89].
In the paper [Waad66], the idea of deriving analytic continuation
of the value f(z) of a continued fraction K.(an(z)/bn(z)) by careful
choice of approximants Sn(wn(z)) was introduced. Independently,
Masson came up with the same method, [Mass83]. A thorough
presentation of the method can be found in [ThWa80], [ThWa81].
References
[GI1173] J. Gill, Infinite Compositions of Mobius Transformations,
Trans. Amer. Math. Soc. 176 A973), 479 -487.
[GU175] J. Gill, The Use of Attractive Fixed Points in Accelerating
the Convergence of Limit-Periodic Continued Fractions,
Proc. Amer. Math. Soc. 47 A975), 119-126.
[Gill80] J. Gill, Convergence Acceleration for Continued Fractions
K(an/1) with lim<zn = 0, "Analytic Theory of Contin-
Continued Fractions", (W.B.Jones, W.J.Thron, ll.Waadeland,
cds), Lecture Notes in Mathematics 932, Springer-Verlag,
Berlin A980), 67-70.
[GrWa83] W. B. Gragg and D. D. Warner, Two Constructive Results
in Continued Fractions, SIAM J. Numer. Anal. 20 A983),
1187-1197.
[HePf66] P. Henrici and P. Pfluger, Truncation Error Estimates for
Stieltjes Fractions, Numer. Math. 9 A966), 120-138.
[HiTh65] K. L. Hillam and W. J. Tliron, A General Convergence
Criterion for Continued Fractions K.(an/bn), Proc. Amer.
Math. Soc. 16 A965), 1256-1262.
[Hille62] E. llille, "Analytic Function Theory", Vol 2, Ginn, Boston
A962).
[Jaco82] L. Jacobsen, Some Periodic Sequences of Circular Conver-
Convergence Regions, "Analytic Theory of Continued Fractions",
184
References
185
Lecture Notes in Mathematics 932 (W. B. Jones, W. J.
Thron and H. Waadeland eds.), Springer-Verlag, Berlin
A982), 87-98.
[Jaco83] L. Jacobsen, Convergence Acceleration and Analytic Con-
Continuation by Means of Modification of Continued Fractions,
Det Kgl. Norske Vid. Selsk. Skr. No 1 A983), 19-33.
[Jaco86] L. Jacobsen, General Convergence of Continued Fractions,
Trans. Amer. Math. Soc. 294B) A986), 477-485.
[JaJW87] L. Jacobsen, W. B. Jones and H. Waadeland, Conver-
Convergence Acceleration for Continued Fractions K.(an/l) where
an —* oo, "Rational Approximation and Its Applications in
Mathematics and Physics", Lecture Notes in Mathemat-
Mathematics 1237 (J. Gilewicz, M. Pindor and W. Siemaszko eds.)
Springer-Verlag, Berlin A987), 177-187.
[JaMa90] L. Jacobsen and D. R. Masson, On the Convergence
of Limit Periodic Continued Fractions Jt(an/l), where
an —> -1/4. Part 111., Constr. Approx. 6 A990), p.363-374.
[JaTh86] L. Jacobsen and W. J. Thron, Oval Convergence Re-
Regions and Circular Limit Regions for Continued Fractions
K(aw/l)j "Analytic Theory of Continued Fractions" II,
Lecture Notes in Mathematics 1199 (W. J. Thron ed.),
Springer-Verlag, Berlin A986), 90-126.
[JaWa86] L. Jacobsen and H. Waadeland, Even and Odd Parts
of Limit Periodic Continued Fractions, J. Comp. Appl.
Math. 15 A986), 225-233.
[JaWa88] L. Jacobsen and H. Waadeland, Convergence Acceleration
of Limit Periodic Continued Fractions under Asymptotic
Side Conditions, Numer. Math. 53 A988), 285-298.
[JaWa90] L. Jacobsen and II. Waadeland, An Asymptotic Property
for Tails of Limit Periodic Continued Fractions, Rocky
Mountain J. of Math. 20A) A990), 151-163.
[JoTh68] W. B. Jones and W. J. Thron, Convergence of Continued
Fractions, Canad. J. of Math. 20 A968), 1037-1055.
186 Chapter III. Convergence criteria
[JoTh70] W. B. Jones and W. J. Thron, Twin-Convergence Regions
for Continued Fractions J?(an/1), Trans. Amer. Math.
Soc. 150 A970), 93-119.
[JoTh76] W. B. Jones and W. J. Thron, Truncation Error Analysis
by Means of Approximant Systems and Inclusion Regions,
Numer. Math. 26 A976), 117-154.
[JoTh80] W. B. Jones and W. J. Thron, "Continued Fractions: An-
Analytic Theory and Applications", Encyclopedia of Mathe-
Mathematics and its Applications, 11, Addison-Wesley Publish-
Publishing Company, Reading, Mass. A980). Now distributed by
Cambridge University Press, New York.
[Lane45] R. E. Lane, The Convergence and Values of Periodic Con-
Continued Fractions, Bull. Amer. Math. Soc. 51 A945), 246-
250.
[LaWa49] ft. E. Lane and H. S. Wall, Continued Fractions with Abso-
Absolutely Convergent Even and Odd Parts, Trans. Amer. Math.
Soc. 67 A949), 368-380.
[LeTh42] W. Leighton and W. J. Thron, Continued Fractions with
Complex Elements, Duke. Math. J. 9 A942), 763-772.
[Levr89] P. Levrie, Improving a Method for Computing Non-
dominant Solutions of Certain Second-Order Recurrence
Relations of Poincare-Type, Numer. Math. 56 A989), 501-
512.
[Lore] L. Lorentzen, Analytic Continuation of Functions Repre-
Represented by Continued Fractions, Revisited. To be published
in Rocky Mountain J. of Math.
[Lore] L. Lorentzen, Bestness of the Parabola Theorem for Con-
Continued Fractions. To be published.
[LoRu] L. Lorentzen and St. Ruscheweyh, Simple Convergence Sets
for Continued Fractions K.(an/l). To be published.
[Mass83] D. Masson, The Rotating Harmonic Oscillator Eigenvalue
Problem. 1. Continued Fractions and Analytic Continua-
Continuation, J. Math. Phys. 24 (8) A983), 2074-2088.
References
187
[Mass85]
[PaWa42]
[ScWa40]
[Seid46]
[Stern48]
[Stern60]
[Stie94]
[Stolz86]
[Sylv69]
[Thie79]
[Thron58]
[Thron59]
D. Masson, Convergence and Analytic Continuation for a
Class of Regular C-fractions, Canad. Math. Bull. 28D)
A985), 411-421.
J. F. Pay don and II. S. Wall, The Continued Fraction as
a Sequence of Linear Transformations, Duke. Math. J. 9
A942), 360 -372.
W. T. Scott and II. S. Wall, A Convergence Theorem for
Continued Fractions, Trans. Amer. Math. Soc. 47 A940),
155-172.
L. Seidel, Untersuchungen u'ber die Konvergenz und Diver-
genz der Kettenbruche, Habilschrift Miinchen A846).
M. A. Stern, Uber die Kennzeichen der Konvergenz eines
Kettenbruchs, J. Reine Angew. Math. 37 A848).
M. A. Stern, "Lehrbuch der Algebraischen Analysis",
Leipzig (I860).
T. J. Stieltjes, Recherches sur les fractions continues, Ann.
Fac. Sci. Toulouse 8 A894), J, 1-122; 9 A894), A, 1-47;
Oeuvres 2, 402-566. Also published in Memoires Presentes
par divers savants a PAcademie de sciences de PInstitut
National de France 33 A892), 1-196.
0. Stolz, "Vorlesungen iiber allgemeine Arithmetic", Teub-
ner, Leipzig A886).
J. J. Sylvester, Note on a New Continued Fraction Appli-
Applicable to the Quadrature of the Circle, Philos. Mag. Ser. 4
A869), 373-375.
T. N. Thiele, Bemcerkninger om periodiske Kjasdebr0kers
Konvergens, Tidsskrift for Mathematik DK A879).
W. J. Thron, On Parabolic Convergence Regions for Con-
Continued Fractions, Math. Zeitschr. 69 A958), 173-182.
W. J. Thron, Zwillingskonvergenzgebiete fur Kettenbruche
1 + K.{an/1), deren eines die Kreisscheibe |a2n-i| < p2 ist,
Math. Zeitschr. 70 A959), 310-344.
188
Chapter III. Convergence criteria
[Thron74]
[ThWa80]
[ThWa80]
[ThWa81]
[Waad66]
[WaWe44]
W. J. Thron, A Survey of Recent Convergence Results
for Continued Fractions, Rocky Mountain J. Math. 4B)
A974), 273-282.
W. J. Thron and H. Waadeland, Accelerating Convergence
of Limit Periodic Continued Fractions K(an/l)» Nuiner.
Math. 34 A980), 155-170.
W. J. Thron and H. Waadeland, Analytic Continuation of
Functions Defined by Means of Continued Fractions, Math.
Scand. 47 A980), 72-90.
W. J. Thron and H. Waadeland, Convergence Questions
for Limit Periodic Continued Fractions, Rocky Mountain
J. Math.ll A981), 641-657.
H. Waadeland, A Convergence Property of Certain T-
fraction Expansions, Det Kgl. Norske Vid. Selsk. Skr. 9
A966), 1-22.
H. S. Wall and M. Wetzel, Contributions to the Analytic
Theory of J-Fractions, Trans. Amer. Math. Soc. 55 A944),
373-397.
[WaWe44] H. S. Wall and M. Wetzel, Quadratic Forms and Conver-
Convergence Regions for Continued Fractions, Duke. Math. J. 11
A944), 89-102.
[Wynn59] P. Wynn, Converging Factors for Continued Fractions, Nu-
mer. Math. 1 A959), 272-320.
Chapter IV
Continued fractions and
three-term recurrence
relations
About this chapter
The fact that the canonical numerators {An} and denominators {Bn}
of the continued fraction K.(an/bn) satisfy the equalities
An = bn An_! ¦+- an An_2 , Bn = &„?„_ L + anBTl_2 for n = 1,2,3,... ;
i.e. that {An} and {Bn} are solutions of the three-term recurrence rela-
relation
Xn = bnXn-1 + ariA"n_2 for n = 1,2, 3,...
with initial values A-\ = lt Aq = 0, J?_i = 0 and Bo — 1, is very useful.
The solution space of this recurrence relation has a very nice structure:
it is a linear space. And this fact can be used in the convergence theory
for continued fractions with "surprisingly" good results, as we shall see.
But we can also make use of this connection in the opposite direction, as
the basis for a continued fraction method to compute certain solutions
of three-term recurrence relations. So useful is this connection that one
189
190 Chapter IV. Three-term recurrence relations
has asked the question of whether there are some continued fraction-like
structures which corresponds to "longer" recurrence relations. This will
be touched upon at the end of this chapter.
Readers familiar with hypergeometric functions and/or orthogonal poly-
polynomials will probably guess that this link between continued fractions
and three-term recurrence relations provides a link between continued
fractions and hypergeometric functions and between continued fractions
and orthogonal polynomials. This is indeed so, but will be treated in
later chapters.
Three-term recurrence relations 191
1 Three-term recurrence relations
1.1 The structure of the solution space
Let us take a closer look at the three-term recurrence relation
Xn = 6nXrl_! + anXn-2 for n = 1,2, 3,... ,
where all an and bn are complex numbers and all an ^ 0. A sequence
{Xn}^__, of complex numbers is called a solution of A.1.1) if its ele-
elements satisfy this equality for all n ? N.
Example 1 The sequence {Xn}<^__l = < (L~2 ) \ is a solution
I J n=-i
of the three-term recurrence relation
Xn = XTl-i + Xn_2 for n = 1,2,3,... .
We shall see this by checking that its elements Xn satisfy the relation.
We have, for n > 1
2 \ 2
and
_ , 1 - V5
Arl —
192 Chapter IV. Three-term recurrence relations
which proves the assertion. By the same method we also find that
{^n}^L-i — {( L 2 °) }r^=-i 1S a solution of the same recurrence re-
relation. A third solution is
{Fn}~=_, = A,1,2,3,5,8,13,21,34,55,89,...).
This solution is obtained by starting with F_} = Fo = 1 and using
the recurrence relation recursively: each element Fn is the sum of the
two previous ones. We recognize {Fn} to be the sequence of Fibonacci
numbers. (See Problem 1 in Chapter I.)
O
The set of all solutions {Xn}™__l of A.1.1) is called the solution space
of A.1.1). It has some nice properties:
1) {0}^L_1 is a solution and thus belongs to the solution space.
2) If {Xn} is a solution of A.1.1), then so is a{Xn} = {aXn} for
every fixed complex number a.
3) If {Xn} and {Yn} are two solutions of A.1.1), then {Xn} -{- {Fn} =
{Xn + Yn} is also a solution.
This means that the solution space is a linear space (vector space) with
0-element ^
Theorem 1 The solution space for the three-term recurrence relation
Xn = 6nXn_ i + OnXn-2 » «n, 6n G C , an ^ 0 forn = 1,2,3,...
25 a linear space of dimension 2. The canonical numerators {An} and
denominators {Bn} ofj?(an/bn) form a basis for the solution space.
Proof : We have seen that the solution space is a linear space. We
also know that a solution {Xn} is uniquely determined if X_i and Xo
are given. In particular this means that {^4n} and {Bn} are uniquely
Three-term recurrence relations 193
determined, {An} = A,0,...) and {#„} = @,1,...). An arbitrary
solution {Xn} can therefore be written as the linear combination {Xn} =
X_i ¦ {An} -f -Xo • {Bn}- Hence, the dimension of the solution space is
<2. It remains to prove that {An} and {Bn} are linearly independent.
Let Ci{i4n} -f C2{Bn} = {0} for two complex constants C[ and c2. Then,
in particular, c\A-\ + ?iB-\ — 0 and c\A^ -f c-iBq — 0; i.e. c\ — C2 = 0.
Example 2 The two solutions
n+l
I fT/ t
from Example 1 are linearly independent, and thus form a basis for the
solution space of
Xn = Xrl_, +Xn_2 for n = 1,2,3, A.1.2)
The Fibonacci sequence {Fn} can therefore be written {Fn} = c\
Ci{Xn) for some complex constants C\ and c2. To determine these con-
constants we check the equality for n = — 1 and n = 0:
F_i = C]X_i + c2y_i , that is 1 = cj 4- c2
Fu= c1Xq + c2Y(>, that is 1 = ct-L^4-c-i1^.
Solving these equations for c{ and c2 gives C\ = —A — y/b)/2\/b and
c2 = A4- Vo)/2\/5. We thus have the following closed form for the
Fibonacci numbers:
_
This formula goes by the name of Binet's formula.
O
Example 3 The canonical numerators {An} and denominators {Bn}
of the continued fraction
194 Chapter IV. Three-term recurrence relations
are solutions of the recurrence relation A.1.2). By Theorem 1 they
actually form a basis for the solution space of this recurrence relation.
However, so do also the two solutions {Xn} and {Yn} from Example
2 since they are linearly independent. Therefore there exist complex
constants Qj, ct-2, P[ and ft such that
for all n. Checking these equalities for n — — 1 and n = 0 gives that
a, = A + v/5)/2\/5, a2 = -A - y/h)/2yfe, ft = -l/\/5 and ft = l/vfc
The approximants of K(l/1) can therefore be written in closed form:
f^ _ (
An _onXn + a2Yn = 2VE \ 2 / 2V5 \^~J
B
n
J
7E\ 2 J +je{ 2
(This expression may of course be considerably simplified.) Since (A —
\/5)/2)n+1 -> 0 as n -> 00 and (A + \/5)/2)n+1 -> 00 as n -+ 00, it fol-
follows that An/Bn converges to {y/& — l)/2 which is consistent with The-
Theorem 6C in Chapter III.
O
1.2 Approximants for periodic continued fractions in closed form
The idea of Example 3 can be extended to more general periodic contin-
continued fractions. For simplicity we limit ourselves to the 1-periodic ones:
Theorem 2 Let the linear fractional transformation s(z) = a/(b -f- z)
have two distinct fixed points x and y. Then the 1-periodic continued
fraction ~K(a/b) has approximants
Three-term recurrence relations 195
Proof : Since x = a/(b-\-x) and y = a/(b-\-y), it follows that a = — xy,
b= -(x + 2/) and thus that {(-sc)M+1KJl_, and {(-jf)n+lKIL-.i are two
linearly independent solutions of the three-term recurrence relation
Xn — bXn—-\ -\- dXn _2 for n = 1,2,3,... .
Hence the canonical numerators {An} and denominators {J9n} o
can be written
An = ()()
+l ()+l, forn--1,0,1,....
In particular
_, =l=a,+a2, J?_! =0 =
o = 0 = -q^ - a2y, Bo = 1 = -
so that qx = -2//(k - y), q2 = a:/(aj - y), C\ = -l/(x - y) and f32
l/(x - y). Hence A.2.1) follows.
Remark: Note that the closed expression A.2.1) for fn can be written
1 —(x/v)n
Hence, If |ar| < |y| then lim/n = x, just as proved in Theorem 6C in
Chapter III. Note also that in Problem 5 in Chapter I you were asked
to prove Theorem 2 (by a different method, induction).
Example 4 The periodic continued fraction KC/2) has approximants
,An 3"-(-!)"
Tn Bn
since the linear fractional transformation s(z) = 3/B + z) has the two
fixed points x = 1 and y = —3. See also Example 1 in Chapter I.
O
196 Chapter IV. Three-term recurrence relations
1.3 Linear independence of two solutions
How can we easily see that two solutions of a three-term recurrence
relation are linearly independent? Intuitively one would say that {Xn}
and {Yn} ^ {0} are linearly dependent if and only if Xyv-i = ctK/V-i
and Xn = aY^ for some a ? C and JV G No- This is indeed the case.
The following theorem provides this result together with some other
useful characterizations of linear independence:
Theorem 3 Let {Xn} and {Yn} be solutions of the three-term recur-
recurrence relation
Xn = 6nXn_i + anXn-2 ? «n i=- 0 for n — 1,2,3,.... A.3.1)
Then the following statements are equivalent.
(A) {Xn} and {Yn} are linearly independent.
(B) There exists an N ? Nq such that X;vY}v-i — Yj^X^_l jd 0.
(C) xoy-i-yox_, ^o.
(D) XnYn.{ - ynXn_! ± 0 for all n ? No.
(E) If{Uu}, {Vu} is a basis for the solution space of A.3.1) and {Xn} —
n} + a2{Vn}, {Yn} = Cl{Un} + C2{Vn}, then a^-^p, ^ 0.
To prove this theorem we use the following lemma which is a generaliza-
generalization of the determinant formula, formula A.2.10) in Chapter I. It follows
by induction on n, using the recurrence relation:
Lemma 4 Let {Xn} and {Yn} be solutions of A.3.1). Then
„ - 1 O Q
Three-term recurrence relations 197
Proof of Theorem 3: The equivalence of (B), (C) and (D) follows
directly from Lemma 4.
(A) <^> (C): Let Cl{Xn} + c2{Yn} = {0}. Then
i + c2Y-i = 0 and CiXq + c2Y{i = 0 .
This is a system of linear equations in c^ and c2. It has a unique solution
if and only if its determinant X0K_i — Y§X-\ ^ 0. This unique solution
is C\ = C2 = 0.
(C) <=> (E): The equivalence follows from the identity
where UqV^ - VqXJ-\ ^ 0 by the equivalence (A) <=> (C).
1.4 The adjoint recurrence relation
The adjoint of the three-term recurrence relation
Xn = bnXn-i + OnXn-2 where an ^ 0 , for n = 1,2,3,...
is by definition the recurrence relation
Pn = bnPn+l + an+lPn+2 for n = 0,1,2,... , A.4.2)
where 60 is some (arbitrary) complex constant. (It is not essential that
n runs from n = 0 in A.4.2), but it is convenient for our purpose.)
Solutions of A.4.2) have the form {jPnj^Lo- This recurrence relation is
more natural for the hypergeometric functions:
Example 5 The confluent hypergeometric function
\z 1 z2 1 z3
(c;z) -
c(c+l)(c + 2) 3T
zn
c(c -f-1) • • • (c + n — 1) n\
198 Chapter IV. Three-term recurrence relations
where c is a fixed complex number ^ 0,-1, —2,..., is an entire function.
For such functions we have that
; z) = *(c + 1; z) + * x*(c + 2; z).
c(c + 1)
This can be seen by comparing the coefficient of zk in the power series
on both sides of the equality sign, for k = 0,1,2, Setting Pn{z) =
+ n; z) for n — 0,1,2,..., we see that
A.4.3)
That is, {\P(c -f n\ ^)}^-0 ^s a solution of this recurrence relation for
every z ? C.
O
There is a strong connection between solutions of A.4.1) and its adjoint
A.4.2):
Theorem 5
(A) {Pn}^Lo ^s o, solution of A.4-2) where all an / 0 if and only if
Po = fco^Pi + CL1P2 and {Pn+2lYjii(-aj)}™=-i is a solution of
A.4.1).
(B) {Xn)<^__l is a solution of A-4-1) if and only if {Pn}^L0 given
by Pn = Jrn_2/ll"=i1(-oj) for all n G N, Po = buPi + aYP2 is a
solution of (I.4.2).
(The empty product n?=i • • • = 1 by definition.)
Proof :
(A): Let {Pn} be a solution of A.4.2). Then
Jrn+2 — ~Z *n+l + Jri ~ bn T On
an+1
Three-term recurrence relations 199
so that
n+l n n~1
Pn+2
That is, {Pn+2 Ilj=i(-«j)} is a solution of A.4.1). The if-part follows
from the same relations.
(B): This is a simple corollary of part (A) since
n+l n+l
Xn = Pn+2 Yl [-aj) if a™1 only ^ Pn+2 = Xn/ Y[
Example 6 In Example 5 we saw that Pn(z) = *(c + n; z) is a solution
of the three-term recurrence relation A.4.3) for all z ? C. Hence
«=-1,0,1,...
is a solution of
Xn{z) = Xn_, (z) + — \ ,Xn.2(z) forn = 1,2,3,... .
(c + n — l){c + n)
(Remember that T(x + 1) = xT(x) for the gamma function T(x). By
using the Pochhammer symbol (a)n = a(a + 1) • • • (a + n — 1) =
F(a + n)/r(a), the expression for Xn(z) can be somewhat simplified.)
This illustrates that it sometimes is easier to look for solutions of the
adjoint equation.
O
200 Chapter IV. Three-term recurrence relations
1.5 Recurrence relations in a field F
Until now our recurrence relations have had coefficients ani bn from C,
and their solutions {Xn} have been sequences of complex numbers. In
some cases, for instance in the Examples 5 and 6, this is too special.
Let us look closer at Example 5: A better approach is to regard A.4.3)
as a recurrence relation in the field M of functions f(z) meromorphic
at z = 0. That is, its coefficients anz = z/((c + n)(c -f n + 1)) G M
and bn(z) = 1 6 M. Its solutions {Pn(z)} also consists of elements
from M in this case; for instance the confluent hypergeometric functions
/>„(*) = ?(c + n;z).
More rewarding is to regard A.4.3) as a recurrence relation in the field
L of formal power series J^JJLno cnzn with no 6 Z. That is, we regard
an(z) and bn(z) as elements from L ("very short" power series) and
get solutions {Pn(z)} C L, as for instance the confluent hypergeometric
series Pn[z) = \t(c + n\ z).
To cover this case, we let F denote a field which is either C or L. We
further let || • ||p be a norm in F, as for instance | • | in C (the usual
absolute value; i.e. the euclidean norm). We shall return to the norm in
L later. For the time being we just think of (F, || • ||p) as (C, | • |).
Inspired by the notation C = Cu{oo}, we write L = L U {/«} where
/oo is the equivalence class of formal power series X]^L-oo cnzU where
cn ^ 0 for arbitrary small indices n. (The reason for this choice will
become clear later.) That is, for short, F = FU
Remark: Theorems 1, 2, 3 and 5 still hold when the three-term recur-
recurrence relations are in F. We could of course have considered even more
general fields F, but we shall not need that in this exposition.
Convergence of continued fractions 201
2 Convergence of continued fractions
2.1 Pincherle's theorem
The clue to Pincherle's theorem [Pinc94] can be found in the proof of
Theorem 2 combined with the subsequent remark. The fact that the
canonical approximants of K.(an/bn) can be written
where {Xn} and {Yn} are solutions of
Xn — bnXn-\ + anXn_2 where an ^ 0 for n — 1, 2,3,... , B.1.2)
leads to convergence of {/n} if Xn/Yn —*¦ 0. And not only that, in such
a case we have that lim/n = etilfi'i- This is essentially the idea we shall
pursue in this section.
Let {Xn} and {Yn} be two linearly independent solutions of B.1.2).
Then Xn and Yn can not be zero for the same index n, by Theorem 3.
Hence Xn/Yn is well defined in F for all n.
Definition We say that {Xn} is a minimal (or subdominant) solu-
solution of the linear three-term recurrence relation B.1.2) if {Xn} is non-
trivial (i.e. ^ {0}j, and there exists a solution {YTl} of B.1.2) such that
limn —> oo Xn/Yn — 0. The solution {Yn} is then said to be dominant.
Let {Xn} be a minimal and {Yn} be a dominant solution of B.1.2) Then
all solutions c{Xn} = {cXn}, where c G F\{0}, are also minimal. All
other non-trivial solutions {Zn} are dominant since they can be written
{Zu} = cA{Xn} + c2{YTl} where cuc2 G F, c2 ^ 0.
From this we also see that Zn/Yn—* c2 ^ 0. The converse is also true;
i.e. if Zn/Yn —> c2 G F\{0}, then {Zn} and {Yn} are both dominant:
Theorem 6 Let {Yn} and {Zn} be two linearly independent solutions
of B.1.2) such that hmYn/Zn = R exists in F. Then B.1.2) has a
minimal solution.
202 Chapter IV. Three-term recurrence relations
Proof : If R = oo then {Zu} is minimal. If R ^ oo then Xn =
yn — RZn is a non-trivial solution of B.1.2) with Xn/Zn —> 0; i.e. {Xn}
is a minimal solution. ¦
Theorem 7 (Pincherle's theorem) LetK.(an/bn) be a continued frac-
fraction with elements an and bn from F and all an ^ 0. Then:
(A) K.(an/bn) converges in F if and only if the corresponding linear
three-term recurrence relation B.1.2) has a minimal solution.
(B) If B.1.2) has a minimal solution {Xn}, then K.(an/bn) converges
to -Xo/X-t G F.
(C) If B.1.2) has a minimal solution {^n}; and {Yn} 2S a dominant
solution, then the approximants fn ofK.(an/bn) satisfy
as n —> oo
fn ~ C2(Yn/Xn) as n —> oo if X-\ = 0
for some constants C\ and C2 from F\{0}.
Remark: The notation in B.1.3) is to be understood as follows:
*n ~ Cyn as n —> 00 <=> lim tn/yn = C
n —* 00
for a C 6 F\{0}. We say that tn is asymptotically equal to Cyn. If
F = C and ||*||p = | • |, the usual absolute value norm, then B.1.3)
expresses the speed of convergence of the continued fraction H.(an/bn).
Proof of Theorem 7: Let /„ = An/Bn be the nth canonical approx-
imant of ~K(an/bn).
(A): Assume first that B.1.2) has a minimal solution {A"w}. Let {Yn}
denote a dominant solution of B.1.2). Then {Xn} and {Yn} form a basis
Convergence of continued fractions 203
for the solution space of B.1.2), and there exist elements ai, ct2, ft and
ft from F such that
Y jlRV for n=-1,0,1, B.1.4)
Hence fn can be written on the form B.1.1). If ft ^ 0 it follows that
fn —> a2/ft. If ft — 0 we necessarily have that ft ^ 0 and q2 ^ 0 since
{ilw} and {Bn} are linearly independent. Hence fn —> oo if ft = 0.
That is, the continued fraction converges to o^/ft ? F.
To prove the "only if" part we first assume that K(an/^n) converges to
a value / G F. That is, limn _> ^ An/Bn — f. Since {An} and {!?„} are
two linearly independent solutions of B.1.2), the existence of a minimal
solution then follows from Theorem 6.
(B): In view of the observations above, it suffices to prove that o^/ft =
-i: Setting n = — 1 and n ~ 0 in B.1.4) gives the equations
,, b_, = o = ftx_,+fty_,,
= 0 = alX0+
which has the solution ct\ = Yy/A, ct2 = —Xo/A, ft = — K_i/A, ft =
X_j/A, where A = yoX_, - K_iX0 ^ 0, since {Xn} and {Fn} are
linearly independent.
(C): Let X_! ^ 0. Then
J Jn — a
ft j9i xn + ftyn ft (/3iXn + fti;)
a2ft -
where Xn/Yn —*¦ 0 and ft ^ 0. For the case X_i = 0, the approximants
satisfy
r _ — i--k 4- Q>2Yn YqXti — Xol^i Yq — XuYn/X
-y_,xn -y.
where X() -/ 0 and Y_i ^ 0 by virtue of Theorem 3.
n
204 Chapter IV. Three-term recurrence relations
The value —Xq/X-i is unique, since every minimal solution of B.1.2) is
proportional to {Xn}. If both a minimal solution {Xn} and a dominant
solution {Yn} are explicitly known, then the approximants /„ = An/Bn
and their limit / = —Xq/X-\ are also known, and the truncation error
estimate in part (C) is of no interest. What is often the case, however,
is that {Yn} is only known in the sense that we have an estimate for the
speed by which Xn/Yn approaches 0. Or, alternatively, that the expres-
expressions for Yn are so complicated that we prefer to use such estimates.
Example 7 In Ramanujan's second notebook we find the formula
x -\- a + 1 x -f a x -f 2 a x -f 3a
z + 1 "~a;-l+aj + a-l+aj +2a-!+¦••
for a € C\{0} and x G C\{—a, —2a, —3a,...}. The meaning of this
formula is that the continued fraction on the right side converges to the
value on the left side. Unfortunately Ramanujan very rarely indicated
how his formulas could be proved! So how could he have found and
proved his result?
Well, a clue is that {Xn} given by
Xn = (-l)n(aj + na + cH 1) for n =-1,0,1,2,...
is a solution of the three-term recurrence relation
Xn = (x -\- na — a — l)Xn_i -f (x + na)Xn-2. for n = 1,2,3,....
If {Xn} is a minimal solution, then Ramanujan's formula follows by
Pincherle's theorem. Is {Xn} minimal? The answer is yes. One can
prove (by BirkhofPs method which is described in [Wimp84]) that there
is a solution {Yn} such that the limit
Urn Yn/(n+1I ann(xM-2
n —> oo
exists in C\{0}. That is, Yn ~ C{n - 1)! a1lnxla and {Yn} is dominant.
The speed of convergence of the continued fraction is of the order
O
Convergence of continued fractions 205
Pincherle's theorem can also be stated for b{) + K.(an/bn) and the adjoint
recurrence relation
Pn = bnPn+i + anPn+2 for n = 0,1,2,... : B.1.5)
Corollary 8 Let 6q + K.(an/bn) be a continued fraction with elements
an and bn from F and all an ^ 0. Then:
(A) 60 -f K.(an/bn) converges in F if and only if the three-term recur-
recurrence relation B.1.5) has a minimal solution.
(B) If {Pn} is a minimal solution of B.1.5), then 60 -| K.(an/bn) con-
converges to Pq/P\.
(C) If {Pn} is a minimal solution of B.1.5) and {Qn} is a dominant
solution, then
{fn-Po/Pi)~ C{(Pn+2IQn+2) asn->oo ifPx ? 0,
fn ~ C2(Qn+2/Pn+2) as Tl-+ OO if jP, = 0
for some constants C\ and C2 from F\{0}.
Proof:
(A): By Pincherle's theorem we know that b0 -f K.(an/bn) converges if
and only if B.1.2) has a minimal solution. From Theorem 5A it follows
that {Pn} is a solution of B.1.5) if and only if Pq = b^Pi -f a\P2 and
\Xn}n--\ given by
n+l
Xn = Pn+2 fj(—a>j) for n =—1,0,1,... B.1.6)
is a solution of B.1.2). It follows from B.1.6) that {Pn} is a minimal
solution of B.1.5) if and only if {Xn} is a minimal solution of B.1.2).
(B): Let {Pn} be a minimal solution of B.1.5). By the arguments above
and Pincherle's theorem we then know that 60 + K.(an/bn) converges to
206 Chapter IV. Three-term recurrence relations
(C): This follows from Theorem 7C and the connection B.1.6) between
solutions of the two recurrence relations. ¦
2.2 Auric's theorem
Application of Pincherle's theorem requires knowledge of two solutions
of the corresponding recurrence relation. What can we do if we only can
find one? And how can we decide whether this is a minimal solution or
not?
Theorem 9 Let {Xn} be a solution of the three-term recurrence relation
Xn = 6nXn_i + anXn-2 where an, bri G F, an / 0 for n= 1,2,3,...
B.2.1)
with Xn ^ 0 for all n. Then {Xn} is a minimal solution of B.2.1) if
and only if
g rcu.(-«m) = ^ B 2 2)
Proof : Let {Xn} be a minimal solution. Then it follows by Pincherle's
theorem, Theorem 7, that K.(an/bn) converges to / = —Xo/X_i ^ oo
since Jf_l ^ 0. Let /„ = An/Bn be the approximants of K(an/bn) in
canonical form. Since {An} and {Bn} are linearly independent solutions
of B.2.1), it follows that {Bn} is minimal if and only if An/Bn —> oo.
We have /„ = An/Bn —> / ^ oo. Hence {Bn} is dominant and
Bn/Xn —» oo. Using Lemma 4 we find that
where we have used that I?_i = 0 and B^ = 1. This proves B.2.2).
Convergence of continued fractions 207
Assume next that B.2.2) holds. From B.2.3) it follows then that
—> oo. But this means that {Xn} is a minimal solution. ¦
This result leads directly to the following useful theorem [AuricO7]:
Theorem 10 (Auric's theorem) Let K{an/bn) be a continued frac-
fraction with elements an, bn ? F and all an ^ 0, and let {Xn} be a solution
of the corresponding recurrence relation B.2.1) such that all Xn ^ 0 and
oo .. _
B.2.4)
n=O A»-1A"
Then K.(an/bTi) converges to the finite value —Xo/X-i, and
(A + t±) - C (? n7l(flm)) « n -. oo B.2.5)
m=0
for some constant C G F\{0}.
Proof : That K{an/bn) converges to —X0/X_i is a direct consequence
of Theorem 9 and Pincherle's theorem, Theorem 7. From the proof of
Theorem 9 it follows that {#n} is a dominant solution of the recur-
recurrence relation. Hence, the order of the speed of convergence follows by
Theorem 7C and B.2.3). ¦
Example 8 The continued fraction
18 40
converges by virtue of the parabola theorem, Theorem 20 in Chapter
III. However, since {Xn} given by
n
Xn = Y[(-2k-3) for n= -1,0,1...
A:=0
is a solution of the corresponding three-term recurrence relation
Xn = Xn-i + Dn2 + 10n + 4)Xn_2 for n = 1, 2, 3,... ,
208 Chapter IV. Three-term recurrence relations
we can also use Auric's theorem to study K.(an/l). We have
nti B« + 3) nr=o
OO
^— TT
Pc -Bm + 4)Bm+ 1)
n=(i m= I
Bm + IJ
(-1)" 1^1
^-f, 2n + 3 i-L, 2m + I
g^ (-l)n A 2m+4
ii 2m + 3
This series diverges to oo G C since its partial sums Sk satisfy
l V* / TT 2to + 4 'tt1 HrnjM
_, - " 3 2. | 11 5^^ - 11 2^^
+ OO
3 | 5^^
n=0 ^m=l m=l
3 ^ 4n + 5 1J-
3 ^ An + 5 AJ\ 2m + 3
n=0 m=l
and
1 1 J^ f2!pj.' 2m+4 i^ 2m + 4l
~ ~3 + 3 ^-J 1 11 2m+3 ~ 11 2m+ 3 J
n=l Vrn=l m=l -^
* ' '2n-12m + 4
3 3 ^ 4n + 3 ±J-, 2m + 3
— CXD
Hence K(ari/l) converges to —Xq/X-\ — 3.
O
Corresponding results for the adjoint recurrence relation
Pn = bnPn+i + an+[Pn+2 where an, btl ? C, an ^0 for n = 1, 2,3,...
B.2.6)
are obtained by use of Theorem 5A. The analogue to Auric's theorem,
Theorem 10 is
Tail sequences once more 209
Corollary 11 Let bo + K(«n/6n) be a continued fraction with elements
an->bn G F and all an ^ 0, and let {Pn} be a solution of the corresponding
recurrence relation B.2.6) such that all Pn ^ 0 and
oo
n=l
bo + K(an/^n) converges to the finite value Po/Pi and
II (-am) as n -> oo B.2.8)
=l m=l
/or some constant C G F\{0}.
Proof : Let Xn = Pn+2 ]l"^i (~aj) for n = -1,0,1,.... Then {Xn}
is a solution of B.2.1) by Theorem 5A. Moreover
B.2.9)
Hence K(an/^n) converges to —Xq/X-\ by Theorem 10 and
to + K{an/bn) converges to
Po
The estimate B.2.8) follows from combining B.2.5) and B.2.9).
3 Tail sequences once more
3.1 Connection to recurrence relations
Let JH(an/bn) be a continued fraction with elements aTl and 6n from
(F, || • ||) with an ^ 0 for all n. Recall that {in}JJLo ls a *a2'' sequence for
K(on/6n) if
tn-i=an/(bn + tn) forn = 1,2,3,... C.1.1)
210 Chapter IV. Three-term recurrence relations
with the usual interpretation if tn_^ = 0 or oo. It is called a right tail
sequence if K(ari/bn) converges to to-
Let {Xn} be a non-trivial solution of the corresponding three-term re-
recurrence relation
Xn — bnXn-\ -f anXri_2 for n = 1, 2,3,... . C.1.2)
If Xn-\ ^ 0, then we can divide this relation by Xn-\. Rearranging its
terms gives us
"~L °n — for n = 1,2,3, C.1.3)
bn — Xn
This equation is also valid (with the usual interpretation) if Xn-i = 0.
What happens if Xn_i = 0? First we note that since {Xn} is non-
trivial and all an ^ 0, there are no two consecutive Xn which are both
0. Hence Xn/Xn_i is always well defined in F for all n. Furthermore,
the left side will be 0 and the right side an/oo = 0. So, C.1.3) holds
without exceptions when {Xn} is non-trivial and all an ^ 0. Comparison
of C.1.1) and C.1.3) shows us that {—X7l/Xn_i}J?i0 is a tail sequence
for 6q + K.(an/bn). It is a right tail sequence if K.(an/bn) converges to
-i] i.e. if {Xn} is a minimal solution of C.1.2).
We say that {^n}^=o 1S a sequence of Perron-tails for 6() + K.(an/bn) if
Tn_, = 6n_i + ^r for 72 = 1,2,3,.... C.1.4)
n
Also here we allow Tn = cxd. We say that {Tn} is a sequence of right
Perron-tails if 60 + K(«n/^n) converges to To.
As earlier, let us assume that all an ^ 0. Then {Tn} is connected with
the three-term recurrence relation
Pn = bnPn+i + an+i Pn+2 for n = 0,1,2,... C.1.5)
in the following way. Let {Pn} be a non-trivial solution of C.1.5). Then,
as before, Pn/Pn+\ is well defined in F for all n, and reaxranging C.1.5)
shows that {Pn/Pn+\ }J^0 is a sequence of Perron-tails for bo-\-K.(an/bn).
It is a sequence of right Perron-tails if {Pri} is a minimal solution of
C.1.5) so that 60 -f K(ct«/^n) converges to Po/P\.
Tail sequences once more 211
3.2 Minimal solutions and value sets
So far we have seen two methods to determine whether a solution {Xn}
of the recurrence relation
Xn = 6nXn_! + OnXn-2 , an, bn e C , an ^ 0 for n ? N C.2.1)
is minimal or not. Either by comparing {Xn} to another solution {Yn}
to see if Xn/Yn —* 0, or by using Theorem 9. Here comes a third one
for the special case when (F, || • ||) = (C, | • |). It is often easy to apply if
we already know that K.(an/bn) converges generally, and if we know a
sequence {Vri} of value sets for K.(an/bn) with Vo ^ C. (.4 denotes the
closure of a set A in C.) So assume that this is so.
We plan to use Theorem 9 from Chapter 111, which essentially says that
if Vn is not "to small" for large n, and wTl ? Vn has a positive distance
to the boundary dVn of VTn uniformly with respect to n, then
lim Sn(u;n)->/@) and /(n) G KM for all rc, C.2.2)
n —r co
where /(") is the value of the nth tail of K(an/bn). This leads to the
following strategy: Form the tail sequence tn = —XnjXn-\. Tf tn (?
Vn for arbitrary large n, then {Xn} is dominant. If tn ? Vn for all
n and has a positive distance to the boundary dVn (measured in the
chordal metric), uniformly with respect to n, then {Xn} is minimal
since Sn(tn) = t{) —> f({)). Let us see how this works in an example:
Example 9 We want to prove that the solution
x-=n (z - ttt'
V 4 ? -4- 1
7=0 V J ^
of the recurrence relation
Xn = B - ——r ) Xn-i + ( -^ + — J Xn_2 for n = 1,2,3,...
\ 71+1/ \lo 71/
is a minimal solution. The corresponding continued fraction K.(an/bn)
satisfies
2-
16 + n
+ 1
212 Chapter IV. Three-term recurrence relations
from some n on. Hence we know from the Sleszyriski-Pringsheim the-
theorem (Theorem 1 in Chapter I) that K.(an/bn) converges and that the
unit disk V = {w ? C; \w\ < 1} is a value set for some tail of K.(an/bn).
We form the tail sequence
in = ~Xnj X +
4 71+1
Since tn ? V, bounded away from the boundary dV from some n on, it
follows that {Xn} is minimal.
O
In some cases we know in addition that {5n(V"n)} converges to a one-
point set (the limit point case). Then {Xn} is minimal if tn ? Vn from
some n on (and thus for all 71). Take for instance the parabola theorem,
Theorem 20 in Chapter III:
If all an are contained in the parabolic region Pa for a fixed angle a and
XX71!61"!1) = ao, then Sn(Vn) converges to a one-point set, where Va is
the halfplane which is the value set for Pa. In other words, if {Xn}<^>__l
is a solution of
Xn = Xn- 1 + OnXn-2 for n = 1, 2, 3, ...
where all an ? Pa and J2{n\an\)~l = 00, and tn — —Xn/Xn-i ? Va for
all n, then {Xn} is minimal.
Example 10 {Xn}^__l where X-m-\ — —1> ^2n = n + 1 is a solution
of
Xn = Xn-1 + anXn-2 for n — 1, 2,3,...
where a^n — 1 + 2/n and atn+i — " + 2 for all n. Since an ? Pa for all n,
^(nlanl) = oo and tn = —Xn/Xn-\ > 0 (and thus ? Vb), it follows
that {A*n} is minimal.
O
3.3 Tails and convergence
The connection between tails and solutions of the corresponding recur-
recurrence relations makes it possible to state Pincherle's and Auric's theo-
Tail sequences once more 213
rems in terms of tails. The first result is based on Pincherle's theorem,
Theorem 7:
Theorem 12 Let {tn} and {un} be two tail sequences with finite ele-
elements for the continued fraction K.(an/bn), where ty ^ Wq- Then:
(A) K{an/bn) converges if and only if the limit
n
lim Rn where Rn = TT
> rvi JL -A.
n —> oo
exists in F.
(B) IfllmRn — R, then K.(an/bn) converges to
_ t0 - RuD
1-R
with the usual interpretations if R — 1 or R = oo; and its speed of
convergence is given by
f — fn~ C\ (Rn — R) as n—* oo ifR^ oo, 1,
f — fn~C2/Rn asn—* oo ifR = oo,
fn ~ C[}/(Rn — 1) as n—* oo if R = 1 and thus / = oo
for some constants C[, C2 and C$ from F \ {0}.
Proof : We first observe that since all tn ^ oo and un ^ cxd, we
also have all tn ^ 0 and un ^ 0 since tn = an+\/(bn+i + ^n+i) and
un = art+i/Fn^i -\-un+\). Hence, Rn is well defined. Next we know that
there exist solutions {Xn} and {Yn} of the recurrence relation C.1.2)
such that tn = —Xn/Xn-\ and un — —Yn/Yn-\ for all n. Clearly, all
Xn and Yn are non-zero. We have
Xn
X/X X
T
Moreover, {Xn} and {Fn} are linearly independent since ?q ^ uo-
214 Chapter IV. Three-term recurrence relations
(A): This part follows directly from C.3.1), Theorem 6 and Pincherle's
theorem, Theorem 7.
(B): If R = oo7 then {Yn} is minimal and I?(an/bn) converges to / =
-yo/y-i = w0 by Theorem 7B. Otherwise, {Xn - YnRX-i/Y^i} is a
minimal solution, and K.[anfbn) converges to
Xp- YqRX-JY- ! = t0 - Ru0
i l-R
Further it follows from Theorem 7C that as n —> oo
if {Yn} is minimal; i.e. R = oo ,
C'2(Xn/Yn) = C2Rn
if {Xn} is minimal; i.e. R = 0 ,
R
- —
- Cs{Rn-R) ifJ2^0,oo,l
and
C,
if R = 1 and thus / = oo .
Remarks:
1. A similar result is also valid for Perron-tails {Tn} and {Un} of
bu + K(an/fcn)? since Tr, ^ oo can be written Tn = bn + *n and
JJn y? oo can be written ?/„ = bn -f- wn where {^r,} and {un} are as
in Theorem 12. Hence the value / of 6o + K(an/6n) is given by
l-R
Tail sequences once more 215
and
" = 11 ^ = 11 ^ 11
k=i
k+i + Wit
2. In Problem 12 you are asked to prove that
and
where R = ]imRn and J?n is as in Theorem 12.
Example 11 Let a < b and flf > 0 be arbitrary real constants, such that
a + nd -fi 0,6+rcflf^ 0 for all nonnegative integers n. We shall prove
that the continued fraction
ab (a+ d)(b + d) (a + 2c?)F + 2d)
+ b
+ + d- a + + + +
converges to 6. We find that {Tn} and {Un} given by
To = 6, Uo = a, Tn-b-\-nd-d, Un = a + nd-d for 72 = 1,2,3,
are Perron-tails for this continued fraction. Since
l
the continued fraction converges to
O
If we only know one sequence of tails for 60 + K(an/^n), w^ are in a
situation where Auric's theorem can be used. For tails it takes the form:
216 Chapter IV. Three-term recurrence relations
Theorem 13 Let {tn} with all tn ^ oo; be a tail sequence for the con-
continued fraction K.(an/bn) such that aflan ^L 0. Then K(an/bn) converges
if and only if the limit
n k i , ,
uiii J^ tnhcro Tf — X lit-- rfnA *r • — —— —
n —> oo
, where Rn = J^ TJ kj and kj = —
k=oj=i ~li
exists in F. If Jimlin = R ? F; then K.(an/bn) converges to
f = to(l - IIR)
with the usual interpretation if R — 0 or R — oo; and
f~fn= tO{l/Rn-l/R)
fn = to(l — 1/Rn) if R = 0 and i/iws / = oo .
This was proved in [Waad84]. To prove Theorem 13 we shall use the
formulas
n
C.3.2)
n
JJ(-ij) for n = -1,0,1,2, C.3.3)
3=0
ft
f+tj) • ft (-*i
for n = -1,0,1,2, C.3.4)
for the canonical numerators An and denominators Bn of I?(an/bn) when
{?„} is a tail sequence with finite elements. These formulas can be proved
by straight forward induction on n. (See also Problem 3 in Chapter II.)
Similar expressions for An + An_itn and for An are easy to derive since
An = a\B\t_x where {Bk } are the canonical denominators of
K(aI+i/6n+1)= — — —
Tail sequences once more 217
Proof of Theorem 13: By using C.3.3)-C.3.4) we find that
Ant An - Bnt0
This proves the assertions.
Remark: The corresponding result for Perron-tails {Tn} follows im-
immediately by using the connection Tn = bn + tn. In particular we then
have
Example 12 The continued fraction
x—ai+K = aj—aH for aj ^ 0 , an ^ 0 ,
is one of the many continued fractions studied by Ramanujan. We find
that tn = an+i is a sequence of tails. Hence, the continued fraction
converges if and only if
that is, if and only if
for some J? G F. For instance, if {ay} is bounded, real and alternating in
sign, then R = oo and the continued fraction converges to a? — a\ -\-t^ — x
for all x > lim sup |aj|.
O
218 Chapter IV. Three-term recurrence relations
4 An application to linear recurrence relations
Forward stability of recurrence relations
Let us consider the three-term recurrence relation
Xn = 6nXn_i -f anXn_2 ; an, bn 6 C,
a
n
0 for n = 1, 2,3,...,
which we assume has a minimal solution {Xn}. Assume further that the
two first elements of this solution, X_i and Xo, are known, and that we
want to compute Xn for n > 1. A simple method seems to be to use the
recurrence relation directly, and compute Xj, X2,... recursively.
However, this method does not work in practice. The computation is
unstable; i.e. roundoff errors "blow up" when we deal with minimal
solutions. This can be seen by the following argument. Assume that
all the computations we are doing are totally accurate, and that Xo is
given with its exact value. But in X_i we have a small roundoff error,
such that we begin with the values X_i = X_! + e and Xo = Xo. Then
we are getting a sequence {Xn} which is no longer minimal, since it is
not proportional to {Xn}. If it is not minimal (and not trivial), it has
to be dominant; i.e. Xn/Xn —> 0). The relative error for our values Xn
(after our exact computations) will therefore blow up:
Xn — Xn
Xn
00
We need another method to compute {Xn}. For dominant solutions this
forward computation is in general stable. Under the same assumptions
as above, only with {Xn} dominant instead of minimal, we have the
relative error
Xn — Xn
X
n
Xn - aXn -
Xn
where {Xn} denotes a minimal solution and Xn = aXn + pXn.
Example 13 The three-term recurrence relation
Xn = Xn_! -}- Xn_2 for 71 = 1,2,3,...
D.1.1)
An application to linear recurrence relations
219
has the minimal solution {(A - V/5)/2)n+1}~=_1. (See Example 1.)
The following table shows the actual value of Xn = (A - \/5)/2)n+L,
and what we get when we use the recurrence relation to compute {Xn}.
The computation is done with 4 decimals precision.
n
-1
0
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
*» = ((l-V5)/2)»+I
1.0000
-0.6180
0.3820
-0.2361
0.1459
-0.0902
0.0557
-0.0344
0.0213
-0.0132
0.0081
-0.0050
0.0031
-0.0019
0.0012
-0.0007
0.0005
-0.0003
0.0002
-0.0001
0.0001
-0.0000
Xn — Xn-i 4- Xn-2
0.3820
-0.2360
0.1460
-0.0900
0.0560
-0.0340
0.0220
-0.0120
0.0100
-0.0020
0.0080
0.0060
0.0140
0.0200
0.0340
0.0540
0.0880
0.1420
0.2300
0.3720
The value of Xn keeps on decreasing in absolute value, whereas the value
obtained by use of the recurrence relation, will stay positive and keep
on increasing, "faster and faster", as n increases from n — 12 on.
O
What one therefore can do is to compute two linearly independent dom-
dominant solutions {Yn} and {Zn} and then find the minimal solution as a
linear combination of these. This requires that we know, say, X_i and
Xq.
220 Chapter IV. Three-term recurrence relations
4-2 A method for computing minimal solutions
We want to compute the first terms of a minimal solution {Xn} of the
three-term recurrence relation.
Xn = bnXn-\ + anXn-2 ; an,bneC, ari^0, for n — 1,2,3,
D.2.1)
(We assume that such a solution exists.) Strictly speaking, {Xn} is
uniquely determined if we choose one of its elements Xn, say X_i. But
how can we find this solution? Pincherle's theorem, Theorem 7, tells us
that the continued fraction K.(an/bn) connected with D.2.1) converges
to —Xu/X^\. Similarly, using Pincherle's theorem on the iVth tail
K (aN+n/bN+n) = D.2.2)
n=\
of H(an/bn) shows that this tail converges to —X/v/X^v-i- We can
therefore compute Xn (approximately) by the following method:
1) Compute approximants f\, ' or 5}/ \w^+k) of D.2.2) for suffi-
sufficiently large k, for N = 0,1,2,..., n.
2) Set -Xyv/JT/v-L « f(kN) (or S{kN\wN+k)).
3) Compute Xn from the relation
n
Xn = X-i JJ X/v/X/v_i
/v=o
where X_i is given.
This method was suggested by Gautschi, [Gaut67].
Example 14 To see how this turns out in practice, let us return to the
problem in Example 13, where the minimal solution {Xn} is actually
known in advance. The continued fraction corresponding to D.1.1) is
An application to linear recurrence relations 221
Using its 10th approximant fw to approximate its value, we get
55
/ = -Xo/X_, « fw = — « 0.61798.
Indeed, we also have
c r
/ /vr\ i A71 OO
/ = ~-&NlA-N—1 ~ /in — /lO — ~7T ~ U.01 iyo.
89
Choosing X_i =1 we thus get that
^ | « (-0.61798)"+1,
which is very close to the exact value ((l-\/5)/2)n+1) » (-0.61803)n+1.
Of course, using the exact value / = /W = A — Vo)/2 for the value of
K(l/1) leads to the exact values for Xn.
O
Minimal solutions {Pn} of the adjoint recurrence relation
Pn = bnPn+i + an+\Pn+2 for n = 0,1,2,... ,
can be computed similarly, since then
— on + ^ ^ tor iV = 0,1,2,.
Example 15 We want to compute the integral
/•OO g-u^a-l
/(a,b\x)= I rrdu where b ? No and a > 0,a? > 0
Jo A + ajw)D
For 6 = 0 it reduces to the gamma function
/•OO
To get a value if 6 > 0 we observe that
I(a,b;x)= J
OO
e-uua-|(l +
a V
222 Chapter IV. Three-term recurrence relations
Further, by integration by part, we get (when we first integrate ua~l)
that
'(«.»+M*)=,,11xt|)HIjo
1 f°° -e-uua(l + xu)b+l - e~uua(b + lWl + xu)b ,
~~ ¦ Tr,—yIm du
a Jo A + xu)*°~t~z
= -/(a + l,6+l;aj) + —tiaj7(a+1,6 + 2; x).
( , ; )
a a
Hence, {Pn(x)} given by
P-2n{x) = 7(a+n,6 + n;aj), P2n+l(x) = I(a + n,b + n + l;x),
satisfies the linear three-term recurrence relation
Pn(x) = 6nPn+i (») + an+ixPn+2(x) for n = 0,1,2,... ,
where
h 1 a 1 &+rc t
»2n = A > &2n+l = j > «2n = ; T , «2n+l = 1 •
a-\- n a + n — 1
If a > 6 > 0, one idea could therefore be to compute /(a, 6; x) by means
of this recurrence relation, beginning with I(a—b, 0; x) = F(a —6). There
are however two problems connected with this procedure:
1) We do not know the value of I (a — 6,1; x).
2) {Prl(x)} is a minimal solution of the recurrence relation, so the
calculation is unstable. (This follows from Theorem 13.)
The continued fraction technique takes care of both these problems. We
get
JV
+ . ^r , , + 1 +•
a + N ' a + iV
(b + N + l)x (a + iV+L)aj F + N + 2)x
1 + I + 1 + 1 +¦••'
An application to linear recurrence relations 223
and
_
I{a + N + l,b+N I¦ 1;a?) a + TV + 1 + }T , , +
a + iv + 1
1 + 1 +
So let us for instance find an approximate value for
du.
+ 2)aj 1
1 +¦•./
We find that
JE,3;l)
-l
where /"C,0; 1) = TC) = 2! =2,
3 + A^ JV + 1 4 + iV JV + 2 5 + ^ JV + 3
1 + 1 + 1 + 1 + 1 + 1 -f
for N = 0,1,2 and
1 f 7V + 1 4 + N iV + 2 5 + iV JV + 3 1
3 + iVl + l + i + i + i + 1 +.-.J
for JV = 0,1. Using the 20th approximants of these continued fractions
to approximate their values gives
7E,3; 1) « 2{3.3534 • 0.42483 • 3.6410 • 0.37841 • 3.8888} « 0.26202 .
However, here we could have saved some work by using several different
continued fractions instead of several tails of the same continued fraction.
We have
224
Chapter IV. Three-term recurrence relations
where
IE,JV;1) 5 JV
= l -+- — . —
6 AT+ 2 7
/E,iV+l;l) -+1 +1+ 1 +1+ 1 +.
Using the 20th approximants for N ~ 0,1 and 2 gives us
7E,3; 1) « {3.8888 • 4.5126 • 5.2212}"l - 4! « 0.26194 .
5 Some generalizations of continued fractions
5.1 Introduction
We have seen that a continued fraction J<i(an/bn) is closely related to
the three-term recurrence relation
Xn = 6nX,,_i -f anXn-2 for n = 1,2,3,... .
E.1.1)
We have:
(i) {tn} is a tail sequence for Vi[an/bn) if and only if there exists a
nontrivial solution {XTl} of E.1.1) such that tn = — Xn/Xn-i for
all n.
(ii) The approximants of K.(an/bn) can be written on the form /„ =
An/Bn, where {An} and {Bn} are solutions of E.1.1) with initial
values
1 0
0 1
Can we get something similar if we have a "longer" recurrence relation,
say four-term? Or more generally, an (N + l)-term linear recurrence
relation
Ar
k=0
^XT^k = 0 , where a^+^alP ? 0 , for n = 1,2,3,... .
E.1.2)
Some generalizations of continued fractions 225
For N > 2 we can not combine (i) and (ii). We have to settle for one or
the other.
5.2 G-continued fractions
Let us first introduce a generalized form of continued fractions which
is connected with E.1.2) in the sense of (i). The idea is to extend
Gautschi's continued fraction method for computing minimal solutions
to this longer recurrence relation. Let all aj, ' = 1. For the case
N = 2 with afi = —an-, o,n = — &m tne recurrence relation takes the
familiar form
Xn = 6nXr,_! + ariXri_2 where an ^ 0 , for ra = 1,2,3,
For a non-trivial solution {Xn} we thus have that
a
n
Xn-2 bn — Xn/XTl-\
= 8n(-Xn/Xn-i) for n= 1,2,3,...,
where sn(w) is the linear fractional transformation sn[w) =
Gnlipn + w)- So, {— Xn/Xn-i} is a tail sequence for the continued
fraction K.(an/bn). In analogy with this, we have for JV > 0,
Xn + aWXn-L + a!iN-l)Xn-2 + • • • + a^Xn.N = 0 E.2.1)
where a;t ^ 0. Let {Xn} be a solution with all Xn ^ 0. Dividing E.2.1)
by Xn_/v+i leads to
an
B) , r:
= a), > + al
"n ' ~n — ' a
W —1 v
. TT An
+ ¦ • • + 11 —^
such that
where ak ' = 1.
226 Chapter IV. Three-term recurrence relations
Let us introduce the transformations sn and Sn from C^ into C given
by
_a@
sn(wu...,wN_l) =
Si(wu...,wN-\) =
and
for n = 2,3,4, Then E.2.2) can be written
_
\X X
and
c
^J '
f
\
52 I v i • • • > v- I i v- j • • • ? v-
3 ^n \
> • • * 1 v" /
2 ^n-\ /
A_/v+n+l
Following the idea of Levrie and Piessens [LePi87] we define the G-
continued fraction (G in honor of W. Gautschi) Kg(—Qn /an ;...; aj, ')
by its approximations fn = 5n@,0,.. .,0). {X_Ar+n+2/^-7V+n+i}S'Lo
acts as a tail sequence for Kr;(—al /aL ,...;a^ ). Note that for TV = 2
we have Kc(-fln /i ) defined by
_a0)
^n(w) = n— for 7i = 1,2,3,...
a!, + w
and
@ @ @
-a\ J -a2 ; -a>n
Some generalizations of continued fractions
227
whereas the classical continued fraction K(—aii / — a!i ) has approxi-
approximations Sn[w) given by
n(w) -
it;
and
Sn(w) =
a
(L)
0)
-a{2) + -a
B)
2
a
<2)
-«$?>
— w
That is, 5M(«;) = — Sn(—w) and {Xn/X^i-ij^u is a tail sequence for
Kc(-fln /fln ) if and only if {—Xn/Xn-\}^LU is a tail sequence for
K(—a>n I ~ a\i )• This change in sign is also reflected in the theorem
below. We say that the G-continued fraction converges and has the value
/, if / = lim/n exists in C. For such continued fractions we have the
following theorem of "Pincherle"-type due to Zahar [Zahar68]:
Theorem 14 The G-continued fraction K(—a\t 7<zn ';...; ah *)
converges if and only if the solution space of E.2.1) has a basis
such that
lim -t2-= 0 /or» = l,2,...,iV-l,
n —> 0 A,,
E.2.3)
where
-@ \r(iV-1
n ' ' " -A n
X
(N-l)
n-N+2
and An is the determinant we obtain by replacing column number i in
Am by the column (Xn ,.. .,Xn_;Y+2J .
If E.2.3) holds, then the G-continued fraction has the value
228
Chapter IV. Three-term recurrence relations
We shall not give the proof of this theorem, but rather indicate an appli-
application. The forward computation of a solution {Xn ^} of E.2.1), which
satisfies E.2.3), is unstable. Hence, we prefer to compute approximants
t of the G-continued fractions
E.2.4)
<?Li •¦¦•,<?$») form = 0,1,2,....
Since E.2.4) converges to
we can then use
m
in
y{N) TT Aj)
A-N+\ 11 h
j=Q
for some suitable k 6 N. The computation of /? for given m and k
can be done recursively by the formulas
(m+fc-l) _
^f
@
f(
In
(m+k-n) _
l) f {m+k-n+N-2)
j Jri_/V+2 1
,(m+fe-rt+l)x for n - 2 *?
/n_, ^ ior n — z, o,.
., k ,
where fj 3' — 0 for jf > 0. (For N = 2 this corresponds to using
the backwards recurrence algorithm to compute K(—Q>n / — a>n )•) To
improve these approximations one can also use modified approximants
OI Jn — On
5.3 Generalized (or vector valued) continued fractions
A generalized continued fraction of dimension N — 1, is given by its
approximants
/n =
Jn
L/A
Some generalizations of continued fractions
229
where {An } and {Bn} are solutions of the linear (iV + 1)-term recurrence
relation
Xn = bnXn^ + 4y
_2 + • •. +
for n = 1,2,3,
where all <!„ ^ 0, with initial values
•• A
(/V-l)
o
1 ••• 0 0
¦ . • ¦
• * . .
. • . .
0 ••• 1 0
0 ••• 0 1
Hence, this type of continued fraction is connected with this longer re-
recurrence relation, in the sense of (ii) in Subsection 5.1. Its approximants
are (N — 1)-dimensional vectors. We see that a generalized continued
fraction belonging to a given (N + l)-term recurrence relation may fail
to exist. (We may get An = Bn = 0 for some natural numbers n and
1 < i < N — 1.) This can be counteracted by using modified approxi-
approximants
Sn(w) =
where
WN-[) =
An° +
-i + . . . +
B
n
Such generalized continued fractions are used for simultaneous approx-
approximation of (iV — 1) functions, when we require that the approximants
are rational functions with common denominators. They are written
) a{N~l)-b ^
230 Chapter IV. Three-term recurrence relations
Problems
A) Show that the TchebychefT polynomials
[n/2] ( r> \
W = E ( 2 J *n~2k(*2 -1)" far n = 0,1,2,...
of the first kind satisfy the three-term recurrence relation
Tn(x) = 2xTn.l(x) - Tn-2{x) for n = 2,3,4,....
(Here [p] denotes the largest integer < p. Hint: Use that Tn(x) =
cos n6 where x = cos#.)
B) Show that {r(z + 7i)}?_0 is a solution of the three-term recurrence
relation
Pn(z) = ~1—;—vT~pn+i(z)- ~,—;—r^Pn+2{z)ioTn = 0,1,2,...,
where z e C\{-1, -2, -3,...}.
C) Show that the integrals
/•oo
/„(»)== / c"xltanhn*d«
Jo
for x > 0 and n (E N, satisfy the recurrence relation
x
= --In(x) + /n_!(a;) for n > 1.
n
n
(Hint: Use integration by part.) Prove further that they satisfy
X ^ , x 71 Tt _. , •>. ~ _
71 > 2.
D) The incomplete gamma function is given by
1 fT
G{atx) = —— / e'lta~xdt, where a > l,sc > 0.
Prove that
Xn(x) = G(a + n, jc) for n = 0,1,2,...
is a solution of the three-term recurrence relation
Xn+l(x) — I 1 + —¦— ) Xn{x) ¦—Xn-\(x)io\:n = 1,2,3,...
1 a -f- 71/ a + n
Problems 231
E) The function
1 r°° e~uuP-1 , n ^
is well denned for x > 0. Show that {Pn(aj)}^0, where
i = /(a + rz,/3 + rc + 1
is a solution of the recurrence relation
Pn{x) = Pn+i{x) + an+lxPn+2{x) for n - 0,1,2,... ,
where
ln P + , 2n+ 1 = a + 71 .
F) Let
where (g)it = FljLiC1 ~ 9j)- Show that {G(zqn)}™=u is a solution
of the three-term recurrence relation
Pn(z) = Pn+i{z) + qn+l zPn+2(z) forn = 0,1,2,... .
G) Show that
f°° xi , . ! 1-2 2-3 3-4
Jo x+ x + x + x -)
for x > 0. (Hint: Use the result from Problem 3.)
(8) Show that the ratio G(a-\-1, x)/G(a, sc), where G(a, x) denotes the
incomplete gamma function
1 fx
G(a,x)= ---r e~lta~^dt for a > 1, x > 0 ,
l\a)Jo
has a T-fraction expansion of the form
G{a + l,x) oo f Fnx \ -1
—~ r^ = - K I — , where Fn = —— .
G(a,x) n=i\l-FnxJ n + a
(Hint: Use the result from Problem 4.)
232 Chapter IV. Three-term recurrence relations
(9) Show that
by use of the result from Problem 2.
A0) Assume that the continued fraction
a | -f h a\ a-2 + h ai
1 +6+ 1 +6-1
converges. Prove that then
cl\ d\ -f- h q>2 CL'i -\- h
1 4 b -f- 1 + 6 +¦ •.
converges to the same value if it converges.
A1) Show that the recurrence relation
Xn = 3Xn_i — 2Xn_2 for n = 0,1,2,...
has the general solution
Xn = C\ + C2 • 2"
where C\ and Ci are arbitrary complex constants.
What can therefore be said about the convergence/divergence of
the 1-periodic continued fraction K(—2/3)?
A2) Let {tn} and {un} be two different tail sequences for the continued
fraction ~K(an/bn) with all an / 0, tn ^ oo and un ^ oo. Let
further the limit
n
R = lim Rn where Rn = TT
n —> cxa ¦*¦¦*¦
k=Q
exist in C, so that ~K(an/bn) converges to
t0 - RuQ
f =
l-R
Problems 233
by Theorem 12. Prove that
/ -to~
Jn —
and
r _ r _ / . N R- Rn
J Tn
A - R)(l - R
A3) Let {tn} be a tail sequence for the continued fraction K.(an/bn)
with all an ^ 0 and tn ^/- oo. Let further uq E C be arbitrarily
chosen. Prove that then {un}, where
un =
and «
/fc=rj=l
is also a tail sequence for K{an/bn).
A4) We want to solve the differential equation
by Frobenius' method. That is, we try to find a formal power series
solution Yl Cktk- Substituting y[t) = Yl Cktk into the equation gives
oo
oo . oo
Jfc=O A:=0
(a) Find the recurrence relation for {c/t} by matching the coeffi-
coefficients for tk for every fc E No.
(b) Show that this recurrence relation has a minimal solution.
(c) Explain why the minimal solution is of particular interest for
us, in this situation.
(d) How do you propose to find Co, C\,..., cyv if {ck} iS a minimal
solution?
234 Chapter IV. Three-term recurrence relations
A5) In each of the following cases, try to find a tail sequence for the
given continued fraction, and use this to find its value.
n—l
OO
3i 2a;+ 1 •
/j\ ?? 1 u l (n + aJ + (n + a-1) , . ,
(d) K r1- where 0n = ^ w 4- a 4- 1 ' a a 1S a complex
constant ^ —1, —2, —3, ....
Remarks
Remarks
235
1. Linear recurrence relations are closely related to linear difference
equations. Written on the form
A(n)A2Yn + B(n)AYn + C(n)Yn = 0 ,
where AYn = Yn+1 -Yn and A2Yn = A(AYn) = Yn+2 -2Yn+l +Yn,
it is often called a linear, homogenous difference equation of order
2.
Linear recurrence relations may also be given on a matrix form
Yn = A(n)Yn_! ,
where A(n) is an (N X N)- matrix and Yn =
The three-term recurrence relation
Yn = bnYn-l +a»Y1l-2 for n= 1,2,3,
may for instance be written
that is
r v(i^ i
* n
VB)
—
0 1
an bn
r vA) i
yB)
For more information on recurrence relations we refer to [Batc27],
, [Wimp84].
2. Pincherle's theorem was proved already in 1894, [Pinc94]. In their
book, [JoTh80], Jones and Thron presented a generalized version.
This version is the basis for our presentation here.
3. The Indian mathematican Ramanujan left an overwhelming her-
heritage of deep, interesting and useful formulas at his far too early
death in 1920. Among these were more than 50 results on con-
continued fractions, of complex and astonishing character, [Rama57].
Unfortunately, most of these results were left without proofs.
Hence, mathematicians are still working to prove them. We refer
in particular to Berndt, [ABBW85], [ABJL92] and [BeLW85].
236 Chapter IV. Three-term recurrence relations
4. Generalized continued fractions were introduced by de Bruin,
[Bruin74], [Bruin78]. They are based on the Jacobi-Perron algo-
algorithm, [PerrO7]. Independently of this, Graves-Morris introduced
the vector valued continued fractions, [Grav83], [Grav84] based on
a work of Wynn, [Wynn63]. These structures turn out to be the
same.
References
[ABBW85] C. Adiga, B. C. Berndt, S. Bhargava and G. N. Wat-
Watson, Chapter 16 of Ramanujan's Second Notebook- "Theta-
functions and q-series", Memoirs Amer. Math. Soc,
Vol. 53, No 315 A985), 1-85.
[ABJL92]
[Batc27]
[Bern89]
[BeLW85]
G. E. Andrews, B. C. Berndt, L. Jacobsen and R. L. Lam-
phere, The Continued Fractions Found in the Unorganized
Portions of Ramanujan's Notebooks (to appear in Mem.
Amer. Math. Soc. 1992). ^/> ^U V I y9 d
[AuricO7] A. Auric, Recherches sur les fractions continues algebri-
ques, J. Math. Pures et App. F) 3 A907), 105-206.
P. M. Batchelder, "An Introduction to Linear Difference
Equations", Cambridge, Mass. A927), Dover Publications,
Inc., New York A927).
B. C. Berndt, "Ramanujan's Notebook. Part II", Springer-
Verlag A989).
B. C. Berndt, R. L. Lamphere and B. M. Wilson, Chap-
Chapter 12 of Ramanujan's Second Notebook: "Continued Frac-
Fractions", Rocky Mountain J. Math., Vol. 15, No 2 A985),
235-310.
[Bruin74] M. G. de Bruin,, "Generalized C-fractions and a Multidi-
Multidimensional Pade Table", Dissertation, Universiteit van Am-
Amsterdam A974).
237
238
Chapter IV. Three-term recurrence relations
[Bruin78]
[Cruy79a]
[Cruy79b]
[Gaussl3]
[Gaut67]
[Grav83]
[Grav84]
[JoTh80]
[Levr87]
[LePi87]
[MU168]
M. G. de Bruin, Convergence of Generalized C-fractions,
J. of Approx. Theory 24 A978), 177-207.
P. van der Cruyssen, Linear Difference Equations and Gen-
Generalized Continued Fractions, Computing 22 A979), 269-
278.
P. van der Cruyssen, "Computing the Minimal Solution
of a Certain Matrix-Vector Recursion", Report no 79-34,
Universiteit van Antwerpen A979).
C. F. Gauss, "Disquisitiones generales circa seriem infini-
infinite 1 . M~ . <>(<>+l)/3(/3+l) . a(a+l)(tt+2)/3Q3+l)(i8+2) 3
tarn 1 -I- 1>7z + i.2.7.G+i) xxm+ i.2.:j.7:G+i)G+2) x
etc", Commentationes Societatis Regiae Scientiarium Got-
tingensis Recentiores", Vol. 2 A813), 1-46, Werke, Vol. 3
Gottingen A876), 134-138.
W. Gautschi, Computational Aspects of Tree-Term Recur-
Recurrence Relations, SIAM Review 9 A967), 24-82.
P. Graves-Morris, Vector Valued Rational Interpolants I,
Numer. Math. 42 A983), 331-348.
P. Graves-Morris, Vector Valued Rational Interpolants II,
IMA J. Num. Analy. 4 A984), 209-224.
W. B. Jones and W. J. Thron, "Continued Fractions: An-
Analytic Theory and Applications", Encyclopedia of Mathe-
Mathematics and its Applications 11, Addison-Wesley Publishing
Co., Reading, Mass. A980). Now distributed by Cambridge
University Press, New York.
P. Levrie, "Het numeriek oplossen van lineaire recursiebe-
trekkingen: Een veralgemening van de kettingbreukmeth-
ode van Gautschi", Dissertation, Katholieke Universiteit
Leuven, Faculteit Wetenschappen A987).
P. Levrie and R. Piessens, "Convergence Acceleration for
Miller's Algorithm", Report TW88, Department of Com-
Computer Science, K. U. Leuven (February 1987).
K. S. Miller, "Linear Difference Equations", Benjamin,
New York A968).
References
239
[PerrO7]
[Pinc94]
[Rama57]
[Waad84]
[Wimp84]
[Wynn63]
0. Perron, liber die Konvergenz der Jacobi-Ketten-
algorithmen mit komplexen Elementen, Sitzungsber. der
Bayer. Akad. Wiss., Math. Naturwiss. Klasse 37 A907)
401-481.
S. Pincherle, Delle Funzioni ipergeometriche e di varie
questioni ad esse attinenti, Giorn. Mat. Battaglini 32
A894), 209-291, Opere Selecte, Vol 1, 273-357.
S. Ramanujan, "Notebooks", Vol. 2, Tata Institute of Fun-
Fundamental Research, Bombay A957).
H. Waadeland, Tales About Tails, Proc. Amer. Math.
Soc. 90 A984), 57-64.
J. Wimp, "Computation with Recurrence Relations", Pit-
Pitman Advanced Publishing Program, Pitman Publishing
Inc., Boston, London, Melbourne A984).
P. Wynn, Continued Fractions whose Coefficients Obey a
Non-commutative Law of Multiplication, Arch. Rat. Mech.
Anal. 12 A963), 273-312.
[Zahar68]
R. V. M. Zahar, Computational Algorithms for Linear Dif-
Difference Equations, Thesis, Purdue University A968).
Chapter V
Correspondence of
continued fractions
About this chapter
In Chapter I correspondence was a link between formal power series
L(z) and continued fractions K.(on(z)/bn(z)) with polynomial elements
an(z) and bn(z). Judging from the examples there, it may seem as if
K{an(z)Ibn(z)) ~ L(z) (i.e. correspondence) implies that J?(an(z)/bn(z))
and L(z) converge to the same function, or, at least that if K.(an{z)/bn{z))
converges to a function /(-z), then L(z) is a power series expansion of
f(z). It is important to know that this is not always so. In this chapter
we shall look closer at what kind of conditions that enters the picture.
Results of this type can be used to sum divergent series, as in Chapter I,
and to find the value of a continued fraction by identifying the function
to which it corresponds.
241
242 Chapter V. Correspondence of continued fractions
1 The normed field (L, || • ||)
1.1 Introducing the normed field
Let K.(an(z)/bn(z)) be a continued fraction with polynomial elements
an(z) and bn(z). We are interested in the following question: Does
K.(an(z)/bn(z)) correspond to a power series
oo
L(z) = V cnzn where m G Z, cn G C , crn ^ 0 ? A-1-1)
That is, do the classical approximants
Bk{z)
have power series expansions which coincide with L(z) as far out as we
want for k large enough?
Let us introduce some notation. Let L denote the field of all power
series A.1.1) with zero element /q = ]C0zr\ (We do not require that
these series converge at any point z, we are for the moment regarding
them as mathematical objects in their own right. They are what we call
formal power series.) For a function f(z), meromorphic at z = 0, we
let ?(/) denote the Laurent series expansion of / in a neighborhood of
z = 0. That is, ?(/) E L. Finally, the degree of the first non-zero term
of an L € L shall be denoted by A(L), that is
oo
X(L)={m *OT L{z) = ?nCnZn withc™^' A.1.3)
oo for L(z) = lu(z) = ? 0zn .
Then J?(an(z)/bn(z)) corresponds at z = 0 to L(z) iff
i/n :=A(L-?(/„))-*oo. A.1.4)
The number vn is called the order of correspondence of fn(z) to -?B).
If Bf.(z) ^ 0, then the classical approximant fk(z) in A.1.2) is a rational
function, and ?(fk) is well defined. It is simpler though to regard an,
The normed field (L, || • ||) 243
6n, An and BTl as elements from L directly. Then the approximants
__ Ak(z)
~ 6, (z) + 62(z) +• • •+ bk(z) ~ Bk(z)
are again elements from L if B^ / /0. Clearly, L& = ?(/&). For conve-
convenience we shall use this way of presentation, that is, we shall use the same
notation for a polynomial p(z) and its Laurent series ?(p(z)) = p(z).
It was the idea of Jones and Thron [JoTh80, p. 148] to regard A.1.4) as
convergence in L. They did so by introducing the norm
for/, EL. A.1.6)
(Recall that ||X|| is a norm in the field L by definition if
II ^ II > 0.
(ii) ||?|| = 0 if and only if L = Zu,
(iii) ||L1L2||
(iv) || L, + L21| <
for all L, L\ and L<l in L. We see that our norm A.1.6) meets the
requirements.) Then A.1.4) is equivalent to
L-Ln II —0, A.1.4')
where Ln is given by A.1.5); i.e. ~K(an(z)/bn(z)) corresponds to L if
and only if K.(an(z)/bn(z)) converges to L in this norm.
1.2 Correspondence at z = oo
Sometimes it is convenient to consider correspondence at other points
than z — 0. Correspondence at a point z = a is the same as convergence
in the normed field (L(a), || • ||a), where L(a) consists of Laurent series
oo
^3 cn(z — a)n if a ^ oo and cm ^ 0 ,
"S" A-2.1)
if a = oo and cm ^ 0 ,
n
244 Chapter V. Correspondence of continued fractions
and || L \\a = 2~m as before. (A polynomial p(z) can always be regarded
as an element in L(a).) For correspondence at z = oo we have in
particular:
Lemma 1 The continued fraction K.(an(z)/bn(z)) with polynomial ele-
elements an(z) and bn(z) corresponds at z = oo to L ? L(oo) if and only
corresponds to L(]/z) at z = 0.
Example 1 The continued fraction
corresponds at 2; = oo to the power series
1! 2! 3! 4!
1
+ + .--. 1.2.3
z zz zs zx
This result is due to Stieltjes [Stiel8]. It can be proved in several ways.
We shall here only observe that the classical approximants fn{z) of
A.2.2) can be written
AW = i = !.
z 1 JL_
z + 1 1 2 4
z_l_ O "°1 ~ I + ^2 ~ ^3 "^ »
and so on. The order of correspondence turns out to be vn = n.
Another matter is that the power series A.2.3) diverges for all z 6 C.
It is known to be an asymptotic expansion of the function
e-ldt
Jo
(o 1 + zt
The corresponding continued fraction A.2.2) converges to F(z) for all
z in the cut plane | argz| < tt. For more information on how continued
fractions can be used to sum asymptotic series, we refer to Chapter VII.
O
The normed field (L, || • ||) 245
Example 2 The T-fraction in Example 7 in Chapter I corresponds to
one power series at z = 0 and to another power series at z = oo. We shall
see later (in Example 6) that this is always so for general T-fractions
Fnz
+ Gnz) 1 + Giz + 1 + G2z+. ¦.
if all Gn ^ 0. The following (modified) T-fraction
c l(c-a + l)z 2(c-a + 2)z 3(r - a + 3)z
c + A -a)z-c + 1 + B- a)z-c + 2 + C - a)z-c+ 3 + D - a)z
A.2.5)
where we assume that neither a, —c + l nor a — c is a natural number,
corresponds at z = 0 to the power series
„ a a(a+l) 9 a(a+l)(a+2) ^
= 1 + z + >--—'—- z2 + t w XT X2f3 +
c + 1 T(+l)( + 2) ( + l)( + 2)( + 3)
A.2.6)
That this is so will be evident in Theorem 4 in Chapter VI. Here we
shall just check the first 3 approximants to see the pattern:
Lx{z) =
c+ A - a)z
= 1 z+[
c \ c J
L2(z) =
2
z2-
(c + 1) + 2c(l - a)z + A - a)B - a)z2
a A - a)Bc + 2 - a + ac) 9
= 1H z— — -z +
+ C + l C(C+ IJ
a a(a + 1) 9
= H z + 7 V——^-T7^2 + ---, where
c + l (c+l)(c+2)
CO) - c(c +1H + 2) + 3c(c+l)(l-a)z
+ 3c(l - a)B - a)z2 + A - a)B - a)C - a)z3 .
The order of correspondence is vn = n. At z = oo A.2.5) corresponds
to the power series
c 1 cA-c) 1 c(l - c)B - c) 1
l-az (l-a)B-a)z2 A - a)B - a)C - a) z3
246 Chapter V. Correspondence of continued fractions
This is also something we will return to later. (Theorem 4 in Chapter
VI.) At the moment we only check the first 3 approximants again. Let
Lk{z) be the approximants regarded as elements in L(oo). Then
c 1 / c \2 1
l-a2 \1 — a) z2
L2(z) =
- a)B - a)z2 + 2c(l - a)z + c(c
( - c) 1
z (l)Ba)z2 '
1- az (l-a)B-a)zi
- cB - a)C - a)z2 + cCc + 3 - 2ac - a)z + c(c + l)(c + 2)
L3\z) = (^73
c 1 c(l - c) 1
l-az A - a)B - a) z2
c(l - c)B - c) 1
+ ~i WrN— .. —- ~r + • • • , where
A - a)B - a)C - a) z3 '
E(z) = A - a)B - a)C - a)z3 + 3c(l - a)B - a)z2
+ 3c(c + 1)A - a)z + c(c + l)(c + 2).
The order of correspondence is vn = n + 1.
O
Remark: In this chapter we shall always let an(z) and bn(z) be poly-
polynomials, as they are in most applications. However, it is straight forward
to see that most of what we do also holds for the more general situation
where an and bn are functions of z which are meromorphic at z = 0 (or
at some other fixed point z = a ? C).
1.3 Properties of the normed field (L,
In view of the previous section we are led to study convergence in
(L, ||-||). Let us first note some properties of the functional A acting
The normed field (L, || • ||) 247
on L. For L[, L-i G L we have
A-3.1)
A(?,/i2) = A(?,)-A(I2), A.3.2)
is
is
This can be verified by inspection.
The normed field (L, || • ||) is known to be complete. Hence a sequence
{Ltl} of elements from L converges to an element L G L if and only if
{Ln} is a Cauchy sequence; that is, if and only if to every e > 0 there
exists an N G N such that
Ln+m - Ln\\ < e for all m and n G N with n> N . A.3.4)
ln our field, this condition can be simplified:
Lemma 2 {Ln} is a Cauchy sequence in (L, || • ||) if and only if
— Ln\\ —> 0.
Proof : The "only if"-part follows immediately from A.3.4) with m —
1. To prove the "if"-part we assume that ||^n+i — ^n|| —*¦ 0. That is,
to every e > 0 there exists an N G N such that || Ln+i — Ln \\ < e for all
n > N. From A.3.3) it follows that
m
A.3.5)
for m G N. That is
II Ln+m — Ln || < max || Ln+j — LJl+j-\ \\ < e for n > N . A.3.6)
This proves that {//«} is a Cauchy sequence. ¦
Tf \(Ln+\ — Ln) —¦ oo strictly monotonely, then A.3.5) can be written
A(//ri+m — Ln) = X(Ln+i — Ln) for alln and m. A.3.7)
In particular this means:
248 Chapter V. Correspondence of continued fractions
Lemma 3 // ||-?n+i — Ln || —> 0 strictly monotonely, then vn — \[L —
Ln) = X(Ln+\ — Ln) for all n, where L is the limit of {Ln} in (L,
2 Classification of continued fractions
2.1 Criteria for correspondence
We return to our original question: Which continued fractions
K.(an(z) / bn(z)) correspond to some L ? L? We can not expect that
every one does since we can not expect that every sequence {L^z)}
from L converges in (L, || • ||). The first example shows a simple contin-
continued fraction which does not correspond to any L G L:
Example 3 The Thiele interpolating continued fraction is given by
. . Z — Zq Z — Z\ Z — Z2
where all zn 6 C are given, distinct points, [ThieO9]. We shall assume
that all Zk ^ 0. Its approximants can be written
F0 + b2)z
J2\*) -
Hz)
etc. so that
FO61626:
(M
Fo6j
' F162
F,62 — Z\) -f Z
J — 6263Z0 — 6063Z1
»263 — b3z\ — b\z2)
-\- 6963 -|- 6263 — Zq
63 - 63Z] - bvz2) H
- 6,N!
+ FiH
- z2)z
hF,+
z2 -
h63
+ ;
h).
V zoz2)
)z
,2
Classification of continued fractions 249
C(f,) = Fo-..6^(l22. ) + ••¦ etc.
As we see, the constant term changes each time we increase the index,
so there is no chance that this continued fraction corresponds to a power
series.
For the sake of justice, let it be mentioned that the application of this
continued fraction has nothing to do with this kind of correspondence.
It interpolates a function f(z) with given values f(zn) = fn(zTl) at the
interpolation points zo, zj, z-i, The coefficients 6ri are calculated
from the equations f(zn) = fn(zn) for n = 0,1,2, —
O
So, what does it take for K.(an(z)/bn(z)) with polynomial elements to
correspond to some IeL? Or in other words, what does it take for its
approxirnants Lk{z) = ?(/^(z)) to be a Cauchy sequence in (L,
According to Lemma 2 we ought to look at the difference
- Bn{z)Bn+x{z) '
where the last step follows from the determinant formula A.2.10) in
Chapter I. Using A.3.1) - A.3.2) this leads to
n+l
X(Ln+i-Ln) =
/0, B.1.2)
or, translated to the language of continued fractions:
Theorem 4 Let K.(an(z)/bn(z)) be a continued fraction with polyno-
polynomial elements an(z) ^ 0 and bn(z), and let An(z)/Bn(z) be its approxi-
mants in canonical form. Then the following statements hold.
2.50 Chapter V. Correspondence of continued fractions
(A) K.(an(z)/bn(z)) corresponds to some L ? L if and only if
n+l
^2 Hak) ~ H^n) ~ HBn+1 ) —> OO ttS U —> OO . B.1.3)
k=l
(B) If K.(an(z)Ibn(z)) corresponds to L G L, then L is uniquely deter-
determined.
(C) If B.1.3) tends strictly monotonely to oo, then the order of corre-
correspondence of An(z)/Bn(z) to L is given by
n+l
vn = ? A(afc) - \(Bn) - X(Bn+l). B.1.4)
Proof : (A): Let K.(an(z)/bn(z)) correspond to an L G L. Then {Ln}
converges to L in (L, || • ||), wliich implies that Bn(z) ^ 0 from some n on.
Hence B.1.3) follows from B.1.2) and Lemma 2. Conversely, if B.1.3)
holds, then, again, Bn(z) ^ 0 from some n on, and the correspondence
follows similarly.
(B): Since L is the limit of a convergent sequence in (L, || • ||), it is unique.
(C): This follows from Lemma 3 and B.1.2). ¦
Theorem 4 is essentially due to Jones and Thron [JoTh80, p. 151-153].
Example 4 How does this fit in with the observation in Example 3? For
the Thiele interpolating continued fraction we have A(ajt) = A(z — z^-i) =
0 unless Zk~\ = 0, but that can happen for at most one index. Hence
ii A(<Zfr) < 1- At the same time Bn(z) are polynomials so that
n) > 0 for all n. So, there is no way B.1.3) can be satisfied, not even
if we allow one of the points z*. to be = 0-
O
Example 5 In Chapter I we saw examples of regular C-fractions 1 +
K.(anz/l) which correspond to power series. For these we have
Classification of continued fractions 251
= n + 1 whereas Bn(z) are polynomials with Bn(Q) = 1 so
that A(Pn) = 0 for all n. This means that regular C-fractions always
correspond, to power series. The order of correspondence is vn > n -f 1
by virtue of Theorem 4C.
O
Example 6 Let us look at the correspondence properties of a T-fraction
A.2.4). Also here ??+{ \{Fkz) = n+1 and Bn(Q) = 1 so that A(?n) = 0
and i/n = n -f 1 if all 2Y) / 0. That is, T-fractions always correspond at
z = 0 to a power series.
To study possible correspondence at z = oo we shall apply Lemma 1.
We have
Fjz F2/z FJz Fv F2z F3z
where X(FX) = 0, ??+.] A(Ffcz) = n and Bn@) = GiG2-Gn. Hence,
at least if all Gn ^ 0, the T-fraction corresponds to a power series at
z — oo. The order of correspondence is vn — n if all F^ ^ 0 and all
Gk ? 0.
O
2.2 Terminating continued fractions
There is always a question of how to deal with continued fractions
K(an(z)Ibn(z)) where a/v(z) = 0 for some N € N. Different authors
have chosen different ways to handle this. Following Henrici and Pfluger
we defined a continued fraction in Chapter I, Subsection Jf.^by requiring
that all an ^ 0. However, it is convenient to allow an = 0 at times. We
shall say that if
aN(z) = 0 , an(z) ^0 for 1 < n< N , B.2.1)
then the continued fraction terminates after N — 1 terms. That is, we
make no distinction between the two structures
oo
K {an(z)/bn(z)) with aN(z) = 0 B.2.2)
252 Chapter V. Correspondence of continued fractions
and
K (an(z)/bn
In fact, we also say that two continued fractions B.2.2) arc equal if they
have equal elements up to and including the first partial numerator which
is 0. From B.2.3) we can see that a terminating continued fraction with
polynomial elements always corresponds to a power series if /vv-
oo, namely Z-yv-i = ?(/;v-i).
2.3 Why classifications?
The correspondence properties of a continued fraction cq(z) +
K.(an(z)/bu(z)) with polynomial elements are closely connected to the
degree and form of these polynomials. For instance, comparing Exam-
Examples 4 and 5 shows that the two continued fractions
K((an + z)/l) and K(anz/l) where 0/flrl?C
have very different properties. The first one does not in general cor-
correspond to any power series L ? L at z = 0, whereas the second one
always corresponds to some L G L. This has led to the introduction of
a long list of various "types" of continued fractions with different corre-
correspondence properties. We shall here look at oidy one of these, and see
what kind of questions one can answer, and how.
2.4 C-fractions
A C-fraction is a continued fraction of the form
c0 + K(anzan/1) where c0 G C , an G C and an G N . B.4.1)
(Also the form cy/(l + K(an^a"/1)) is called a C-fraction. We shall
even say that c0 + K.{an{z)/bn(z)) is a C-fraction (modified) if it can be
brought to one of these forms by an equivalence transformation and/or
a change of variable z = l/?. In the following, though, we shall use
Classification of continued fractions 253
the form B.4.1).) A regular C-fraction, as described in Example 5, is
a special kind of C-fraction with all ctn = 1. The importance of C-
fractions lies in their powerful correspondence properties combined with
their simple form. Before stating these properties, it is convenient to
define the subset
B.4.2)
of L. That is, series from Lo are Taylor series in the sense that they do
not contain terms with negative exponents. Lq is no longer a field. The
following can be found in [LeSc39]:
Theorem 5
(A) To every C-fraction B.^.1), terminating or not, there corresponds
a uniquely determined L 6 Lo. The order of correspondence of the
nth approximant fn{z) is
n+l
(for n < N — 1 if the C-fraction terminates with ayv = 0^.
(B) To every L € Lo there corresponds a uniquely determined C-
fraction, terminating or not.
(C) L E Lo is the Taylor expansion at z = 0 of a rational function
holomorphic at z = 0 if and only if its corresponding C-fraction
terminates.
Proof: (A): C-fractions B.4.1) have canonical denominators Bn(z)
such that ?_,(z) = 0, B0(z) = 1 and Bn(z) = Bu_i(z) + anzCXnBn_2\z)
for n = 1,2,3 This means that Bn@) = 1 for all n > 0, so that
X(Bn) = 0 for all n > 0. In the same way X(An) > 0 for all n > 0 and
thus X(An/Bn) > 0; i.e. An/Bn 6 Lo for all n > 0. The correspondence
follows therefore trivially if the C-fraction terminates. Assume next that
254 Chapter V. Correspondence of continued fractions
all an ^ 0. Then B.1.3) in Theorem 4 holds since
n+l
k-\
(Xk)-X(B
»)
— AG?n+
.)
n+l
/ j
k—\
where a^ are natural numbers > 1. In fact, Ylak ~~* °° strictly mono-
tonely, and the result follows from Theorem 4.
(B): Let L G Lo be given, say
L{z) = k
If Cfc = 0 for all k ? N, then L(z) — cy, which can be regarded as a
terminating C-fraction. Otherwise, let n be the first positive index for
which cn ^ 0. Then L(z) can be written
L(z) = c0 + cn.
and we choose aj = cn and aj = n. The power series in the brackets
can be inverted in Lo, and we obtain
where xO(z) = |1+g^} eLo.
B.4.4)
If all the coefficients of L^l\z) are zero, apart from its constant term
which is 1, then
a terminating C-fraction, and we are finished. Otherwise we repeat the
procedure with L^lUz) to obtain
and thus
Hz) =
Classification of continued fractions 255
and so on. Either the process stops, and we get a corresponding C-
fraction which terminates, or it never stops, and we get an infinite C-
fraction. That this C-fraction actually corresponds to L(z) follows since
X(L — Ln) = A(Xn+i — Ln). It remains to prove that this C-fraction is
unique. Assume that the C-fraction do -f J?(bnz@n/I) also corresponds
to L(z). Since
K(anzan/l)~alZai +-.. and K(bnz^/l) ^byz01 +•-¦
it follows directly that do = c0, b\ = a\ and @\ = oty. Hence
lM(z) ~ 1 + K (anza« /I) , L^(z) ~ 1 + K (&„**"/1) .
n=2 n=2
But then 62 = 02 an(l Pi — a2 f°r the same reason as above. Continua-
Continuation of this process shows that bn = ari and f3n = ctn for all n (up to and
including the first zero-term if one of the C-fractions terminates). That
is, the C-fraction is unique.
(C): If the C-fraction terminates at a at = 0, then it corresponds to
Z/v-t = C(Aj\'-i /Bn-[) by definition. (See Subsection 2.2.) To prove
the converse we assume that the non-terminating C-fraction
K(anz""/1) corresponds at z = 0 to L(z) = P(z)/Q(z) = J2kS
where P and Q are polynomials. Without loss of generality we assume
that Q@) = 1. Then we can write
=co + in?) B-4-6)
= W)
where ai and a] are determined as in B.4.4), and
where Qi(^) = (P(^) — c0QB))/a1zO!l is a polynomial with Qi@) =
1. The degree deg(Qi) of this polynomial must satisfy the inequality
< max{deg(P),deg(Q)} — an. Repeating this process with
we get from B.4.5) that
Q(z) - 0
256 Chapter V. Correspondence of continued fractions
where Q-i(z) = (Q(z) — Q\(z))/a2zct:i is a polynomial with $2@) = 1 and
degree deg(Q2) < *nax{deg(Q),deg(Qi)} - a2l and so on. This can not
go on for ever, since a polynomial QTi(z) always must have deg(Qn) > 0.
Hence the continued fraction terminates. ¦
Part C of this theorem implies that a non-rational function, holomorphic
at z = 0, always has a corresponding non-terminating C-fraction. This
can be useful to know in advance.
Theoretically, the result in part C can also be used to determine whether
a given power series L(z) is the MacLaurin series of a rational function
or not. However, this is often simpler to decide by other means.
The proof of part B is constructive in the sense that it actually describes
a method to produce the C-fraction corresponding to a given power
series. The algorithm is easy to program on a computer. It is described
as Method 1 in Subsection 2.6.
The proof of part C is also constructive. It describes a method to pro-
produce a C-fraction corresponding to a given rational function. In Subsec-
Subsection 2.6 we shall see how these methods can be generalized.
Another question is: if the C-fraction converges to a function f(z) holo-
holomorphic in a neighborhood of z = 0, will then L = ?(/)? The answer
is in fact yes, at least if the convergence is uniform, as we shall see later
in Theorem 10.
2.5 When does f(z) have a regular C-fraction expansion?
Let f(z) be holomorphic at z = 0 and let L = C(f). Then we know
from Theorem 5 that L has a corresponding C-fraction CQ + J?(anzan /I).
When is this C-fraction regular (all ctn = 1), and when does this regular
C-fraction converge to f(z) in a neighborhood of z = 0?
One way to find out is to actually find the corresponding C-fraction and
look at it. Is it regular? Does it converge? This can be used if the
C-fraction terminates or if we can find a formula for its elements an.
Classification of continued fractions
257
What we would like to have, though, are criteria which can be checked
before we go to the trouble of actually finding the C-fraction. We shall
list four different results to this effect, all of them without proof.
Theorem 6 The C-fraction expansion of the power series L(z) =
53??ocj.zfc is regular and non-terminating if and only if H^ H^ ^ 0
for k = 1,2,3,..., where
*Lm) =
n+k-1 cn+k m ' •
B.5.1)
// h[X)H^] ? 0 for all k ? N, tfien L(z) - c0 + K(an^/l) w/iere
w@ ;
B.5.2)
k-\
This is really not much improvement over the scheme already presented.
Fortunately we have some more user-friendly results. The two next ones
concern S-fractions (Stieltjes fractions) which are regular C-fractions
with Co > 0 and all an > 0. (See for instance Subsection J^.3 in Chapter
III.)
Theorem 7 (Stieltjes, [Stiel8]) The power series L(z) =
has a corresponding S-fraction if and only if there exists a distribution
function \E : [0, oo] —¦ R such that
/•OO
ck = / (-t)kdV(t) fork = 0,1,2,.., .
B.5.3)
By a distribution function we mean a real function which is bounded,
nondecreasing with infinitely many points of increase. We shall return
to this in Subsection 3.1 in Chapter VII. A nice and simple consequence
of Theorem 7 is obtained if L(z) has a positive radius of convergence
258 Chapter V. Correspondence of continued fractions
R. Then L(z) converges locally uniformly to a function f(z) in \z\ < R,
and
oo oo
(-t)kd*(t)
r00 S l. r°° d$(t)
= / y^(-tz) d^U) = /
on compact subsets of \z\ < R. Hence, f(z) has a corresponding S-
fraction if and only if it can be written in the form
>«> - r f
where ^(t) is a distribution function on [0, oo).
Theorem 8 (Carleman [Carl26]) If L(z) = ?°f0 ck*k satisfies B.5.3)
and
oo
»*|-1/BA:) = °°, B-5.5)
k=0
then its corresponding S-fraction converges to f(z) given by B.5-4) l°~
cally uniformly in the cut plane \ arg(z)| < tt.
For the proof of this theorem we refer to Wall's book [Wall48, p. 330].
A different approach was made by Lubinsky:
Theorem 9 (Lubinsky [Lubi85, Theorem 2]) Let f(z) be an entire func-
function with C(f) = ]ClS) ckzk where all Ck ^ 0 and
</?2 fork = 1,2,3, B.5.6)
Jfere /> = 0.4559 ... is J/ie positive root of the equation
oo
2 ^ /? = 1. B.5.7)
Then f(z) has a regular C-fraction expansion which converges locally
uniformly to f(z) in C.
Classification of continued fractions 259
2.6 Algorithms for producing corresponding continued fractions
Let
oo
L(z) = J2 c^k B.6.1)
k=tn
be a given power series from L. We shall look at some methods to pro-
produce a continued fraction 60 + K(an(z)/6n(a:)) with polynomial elements
of given degree and/or form, which corresponds to L at z = 0.
Method 1 The idea from the proof of Theorem 5B can be used to
find a corresponding C-fraction. In fact, this was done in Chapter I,
Subsection 3.2, where it was used to find the beginning of the regular
C-fraction expansion of
lOg{kl + z) = Z-j + j-j+j-.~. B.6.2)
The method was called successive substitutions. This method can of
course also be used to find other types of continued fractions.
Example 7 We want to expand B.6.2) into a continued fraction of the
form
2
axz2 + b\3? a2z + 62* a3* + b3z
co + ciz+ - B.6.3)
i + l + i +• ¦ •
We have no guarantee that this is possible, even though B.6.3) always
corresponds to some power series. In fact, we have by Theorem 4C that
n+l
vn = ? Kakz2 + bkz3) - X(Bn) - X(Bn+l) > 2(n + 1)
k=\
as long as not both a^ and bk are equal to zero. (Why?) Say we want
the fourth approximant of B.6.3). Then we need to use the 2D + 1) = 10
first terms of the series in B.6.2) which we denote by L(z). (We also
count the constant term Co = 0.) We have
L(z) = z+ 2 ^
260 Chapter V. Correspondence of continued fractions
where
z2
z1 z3 lzA 5z5 2221z6 2603z7
~~2~+15~180+189~ 113400 + 170100
1 . ~T + 15
?3
15
z2 z
1
1 90
¦' 7z4
i_
1 1575
L
1 189
47250
2221z()
113400 '
2306z5
1 118125
2603z7
170100
90 + 1575
and
7z2 67z3
_ ~~90~+ 1575
^_ 6 97 2306z5
~~90~+ 1575 " 47250 + 118125
1297z2 104382T1
3675 + 180075 '
This means that
log(l
2
1 + 1+ 1 +
1297z2
3675 + 180075
B^
+ 1 +•••
Let us look a little closer at this continued fraction for z = 1 We
used the first 10 terms of the Taylor series for log(l + z). Summing
Classification of continued fractions 261
these 10 terms with z = 1 gives 1627/2520 « 0.645635 which is not a
good approximation to log 2 « 0.6931472. Computing the approximant
/i(l) of B.6.4) gives /.t(l) = 545953/787182 % 0.6935537 which is much
better.
O
The disadvantage of this method is that we repeatedly have to invert
power series. We shall now see how this can be avoided:
Method 2 In the proof of Theorem 5C, L(z) was a rational function,
L(z) = P(z)/Q(z). This in turn led to
$?$& B-6-5)
where
Q(z)
~ P(z) - cu(z)Q(Z) =
where Qi(z) is again a polynomial if c^(z) is a polynomial and a\ (z) is a
polynomial which divides the polynomial P(z) — Cu(z)Q(z). And so on.
What happens if we start with an arbitrary power series L(z) ? Lq?
Let us define P(z) = L{z) and Q(z) = 1. Then B.6.5)-B.6.6) still
holds, only P(z) and Q\(z) are no longer necessarily polynomials. The
coefficients of the new series Q\(z) in B.6.6) can be found from the
relation
Qi{z) = (P(z) - co{z)Q(z))/ai(z) B.6.7)
which becomes particularly simple to compute if a\ (z) is just a constant
or a contant times a power of z. Repeating this process gives the con-
continued fraction (if we have chosen a form for an{z), bn(z) which works).
Example 8 Let L(z) = ]>] Ckzk be a given power series from Lq. Then
we know from Theorem 5B that L(z) has a corresponding C-fraction. Let
us say that we know (or believe or hope) that this C-fraction is regular.
That is, we want to find a continued fraction of the form c0 +
262 Chapter V. Correspondence of continued fractions
corresponding to L(z). Following B.6.5) with P(z) = L(z) and Q(z) = 1
we get
P(z) Cq + C\Z + C2Z2 + ••¦ C{Z
Q(z)
where
<?,(*) = -(P(z) - c0Q(z)) = 5>*+i** =: X)
Repeating the process we find that
C\Q(z) ciQ(z)-Ql(z) c2tQz
Qi{z) OW Q()/
where
02(^) = -(ciQ(z) - Qi W) and c2,o =
and
OW "C|f0
where Q:i(z) = ~{c2yoQ\{z) - cU[)Q2(z)) and c:Ji0 = Q3@).
Writing Qn(z) — 52?So cn,kzk•» the general step becomes
@()n-1f0 ~ Qn-
that is,
Cn,ib = Cn_iioCn_2,fc+l - Cn-Z.oCn-l.lt+l B.6.8)
for fc = 0,1, 2,..., n = 2,3,4, From the expansion of Q i (z) we see
that
el,* = cfc+i for fc = 0,1,2,... . B.6.9)
Writing Qt)(z) = Q(z) = 1 and Q-\{z) = P(z) so that
co,o = 1 , co,jt = 0 and c_i,jt = c^ for fc = 1,2,3,... , B.6.10)
we find that also B.6.9) follows from B.6.8) with n = 1.
Classification of continued fractions
263
This particular recurrence system B.6.8)-B.6.10) goes by the name of
Viscovatov's algorithm, [ViscO6], [CuWu86, p. 16]. If all cn>o ^ 0 we get
L(z)~co+
ci,o
,o H
which is easily converted to a regular C-fraction by an equivalence trans-
transformation. If cnio = 0 for an index n, then either we have a terminating
regular C-fraction (all cnk = 0 for this particular n), or the C-fraction
corresponding to L(z) is not regular after all. Let us try this algorithm
on the power series
L(z) = 1 - z - z
- zl -
7z
6
where the coefficients Ck are given by the (k X fc)-determinant
c, =
The recursion B.6.8) and initial values B.6.9)-B.6.10) then gives
Ck+l '
-ci,fc+i :
cl,A:+l + C2,k+1 '•
—4c2,Ar+l — c3,/:+l
8c3ja;+i + 4c4ifc+i
1
1
-1
1
-4
: 8
: 0
-1
0
-1
-3
4
-8
0
-1
0
3
1
4
-8
3
0
-1
5
-12
-1
0
-5
-7
-5
0
7
7
0
Here the algoritlim breaks down. Hence the C-fraction either terminates
after these terms or the next term has a higher degree in z. We can
determine which of these two cases we actually have by computing the
264
Chapter V. Correspondence of continued fractions
general expression for c^^ to see if they all vanish or not. We get:
C2,fc =
- -H('+7f)(-
A:+2
c3,Jfc =
C4.it =
=> c2lo = 1,
= Ck+2 ~ ck+3
k+2 ^
=> c.Jj0 = -4 ,
(>*$) (=
k+3
=> c4(o = 8,
= 0.
Hence the continued fraction terminates, and we get
z z 4z Sz z z 4z 2z
.o
Looking a little closer at method 2, we find from B.6.5)-B.6.6) that
what we essentially do is setting
and in the general step
Qn(z)
-•*¦>
that is,
= bn+l(z)
Qn{z) = bn+\(z)Qn+i(z) f an+2{z)Qn+2{z).
This means that the sequence {P{z),Q(z),Q\(z),Q-2(z),...} is essen-
essentially a solution of the three-term recurrence relation.
Pincherle's and Auric's theorems in (L, ||-||) 265
There exists a wide variety of algorithms for producing various types
of continued fractions corresponding to a given power series. (See the
remarks at the end of this chapter.)
3 Pincherle's and Auric's theorems in (L,
3.1 Interpretation
We have already defined correspondence of a continued fraction Cu(z) +
K(an(z)/6n(z)) to a power series L G Las convergence of its approxi-
mants An(z)/Bn(z) = C(fn(z)) to L(z) in the norm ||-|| in L. In Chap-
Chapter IV we tied convergence of a continued fraction to the existence of
dominant /minimal solutions of the corresponding three-term recurrence
relation
Xn(z) = 6n(z)Xn_1(z) + an{z)Xn.2{z) for n = 1,2,3,... . C.1.1)
With further applications in mind we permitted convergence in some
normed field (F, ||*||). So, what we want to do now, is to apply these
results to the field (L, || • ||). For convenience we still restrict the elements
an and bn to be polynomials, and we use the simplified notation art, 6M,
An and Bn to denote both the polynomials and their Taylor expansion
at z = 0.
We recall that {Xn} is a minimal solution of C.1.1) if not all Xn =
0 and if there exists another solution {Yn} such that Xn/Yn —> 0. If
{Xn} is minimal, then every solution {Zn} of C.1.1) which is linearly
independent of {XTl} is dominant; i.e. Xn/Zn —> 0. In the field (L, || • ||)
we thus have that {Xn}; Xn ? L is a minimal solution of C.1.1) if not
all Xn(z) = /o and if there exists another solution Yn(z) of C.1.1) such
that j| Xn/Yn || -> 0, i.e. X(Xn) - X(Yn) -+ oo.
Theorem 10 (Pincherle's theorem, modified)
(A) The continued fraction b^ + Ji.(an(z)j^bn(z)) with polynomial ele-
elements an(z) ~? 0 and bTl(z) corresponds to some formal power se-
266 Chapter V. Correspondence of continued fractions
ries L(z) 6 L if and only if its canonical denominators {Bn} form
a dominant solution of C.1.1) in (L, ||
(B) If {Xn} is a minimal solution of C.1.1) in (L, ||-||) with
X-\(z) ^ /(j, then bu(z) + K.[(in(z)/bn(z)) corresponds to the for-
formal power series L = 6q — Xq/X_i 6 L.
To determine whether a given solution {Xn} is minimal or not is often
easier in (L, || • ||) than in (C, | • |). A typical situation is for instance that
Bn@) ^ 0 for all n, whereas X(Xn) —> oo. Then we immediately know
that {Xn} is a minimal solution.
Example 9 In Example 5 in Chapter IV we saw that Pn(z) =
+ n; z) is a solution of the three-term recurrence relation
= Pn+\(Z) + /r,wvlt,,1Xf"+2W for 71 = 0, 1, 2, . . . ,
{c + n){c + n + I)
where c is a complex constant / 0, — 1, —2, This can be regarded as
a recurrence relation in L, and we want to use this fact to prove that
1 + K(fln2/1), where an = l/(c + n — l)(c + ra), corresponds to \t(c; z)j
+ 1; z), [Gaussl3]. From Example 6 in Chapter IV it follows that
+ n + 2;,)^f^ +$^^ for n =-1,0,1,...
is a solution of the recurrence relation
Xn(z) = Xn-i(z) + anzXn--2(z) for n = 1,2,3,...
for {Bn(z)}. Since X(Xn) = n + 1 whereas \(Bn) = 0 for all ri, it follows
that X(Xn/Bn) —> oo; i.e. ||Xn/??n || —> 0. Hence, {Bn} is dominant,
{Xn} is minimal, and by Theorem 10, 1 4- K.(anz/1) corresponds to
c(c+ 1) *(c + l;z)
O
Pincherle's and Auricfs theorems in (L, || • ||) 267
Theorem 11 (Auric's theorem, modified) LefK(an(z)/bn(z)) be a
continued fraction whose elements an,bn ? L are polynomials with all
an ^ /o. Further let {Xn}, where Xn ? L, be a solution of the corre-
corresponding recurrence relation C.1.1) such that Xn / /o for all n. Then
T?{an(z)/bn(z)) corresponds to —Xq/X-i if and only if
n —* oo
lim_ || Rn || = oo where Rn = ^ ^"^^ e L • CX2)
Condition C.1.2) may not be so easy to check. (Sums are always worse
to control than products and ratios. See Subsection 1.3.) We shall derive
a useful consequence of this result:
Theorem 12 Let K.(an(z) / bn(z)) be a continued fraction where an, bn ?
L are polynomials with all an / /o, and let {Xn(z)}} where Xn ? L;
be a solution of the corresponding recurrence relation C.1.1) such that
Xn ^ /(j for all n. If
AFri_!) + AFn) < A(an) for n = 1, 2,3,... , C.1.3)
and
X(bn) < XiXjXn-i) for n = 1,2, 3,... , C.1.4)
then 'K.(an(z)/bn(z)) corresponds (at z = 0) to —Xq(z)/X-\(z).
Proof : According to Theorem 11 we need that \(Rn) —* — oo, where
Rn is given in C.1.2) by a sum of n + 1 terms Rj such that
J) for j = 0,1,2,
m-\
In view of the rule A.3.3) for calculations of X(L\ ± L-i) it thus suffices
to prove that dJ+I < dj under our conditions. We have
and thus
J^n _ t , anAn-2
Xn-1 Xn-1
268 Chapter V. Correspondence of continued fractions
By use of C.1.4) and the rules A.3.1)-A.3.3) of calulation it follows that
so that
This gives
= A(ai+2) - X(bj+l) - X(bj+2) > 0
by C.1.3). Hence dj+i < dj. ¦
This result is essentially proved by Jones and Thron [JoTh80, Thm. 5.2,
p. 152] (in a slightly different way).
By means of Theorem 5 of Chapter IV this result can be "translated"
to solutions of the recurrence relation
Pn = buPn+l + <xn+|jPn+2 for n = 0,1,2, C.1.5)
Corollary 13 Let K.(an(z)/bn(z)) be a continued fraction, where
«n>^ri ? k are polynomials with all an / /<j, and let {Pn}\Pn ? L;
be a solution of C.1.5) with bo = 0 and Pn / /q for all n. If
n) < X(an) forn = 1,2,3,... , C.1.6)
and
A(Pn/Pn+i) + AFn_l)<A(on) forn= 1,2,3,..., C.1.7)
then K.(an(z)/bn(z)) corresponds to Pu/P\.
Example 10 Let us use this to prove that the regular C-fraction 1 +
K(a»z/1) in Example 9 corresponds to \t(c; z)/\?(c + l;z). In that
Pincherle's and Auric's theorems in (L, || • ||) 269
example Pn(z) = ^(c + n\ z) so that Pn@) = 1 for all n, and thus
A(Pn/Pn+1) = 0. Further bn — 1 so that AFn) = 0, and an(z) = anz
so that A(an(z)) = 1. This means that C.1.6)-C.1.7) are satisfied, and
the conclusion follows.
O
Example 11 Recall that the hypergeometric function 2^1 (a? 6;c; z) is
given by
, x ab z a(a+ 1NF+ 1) z2
where a, 6 and c are complex constants and c ^ 0, —1, —2, We can
also regard 2^1 (a> 6; c; z) as an element from L(), in which case it is often
referred to as the hypergeometric series. The following formulas due
to Gauss [Gauss 13] can be verified by comparing the coefficients of the
series on each side of the equality signs:
(c-aJFi(a-1,6; c; z) = (c-a-bJF\ (a, 6; c; z)+b(l-zJFi(a, 6 + 1; c; z)
and
(c-6-lJFI(a,6;c;z) = (c - a - 6 - lJF,(a,6 + l;c;z)
This means that the sequence {PnB)}?To of elements from Lo given by
P2n(z) = 2F\(CL + Tl — 1 6 \ W C Z) P2n+l(^) = 2^1 (a + ^? 6 + M", C z)
is a solution of the three-term recurrence relation
Pn{z) = bnPTl+l(z) + an+I(l - z)Pn+2(z) for n = 0,1,2,3... ,
where
_ 6 + n i _ c — a — b — 2n
~~ c — a — n 2n ~ c — a — n »
_ a + n — 1 l _ c — a — 6 — 2n + l
270 Chapter V. Correspondence of continued fractions
if a, 6, a — c and 6 — c are ^ 0,-1, —2, It is therefore tempting to
believe that the continued fraction
«n( *) c-q-6
o + K : = r
6n c-a c-a-j-1 +
c — o —
6+1
c — a — 6 — 2 -f c — a — 6 — 3
c — a — 1 c — b — 2
{c — a —
c — a[ c — a — b— 1 +
a(c — a —
c-a-6-2 + c-a-6-3 +••
corresponds to Pq[z)/P\(z) at z = 0. According to Auric's Theorem for
the adjoint recurrence relation as stated in Corollary 11 in Chapter IV,
this is equivalent to
Rn\\ = oo where Bn = )TPjPj+l{l - z)j f[ (-arn) e L.
3=2 m=l
This is impossible since
4,(o)=y, PjWPj+iw n (-a™)=e n (-«".) ^ °
j=2 m=l p2m=l
for infinitely many indices n, except possibly for very special values of
the parameters a, b and c. Hence, we do not have correspondence at
z = 0. Another matter is that the correspondence at z = 1 follows
easily if P,(l) 7^ 0 for infinitely many indices.
O
3.2 A link between correspondence and classical convergence
We have two ways of assigning a function f(z) to a continued fraction
J?(an(z)/bn(z)): 1) Convergence of the continued fraction to f(z) in
Pincherle's and Auric's theorems in (L, ||-||) 271
some domain D, and 2) Correspondence of the continued fraction to
?(/) at z = 0 if / is meromorphic at z = 0. Convergence is usually
what one wants in applications. Correspondence is often what one has
or what one is able to establish. So, the important question is then: say
K.(an(z)/bn(z)) corresponds at z = 0 to a power series L(z) which is the
Laurent expansion of a function f(z) in a neighborhood of z = 0. Will
then K(an(z)/bn(z)) converge to /(z)?
One has the feeling that if the classical approximants are uniformly
bounded in a domain D with 0 ? D, then the answer is yes. This feeling
is based on the experience that uniform boundedness + "something else"
often lead to uniform convergence of a sequence of analytic functions.
And indeed, so it is:
Theorem 14 (Jones and Thron [JoTh80]) LetD be a deleted neighbor-
neighborhood of the origin. Further let K.(aTl(z)/bn(z)) with polynomial elements
an(z) ^ 0 and bn(z) correspond to an L ? L and have holomorphic
approximants in D. Then the following statements hold.
(A) K.(a.n{z)/bn(z)) converges locally uniformly in D if and only if its
approximants are uniformly bounded on every compact subset of
D.
(B) If K.(au(z)Ibn(z)) converges locally uniformly in D, then its value
f(z) is holomorphic in D, meromophic at z = 0 and L = C(f).
Proof : (A): Let first C be a compact subset of D and K.(an(z)/bn(z))
converge uniformly in C. Then {fn(z)} converges to a holomorphic
function in C, and \f(z) — fn(z)\ < l*n for all z ? C where {/xn} is a
sequence of positive numbers converging to 0. Hence |/«(z)| < |/(^)| +
fin < max{|/(z)|; z ? C} -f max{/zn;n ? N} =: M < oo for all n. This
proves the "only if" part.
To prove the "if" part, we let 0 < 8 < r^ < r>z < R be positive numbers
such that the annulus A = {z ? C; 6 < \z\ < R} C D, and Mx =
max{|/n(z)|;7i ? N and z ? A}. Since each fn{z) is holomorphic in D
272 Chapter V. Correspondence of continued fractions
and meromorphic at z — 0, it has a Laurent expansion Ln(z) = C(fn) G
L which converges to /n(z) in A. By Cauchy's estimate we thus know
that
oo
c».*2*> where
Let i/?Nbe chosen arbitrarily. Since || L — Ln || —> 0 and thus X(L -
Ln) —> oo, there exists an n G N such that A(?n+? — //„) > v for all
n > N and Aj G N. This means that
oo °°2M
< E lc«+*.i - c'vlklj < E r (
C.2.2)
for all z G i4x ={^C;Kz<ri} and all n > iV, k G N. This proves
that {/n(^)} is a (uniform) Cauchy sequence on A\ and thus converges
on A\. The locally uniform convergence of {fn(z)} on D follows then
by Stieltjes-Vitali's theorem.
(B): Let K(an(z)/&n(z)) converge locally uniformly to f(z) in D. Then
f(z) is holomorphic in D. We need to prove that / is meromorphic at
z = 0 and that L = ?(/). Let L(z) = ?g?m cfcz*. Since A(L - Ln) =
i/n —> oo, it follows from C.2.1) that 7?i = 7n(n) from some n on and
that |c/b| < M\/r% for all fc. Now
|/(z) - X(z)| < |/(z) - fn(z)\ + |/n(z) - Xn(z)| + \Ln(z) - L(z)\ C.2.3)
where \f(z) - fn{z)\ —» 0 uniformly in A^ |/n(z) - Ln(z)\ = 0 in Au
and |X?l(z) — /yB)| < M2{ri/r2)Un in A] by the same argument as in
C.2.2). Hence, the right hand side of C.2.3) can be made arbitrarily
small, so L(z) converges to f(z) in A\. Since the inner radius 6 in A,
can be made arbitrarily small, it follows that L — C(f). In particular
then / is meromorphic at z = 0. ¦
As we can see, the theorem is a simple consequence of the normality of
the family {/n(z)} of approximants for K.(an(z)/bn(z)), combined with
the correspondence of K.(an(z)/bn(z)). This theorem provides a useful
method for proving convergence of a continued fraction with polynomial
elements. In fact, there are even cases of continued fractions with con-
constant elements where it pays to introduce an auxiliary variable z, just
to apply this method:
Pincherle's and Auric's theorems in (L, ||-||) 273
Example 12 We want to find the value of the so-called Rogers-Rama-
nujan continued fraction
2 3
i^c 0<|g|<i. C.2.4)
(We already know that it converges, since it has the form K(cn/1) where
cn —> 0. See for instance Example 1 in Chapter II.) Trying to find explicit
solutions of the three-term recurrence relation
Pn = Pn+l + <7n+1 Pn+2 for 71 = 0, 1, 2, . . . ,
turns out to be rather difficult. So also for
An — An_i -f- ^ -^n-2 Ior n — 1, Z, O, . . . .
Let us introduce a complex variable z, to get the regular C-fraction
qz q2z q*z .
1 + — ^— ^— • C.2.5)
1 + 1 + 1 +--- ^ ^
Then we find that
oo
where
(9)o = 1 and (q)k = A - g)(l - q1) ¦ ¦ ¦ A - gk) ,
is a solution of
Pn{z) = Pn+i(z) + qn+lzPn+2{z) for n = 0,1,2,...,
and that C.2.5) corresponds to Pu(z)/P\(z). (See Problem 6 in Chapter
IV.) We are interested in the convergence for z — 1. Let D — {z ?
C; \z\ < R} for an R > 1. Then there exists an TV ? N such that
\qnz\ < 1/4 for all n > N. That is, the continued fraction
T+1+T+. • • C-2-6)
satisfies the conditions of Worpitzky's theorem for z ? Z), and thus its
approximants are uniformly bounded by 1 + 1/2 in D. Hence C.2.6)
converges to P^-\(z)/Pj\(z) in D by virtue of Theorem 14. This in
274 Chapter V. Correspondence of continued fractions
turn implies that C.2.5) converges to P{)(z)/P\(z) in D and thus, in
particular,
'¦) 11? ~w] '
where the equality sign stands for convergence.
O
3.3 Tails and correspondence
In the previous subsections, correspondence of a continued fraction
~K.(an(z)Ibn(z)) was tied to properties of solutions of the correspond-
corresponding three-term recurrence relation
Xn(z) = bn(z)Xn^(z) + an(z)Xn.2(z) for rc= 1,2,3.... C.3.1)
Of course, each time we have such a non-trivial solution {Xn(z)} we also
have a tail sequence {—Xn(z)/Xn-\(z)}. (See Subsection 3.3in Chapter
TV.) Hence, Theorem 12 is actually a theorem based on properties of
a tail sequence. Similar relationships exist between Perron-tails and
solutions of the adjoint of C.3.1), so that Corollary 13 is in reality based
on properties of Perron-tails.
4 Branched continued fractions
A simple example
The idea of correspondence can be extended to functions of several vari-
variables. The application is still the same, namely to find rational approx-
approximations to such functions or to sum divergent series. Let us look at a
very simple example of how this can be done:
Example 13 We consider the function
— x
Branched continued fractions 275
of the two complex variables x and y. The Taylor expansion of f(x,y)
around @,0) can be found in the following way:
L{x,y) =
y2 y3 )
4 U . . . >
2! 3! I
{ + 2! + 3J"
Now
'K XXX
4- sc* H ~~ = - -r=:K
1 1 1
1 — x 1 — 1 1
which can be regarded as a terminating regular C-fraction, and
. y2 . y* . y y y y y y y
y -L _U 4.... r^, — — — — — — —
9 2! 3! 1-2 + 3-2 + 5-2 + 7
~~* ^ ,(o) >
(see Problem 1), so
a?]x r@)
L(x,y) - 1 + K^—+ K-
xy
21 + 3T +
()
x Cn'y xy
L[{x,y)'
where
2! + 3! + ''"
y , y2 y'x , y6
276 Chapter V. Correspondence of continued fractions
, A y , y2 yl , \
\ 2 2 - 3! 3! 5! J
y:i
Again — x can be regarded as a regular C-fraction K(a« ^/l) and
y
2!
Finally,
1
2! 2
f 2
2/
•3!
y2
3!
1
2/
3!
y
3!
5!
i .
5! '
y y
. m fN^
O 1 Q
Zi -\- O
2 + 3-
y
+ 3-
y y y y
_2+5-2+7—
1
y y y y
2+5-2+7
y y y y
2+5-2+7
_. 1C Cn
Therefore
f{x,y) - l+K-^ + K
+
a;
2 + K-^f
-O
Let us look closer at what we got in D.1.1). The right side can be
regarded as a terminating continued fraction
where the partial denominators BUi B\ and #2 are (sums of) continued
fractions. D.1.1) ib an example of what we call a branched continued
Branched continued fractions 277
fraction. More specifically it is a special case of what is called a TDCF
(two-dimensional continued fraction), sometimes called a regular two-
dimensional C-fraction. The structure of these regular two-dimensional
C- fractions is in general
xy xy xy .
Bo + -jf -jf / , D.1.2)
where
(k) (k)
/i,\ OO ah. X OO Cn V
Bk = b™ + K -^- + K -^ for* = 0,1,2,...; D.1.3)
i.e. sums of regular C- fractions in x and y separately.
4.2 Approximants
To form approximants of D.1.2) (or any other branched continued frac-
fraction), we need to truncate all the continued fractions involved. This can
be done in many different ways, and it is not always easy to know which
ones will serve meaningful convergence or correspondence purposes. If
we truncate after the same number of terms in each branch of D.1.2),
then its 2. approximant (of classical type) would be
. (o) a\}x (o)
1 + 6<°> '
xy
A)
! 41}
B) B)
0 + B) I
42)
This choice is, however, usually not so good. Since ordinary, conver-
convergent continued fractions have values which depend mostly on their early
278 Chapter V. Correspondence of continued fractions
elements, a better choice might be to include more terms in the first
branches than in the later ones. The 2. approximant of D.1.2) could
then be for instance
,@)
oo +
«i
(())
o, +
'X
ci
M
i •
y
(°)
D.2.2)
This choice, where branch number k in the nth approximant uses (n—k)
terms in each continued fraction is often the best choice.
Also for branched continued fractions one may choose to replace tails by
some modifying factors Wnjt when forming approximants, both for the
main fraction and for the branches.
Example 14 We consider again the regular, branched C-fraction ex-
expansion D.1.1) of ey/(l — a;), and we form approximants of the form
D.2.2). We get
() (o)
a, 'x c\ y xy
a\'x c\'y t xy
2 + 2/
I- X 1 - tt 1 - fl! - ^ 4
K y xy
6 + 4y +
A-20F-230
and so on.
Branched continued fractions 279
4-3 Another example
A power series L(z) — ]T cnzn has a natural ordering of its terms cnzn.
This fact was the basis for our concept of correspondence. This is no
longer so if we move on to power series L(x, y). Correspondence between
L(x,y) and a TDCF will depend on how we order the terms in L(x,y).
The TDCF in D.1.2) corresponds to a power series with the ordering
@0 00 \ /oo 00
)
)
n—\ n=l / \n=\ n=l
(Correspondence is here tied to the main continued fraction D.1.2) and
not to its branches D.1.3).)
Example 15 We return to the function
l-x
from Example 13, but this time we arrange its Taylor expansion L(x, y)
differently:
L(x,y)
y y2 t/3
2 v3
bn
1
v y2
" 1 I O!
A. m ?u m
1
v v2
' 1! ' O?
280 Chapter V. Correspondence of continued fractions
(lTl
x
d
n
where 1 + J<L(cTly/dn) is the regular C-fraction expansion of ev as given
in Problem 1. The TDCF has now the structure
(o)
xx
i xv
n "ri
Problems 281
Problems
A) The exponential function ez has the corresponding regular C-frac-
tion
_ z z z z z z z
e* ~ 1 + - - - - - - -
1-2+3-2+5-2+7
at z — 0. Find the Taylor series expansion of the first four approx-
imants and compare to the Taylor series of ez at z — 0.
B) The function f(z) = log((l + z)/(l — z)) has the corresponding
C-fraction
1 + z 2z l2z2 2V 3V
log - — -— —- -— at z = 0 .
1 — z 1— 6 — 5 — 7 — •••
Show that the continued fraction
2 12 2* 3*
z — 3z — 5z— Iz
corresponds to the function log(B; + l)/(z — 1)) at z — oo.
C) P-fractions introduced by Magnus [Magn62A], [Magn62B] are con-
continued fractions of the form
1 1 1
where each bn(z) is a polynomial in 1/z with degF^.) > 1 for k > 1,
bn(z) = ? 4"'// •
ib=0
Show that a P-fraction always corresponds at z — 0 to a power
series L(z), and determine its order of correspondence.
D) Regular ^-fractions introduced by Lange [Lange82] are continued
fractions of the form
diz d2z
Ot) — +
where 6n is either 0 or 1 for every n and where 6q, dn are complex
constants with dn+i = 1 for each n such that ?n = 1.
282 Chapter V. Correspondence of continued fractions
(a) Show that a regular ^-fraction always corresponds at z — 0
to a power series L(z), and determine its order of correspon-
correspondence.
(b) Show that to every L(z) E Lo there exists a corresponding
regular ^-fraction.
(c) Show that L(z) 6 Lt) is the Taylor series expansion at z = 0
for a rational function if and only if its corresponding regular
^-fraction terminates.
E) Hermitian PC-fractions (Perron-Caratheodory-fractions) are
continued fractions of the form
J
+8xz-\- 8\ -\-62z+ 52 H
[J0NT86], [Perr57]. Show that if all 8n ^ 0 then the hermitian
PC-fraction corresponds to a power series L(z) at z = 0 and to a
power series L(l/z) at z = 00. What is the connection between
L(z) and 1A/2)?
F) Use Viscovatov's algorithm to develop the first 5 terms of the C-
fraction corresponding to
L(z) = 1- z + 2z2 - 2z:i - 4z4 + 22z5 + • • ¦ .
G) Find the first 5 terms of the ^-fraction (see Problem 4 for defini-
definition) corresponding at z — 0 to the power series
L(z) = 1 + z - 2z2 + 4z3 - Uzl + 58z5 + • • ¦ .
(8) Show that if
a2z a
1+1 + 1 + 1 +¦•¦
whore all an ^ 0, then
C2Z C'AZ C4Z
where {cn} is given by c-i = a\ + a-} and
= cn+l
at z — 0
Problems 283
for all n > 1, if all cn ^ 0.
Hint: Compare the odd part of the first continued fraction to the
even part of the second one.
(9) Let 1 + J?(anz/[) be a non-terminating regular C-fraction corre-
corresponding (at z = 0) to the formal power series L(z) = 1 + X] cnZn-
(a) Prove that L(z)L( — z) = 1 if and only if
and a2n = —a-m-y for all n > 2.
(b) Prove that c-2k+\ = 0 for all k E N if and only if
a2Jfc+i = —a-2k f°r all fc G N .
Hint: Use the result in Problem (8).
A0) Show that the non-terminating general T-fraction K(Fn2;/(l +
Gnz)) with all Fn ^ 0, Gn / 0, corresponds to L(l/z) = -1
at z = oo if and only if Gn — — Fn for all n.
A1) Let the non-terminating general T-fraction K(Fnz/(l -\-Gnz)) cor-
correspond to
L(z) = c{z + c2z2 -f C32;3 + • • •
at z = 0 and to
dx z~x + d2^ + d3z~3 + • • •
at z = 00. Prove that L(z) = — zL(l/z), i.e.
l — c:\z~2 - • • •
if and only if all Gn — — 1.
A2) Find the first terms of the branched C-fraction described in D.1.2)
corresponding to:
(a)
(b) \n(x\ny)
284 Chapter V. Correspondence of continued fractions
Remarks
1. C-fractions go all the way back to Worpitzky, Pringsheim, Sleszyri-
ski and others, who used them extensively. Leighton and Scott
made a systematic study of them in their paper from 1939, [LeSc39].
One of the advantages of C-fractions is that every L 6 Lo has a
corresponding C-fraction.
2. T-fractions were closely examined by W. J. Thron, [Thron48],
[Thron77]. As mentioned in Chapter I, 0. Perron therefore sug-
suggested the name Thronsche Kettenbruche (or T-fractions) for these
structures, [Perr57, p. 174]. Their surprising correspondence prop-
property, that they correspond to two power series, one at z = 0 and
one at z — oo, was established by J. H. McCabe and J. A. Mur-
Murphy [McMu76], [McCa78]. They called their continued fractions
M-fractions, but M-fractions are essentially T-fractions. See also
[Waad64].
3. There exist several algorithms for finding the continued fraction
(of given type) which corresponds to a given power series L(z).
For instance:
Regular C-fractions: The qd-algorithm introduced by Rutishau-
ser [Ruti54]. This algorithm is also described in [JoTh80, p.
227]. Henrici [Henr63] has written a very interesting survey
on applications of this algorithm. The stability is discussed
in [Ruti63], [Henr74, Sect. 7.6].
The classical approximants of regular C-fractions are Pade
approximants. Hence, one can also apply algorithms which
produce these approximants directly. This is in particular
useful if we do not need the continued fraction itself.
C-fractions: E. Frank [Frank46] suggested a method for simul-
simultaneous computation of the denominators Bn(z) and the ele-
elements anzan of the C-fraction corresponding to a given power
series. The method is also described in [Perr57, p. 111].
T-fractions: The FG-algorithm was introduced by Jones and
Thron, [JoTh80]. In [McCa83] McCabe showed that this al-
algorithm can be regarded as an extension of the qd-algorithm.
Remarks 285
4. There are several special examples of Theorem 7. For a survey we
refer to [JaWa89].
5. Special versions of Theorem 14 have been known for a long time.
See for instance [LeSc39].
6. Branched continued fractions were introduced by V. Ya. Skorobo-
gat'ko. We refer to his book in Russian [Skor83] on the subject.
He also wrote an article in English for the conference proceedings
of a French-Polish meeting in Lancut in Poland [Skor87]. The
typical feature of such continued fractions are that the partial de-
denominators of the main continued fraction Bo + K(an/^n) a*e
again (sums of) continued fractions. Branched continued fractions
have in general no natural connection to three-term recurrence re-
relations. For more information we refer to the extensive works of
Kutchminskaya, Cuyt, Wuytack, Verdonk, Siemasko and Bodnar.
See for instance [Kuch78], [Kuch80], [Siem80], [Bodn86], [KuSi87],
[CuWu86], [CuVe88] and the references therein. Let it merely be
mentioned that the type of TDCF in Subsection J^.l was intro-
introduced independently by O'Donoghue, Kutchminskaya and Cuyt
and Verdonk.
References
[Bodn86]
[Carl26]
[CuVe88]
[CuWu86]
[Frank46]
[Gaussl3]
[Henr63]
[Henr74]
D. I. Bodnar, "Branched Continued Fractions", Kiev
Naukova Dumka A986). (In Russian.)
T. Carleman, "Les Fonctions Quasi Analytiques", Paris
A926), 78-96.
A. Cuyt and B. Verdonk, A Review of Branched Contin-
Continued Fraction Theory for the Construction of Multivariate
Rational Approximants, Appl. Numer. Math. 4 A988).
A. Cuyt and L. Wuytack, "Nonlinear Methods in Numeri-
Numerical Analysis", North-Holland, Amsterdam A986).
E. Frank, Corresponding Type Continued Fractions, Am. J.
of Math. 68 A946), 89-108.
C. F. Gauss, Disquisitiones generates circa seriera infini-
,3 i
+ 22-T +
+1-7a!+ 1-2.7G+1) TG)G)
etc., Commentationes Societatis Regiae Scientiarum Goet-
tingensis Recentiones, Vol. 2 A813); Werke, Band 3,
Koniglichen Gesellschaft der Wissenschaften, Gottingen
A876), 123-162.
P. Henrici, Some Applications of the Quotient-Difference
Algorithm, Proc. Symp. Appl. Math. 15, Amer. Math. Soc,
Providence, R.I., A963), 159-183.
P. Henrici, "Applied and Computational Complex Analy-
Analysis", Vol. 1, Wiley, New York A974).
286
References
287
[JaWa89] L. Jacobsen and H. Waadeland, When does f(z) have
a Regular C-Fraction or a Normal Fade Table?, Journ.
Comp. and Appl. Math. 28 A989), 199-206.
[JoTh80] W. B. Jones and W. J. Thron, "Continued Fractions: An-
Analytic Theory and Applications", Encyclopedia of Math-
Mathematics and its Applications, Vol. 11, Addison-Wesley
A980). Now distributed by Cambridge University Press.
[J0NT86] W. B. Jones, O. Njastad and W. J. Thron, Schur Fractions,
Perron-Caratheodory Fractions and Szego Polynomials, a
Survey,"Analytic Theory of Continued Fractions II, Pro-
Proceedings, Pitlochry and Aviemore", 1985 (W. J. Thron,
ed.) Lecture Notes in Math., No. 1199, Springer-Verlag
Berlin, Heidelberg A986), 127-158.
[Kuch78] K. I. Kuchminskaya, Corresponding and Associated
Branched Continued Fractions for Double Power Series,
Dokl. Akad. Nauk Ukr. SSR, Ser. A 7 A978), 614-617.
(In Russian.)
[Kuch80] K. T. Kuchminskaya, On Approximation of Functions by
Continued and Branched Continued Fractions, Mat. Met.
Fiz. Meh. Polya 12 A980), 3-10.
[KuSi87] K. I. Kuchminskaya and W. Siemasko, Rational Approxi-
Approximation and Interpolation of Functions by Branched Con-
Continued Fractions, "Rational Approximation and its Appli-
Applications in Mathematics and Physics, Proceedings, Lancut
1985", (J. Gilewicz, M. Pindor, W. Siemaszko, eds.), Lec-
Lecture Notes in Math., No. 1237, Springer-Verlag Berlin,
Heidelberg A987), 24-40.
[Lange82] L. J. Lange, 6-Fraction Expansions of Analytic Functions,
"Analytic Theory of Continued Fractions, Proceedings,
Loen, Norway 1981", (W. B. Jones, W. J. Thron and
H. Waadeland, eds.), Lecture Notes in Math., No. 932,
Springer-Verlag Berlin, Heidelberg A982), 152-175.
[LeSc39] W. Leighton and W. T. Scott, A General Continued Frac-
Fraction Expansion, Bull. Amer. Math. Soc. 45 A939), 596-
605.
288
Chapter V. Correspondence of continued fractions
[Lubi85] D. S- Lubinsky, Pade Tables of Entire Functions of Very
Slow and Smooth Growth, Constr. Approx. 1 A985), 349 -
358.
[Magn62A] A. Magnus, Certain Continued Fractions Associated with
the Pade Table, Math. Zeitschr. 78 A962), 361-374.
[Magn62B] A. Magnus, Expansion of Power Series into P-Fractions,
Math. Zeitschr. 80 A962), 209 -216.
[McCa78]
[McCa83]
[McMu76]
[Perr57]
[Ruti54]
[Ruti63]
[Skor83]
[Skor87]
[Siem80]
J. H. McCabe, A Further Correspondence Property of M-
Fractions, Math, of Comp. 32 A978), 1303-1305.
J. H. McCabe, The Quotient-Difference Algorithm and the
Pade Table: An Alternative Form and a General Continued
Fraction, Math, of Comp. 41 A983), 183-197.
J. H. McCabe and J. A. Murphy, Continued Fractions
which Correspond to Power Series Expansions at Two
Points, J. Inst. Maths. Applies. 17 A976), 233 -247.
O. Perron, "Die Lehre von den Kettenbriichen", Band II,
B. G. Teubner, Stuttgart A957).
H. Rutishauser, Anwendungen des Quotienten-Differenzen-
Algorithmus, Z. Angew. Math. Phys. 5 A954), 496-508.
II. Rutishauser, Stabile Sonderfdlle des Quotienten-Diffe-
renzen-Algorithmus, Numer. Math. 5 A963), 95-112.
V. Ya. Skorobogat'ko, Theory of Branched Continued Frac-
Fractions and Their Applications in Computational Mathemat-
Mathematics, ed. Nauka, Moscow A983). (In Russian.)
V. Ya. Skorobogat'ko, Branched Continued Fractions and
Convergence Acceleration Problems, "Rational Approxima-
Approximation and its Applications iti Mathematics and Physics,
Proceedings, Lancut 1985", (J. Gilewicz, M. Pindor, W.
Siemaszko, eds.) Lecture Notes in Math., No. 1237,
Springer-Verlag Berlin, Heidelberg A987), 46-50.
W. Siemasko, Branched Continued Fractions for Double
Power Series, J. Comp. Appl. Math. 6 A980), 121-125.
References
289
[Stiel8]
[ThieO9]
[Ihron48]
[Throti77]
[ViscO6]
[Waad64]
[Waad66]
T. J. Stieltjes, Recherches sur le fractions continues, Ann.
Fac. Sci. Toulouse Sci. Math, et Sci. Phys. 8 A894), 1-
122; 9 A895), 1-47. Oevres completes, Tome 2, P. Noord-
hofT, Groningen A918), 402-566. Also published in Mem-
Memoirs presences par divers savants a l'Academie de Sciences
de PInstitut National de France, 33, 1-196.
T. N. Thiele, "Interpolationsrechnung" Teubner, Leipzig
A909).
W. J. Tliron, Some Properties of Continued Fraction 1 -f
doz + K{z/(l + dnz)), Bull. Amer. Math. Soc. 54 A948),
206-218.
W. J. Thron, Two-Point Fade Tables, T-Fractions and
Sequences of Schur, "Pade and Rational Approximation",
(E. B. Saff and R. S. Varga, cds.), Academic Press, New
York A977), 215-226.
B. Viscovatov, De la methode generate pour reduire toutes
sortes de quantites en fractions continues, Mem. Acad.
Imperiale Sci. St. Petersburg 1 A803-1806), 226-247.
H. Waadeland, On T-Fractions of Functions Holomorphic
and Bounded in a Circular Disk, Det Kgl. Norske Vid.
Selsk. Skr. 8, Trondheim A964), 1-19.
II. Waadeland, A Convergence Property of Certain T-
Fraction Expansions, Det Kgl. Norske Vid. Selsk. Skr. 9,
Trondheim A966), 1-22.
[Wall48] II. S. Wall, "Analytic Theory of Continued Fractions", Van
Nostrand, New York A948).
Chapter VI
Hypergeometric functions
About this chapter
Hypergeometric functions 2^1 form an important class of special func-
functions. They satisfy three term recurrence relations which lead to very
nice continued fraction expansions. This was pointed out already by
Gauss in 1812, [Gaussl2]. He obtained a regular C-fraction expansion of
the ratio 2^1 (a, b] c; z)J2F\ (a, 6 +1; c + 1; z)> the so-called Gauss fraction.
It has very nice convergence properties compared to the hypergeometric
series itself. Also other types of continued fraction expansions for ra-
ratios of hypergeometric functions have been developed. We shall present
some of them here.
The basic hypergeometric functions (or q-hypergeometric functions) 2<?>i
also have natural connections to continued fractions. The regular C-
fraction expansion of 2^1 (a> &>c> 9» z)/2*Pi (a» bq\ cq\ q; z) is the q-analogue
of the Gauss fraction. It was developed by Heine in 1847 [Heine47], and
we call it the Heine fraction.
As an illustration of the role the hypergeometric functions play in the
continued fraction theory, we refer to the appendix. Most of the func-
functions there are related to hypergeometric functions.
291
292 Chapter VI. Hypergeometric Junctions
1 The hypergeometric functions 2^1
1.1 Why and how
Let us look at the hypergeometric series
q(q+l)fe(fe+l)z2
- + ... , A.1.1)
where the parameters a, 6 and c are complex constants. For short we
denote it by F(a, 6; c; z). For obvious reasons we assume that c 0 Z \ N.
If a ? Z \ N or 6 G Z\N then jP(a, 6; c; 2) reduces to a polynomial.
Otherwise the infinite series in A.1.1) has radius of convergence = 1.
This can be seen by the ratio test. It converges at z = 1 if -R(c—a—b) > 0.
(See for instance [AbSt64, p. 556].) The function to which it converges
can be extended analytically to the cut plane
D = {z?C; |arg(l - z)\ < tt} , A.1.2)
that is, to the complement of the real interval [1, 00). It is known as the
hypergeometric function, or more precisely, the principal branch of the
hypergeometric function, and we use the same notation F(a, 6; c; z) for
this function as for the series. Special examples of such functions are
jPA, 1; 2; z) = -z~l log(l — z) , (natural logarithm)
= z~'arctanz,
F(a,b;b;z) = A - z)~" for fc?Z\N.
The hypergeometric functions 2^1 293
For special values of z we get for instance
| + la)
if | + |
l~cT(c)T(-
For more examples we refer for instance to [AbSt64, p. 556-557], [Bern89],
[Erde53], [Bail64]. In Subsection 3.1 of Chapter I we claimed that
F(a,6;c;z) ^ a^ O2? W A14)
F(a,6 + l;c + l;z) 1 + 1 + 1 +
where
(a -f n)(c — 6 + n)
a'2n+1 — —
(c + 2n)(c + 2n
F + n)(c- a -f
We even indicated that 1 + K(an2/1) Converges to the function on the
left side of A.1.4) in the cut plane D given by A.1.2) (which of course
is much larger than the convergence disk of radius 1 for the series). We
shall justify this. Let us first assume that all an ^ 0.
Correspondence. By comparing the coefficients of zn on both sides of
the equality, we derived that
F(a, 6; c; z) = F(a, b + 1; c + 1; z) - ; ~ ( zF(a + 1, b + 1; c + 2; z) .
c(c+ 1)
A.1.6)
Since F(at 6; c; z) = F(b, a; c; z) we therefore also have
a,6 + l;c+l;z) = F(a + 1,6 + 1; c + 2; z)
(t?l)(c + 2I)gi;i(fl+1'6+2;C+3;g)-
294 Chapter VI. Hypergeometric functions
This means that {P«B)}^Lo> where
P-2n{z) = F(a+n, 6+n; c+2n; z), P2n+i = F(a+n, 6+n+l; c+2n+l; z)
is a solution of the three-term recurrence relation
+ an+lzPn+2(z) for 71 = 0,1,2,.... A.1.7)
The correspondence A.1.4) follows therefore from Corollary 13 in Chap-
Chapter V.
Convergence. We see from A.1.5) that an —> —1/4 asn-» oo. Hence
1 + TS.{anz/l) is limit periodic of loxodromic type for z ? D, uniformly
on compact subsets C C D such that oo ^ f(C). (See Theorem 28 and
Theorem 31 in Chapter III.) That f(z) = P0(z)/P,(z) follows then by
Theorem 14B in Chapter V.
The point z = 1. At this point 1 -f K(anz/L) = 1 -f K(an/1) is limit
periodic with an —> —1/4. Therefore the continued fraction may con-
converge or diverge, depending on how {an} approaches —1/4. In Problem
10 you are asked to prove that 1 -f K(an/l) converges to 1 — a/c if
?R(c — a - 6) > 0 or if c = a + 6, and that 1 + K(an/l) also converges if
R(c-a-b) < 0. 1 + K(an/1) diverges if c - a - 6 = it with* 6 R\{0}.
For 0?( c— a — b) >0 or c = a + b the value agrees with A.1.4) since
F(q,6;r;l) _ T(c)T(c - a - b) /T(c + l)r(c - a - 6)
a, 6 + 1; c + 1; 1) ~ T(c - a)r(c - 6) / T{c + 1 - a)r(c - b)
- r^ r(c-a+l) = ?-a_
The cut z > 1. We shall not go into details here. The fact of the
matter is however that the continued fraction diverges for z > 1. (See
[Lore].)
So far we have assumed that all the coefficients an of K(anz/1) are non-
nonzero, such that the regular C-fraction is non-terminating. It remains to
look at:
The terminating case. We have that aw = 0 for some N 6 N if either
a?Z\N, or 6 G Z \ No, or c - b 6 Z \ N, or c - a 6 Z \ No.
The hypergeometric functions 2F1 295
Case 1: a € Z\N. Let a = -k, k E No. Then F(a+ /;,& + &; c + 2fc; z) =
F(a 4- fc, 6 -f k + 1; c -f 2/j + 1; z) = 1 and GL2A-+1 = 0. By repeated use of
A.1.7) we find that
'
Hence the choice N — 2k in this relation gives
F(-k}b\c;z) aiz_ a^ a-lkz
1 + 1 +...+ 1
for all z E C. On the other hand a-ifc+i = 0 so the right hand side
of A.1.10) is equal to 1 + K(an^/1). Hence we still have /(z) = 1 +
J?(aTlz/l). In a similar way we can prove that 1 + K(anz/1) has the
value as given by the left side of A.1.4) if 6 = —k for a k E N.
Case 2: c - b ? Z \ N. Let c - 6 = -k, k E No. We shall use the well
known formula (see [Erde53, p. 69])
F(a, 6; c; z) = A - z)r-a-fcF(c - 6, c - a; c; z).
Since c — b = — &, it follows that
F(a+k, fc+ib; c+2fc; z) = (l-z)r-a-fcF@, c-a+k\ c+2A;; z) = (l-z)c'a'b
and similarly
F(a + A, 6 + ib + 1; c + 2ib + 1; z) = A - 6
Further, a.2k+i — 0, so A.1.10) still holds and its right side is equal to
1 + K(a«z/1). The argument for the case c — a E Z \ No is essentially
the same.
Of course, if z = 0 then the continued fraction also terminates. Then
both sides of A.1.4) are equal to L, and equality holds trivially. But this
case is already covered by the previous arguments, (z = 0 E D.)
We collect all these results to get:
Theorem 1 (Gauss fractions) Let a, b and c be complex constants
with c ^ Z \ N, and let {an} be given by A.1.5). Then:
296 Chapter VI. Hypergeometric functions
(A) 1 + K(anz/l) ~ F(a, 6; c; z)/F(a, b + 1; c + 1; z).
= f(z) = F(a, 6; c; z)/F(a, b + 1; c + 1; z) ire
plane D — {z ? C; | arg(l — z)| < 7r}. 77mz? is, 1 + K(an
verges to the well defined, meromorphic function f(z) in D. The
convergence is uniform on every compact subset of {z ? D\ f(z)
oo}.
(C) 1 + K(an/1) = /(I) = 1 - a/c if »(c - a - b) > 0 or c = a + b.
That is, 1 + K(an/l) converges to /(I) (given as in B) under
these conditions. If !R(c — a — b) < 0 then 1 + K(an/l) converges
—> i_ f(z).
(D) 1 + K(an2/1) = f{z) = F(a1b',c]z)/F(a,b + l;c+l;z) for all
z 6 C if the continued fraction terminates.
(E) l + J<i(anz/l) diverges if all an ^ 0 and either z = 1 with c — a — b =
it;t e R\{0}, or z > 1.
1.2 A special case
Tf b — 0, then F(a, 6; c; z) = F(a, 0; c; 2) = 1. This means that Theorem 1
can be used to obtain a continued fraction expansion of F(a, 0;c;z)/
/^(a, 1; c+ 1; z) = l/F(a, 1; c+ 1; z). Let us replace c by c — 1. Then we
get
a(c- 1) (c- q
1- 1 - 1 - 1
l(c — a)z (a-fl)cz
1-c- c+1 - c+2 -
(a+2)(c+l)z
c + 3 - c + 4
for c 0 Z \ N, c / 1, z € D = {z 6 C; | arg(l - z)\ < ?r}.
V - • ;
The hypergeometric functions 2F1 297
Example 1 We apply Theorem 1 to the special examples mentioned in
the beginning of Subsection 1.1. We get
log(l-z) = -zF(l,l;2;z) =
? l!f l!i ?!f ?!f ?!i ?!i
1_ 2-3-4-5-6-7
for 2 € D ,
2z ^z2 l2z
2z2
1_3_5_ 7 _9
2 2 2 2
2z l2z2 22z2 32z2
l_3-5-7-9 '
arctanz = zF(?, 1; §;-z2)
z IV 2V 3V 4V
D
log [z +
A-
arc sin
(i - *)
(i + -2)'
z
1/2
1
for
2
1-
1-
3-
z
- 3
z2
- 3
2
1-
4z2
9
1-
+
-z2
1.3.
' 2'
—
2z2
3
5
—
2z2
5
el
' 2Z
5
2
1
—
•6z
11
1
+ '
2 2
—
•2z2
5
2
¦ ¦ •
•2z2
• |,
7
2
3
—
•
3
h 9
r2 2
—
-4z2
7
for
•4z2
+ ••
37
'22
9
2
z2
3-
•
2 3
—
4z2
• ^z2
2"
>
T+ 3 + 5 + 7 + 9+-
for - z2 E D .
-O
298
Chapter VI. Hypergeometric functions
1.3 Choice of approximants
What kind of approximants should one choose for 1 + K(an^/l) in The-
Theorem 1? Since anz —» — z/4 we can use the idea from Subsection 5.5 in
Chapter III and use
a\Z
A.3.1)
where
x{z)=
A.3.2)
with
z) > 0 for z € D
(If one needs rational approximants, then y/1 — z can be approximated
by a constant, a polynomial or a rational function.) But we can do
better. From A.1.5) we find that
1 4n(fc - a + |) + c2 - 4a(c - 6) + c
and
a2n+l ~ 4 ' 4(c + 2n)(c + 2n
1 4rc(a - 6 - i) + c2 - 46(c - a) - c
4 +
Hence, writing
= (an + ^
we find that
n —> oo
= <
so that
Wl'iZ) = <
.3.3)
1 ifa-6-^=0 A.3.4)
and c2 - c - Abe + 4fc2 + 26 ^ 0 ,
A.3.5)
The hypergeometric functions 2F1 299
is an even better choice according to Theorem 33 in Chapter III. (If
a - b - \ = 0 and c2 - c - 46c + 462 -f 26 = 0, then all 6^ = 0 and
1 + K(an^/l) is periodic with value x(z).) Continuing this process we
can write
#>(*) = anz - «,«,!>,(*) (l +wW(z)) A.3.6)
to find Wn . We distinguish between two cases.
Case 1: P = a - 6 - 1/2 = 0.
Suppressing the variable z we have
) ^. l( + ¦
@)
2zJ '
where C = Qz/(c + n - l)(c + n), Q = (c - 26)(c - 26 - l)/4. Hence
l>n+\/6n —> 1. In fact, by induction one finds that S^\/6n —* 1 as
n —> 00 for every m > 0, just as in Example 25 in Chapter III. So we
choose
A.3.8)
Case 2: P = a - 6 - 1/2 ^ 0.
Now we get
where
+ Qu .@) _ -Pn
2n ~ (c+2n- l)(c + 2n)Z? 2rt+1 " (c + 2n)(c + 2n+ i
300 Chapter VI. Hypergeometric functions
with Qo = (c2 — c)/4 - fc(c — a) and Q\ = (c2 + c)/4 — a(c — 6). Hence
c@ _ W-P + Qo + Qj )? + P2z]n2 + lower degree terms
2n = ~* (c + 2n - l)(c + 2nJ(c + 2 + 1)
and
A) [4(Qu + QiK1 + P"z]n2 + lower degree terms
2/1+1 = ~2 (c + 2n)(c + 2n + lJ(c + 2n + 2)
and thus
and
1 .
* ~ 2" ^ ( 3 }
just as in Example 26 in Chapter III. Hence, by Theorem 34 in Chap-
Chapter III we get faster convergence to the right value if we choose the
approximants Sn(wn ') where
•" *=w)
and
« *=w")+1/:{',' (x-3-12)
1 + x + iq
q = Q\ = — • A.3.13)
q
* 1 + X - Xt\
Let us look at some examples.
Example 2 Let a — 1/2, b — 3/2 and c = 5/2 in Theorem 1, so that
1-2. 5-6 _3-4, 7-8 .
1
1 - 1 - 1 - 1 -••
We have P — a — b — 1/2 ^ 0, so we can use
The hypergeometric functions
301
Since by A.3.10) - A.3.11)
lim
- 2x(z)
n
we choose
= W
where
3z
3z
t
3z- 1(
- 10zB)'
- 10i(z)
-2x(z) '
, _ 8i2(z)
We stop here, although we could have continued the process. We shall
instead study the effect numerically, for given values of z.
We first choose z = — 1. Then 1 + K(an^/1) 1S a continued fraction
with positive elements, and we expect fast convergence. The first 8
approximants are given in Table 1. The value of the continued fraction
is 1.0397662053001, correctly rounded to 14 digits.
n
1
2
3
4
5
6
5n@)
1.057...
1.0387...
1.0401...
1.039736...
1.039774...
1.0397653...
Sn{x)
1.047...
1.0409...
1.03988...
1.03978...
1.0397685...
1.03976659...
$,.(«?})
1.03986...
1.03963...
1.0397675...
1.0397647...
1.039766226...
1.039766182...
1.039791...
1.03976603...
1.03976642...
1.0397662018...
1.0397662079...
1.03976620523...
Table 1: z - -1. /(-I) = 1.0397662053001
Notice also the nice oscillation properties of {5n@)}, {5n(iUn )} and
n )}. In fact, Theorem 4 in Chapter III can be applied to deter-
determine when Sn(wn) oscillates regularly about its limit.
302
Chapter VI. Hypergeometric functions
In Table 2 we show how fast the various types of approximants reach the
value of the continued fraction, correctly rounded to the given number
of digits for some values of z. The number N is the smallest index such
that the approximants take this value for all indices n > N.
z
-1
-2000
10 + 0.H
100 + 0.H
Value of the
continued fraction
1.039766
1.198201
1.2152-0.1424z
1.21728-0.018383i
iVfor
Sn@)
7
202
2991
> 5000
iVfor
Sn(x)
7
161
1149
>5000
AT for
5
141
291
1483
AT for
2
93
37
183
Table 2.
As expected, the convergence is slower when z is close to the cut z > 1
of D. But it is for such values of z that the gain by using 5ri(iUn ) or
Sn{wn ) is most dramatic.
Another question is: IIow can we use the (approximate) value of A.3.14)
to find for instance F(l/2,5/2;7/2; zI We have
P( I 5. 7. _\ _
x V2> 2' 2' ^) ~
I 5. Z. 7\
2' 2' 2' /
1 3. 5. \
2 ' 2' 2 ' /
I1 I1 I1 I p>( l_ •> . .±.
\ 'I 'i \ \ 2' 2' 2'
2' 2 ' 2' /
where F{ — ^, |; :|; 2) is known to be equal to A
where
1 13
2» 2» 2»
1 *±
2' 2» 2'
= 1-
3-4
3-5
3-4
9 • 11
for z ? D and
hk
1 _ 1 _ 1 _ 1 _.
by Theorem 1, and f(z) =
A.3.14). Hence
IT/I 5. Z._|\
¦* V2' 2' 2' XJ
±, §; f; z)/F(^/j] \\ z) is the value of
1.039766
0.768692
1.030766
The hypergeometric functions 2^\
303
Example 3 According to Example 1 we have
log—= 2zF(l, !;§;,')- y.-j- ______
for z2 ? D — {w ? C; | arg(l — w)\ < 7r}. Or, equivalently,
log((l + z)/(l - z)) ~ 2z/(l + K^2(anz/1)) where
l 1 9v,^/l 3
-,0;-;«2)/F(-,l;-;*
Therefore we are in Case 1 where P = 0, and by A.3.8) we choose
for 77 = 1,2, 3,... ,
oo
1+ K
n=:2
and so on. For z — 52 we have
1 + z
log
1
where ^ = arg(l + 5i) « 1.37340077. That is, the continued fraction
converges to 2.74680152, correctly rounded to 8 digits. Its first approxi-
mants are given in Table 3.
n
1
2
3
4
5
6
7
8
5n@)
10.0...2
1.07. ..2
4.79. ..2
Lot). . • 2
O.Oo. . • 2
2.308. ..2
3.08. ..2
2.540...z
Sn(x)
%J m ? I m m m L
2.679...2
2.7658...z
?j* i oyy... 2
2.7496... 2
2.7454...z
2.7474...2
2.74647... 2
2.891... 2
2.731...i
2.750...z
2.7457... z
2.74716...z
2.74665... z
2.746863... i
2.746773... i
Sn(w™)
2.7427... z
2.7475. ..2
2.74660... z"
2.746860... 2
2.746780... z
2.7468095... i
2.7467982... i
Table 3: z = 52. /(z) = 2.74680152
A similar exposition as given in Table 2 is given for this continued frac-
fraction in Table 4.
O
304
Chapter VI. Hypergeometric functions
&
52
1 +»
5 + O.li
Value of the
continued fraction
2.7468 2
0.804719 + 2.0344442
0.4053 + 3.133i
TV for
Sn@)
30
15
1448
N for
Sn(x)
12
9
69
TV for
8
7
21
iVfor
6
6
10
Table 4.
1.4 Other continued fraction expansions
So far we have looked at regular C-fraction expansions of ratios
F(a, 6; c; z)/F(a, b + l;c+ 1;^), usually called Gauss fractions. They
are very useful. In this section we shall briefly mention two other clas-
classical expansions. The first one is due to Norlund [N6rl24].
Theorem 2 (Norlund fractions) Let a, b and c be complex constants
with c ? Z \N. Then:
(A) The continued fraction
<c+l)
+ b 4- 3
4.
a 4- 6+ 5
c+2
4
corresponds at z = 0 ?0 ?/ie series
L(z) =
A.4.1)
A.4.2)
The continued fraction A.4.1) converges to the function in A.4.2)
if it terminates, or if $l(z) < 1/2, or if z — 1/2 and \$s(a + b)\ <
-a-6- 1).
The hypergeometric functions 2F1 305
Proof : (A): {F(a + n,6|n;c|n; z)}5S=o ls a solution of the three-
term recurrence relation
a + b + 2n + 1
^ ~
(c + n){c-\- n -f
for n = 0,l,2, A.4.3)
(This can be verified by comparing the coefficients of the power series
involved.)
(B): An equivalence transformation brings A.4.1) to the form 6o
K(cn(z)/1) where
= (a + n)(b + n)(z - z2
(c + n - (a + b + 2n + l)z)(c +n-l-
for n > 2. Hence lim^ —> oo cn(z) = c*(z) = (z — z2)/(l — 2zJ. Since
c"(z) is real and negative < —1/4 if and only if $l(z) = 1/2, z / 1/2, it
follows that A.4.1) converges to a meromorphic function for 9ft(z) ^ 1/2.
By Theorem 14B in Chapter V we find that this function is A.4.2) in
the domain 9ft(z) < 1/2.
For z = 1/2 we find that cn(z) reduces to
For 5RBc - a - b) > 1 we let a = - argBc - a - b - 1) where \a\ < v/2.
We plan to use the parabola theorem, Theorem 20 in Chapter III, to
prove convergence of 60(l/2) + K(cn(l/2)/l). We have for n > 2 that
cn(l/2) G Pa from some n on if and only if
(a + n){b + n)
Bc- a- b- 1)-
2
1. e.
\n2 + (a -f b)n + ab\ < n2 + 5R((a -f ^>)^ + a6) H—|2c — a — 6 — 1|2 cos2 a
?*
306 Chapter VI. Hypergeometric functions
from som n on. Squaring this equation and collecting terms of the same
degree in n gives
n2 ([$s(a + 6)]2 - |2c - a - b - l|2 cos2 a) + lower terms < 0
which holds from some n on if \$s(a + b)\ < |2c — a — b — 1| cos a =
The terminating case can be proved in the same way as in Theorem 1.
The Norlund fraction A.4.1) can be multiplied by c and simplified some-
somewhat by means of an equivalence transformation to give
(a + 2)F + 2){z -
c + 2- (a+b + 5)z +
o,6;c;z)
By substituting z = 1 — u in A.4.1) we can also clarify what happens if
> 1/2 or if z = 1/2 with 5RBc - a - b) < 1:
Corollary 3 Le? ?/ie continued fraction A.4.1) be non-terminating.
Then it converges to
c — a — b — 1 F(ai b;a-\- b -\- 1 — c]X — z)
c F(a + l,6 + l;a+^ + 2-c;l-z)
if either $l(z) > 1/2 or z - 1/2 with |S(a+ 6)| < -5RBc - a - 6 - 1)
The hypergeometric functions 2F1 307
Proof : With z = 1 — ii, the continued fraction A.4.1) can be written
{a f 2)F + 2){u - u2)
2-a-6-5 + (a + 64
+
1
-C - 1 + (a + 6 + 3)u
a+2)F +
5W4.... I
where c* = — c + a + 6-fL An equivalence transformation brings this
over to the form
+ c* + 2 - (<
which we know converges to
2)(tt-i/2) I
a + b + S)u -\ J
c F(a+ 1,6+ l;c*+ l;w
for 5ft(w) < 1/2 or for u = 1/2 with 5(a + 6)| < ftBc* - a - 6 - 1) by
Theorem 2 and A.4.4). Substituting u = 1 — z and c* = — c + a+6+1
gives the result. ¦
PfafF's transformation
F(a, 6; c; z) = A - z)~fcF(c - a, 6; c; z/(z - 1)) A.4.5)
can be verified (formally) by comparing the coefficients of the power
series G Lo on both sides. If we apply this in Theorem 2 and use the
substitution z/(z— 1) —> z in A.4.1)- A.4.2), we get a continued fraction
which essentially is due to Euler, [Euler27], [Euler67]:
Theorem 4 (Euler fractions) Let a, b and c be complex constants
with c ? Z \N. Then:
308 Chapter VI. Hypergeometric functions
(A) The general T-fraction
_ a + 2)z-c + 2 + F - a
A.4.6)
corresponds at z = 0 to
cF(a, 6; c; >z)/F(a, 6 + 1; c + 1; z) A-4-7)
and converges to A.4.7) if\z\ < 1, or i/z = —1 with \$s(c — a
+ a — 6 — 1), or i/ A.4.6) terminates.
(B) Ifb — a^ —2,-3,-4,..., then A.4.6) corresponds at z — oo to
A.4.8)
converges to A.4.9) if\z\ > 1 or if z = —1 wi
c + a-b-l).
Remark: Strictly speaking it is A.4.6) divided by c which is a T-
fraction in the usual sense, since this can be written on the form 1 -f
Goz + K{Fnz/(l + Gnz)) where
_ _(ca + n)F + n) ^ 6-q
n~ "( + nl)(c + n)' n~
c-l-n
The hypergeometric functions
309
Proof (A): By Theorem 2 and A-4.4) we know that
-c
c+l-(a + 6 +
(a + !)(>+
-fl
(a+ 2)F
c + l + (a + 6 + 3-c-l),
A.4.9)
where z — —f/(I — f), converges to
=
=
F(a + 1,6 + 1; c + 1; 0 A -
- a, 6 + 1; c + 1;
Uj
for »(() < 1/2 or ? = 1/2 with |S(o + 6)| <
cancel the factor A — ?). Replacing a by c
< 1/2 <=> |z| < 1, and ( = 1/2 <=> z =
- a - 6 - 1). We can
a gives the result since
-1.
(B): By Corollary 3 we find that for 3?(f) > 1/2, i. e. for \z\ > 1, or
for f = 1/2 with |S(a + 6)| < — 5RBc — a — 6 — 1), the continued fraction
310 Chapter VI. Ilypergeometric functions
A.4.9) converges to
F(q,6;q + 6 + l-c;l-e)
^a+ c;JF(a+ij6 + l;a + & + 2-c;l-O
f "fcF ( 6 + 1 - c, 6; a + 6 + 1 - c;
= -(a + 6+l-c) ?
Again we replace a by c — a and ?/(l — f) by —z to get the result.
Example 4 Let 6 = 0 in A.4.1) and replace a by a— 1. Then Theorem 2
leads to the expression
«41.C+1.J)=Ja
c- az + c + 1 - (a + 2)z+ c + 2 - (a + )
A.4.11)
for 5R(z) < 1/2. Similarly, with 6 = 0 in A.4.6) - A.4.7) we find that
)
A.4.12)
for |z| < 1. Hence, also counting A.2.1), we have three different contin-
continued fraction expansions for F(a, 1; c + 1; z). They converge in somewhat
different domains, but let us compare them for the function
for some values of z where they all are valid. All three of them have the
form K.(an(z)/bn(z)) where the limits
= a{z)
exist. Hence we use the approximants Sn(bn(z)x(z)) where
x(z) = ( yj\ + Aa(z) ~ 1J /2 where 5R^/l + 4a(z) > 0 .
With JV as used in Table 2 and 4, the results are given in Table 5.
Confluent hypergeometric functions
311
z
0.49
0.99z
-0.99
0.49 + 0.8*
-0.2
-z~x log(l - z)
1.37417
0.788256 + 0.345024i
0.6950855
0.941249 + 0.510734*
0.911608
JVfor
A.2.1)
7
8
9
10
5
N for
A.4.11)
139
23
17
278
6
JVfor
A.4.12)
13
299
ca 700
83
7
Table 5.
Of course the convergence is slower the closer we are to the boundary of
the convergence regions. Still the Gauss fraction seems to be doing very
well.
O
One can find several continued fraction expansions of similar nature for
hypergeometric functions. For instance, in [AbSt64, p. 558]: one can find
quite a number of three-term recurrence relations for such functions. In
the cases where the hypergeometric functions are minimal solutions of
these relations, minimal regarded as elements in (F, || • ||), (see Chap-
Chapter V), we immediately have a corresponding continued fraction. The
usefulness of this continued fraction is normally tied to its convergence
properties.
2 Confluent hypergeometric functions
2.1 Notation
Let us introduce the Pochhammer symbols
(aH=l, (a)fc = a(a + l)(a
for Jb€N. B.1.1)
312 Chapter VI. Hypergeometric functions
Then the hypergeometric series in (l.l.l) can be written
[^h N. B.1.2)
A generalized hypergeometric series is defined for given numbers p, q ?
No:
where ai,... ,ap,6i,... ,bq are complex parameters with 6],... ,6q ^ Z\N.
The series in B.1.2) is a 2^1 - The series ^(c; z) in Example 9 in Chapter
V is a o-Fi. We shall look at some cases which can be derived from
2.2
The series 2^1 (a? &5 ci z/a) converges locally uniformly for \z/a\ < 1.
Hence, we can let a —> 00 termwise in this series. Since
lim ^ = lim A + -\ (l + -V • • A + ^—^ = 1 B.2.1)
«—> 00 a« a—>oo \ a) \ a) \ a )
for all n ? N, we find that
00
lim
a —> 00
for c ^- 0,-1,-2,... . If we let a —* 00 term by term in the Gauss
fraction 1 + I?((anz/a)/\) corresponding to
= /(f)
that is, an is given by A.1.5), then it transforms into the continued
fraction
1 , v dnz x j c-b + n
1 + K -7- where d2n+1 = -
1 ~znTI (c + 2n)(c + 2n
h 4- 77
^2n =
(c+ 2n- l)(c-
There are at least two ways in which we can prove:
Confluent hypergeometric functions 313
Theorem 5 Let b and c be complex constants with c $ Z \ N. Then the
continued fraction B.2.3) corresponds (at z — 0) to the function
g(z) := ,F{F; c; z)/iFl (b + 1; c + 1; z) B.2.4)
and converges to g(z) for all z G C. The convergence is uniform on
compact subsets of DfJ = {zE C;g(z) ^ oo}.
Correspondence. We can either proceed as in the proof of Theorem 1
and start with the recurrence relation
Pn{z) = Pn+i{z) + dn+lzPn+2{z) for n = 0,1,2,... ,
or we can use the following theorem due to Perron, [Perr57, Satz 3.10,
p. 112]:
Theorem 6 Let
V 2 i4----, B.2.5)
+ V , V V + i +
I -f 1 + 1 +
where the coefficients a*, and c& are functions of a parameter p such that
lim a^ = at and lim c^ — cl for k = 1,2,3,... B.2.6)
p—*po p—*poK
where pu G C and a*k ^ 0 if a^ = a^.(p) ^ 0. Then
n
i -f- i -f i -+¦
. B.2.7)
It simply gives that the correspondence 1 +K{dnz/1) ~ ^(z) is inherited
from the correspondence 1 -f K((c?n2;/a)/l) ~ f(z/a). Theorem 6 is a
direct consequence of the following observation:
Lemma 7 Let the C-fraction 1 -f 'K(anz/1) correspond to the power
series L(z) ? Lo and have approximants Ln(z) ? Lo- Then
Ln+\(z) — Ln(z) = dnzUn + higher order terms B.2.8)
314 Chapter VI. Hypergeometric functions
and
L(z) - Ln(z) = dnzVn + higher order terms B.2.9)
where
dn = (-1)" II ak and "n = Yj(*k. B.2.10)
Proof of Lemma 7: Let Ln(z) = An(z)/Bn(z) (canonical form).
Then we find, (as in Chapter V, formula B.1.1)) that
T
n+l(z) Ln(z) - rr , ,
_ An+\{z)Bn(z) - ATl(z)Bn+l(z)
Bn+l{z)Bn(z)
=
Bn+l{z)Bn{z)
= dnzUn -f higher order terms. B.2.11)
The expression for L{z) — Ln(z) follows then by Theorem 4C in Chapter
V. ¦
Convergence. That l|KDz/l) in B.2.3) converges to g(z) in B.2.4)
can also be seen in two different ways. Either we observe that dnz —¦ 0
locally uniformly with respect to z in C. This means that 1 + K(rfn^/1)
converges to a meromorphic function in C. If all dn ^ 0, then this
function must be g(z) because of the correspondence.
The other way to see this, is that, on each compact set C C C with 0
as an interior point, 1 -f K((^n2/«)/l) has a tail 1 -f K((d2N+nz/a)/l)
converging uniformly to 2iri (« + N, 6 + JV; c + 2iV; z/a)/2F\ (a + JV, 6 -f
N + 1; c -f 2N + 1; z/a) in C by Theorem 1. (The index N depends on
C, of course.) Moreover z/a —* 0 uniformly in C. Hence the uniform
convergence of 1 + K{d<2N+nz/l) to tF,F + N\ c + 2iV; z)j^Fi{b + JV +
l;c + 2iV + 1"}2) follows. This proves that 1 + K(^n<z/1) converges to
Confluent hypergeometric functions 315
The terminating case, d^j — 0 for some N ? N only if a^ = 0 in
1 + K.{an(z/a)/l). From Theorem ID it follows therefore that g(z) =
1 + K{dnz/l) for all z e C if dN = 0.
Now we have proved Theorem 5. The regular C-fraction converges to
g(z) for all z ? C. On the other hand, the \F\-series in B.2.2) also
converges for all z ? C, so we do not gain anything when it comes to
domain of convergence. What we gain is speed of convergence and less
chance of overflow. As approximants we choose ?n@) which normally
converges reasonably well for these continued fractions. The squareroot
modification as described in for instance Example 27 of Chapter III is
not a good idea here because of the alternating character of {dn}. But
one might try Sn(wn) where wn is the value of the 2-periodic continued
fraction
t \ _ dn+iz dn+'2z dn+\z dn+2z
if it converges.
Example 5 The confluent hypergeometric series 12*1A; c + 1; z) can be
written iF|(l; c -f 1; z)/\F\@\ c; z) since iFi@]c;z) = 1. Application of
Theorem 5 therefore leads to
z \-z (c+l)z 2z
T-c + l + c + 2- c + 3 +c + 4- c + 5 +c+6 "
Let c = 2. We shall compare the speed of convergence of the hypergeo-
hypergeometric series \Fi(l\3]z) and the corresponding C-fraction
1 z 1 • z
for some values of z. Of course we neither need the series nor the con-
continued fraction to compute
On the other hand, since we know the value of \F\(\.\ 3; z) already, the
convergence is easier to study. Table 6 gives the number of terms needed
to reach the given accuracy.
O
316
Chapter VI. Hypergeometric functions
z
1
10
-10
10+lOi
lOi
-1000
Value
1.4365636569
440.3039
0.180000908
-119.9286+184.9278i
0.03678143+0.2108804i
0.001998
Series
N = 12
N = 28
N = 38
iV = 40
AT = 35
overflow
Sn@) for c.fr.
N=l0
N = 24
N = 20
N = 29
JV = 23
iV = 116
Table 6.
2.3
If we replace z by cz in Theorem 1 and let c —> oo, we obtain
Theorem 8 Let a and b be complex constants. Then
where p2n+\ — — {a + n)i P2n = —{b-\-n) B.3.1)
corresponds (at z = 0) to
2F[)(aJb;z)
B.3.2)
and converges in the cut plane D = {z 6 C; | arg(—z)\ < ir} to a func-
function h(z)} meromorphic in D. The convergence is uniform on compact
subsets of Dh = {z ? D\ h(z) ^ oo}.
In this case the hypergeometric series has radius of convergence zero.
That is, 2-^0(^5 b\ z) diverges for all z ^ 0 if a,6 ^ Z \ N. However the
continued fraction 1 -f- lK.(puz/l) corresponds to B.3.2) and converges
to h(z) in D. It turns out that the connection between h(z) and the
divergent series 2-^o(a, b; z) is that 2^o(«j ^5 z) is an asymptotic series for
h(z) in D as z —> oo. For more information on asymptotic series we
refer to Chapter VII.
Since pnz —> oo, approximately as (—nz) as n—> oo, the squareroot
modification is likely to work well for this continued fraction. That
Confluent hypergeometric functions 317
is, we may use approximants Sn(wn) where
pn+lz pn+lz pn+]z
and expect these to converge faster to the value of B.3.1) than the
classical approximants 5n@).
2.4 o
Still another confluent case arises if we replace z in Theorem 5 by z/b
and let b —> oo. We get:
Theorem 9 Let c be a complex constant with c ? Z \ N. Then
where qn = J— iW ^ ,, B-4.1)
(c + n — l)(c + n)
corresponds at z — 0 to
converges to k(z) for all z 6 C. T/te convergence is uniform on
compact subsets of D^ = {z 6 C; A?(z) ^ oo}.
This is exactly the continued fraction that we studied in Example 5 in
Chapter IV and Example 9 in Chapter V. It is connected with the Bessel
function Ju{z) of the first kind of order v. For v $ Z\ No we have
318 Chapter VI. Hypergeometric functions
so that
= 2(^ + l):
i
= 2(i/ +
2(i/ + l)-
1
2 ^2
i/ + 2) - 2(i/ + 3) - 2{v + 4) —
3 Basic hypergeometric functions
.1 Definition
In 1847 Heine [Heine47] studied the series
where \q\ < 1. By use of L'llopital's theorem we find that
s
1 a
lim = lim
2
C.1.1)
This means that C.1.1) is transformed into 2JF1 (a, /3; 7; 2) when g —> 1.
C.1.1) is called a basic hypergoometric series (ot just a q-hypergeometric
series). Many results for hypergeometric series have their counterpart
for q-hypergeometric series.
Watson [Wats29] simplified the notation in C.1.1) by writing a = grt,
6 = qQ and c = q1. Inspired by the Pochhammer symbol B.1.1) used in
hypergeometric series, we define
(a;g)o= 1, (a\q)n = (l-a)(l-aq){l-aq2) • • -{l-aqn-1) for n ? N.
C.1.2)
Basic hypergeometric functions 319
Then C.1.1) can be written
00
which strongly suggests the connection to hypergeometric series. The
parameters a, 6, c and q are complex constants with \q\ < 1. To avoid
zeros in the denominators we assume that c is chosen such that (c; q)n ^
0 for all n\ that is, c/ 1, g, q~2, <j~3, .... Tf the series in C.1.3) is
non-terminating, then it converges locally uniformly for \z\ < 1. Hence,
the function to which it converges is analytic in the unit disk \z\ < 1.
On the other hand, if a ^ q and b ^ q then
A - zJ<px (a, 6; c; q; z)
= . , ^(a;g)n-i(*;q)n-i f(i-ag-'Xi - 6g"~') A „
?
g
- 6/g) ^ (c; g)n(9; 9)TJ
r-a-6 + cj-g ab- cqf 2 ri\
( M + —jr-iq z) )
l2^1 (~'~5c;g;^)-1 [
{ \q q ) J
-1)- C-L4)
Tf a = 9 ot 6 = g, similar expressions can be found. Dividing by 1 — z
we thus find that the function, which we also denote by 2(p\ («j b\ c; q\ z),
is analytic for \z\ < q~l except for a simple pole at z ~ 1. Repeating
the argument, we find that 2^1 (a, b\ c; q\ z) is analytic in the whole com-
complex plane except at the points z = 1, z = g, z — q~2, ... where
it has simple poles. This function 2(P\{aib; c;q\ z) is called the basic
hypergeometric function or the q-hypergeometric function.
320 Chapter VI. Hypergeometric functions
Just as for hypergeometric functions, one may generalize to get
r<ps(ax,..., ar; 6,,..., 6.,; q\ z) = > )- { tt r— 7 r- . C.1.5)
3.2
The q-analogue of the recurrence relation A.1.6) for 2F1 is
2<p\(a,b\c\q\z) =
2
^ w1-c N
C.2.1)
(Notice the simpe and straightforward transformation of A.1.6) into
C.2.1)! That C.2.1) really is true can for instance be checked by com-
comparing the coefficients of zn on each side of the equality.) Based on this
relation, it is natural to study the continued fraction
- bq)
()()
(l-c)(l-cg)Z A - cg)(l - eg2) (T?)(l - eg3)
1 + 1 + 1 +•••
C.2.2)
and its possible connection to the series (or function)
2y?1(q,6;c;g;z)
t\z) — — -. {6.2.6)
<p(abq]cq;q\z)
This was first done by Heine himself [Heine47], and C.2.2) goes by the
name of Heine's continued fraction. By the same methods as used in
Subsection 1.1 one finds:
Theorem 10 Let a, 6, c and q be complex constants with \q\ < 1 and c
q~n for all n ? Ny. Then the continued fraction in C.2.2) corresponds
at z = 0 to F(z) in C.2.3) and converges to F(z) in the whole complex
plane. The convergence is uniform on compact subsets of{z ? C; F(z)
oo}.
Basic hypergeometric functions 321
The continued fraction in C.2.2) has the form 1 + K(anz/1) where
A - aqn){cqn - b)qn
- bqn)(cqn - a)qn-1
a'2n ~ (l-cq2n-l){l-cq2n) '
That is, an —> 0, and the classical approximants {5ri@)} represent a rea-
reasonable choice. For large \z\ one can also use the squareroot modification
Sn(wTl) where
an+vz an+lz an+lz y/l + 4a,l+1z 1
wn[z) = = . M.z.o
K J 1 + 1 + 1 +¦-. 2 K J
Some other continued fraction expansions for hypergeometric functions
also have their counterparts for basic hypergeometric functions. The
relation A.4.3) has the analogue
a + h - ab - abq
— z
(I - aq)(i - bq) . 2 ., .,
_ c \
C.2.6)
which leads to:
Theorem 11 Let a, b, c and q be complex constants with \q\ < 1 and
c ^ q~n for all n 6 Nu- Then the continued fraction b^ + K(an/^n
an(z) = A - ag")(l - bqn)qn~lz(c - zabqn) ,
corresponds at z = 0 ?o f/ie function
F(z) = A - cJ^i(a,&;c;g;z)/2<pi(ag,6<7;c<7;g;z) C.2.8)
converges to F(z) for all z ? C. T/ie convergence is uniform on
compact subsets of {z 6 C; F(z) ^ oo}.
322 Chapter VI. Hypergeometric functions
The proof is left as an exercise.
Here also the classical approximants represent a reasonably good choice
since an —> 0. The continued fraction 6q + H(an/bn) is the q-analogue
of Norlunds continued fraction A.4.1).
4 Continued fractions bo + K(an/6n) where an
and bn are polynomials in n
4.1 Introduction
The continued fraction expansions of (ratios of) hypergeometric func-
functions presented in the previous sections have all had the form b^z) -+-
K.(an(z)/bn(z)), where a2n+p{z) and b2n+p(z) have been polynomials in
n for p = 0,1. This leaves the impression that maybe every continued
fraction 60 + K(an/bn) where an and 6n (or a2n+p, b2n+p\ p = 0,1) are
polynomials in n, is related to hyp crgeome trie functions? This question
is easy to answer affirmatively in some special cases.
4-2 Some special cases
Let
an = 17k=o Pk">k with pr / 0 ,
D.2.1)
bn = EiLo ^nk with qH ^ 0 .
Do we then know the value of the continued fraction 6o + K(<Jn/6n) in
terms of hypergeometric functions?
The case r = s = 0: Then all an = pu and bn = go- This means that
bo + K(a«/6n) is 1-periodic, and thus has the value
f (v/l + 4po/go2 + l) = |(l + , fo(-l/2; -4R,/«g)) ; S(D > 0
Continued fractions 6^ + K(an/6M)
323
if
The case r = 0, .s = 1: From Theorem 9 it follows that
= c
0JFi(c+l;z)
where the equality sign means convergence for all z 6 C as long as
c ? Z \ N. Hence, if go ^ 0 then
Po
Po
Po
go
<7o
— +
If go = 0 then
Po Po Po
Pu/g?
+¦
Pu/g? Po/g?
2 + 3 +.
gi o^i A;
Po/gi
x o^iA;Po/g?)
^B/?)
case r = 1, .s = 1: If we replace z by z/a in the Euler fraction
A.4.6) and let a —> oo, we get the continued fraction
D.2.2)
By the same argument as used in Subsection 2.2 one can prove that
D.2.2) converges to the function
D.2.3)
324
Chapter VI. Hypergeometric functions
for all 2 ? C. (This identity can be found in [Perr57, p. 279], and
in formula D.1.5) in the appendix in a slightly different form.) Our
continued fraction is therefore
Pi
3p,
q\
o , Pi
919
Qi
2
Pi ^?,qEL
2 2 ¦ " 2
qj_ qf qf
2 '
J>1
01 gf
2
r
T/ie case r = 2, s = 1: We want to find the value of
V\ + P2 Po
Po
32
32p2
q\
+¦
D.2.4)
and the general T-fraction A.4.6) in Theorem 4 seems to be a nice
continued fraction to compare with. It has parameters a, 6, c and z
which we will try to adjust to match D.2.4). We evidently need p2 — —z
and gi = 1 -f z. But this can only be done if qi + pi — 1. To get enough
freedom we introduce an extra unknown 6^0 and write D.2.4) as
H
u-
D.2.5)
Then we need that p26z — —z,p\62 = — (c— a -f ^>)>z, Pu^2 = — (c — a
gj ^ = 1 + z and <7q? = c + F — a + 1 )z. This is a system of 5 equations to
determine our 5 unknowns a,b,c,z and 6. If we find one solution with
\z\ ¦? 1, then Theorem 4 gives us the value of D.2.4).
Example 6 To find the value of
1
K -\
Tl=l 77
1 (^ c
?\5 + J
OO -
K
=i ^/2 +
2^2n2 1
26n J
Continued fractions bo -f ~K(an/bn) 325
by comparing with A.4.6) we get the system
262 = -z, 0 = -(c - a + 6J, -?2/2 = -(c
5 = 1 + z9 6/2 = c + F - a +
of equations. This system has the solutions
A) a=I,* = -l,a=l,6=l,c=-J,
B) $=!,z = -I,a = 0,6=-?,c=i,
C) $ = -1, z = -2, a = 0, 6 = -?, c = ?,
D) 5 = -1, z = -2, a = 1, b = ?, c = i.
The first solution leads to the value
by Theorem 4A. So does also the second solution. If we choose solution
C), we apply Theorem 4B which gives the same value
as it should.
O
326 Chapter VI. Hypergeometric functions
Problems
A) Prove that Pn{z) = i F\ (a + n; c + n; z) where a and c are complex
numbers ^ 0, — 1, — 2,..., is a solution of the three-term recurrence
relation
(c + n)Pn(z) = (c-\ n- z)Pn+l(z) + — —~zPn+2{z)
c f- n + 1
for n = 0,1,2,...
in L. Determine the correspondence and convergence properties
of the corresponding general T-fraction
(a + l)z (a + 2)z (a+ 3)
+ I - 2 + C + 2 - z + c + 3- z-\ '
B) Compute the first two approximants of each type 5n@), Sn(x),
Sn(iVn ') and Sn{wn ^) for the C-fraction expansion of arctanz for
2=1 and compare with arctanl. (This C-fraction expansion can
be found in Example 1.)
C) Establish a formula for approximants of the type Sn(wij }) for the
Norlund fraction in Theorem 2.
D) Establish a formula for approximants of the type Sn{wn ) for Eu-
ler's T-fraction in Theorem 4.
E) Use Theorem 5 to determine the C-fraction expansion of ez.
F) Use Theorem 9 to determine the C-fraction expansion of tanz =
sinz/ cos z.
G) Prove Theorem 11.
(8) Express the value of
2 2 4 6 8
by means of hypergeometric functions.
Problems 327
(9) Express the value of
(l2-t2)z B2-t2)z C2-t2)z , ,
1 + - V-^J r~^J 7-1-. for |argz|<7r.
by means of hypergeometric functions.
A0) Let 1 + K(on/1) be given by A.1.5); that is
fl2n =
(c + 2n)(c + 2n) '
and assume that neither a, 6, c — a, c — 6 nor c is a non-positive
integer.
a) Prove that fn, where
"~ c + 2n' +1 c + 2n+l
is a tail sequence for 1 -f K(an/1)-
b) Prove that 1 -f K(an/1) converges to
1 + t0 = 1 - a/c
if 3ft(c - a - b) > 0 or if c = a + b.
c) Prove that 1 + K(an/1) converges if 5?(c — a — 6) < 0.
(Hint: Theorem 13 in Chapter IV may be of help.)
328 Chapter VI. Hypergeometric functions
Remarks
1. Gauss' work on hypergeometric functions [Gaussl2] is very useful.
His contiguous relations lead to recurrence relations for hypergeo-
hypergeometric functions which again lead to continued fraction expansions
of ratios of such functions. The Gauss-fractions are the most well
known and widely used of these expansions. But also the Euler
fractions and the Norlund fractions, among others, can be obtained
from these relations. See for instance [AbSt64, p. 558].
Another important source for continued fraction expansions of ra-
ratios of hypergeometric functions or functions strongly related to
these is llamanujan's work. This extraordinary mathematician
had a strong liking for continued fractions. In [ABBW85] and
[Bern89] some of his results are presented with comments and
proofs.
Apart from this, almost every book on continued fractions contains
a section on hypergeometric functions. We mention in particular
[Perr57] and [JoTh80].
2. Quite a number of formulas for hypergeometric functions have an
analogue valid for basic hypergeometric functions. From a contin-
continued fraction point of view the most striking is the Heine fraction
[Heine47] which is the q-analogue of the Gauss fraction. But this
is not the only one. Again we refer to Ramanujan's notebooks as
described in [ABBW85] for a wide selection of continued fraction
expansions related to basic hypergeometric functions.
3. The art of finding the value of a given continued fraction 60 +
K(an/6n) where an and bn are polynomials (or rational functions)
in n, is well described in [Perr57, p. 276—*].
References
[AbSt64] M. Abramowitz and T. A. Stegun, "Handbook of Mathe-
Mathematical Functions with Formulas, Graphs and Mathemati-
Mathematical Tables", National Bureau of Standards, Appl. Math.
Ser. 55, U.S. Govt. Printing Office, Washington, D.C.
A964).
[ABBW85] C. Adiga, B. C. Berndt, S. Bhargava and G. N. Watson,
"Chapter 16 of Ramanujan's Second Notebook: Theta-
Functions and q-Series", Memoirs, Amer. Math. Soc, 315,
Providence A985).
[Bail64] W. N. Bailey, "Generalized Hypergeometric Series",
Stechert-Hafner, New York A964).
[Bern89] B. C. Berndt, "Ramanujan's Notebooks", Part II, Springer-
Verlag A989).
[Erde53] A. Erdelyi et al, "Higher Transcendental Functions",
Vol. 1, McGraw-Hill, New York A953).
[Euler27] L. Euler, "Institutions Calculi Integralis", Vol. 2, 3. ed.,
Impensis Academiae Imperialis Scientiarum, Petropoli
A827); Opera Omnia, Ser. 1, Vol. 12, B. G. Teubner, Lip-
siae A914), 1-413.
[Euler67] L. Euler, De fractionibus continuis observationes, Comm.
Acad. Sci. Imp. St. Petersbourg, 11A767), 32-81 Opera
Omnia, Ser. 1, Vol. 14, B. G. Teubner, Lipsiae A925),
291-349.
329
330
Chapter VI. Hypergeometric functions
[Gaussl2]
[J0TI18O]
[Lore]
[N6rl24]
[Perr57]
[Wats29]
C. F. Gauss, Disquisitiones Generates circa Seriem Infini-
3 ¦
+
--
7 7G) ^G+ )(y+)
etc., Pars prior, Comm. soc. regiae sci. Gottingensis rec. 2
A812), 1-46; Werke, Band 3, Konigliche Gesellschaft der
Wissenschaften, Gottingen A876), 123-162.
[Heine47] E. Heine,
Untersuchungen u'ber die Reihe
¦x2 + >¦-, J.
reine angew. Math. 34 A847), 285-328.
W. B. Jones and W. J. Thron, "Continued Fractions. An-
Analytic Theory and Applications", Encyclopedia of Math-
Mathematics and its Applications 11, Addison-Wesley, Read-
Reading, MA A980). Now distributed by Cambridge University
Press, New York.
L. Lorentzen, Divergence of Continued Fractions Related
to Hypergeometric Series. To appear.
N. E. Norlund, "Vorlesungen uber Differenzenrechnung",
Springer-Verlag, OHG, Berlin A924).
O. Perron, "Die Lehre von den Ketteiibruchen", Band TT,
B. G. Teubner, Stuttgart A957).
G. N. Watson, A New Proof of the Rogers-Ramanujan Iden-
Identities, Journal London Math. Soc. 4 A929), 4-9.
Chapter VII
Moments and
orthogonality
About this chapter
The threefold, rather modest, intention of this chapter is reflected in its
three sections: In the first one the connection between orthogonal poly-
polynomials and continued fractions is established, key words being Favard's
theorem and Jacobi fractions. Tn the second section the denominators of
the approximants of the Jacobi fraction are used to construct the clas-
classical Gaussian quadrature formula. The third section is different: For
the classical Stieltjes moment problem necessary and sufficient condi-
conditions for existence of a solution may be expressed in terms of continued
fractions. This is also true for uniqueness.
The chapter contains very few proofs. Examples are used to illustrate
the theorems. Even the topics chosen are meant merely as examples of
connections between orthogonality, moments and continued fractions.
331
332 Chapter VII. Moments and orthogonality
1 Orthogonality and continued fractions
1.1 Three examples
Example 1 The TchebychefF polynomials Un(x) of the seond kind are
defined by
find generally
sin(n
=
We easily find, e.g.,
U2{x) = 4k2 - 1,
G3(ar) = 8z3 - 4z.
We shall first establish two properties: a) They satisfy a certain three-
term recurrence relation, b) They are orthogonal with respect to a
certain weight function.
a) From
sin((n + 2H) + sin(n0) = 2 sin((n + 1H) cos 6
we find
Un+l{x) = 2xUn{x) - Un-i{x),
valid for n > 0 if we define U-i(x) = 0.
b) From
r s'm(k6) - sm{pe)d6 = 0
Jo
for integers k, p, k ^ p we get
i
' sin(n + 1N sin(m + 1N
— —— ¦ — • sin 6 - sin
sm0 sin0
f Un{x)Urn(x){l-x2)l'2dx = Q
J-i
for m ¦? 7i, which means orthogonality on [—1, +1] with respect to
the weight function w(x) = A — a;2I/2. For m = n the integral
= ?r/2.
Orthogonality and continued fractions 333
A natural connection to continued fractions is through the recurrence
relation, from which it follows that Un(x) is the canonical denominator
of the nth approximant of the continued fraction
-1 -1 -1 -1
2x-{-2x-{- 2x -\ f- 2x -\ *
The polynomial Un(x) is of degree n, and the coefficient of a:" is 2". The
polynomials
Un(x) = 2-"Un(x) (M.)
are monic, i.e. the coefficient of xn is 1. We have
U0{x) = l, Ul{x) = x,
and generally
Un+l(x) = X • Un(x) - \Un-l{*) (Rl)
for n > 0 ifU-i(x) = 0. The monic polynomials Un(x) are the canonical
denominators of the approximants of the equivalent continued fraction
1
2
X + X -{- X -\ h X +•
(J.)
The reason for using the symbol J is that the continued fraction here
is a special case of what is called a Jacobi continued fraction. We shall
return to other special cases as well as to the general Jacobi fraction
later.
We still have the orthogonality, in fact
/" Un[x) ¦ 0m(x)(l - x2)l'2dX = ^2-<+">«mn . @,)
/
(The symbol 6mn is called the Kronecker delta, and has the value 0 for
772 ^ n and 1 for m = n.) In conclusion we emphasize the following key
points: The properties of being monic (Mi) and orthogonal (Oj), the
recurrence relation (Rl) and the Jacobi continued fraction (Ji). These
points will also be present in the other examples, and will play a crucial
role in the general theory.
O
334 Chapter VII. Moments and orthogonality
Example 2 The Legendre polynomials Pn{x) are given by
and generally
(n + l)Pn+i(x) = {2n + l)xPn(x) - nPn_, (x)
for n > 0, if we define P_i(») = 0. We easily find the first polynomials
3
4 2 ,
-x --x +_
The Legendre polynomials are known to be orthogonal on the interval
[—1, +1] with respect to the weight function w(x) = 1, we have in fact
/
2n
The recurrence relation can be written
For a continued fraction ~K{an/bn) with arbitrary a\ / 0 and
2n + 1 n
n
the recurrence relation above is exactly the recurrence relation for the
canonical numerators and denominators of the approximants. The se-
sequence {Pn(x)}™=l is thus the sequence of canonical denominators for
the approximants of the continued fraction
1 _2 3
ai ~2 ~3 ~4
7
a; x —
2 3 4
3 -f5 .1-7
—a; ' — x —x
2 3 4
Orthogonality and continued fractions 335
By an equivalence transformation this continued fraction can be changed
to
I2 22 32
X -\- X -f X -f X H '
which is again a J-fraction. The recurrence relation (for the denomina-
denominators) is
2
PTl+l(x) = xPn(x) - — -Pn-iix) , (ll2)
and the initial values are
We easily find that
P2(x)
P3(x)
3
-
5
and generally
Pn(a:) = k" ¦+- lower powers of x ,
i.e. the coefficient of a;" in Pu[x) is 1, Pn(x) is a monic polynomial. The
orthogonality is of course preserved:
r\
I Pm(x)Pn(x)dx = 0 for m^n. @2)
J-\
The same four key points as in Example 1 are indicated in a similar
way, by writing (J), (R), (M), @).
O
Example 3 Let G be the following function of two variables x (real)
and w (complex), given by
336 Chapter VII. Moments and orthogonality
The Taylor expansion at w = 0 is
w w2
n=0 nm
valid at least in |w;| < 1. Here
and generally
Cn(x) = (-1)"
n
From this formula it is not difficult to prove that these polynomials are
determined by the initial values Cq(x) — 1, C\(x) — x — 1 and the
recurrence relation
Cn+i(x) = (x-n- l)Cn(x) - nCn-i(x), n > 1. (R3)
Cn(a;) is a polynomial of degree n, and the coefficient of xn is 1, i.e.,
Cn(x) is monk. (M;i)
We find from the recurrence relation that Cn(x) is the canonical de-
denominator of the nth approximant of the continued fraction
a.\ —1 —2 —3 —n
• -f x — n — 1-}-- • •
which is also a J-fraction. The value of a\ ^0 can be chosen arbitrarily.
Also for the polynomials Cn(x) we have orthogonality:
Cn{k) • Cm(k)— = e • n\ ¦ 6tnn . (O:J)
The proof is left as an exercise, see Problem 3 (with hints). This is
orthogonality on R with respect to the discrete mass distribution with
mass 1/fc! at x — k. An alternative way of writing the orthogonality
Orthogonality and continued fractions
337
relation is as a Ricmanii-Stieltjes integral with respect to the function
, defined by:
=: <
0
1
1+A
for x G ( — 00,0)
for x G [0,1)
for x G [1,2)
^i fora; G [k,k+l)
/•OO
/ Cm(x) • Cn{x)dip(x) = e • n!
J—oo
The polynomials Cn(x) are special cases of the monic Charlier polyno-
polynomials. The function G(x, w) is a generating function for the polynomials
Cn(x)
n! '
meaning that the Taylor expansion at w — 0 is such that these polyno-
polynomials are the coefficients of wn in the expansion.
O
Remark: The polynomials in the Examples 1 and 2 can also be pro-
produced by generating functions:
oo
UJx) - wn (See Problem lb.)
n-0
OO
v1 —
w
n=0
For Charlier polynomials as well as other orthogonal polynomials and
their elementary theory we refer to the first chapter of Chihara's book
[Chih78].
338 Chapter VII. Moments and orthogonality
1.2 Moment sequences and moment functionals
In the three examples in Subsection 1.1 we dealt with different integrals
of polynomials,
l i
/ Q(x)dip(x) — I Q{x)\/\ — x2dx in Example 1,
-l -i
i i
I Q{x)dij>~{x) = f Q(x)dx in Example 2,
-i -l
OG
/ Q(x)dip[x) in Example 3.
—oo
If
Q(x) = au -f aix -\ + anxn
we may in Example 1 write
/_'
— x2dx
1
/•l /.i
= a0 / 1->/l-x2dx + --- + an xn\J\-x1dx
J-i J-\
h an/in ,
where fi^ = /I, xk\/l — x2dx — /I, xkd7p(x) for k — 0,1,2,... and
similarly for the other examples. The integrals are in all cases examples
of linear functionals acting on the space of polynomials. We shall now
look at this more generally.
Definition Let {/^n}JJLu ^e a sequence of complex numbers and L a
complex linear functional defined on the space of all polynomials by
L[xn] = fin , n — 0,1,2,... .
Then L is called the moment functional determined by the moment se-
sequence {fJ>n}' Vn is called the moment of order n.
Orthogonality and continued fractions
339
The polynomials ]C?=o ckxk *° De considered have complex coefficients,
whereas the symbol x is regarded as being real. Since the functional is
linear we have
n
?
U-=u
and, since x is real (and z means the complex conjugate of z),
n
Uk=O
n
A:=0
We shall now define the concept of orthogonal polynomial sequence. The
concept of orthogonality in itself, and in this setting will come later,
after having introduced an inner product.
Definition An orthogonal polynomial sequence {Qn(x)}™=o for L is
defined by the requirements
Qn(%) has exact degree n,
L[Qn(x)Qm(z)] = 0 for m ^ n ,
Since {Qn(%)}n=u sPan the space of polynomials of degree < iV, it is a
consequence of this definition that also
L[xnQN(x)] = 0 for n< N
for every TV. In the examples in Subsection 1.1 the moment functionals
and the moment sequences are as follows:
In Example 1 we have
L[Um(x)UTl(x)] = f Um(x)Un(x)(l-x2)^2dx,
Vn = f xn{l-x2ff2dx.
340 Chapter VII. Moments and orthogonality
For odd n we have /zn = 0, since the integrand in this case is an odd
function. For even n, n — 2fc, k — 0,1,2,... we have (jlq = tt/2 and
= r cos2*" 0 • sin2 0d6
Jo
[* cos2k6d6- [* cos2k+20d0
q Jo
7T l-3---BJfe-l)
:
2-4---2A:
The details are left as an exercise (Problem 4).
Tn Example 2 we have
( 0 for odd n,
. for k > 1.
., "^ | ^^ for n = 2k,k = 0,1,2,
In Example 3 we have
oo
/
—OO
in particular
= e,
OO i 2
L 111
- 1V = ^ ^ifc - IV
U" k=l KK L)-
oo 1 oo
Next we find /X3 = 5e. The proof is left as an exercise (Problem 5.)
In the examples in Subsection 1.1 the orthogonal polynomial sequence
was the starting point, or more precisely: We were in each case given a
polynomial sequence, which turned out to be an orthogonal polynomial
sequence if the moment functional was properly defined. In the present
Orthogonality and continued fractions
341
subsection, however, we shall go in the opposite direction: We shall start
with a moment functional L, or equivalently, with a moment sequence
{//}?L0, and ask for necessary and sufficient conditions for existence of
an orthogonal polynomial sequence for L. Let us look at "the start of an
answer": The two first polynomials (assumed to be monic) must be of
the form Pq(x) — 1, P\(x) = x + ai for some constant a.\. The conditions
L[P0(xJ] / 0, L[PQ(x)Pi(x)] = 0, L[Px{xJ} ± 0 take the form
L[P0(x)Pi(x)] = L[x + ax] = i±y +ai/z0 = 0, i.e. a, =
Mo
L[Pv{xJ] = L[x2 + 2a, x + a2] = fi2 + 2a,//, +
2 2 2
Mo
The two first conditions are thus
:= Mo
Mo
Mo Mi
Mi M2
The general answer is given in terms of the determinants (Hankel deter-
determinants)
Mo Mi
Mi M2
^ F A.2.1)
by the following theorem, here stated without proof (see e.g. [Chih78],
p. 11):
Theorem 1 Let L be a moment functional and {fin} Us moment se-
sequence. Necessary and sufficient for existence of an orthogonal polyno-
polynomial sequence is that
n ^ 0 for n — 0,1,2,... .
A.2.2)
Remark: A moment functional for which A.2.2) is true, is called
quasi-definite.
342 Chapter VII. Moments and orthogonality
In the examples we met in Subsection 1.1 the moment functional L was
defined by an integral: In Example 1 and Example 2 we had a Riemann
integral with a non-negative weight function (\/l — x2 in Example 1 and
1 in Example 2), in Example 3 we had a Stieltjes integral with respect
to a function tp. The most important orthogonal polynomials are such
that the moment functional L is defined by a Riemann integral with a
weight function or more generally, as a Stieltjes integral
/•OO
L[xn]= /
•/— OO
A.2.3)
where ip is a bounded, non-decreasing function with an Infinite number
of points of increase, called distribution function. It can be proved,
that this is the case, if and only if L is such that L[p(a;)] > 0 for all
polynomials p(x) which are > 0 for all real x and not identically zero,
or equivalently, if and only if
all moments are real and all An > 0. A.2.4)
Such moment functionals are called positive-definite. They have some
important properties, for instance having a corresponding orthogonal
polynomial sequence of real polynomials with only real, simple zeros.
For a positive-definite moment functional it is easily verified that
J~ A.2.5)
defines an inner product on the space of all polynomials in one real
variable. What we so far have called orthogonality of a sequence, with-
without any reference to an inner product, is in fact orthogonality with
respect to the inner product A.2.5) in the case of a positive-definite
moment functional. (We have in fact for polynomials Pm and Pn in
an orthogonal polynomial sequence, that (Pm, Pn) — L[Pm(x)Pn(x)] —
L[Pm(x)Pn(x)\ ~ 0 for m ^ n. This follows immediately from the prop-
property L[Pm(x)xn] = 0 for all n < m — 1.) The whole theory of inner
product spaces will be at hand, e.g.: starting from
we can by the Gram-Schmidt-process recursively construct an orthogo-
Orthogonality and continued fractions 343
nal sequence {pnC3*)} of polynomials in the usual way:
Pi (x) = x- apo(x), a = L[xpo(x)],
pi(x) = F
and generally
Pn+i(x) = *"+' - E*=o<*kPk(x), a* = L{xn+lpk{X)},
Observe that we, through the standard Gram-Schmidt process, get or-
orthogonal polynomials which are normalized by the requirement
L[pn(.nJ] = 1, i.e. orthonormal polynomials, rather than by the require-
requirement that the coefficients of xn be 1. They are (if desired) transformed
to monic orthogonal polynomials by suitable divisions by factors inde-
independent of x. Furthermore, having constructed an orthonormal sequence
{pn(sc)} we can find a Fourier expansion of an arbitrary polynomial Q(x)
of degree n:
n
Q(x) = J2 CkPk(x), ck = L[Q(x)pk(x)]. A.2.7)
Example 4 In Example 1 we have
f Un(x)Un(x){l-x2y/2dx= rsin2(GH 1)^)^=^.
J-\ Jo I
Hence the sequence {w/t(ai)}^.(), where
is an orthonormal sequence. We expand a;2 in a Fouries series based
upon {//fc}) and find
x2 = couo(x) + ciui(x) + c2u2{x),
where
/
-i
sin0
344
hence
and
i
Chapter VII.
^, Cl =
c)-f w2(k))
0 , C-2
Moments and orthogonali
^
" 8 " ^
l + 4i2- 1) = xA .
O
Example 5 In Example 2 it can be proved (for instance by using the
generating function) that
I O
PUfd
/¦I
/ PUxfdx =
J-i K }
Hence the sequence {pkfa)}^^ where
2Jfe
is an orthonormal sequence. A Fourier expansion based on these poly-
polynomials is then given by
x2 = dopu(x) + dxpx(x) -f d2p\{x),
where
= x2pk(x)dx,
J-i
hence
rfo = ~^-, rfi=0, d2 = —
o 15
and
3 r w 15
V2 1 2
-O
Orthogonality and continued fractions 345
1.3 Favard's theorem and Jacobi fractions
In the three examples we have studied in Subsection 1.1, the monic
orthogonal polynomials satisfied a recurrence relation of the form
Qn{x) = {x- cn)Qn_i(a0 - XnQn.2{x), A.3.1)
valid for n > 1 if we define Q-i(x) = 0. In Example 1 we had cn = 0 and
Xn = 1/4 for all n, in Example 2 we had cn = 0 and An = n2/Dn2 — 1),
whereas in Example 3 we had cn = n and An = —n + 1 for n > 2. This is
actually a general property for monic orthogonal polynomial sequences
for a quasi-definite moment functional. If in particular the functional is
positive-definite, then cn is real and \rl+\ > 0 for n > 1 (as in the three
examples). See [Chih78, Thm. 4.1].
An important property in the theory of orthogonal polynomials is that
a converse type of result is also true. The theorem carries the name of
Favard. According to Chihara [Chih78, p. 21] it was discovered at about
the same time independently by J. Shohat and I. Natanson. Jones and
Thron point out that it can be deduced form a result in Perron's book
on continued fractions [Perr57].
Theorem 2 (Favard's theorem) Let {cn}^=l and {Ki}^Li be arbi-
arbitrary sequences of complex numbers, and let {Qn{x)}^=o be defined by
the recurrence formula
Qn(x) = (x- cn)Qn_i(as) - AnQn_2(ar), n = 1,2,3,... A.3.1)
with Q-x(x) = 0, Qo{x) — 1- Then there is a unique moment func-
functional L such that L[l] = Ai, L[QTTl(x)Qn(x)] = 0 for m ^ n, m,n =
0,1,2, This L is quasi-definite and {Qn{x)} 25 the corresponding
monic sequence of orthogonal polynomials if and only if An ^ 0, and L
is positive-definite if and only if cn is real and An > 0 for n > 1.
For the proof we refer to [Chih78], p. 22.
These two results show the close connection between monic orthogonal
polynomials and Jacobi fractions:
346
Chapter VII. Moments and orthogonality
a) For an arbitrary quasi-definite moment functional there is a J-
fraction
x —
+ x - c2 -| -f- ar - cn -|
such that the sequence of denominators of the approximants is
the corresponding monic orthogonal polynomial sequence. If in
particular the functional is positive-definite, the parameters cn are
all real and \Tl+i > 0 for n > 1.
b) For an arbitrary J-fraction A.3.2) with all An ^ 0 there exists
a unique quasi-definite moment functional L with X[l] = X\ such
that the sequence {Qn{x)} of denominators of the approximants is
the sequence of monic orthogonal polynomials for L. If in particu-
particular cn is real and An > 0 for all n > 1, then the moment functional
is positive-definite.
c) It follows from Theorem 4 in Chapter V that the J-fraction A.3.2)
corresponds at x = oo to a formal Laurent series
oo
E
n=0
X
This correspondence represents in fact an important link between
continued fractions on the one hand and orthogonality on the
other. Let us first look at the start of the corresponding series
in the Examples 1, 2, 3, with Ai = /xo = X[l] > 0 in all cases.
In Example 1: Ai = jlzq = j[^(l — x'2)l/2dx = 7r/2. J-fraction
7T 1
2. Zi
1
i
1
i
1
i
Corresponding series:
7T 7T 7T
X X
With weight function
X
7T
X
7T
X
w
(x)= A -
Orthogonality and continued fractions 347
the first moments are:
2' '8' '16' '128' '256' '
In Example 2: Ai = fly = J_x dx = 2. J-fraction:
I2 22 32
1 1 -3 ~3-5 ~5-7
x-\ x + ar + x -|
Corresponding series:
2 2 2
X X* X° X
Corresponding series:
e e 2e be 15e
a; a;^ a;-* aj1 a;j
With distribution function
-O
With weight function
the first moments are:
- 0 - -
'°'3' '5' '7'0'""
O
In Example 3: Aj = /io = c. J-fraction:
e -1 -2 -3
where ^ is defined as in Example 3, the first moments are:
o
348 Chapter VII. Moments and orthogonality
We observe in the three examples that the first coefficients of the
corresponding series coincide with the first moments. It can be
proved that this goes on, such that the sequence of coefficients co-
coincides with the sequence of moments. And more so: This holds
generally: If {Qn} is a sequence of polynomials satisfying a 3-term
recurrence relation of the form A.3.1) with real cn and positive
An for all n > 1, then the J-fraction A.3.2) given by the recur-
recurrence relation A.3.1) corresponds at x = oo to a Laurent series
A.3.3), where the coefficients /xn are the moments with respect to
the unique moment functional of Favard's theorem. In our case
the functional can be represented as an integral with respect to a
distribution function rp{x), which means that we will have
[°° A.3.4)
[
—oo
in particular X\ — J^000dijj(x). (Tn the Examples 1 and 2 this
is to be understood as follows: Extend the definition of w(x) in
both cases to the whole real line by puttting w[x) = 0 for all
x ? [—1,-f-l]- Next take ip(x) to be the absolutely continuous
function
ip(x) = f w(t)dt.)
J —oc
We leave out the proof, and refer to the monographs [Wall48],
[Chih78] and [JoTh80] for a more thorough treatment of the sub-
subject.
2 Gaussian quadrature
2.1 A quadrature formula
We shall derive a formula for numerical integration of /f^ f(x)w(x)dx,
or more generally /^ f(x)dtp(x), where $(x) is a distribution function.
If f(x) is a polynomial we can write this as X[/(aj)], where L is the
positive definite moment functional corresponding to tp.
We shall first get aquainted with the Lagrange interpolation polynomial,
Gaussian quadrature 349
which is a polynomial Ln(x) of degree (n — 1) taking prescribed values
y^y2,..-,yn at given points xl,x2,.. .,xn:
Let x\, x2,.. .,arn be n distinct numbers, and let F(x) be given by
n
x^ \*"i — I I l«» — *"k) m I ^dil I
For any k we find that
is a polynomial of degree (n — 1) (when the removable singularity at
x — Xk is removed, whereby /nfcCzfc) = !)• Moreover lnk{zj) = 0 for all
k. Then the polynomial
n
B-1-3)
takes the value y^, for ar = Xj. This formula is called the Lagrange
interpolation polynomial. The points X\,X2,.. .<,xn are called the nodes.
The significance of the Lagrange polynomial is that it interpolates a
function / at the points ajl5 x2,. • •, xn by taking yk = f{^k)- In such a
case the name Lagrange interpolation formula is used for
n
B.1-4)
although we only know that it holds for x — Xk-, k = 1, 2,..., n. (But it
is often very useful as an approximate formula for other aj-values.)
We shall now use the Lagrange interpolation polynomials to compute
L[f(x)], where / is a polynomial of a real variable k, and where L is a
positive-definite moment functional. We know in this case that L can
be represented by an integral, as earlier mentioned:
L[f(x)]= [bf(x)d^(x). B.1.5)
J
Observe that whereas the lefthand side is defined only for polynomials
f(x), the righthand side is defined for all /, integrable with respect to
ip on the interval in question ([a, 6] if a and 6 are finite).
350 Chapter VII. Moments and orthogonality
We shall use nodes, determined by L itself: It is known, that the or-
orthogonal polynomials Qn{x) all are real for real x, and that they have
simple, real zeros, located in the interval. Let the zeros of Qn(x) be
We shall use them as nodes. This particular choice will prove to be very
profitable compared to other choices. For a given polynomial f(x) we
replace f(x) by the Lagrange interpolation polynomial, and find
n
n
= ? Ank • /(*,.*) , B.1-6)
k=\
where
Ank = L[lnk(x)] = / lnk(x)dip(x).
Ja
For a polynomial of degree < n — 1 the Lagrange polynomial equals
the polynomial, since it is uniquely dermined by its values at n points.
Hence, for such polynomials the righthand side of B.1.6) gives the exact
value. This would be true for any choice of nodes. But with our partic-
particular choice we get much more: The formula B.1.6) actually holds for all
polynomials up to and including the degree 2n — 1. This can be seen as
follows:
Let f(x) be a polynomial of degree < 2n — 1, and let Ln(x) be the
Lagrange interpolation polynomial constructed as above. Then
f(x)-Ln(x)
has degree < 2n — 1 and vanishes at the nodes. Hence it is equal to
Qn{x)-R{x),
where R(x) is a polynomial of degree < n — 1. The Fourier expansion of
R(x) is a linear combination of the polynomials Qk(x), k = 0,1,..., n— 1.
Since L[Qn(x)Qk{x)] — 0 we find
L[f(x)-Ln(x)] = 0.
In conclusion we have: With the notation introduced the formula
L
f{x)diP{x) = ? Ankf{xnk) B.1.7)
k=i
Gaussian quadrature 351
holds when / is a polynomial of degree < 2n — 1. This is the Gauss
quadrature formula. It has turned out to be of great use in numerical
analysis, as an approximate formula in cases when it is not exact. We
shall not go further into this here. Note that both the nodes xnk and
the weights Ank are independent of f(x).
2.2 An example
We shall illustrate the formula B.1.7) on a specific example. For the
example, as well as generally, we notice that the normalization of the
orthogonal polynomials is insignificant for the formula.
Example 6 For the first Legendre polynomials we have
We shall find the quadrature formula in two cases. Here the weight
function is 1 and the interval is [—1, +1].
n = 2. The equation P'z[x) = 0 leads to the two nodes
and (with the notation used)
^l - I
3 - 2
352 Chapter VII. Moments and orthogonality
and the formula takes in this case the form
It holds for polynomials up to and including degree 3 (which is also
easily verified directly).
n = 3. The equation PK\(x) — 0 leads to the three nodes
/3 „ /3
K:ii= V5' K->2 = °3 333= Yg-
We furthermore get
M*) = " x o v"y =**2-
6 6
5 2
3* '
5 .
= 5*
and thus
_ 5
~ 9'
10
In this case the formula takes the form
It holds for all polynomials up to and including degree 5 (also easily
verified directly).
An illustration with Tchebycheff polynomials of the second kind is left
as an exercise (Problem 7).
Moment problems 353
3 Moment problems
3.1 The Stieltjes moment problem
A moment problem is roughly as follows: When is a sequence of num-
numbers the sequence of moments with respect to some distribution func-
function? And when is (in case of existence) the function unique up to an
additive constant, except at the points of discontinuity? We shall make
the first question more precise in one special case, which is a classi-
classical problem, the Stieltjes moment problem. Following Jones and Thron
[JoTh80, p. 331] we shall for a, 6 with -oo < a < b < +oo let $(a,6)
denote the family of all real-valued, bounded, non-decreasing functions
ip with infinitely many points of increase on [a, 6] if a, 6 E R, else on
(—oo, 6], [a, oo) or (—oo, oo).
Stieltjes moment problem: Find conditions on a sequence {cn}^0 of real
numbers to ensure the existence of a ip E $@, oo), such that
/•OO
cn = / {-t)ndi/;{t), n = 0,1,2,... . C.1.1)
Jo
(The factor ( — 1)" is included for practical reasons.) Such a function is
called a solution of the moment problem.
There are several reasons for being interested in this problem, and in
moment problems generally. Let it here merely be mentioned (as will
also be seen in Theorem 3 and subsequent examples) that solutions of
moment problems can be used to "sum" certain divergent series or to
find closed integral representations of certain continued fractions.
We shall use the concept of asymptotic expansion: We say that the series
oo
? cnz~n C.1.2)
n=0
is an asymptotic expansion of F(z) at z = oo with respect to the angular
region |argz| < a, 0 < a < tt, if there exist sequences of positive
354 Chapter VII. Moments and orthogonality
numbers {rjtl} and {Rn}, such that for each n > 0
n
-k
F(z) - ? ckz
C.1.3)
for \z\ > Rn and | argjz| < a.
Remarks:
1. The asymptotic expansion C.1.2) may very well diverge, in fact:
in many of the important cases it does. The point is, that for any
fixed n the section 53JjJ=o Cfcz~k is an approximation to F(z) that
improves with increasing \z\ in the sense of C.1.3).
2. Asymptotic expansions C.1.2) may also be defined with respect to
other angular regions, or other regions stretching to oo for that
matter.
For the Stieltjes moment problem the following holds:
Theorem 3 Let ip 6 $@, oo) be a solution of the Stieltjes moment
problem for a sequence {cn}™=(J. Then the integral
JO
z + t
is a holomorphic function F(z) in the cut plane |arg,z| < ir, and the
series
oo
X>^-n C.1.2)
71=0
is the asymptotic expansion of F(z) at z = oo with respect to the angular
region \ argz\ < a, 0 < a < tt.
Moment problems 355
Step in the proof: Crucial in the proof is the connection between
C.1.4) and C.1.2):
Jo z+t Jo
71 /-oo (_
a correspondence we recognize from Subsection 1.3.
We shall illustrate Theorem 3 by an example:
Example 7 (first time). For the sequence {cn}J?_0, where
cn = (-
we see by direct verification that
ip(t) = 1 - e~l for 0 < t < oo
is a solution of the Stieltjes moment problem. By Theorem 3 the function
/•OO yp — t
F(z) = / ^— dt
Jo z + t
is holomorphic in the cut plane | arg z\ < tv. The series
1 - 1! z~x + 2! z~2 - 3! z~3 + • • ¦
is an asymptotic expansion of F(z) at z = oo with respect to the an-
angular region |argz| < a, 0 < a < 7r. This is easily verified. We have
therefore, by determining ip(t), succeeded in summing the divergent se-
series Ylcnz~n-
O
Let it briefly be mentioned, that for bounded intervals the following result
by Markov holds: If in C.1.1) and C.1.4) the interval of integration is
changed to [a, 6], —oo < a < b < oo, then the J-fraction corresponding
to C.1.2) converges to C.1.4) for all z G C not on the segment [—6, —a]
on the real line [Mark95].
356 Chapter VII. Moments and orthogonality
3.2 Connection to continued fractions
When it comes to the solution of a moment problem there are in fact
three questions to handle: existence, uniqueness and the actual con-
construction of a possible solution.
Let us first consider the question of existence. In Favard's theorem we
found that a positive definite moment functional L is uniquely deter-
determined by the J-fraction
A > A ;Afl>0, cn<ER. C.2.1)
X — C\ — X — C2 — X — C-j — •
Moreover, in the subsequent discussion we found that the series
oo
/ -i ~n+l
to which this J-fraction corresponds, has coefficients fin which actually
are the moments of L\ i.e. /zn = 2/[scn]. Hence, a given sequence {/xn}
consists of the moments of a positive definite moment functional L (i.e.
ip exists) if and only if the series C.2.2) has a corresponding J-fraction
C.2.1) with all An > 0 and cn ? R. But this is a ip G $(-oo,oo). In
the Stieltjes moment problem we are looking for a ip (E $@, oo).
We shall first see that this is equivalent to the existence of a ip G
#(—oo,oo) with ip(—x) = —ip(x). Let tp G $(—00,00) with ip(—x) —
— ip(x). Then dip(-x) = dtp(x), and the moments are
°° ( 0 if n is odd,
] ^() ifniseven.
-00 I 0
Hence 2dip(*Jx) restricted to x > 0 gives a function ^1 G $@,oo) with
moments fik — //2fc- Similarly, given ^1 G $@,oo), an odd extension of
gives a^G $(-00,00) with moments C.2.3).
One can prove that ip ? $(—00, 00) is an odd function if and only if all
Cfc in the corresponding J-fraction C.2.1) are zero. Hence the Stieltjes
Moment problems
357
moment problem for a sequence
series
^_0 has a solution if and only if the
n=0
x.x
2n
n
has a corresponding J-fraction
A3
A,|
x-x-x
One can also prove that the solution is unique if and only if this partic-
particular J-fraction converges. Multiplying by x and substituting z = —a?2
now gives:
Theorem 4 The Stieltjes moment problem for a sequence {cn}^_0 has
a solution if and only if the series
co + c^-1 + c2z~2 + • • •
corresponds at z = 00 to a continued fraction of the form
a 1 do
C.2.4)
C-2-5)
where an > 0 for n = 1, 2,3,.... The solution is unique if and only if
C.2.5) converges for \ arg z\ < 7r.
The continued fraction C.2.5) is traditionally called a modified Stieltjes
fraction, or more generally, if all an (E C \ {0}, a modified regular C-
fraction. Note also that C.2.5) converges for all z with |argz| < -k if
and only if it converges for one such z (see Theorem 22 in Chapter III).
We shall illustrate Theorem 4 on the Stieltjes problem for the sequence
in Example 7.
Example 7 (second time). We have (again) the sequence
{(-l)n -ti!}?10. To the series
358 Chapter VII. Moments and orthogonality
corresponds the modified regular C-fraction
11.12233 n n
which is a modified Stieltjes fraction since the coefficients all are > 0.
From Theorem 4 it then follows that the Stieltjes moment problem in
this particular case has a solution. From Theorem 22 in Chapter III it
follows that the continued fraction converges for |argz| < 7r, and hence
the solution of the Stieltjes problem is unique. (We already know one
solution, namely
ip(t) = l-e~e, 0 < t < oo,
and hence the solution is
V-(t) = K - e-(,
where K is arbitrary.)
O
Again bounded intervals represent a simpler situation. Markov has
proved, that if there is a solution tp (E $@,fc), 0 < 6 < oo, then C.2.5)
converges for all z ? C not on the segment [—fe, 0] of the real line (to
C.1.4), where oo is replaced by 6) [Mark95].
In Theorem 4 necessary and sufficient conditions for
a) existence and b) uniqueness
of a solution to the Sieltjes moment problem were presented. In both
cases the conditions were expressed in terms of conditions on a continued
fraction expansion. We shall not here go into the question of the actual
construction of a solution. We shall merely indicate briefly one way,
where also in fact continued fractions are used as a tool: We assume
that the continued fraction C.2.5) is a modified Stieltjes fraction, in
which case we know that there exists a solution. Let {An(z)/Bn(z)} be
the sequence of approximants. It can be proved, that the zeros of all Bn
are real, negative and simple. The partial fraction decomposition of the
Moment problems
359
approximants can be written as a Stieltjcs integral. To illustrate this
take A,i(z)/B,i(z) in Example 7.
1112
z2 + 4z + 2
2+n/2 ^
I 2
roo
JO
z + t
Here
=z <
r o
2+v^2
2+n/2
-1
forO < t < 2-
for 2- y/2 < t <
If the continued fraction converges, the expression
An(z) f°° zdipn(t)
converges to
A (z\ r00
Bn(z) Jo
roo
F(z) = /
Jo
z + t
dip(t)
From F(z) the distribution function can be determined by using Stieltjes
inversion formula, see for instance [Chih78, p. 90],
Mt) -
1
= lim
7T y —> 0+
/
iy)}dx .
If the continued fraction diverges, the solution is no longer unique. In
such a case its even and odd parts converge, and by the above procedure
one can get two different solutions ip\ and ip2 (and hence infinitely many,
aij)\ +A -a)V>2,0 < a < 1).
Remarks:
1. Closely related to the Stieltjes moment problem is the Hamburger
moment problem, in which the interval is ( — oo,oo) instead of
@, oo). Observe that a solution of the Stieltjes problem auto-
automatically gives a solution of the Hamburger problem, by defining
360 Chapter VII. Moments and orthogonality
it also for t < 0 by ip(t) = 0. Furthermore, if we have (in either
problem) a double sequence
. . . , C_2, C_i, C(j, C\, C'2, . . .
instead of a simple one {cn}?L0, the moment problems are called
strong problems. The strong Stieltjes problem and the strong
Hamburger problem have both been studied recently, see for in-
instance the survey article [JoTh82]. All problems mentioned above,
as well as other moment problems, play an important role in the
analytic theory of continued fractions, and may be successfully
dealt with by using continued fractions.
2. Some times it is useful to have simple tests. A simple test for
uniqueness of solution of the Stieltjes problem (if we know the
existence) is the Carleman criterion
oo
= oo,
n-=l
see also Theorem 8 in Chapter V.
Problems
Problems
361
A) (a) Prove that the monic TchebychefF polynomials of the second
kind, Un(x), can be expressed by the (n x n)-determinant
X
1
4
0
(b) Prove that the function
1 0
1
1 x
0 0
i
4
0
0
•
1
x
1 — 2xw -f w2
is a generating function for the Tchebycheff polynomials of
the second kind Un(x).
B) (a) Take for granted the expression
oo
= "V Pn
for the Legendre polynomials Pn(z). Then establish the re-
recurrence relation for the Legendre polynomials. Hint: Prove
first that
_^_
dw
w
X —
= 0.
(b) Find the coefficient of xn for the Legendre polynomials Pn(z).
(c) Prove the following connection between Legendrc polynomi-
polynomials and TchebychefF polynomials of the second kind:
n
Un(x) =
Pn-k(*)
k=0
362 Chapter VII. Moments and orthogonality
C) Prove the orthogonality of the Charlier polynomials (i.e. the rela-
relation O3). Hint: Establish the two expansions
„ G(k,w)G(k,z)
> ,. = c • e~™ =
f-1 k\
and
oo
CnW - lzn,wnt
1 «-» **" ii \ I r\ III" ll>. n.
fc=O rn,n=O \ fc=O
where ^(Zjiy) is the generating function €~w(l + w)x. Then com-
compare coefficients.
D) Compute the moments for the functional in Example 1. (See Sub-
Subsection 1.2.)
E) Compute the moments [i-i and /i.i for the functional in Example 3.
(See Subsection 1.2.) What can you say about /jLn generally?
F) Find the Fourier expansion of x3
(a) in terms of TchebychefF polynomials of the second kind, and
(b) in terms of the Legendre polynomials.
Find in both cases the expansions also in terms of the correspond-
corresponding orthonormal polynomials.
G) Find the Gauss quadrature formula for n = 2 and n = 3 for the
integral
f(x)\/l-x2dx.
(8) In the continued fraction in Example 7 write An(z)/Bn(z), n = 2, 3
in the form
zdipn(z)
r
Jo-
(9) Use Carleman's test to prove the uniqueness of the solution of
the Stieltjes problem for the sequence in Example 7. (Hint: Use
Stirling's formula.)
Remarks 363
Remarks
1. For a deeper study of orthogonal polynomials, including their con-
connections to continued fractions, we refer to the book [Chih78] by
T. S. Chihara. It also contains the concept of chain sequences,
which will be briefly touched upon in Chapter X on zero-free re-
regions. See also Wall's book [Wall48]. Other useful expositions are
for instance [Nevai79] and [Lubi87]. As an example of orthogonal
rational functions we refer to [HeNj89].
2. For moment problems we refer to the book by N. I. Akhiezer
[Akhi65] and the book by U. Grenander and G. Szego [GrSz58].
But the topic is treated in a large number of books and papers. A
useful survey article is [JoTh82].
3. For the Hamburger moment problem a result related to the one
for the Stieltjes moment problem in Theorem 3 holds. Essential
differences are that the integral
/•CO
J — CO
zdip(t)
t
represents two different functions in different regions (half-planes),
and the series
oo
is an asymptotic expansion at z = oo to the two functions in the
two regions.
For the strong Stieltjes problem we have two series, being asymp-
asymptotic expansions of
zdi/>(t)
Jo
+ t
at 0 and oo. For details, see [JoTh80], Section 9.2.
4. Moment theory can be established on different real or complex
sets, and several things have been done, which will not be discussed
here. Moment theory on the unit circle is one special topic which
has attracted attention recently. The topic is old, but much of the
theory developed from the old roots is rather new. We refer to
364 Chapter VII. Moments and orthogonality
the paper [JoNT89] and to the bibliography therein. Orthogonal
polynomials on a circular arc are studied in [Gaut89] and [DeBr90].
References
[Akhi65] N. I. Akhiezer, "The Classical Moment Problem and Some
Related Questions in Analysis", Hafner, New York A965).
[Chih78] T. S. Chihara, "An Introduction to Orthogonal Polynomi-
Polynomials", Mathematics and Its Applications Series, Gordon and
Breach, New York A978).
[DeBr90] M. G. de Bruin, Polynomials Orthogonal on a Circular Arc,
J. Comp. and Appl. Math. 31 A990), 253-266.
[Gaut89] W. Gautschi, On Zeros of Polynomials Orthogonal on the
Semicircle, SIAM J. Math. Anal. 20 A989), 738-743.
[Gragg74]
[GrSz58]
[HeNj89]
[JoThSO]
W. B. Gragg, Matrix Interpretations and Applications of
the Continued Fraction Algorithm, Rocky Mountain J.
Math. 4 A974), 213 -225.
U. Grenander and G. Szego, "Toeplitz Forms and their Ap-
Applications", University of California Press, Berkeley A958).
E. Hendriksen and 0. Njastad, A Favard Theorem for Ra-
Rational Functions, J. Math. Anal. Appl., 142, 2 A989), 508-
520.
W. B. Jones and W. J. Thron, "Continued Fractions: An-
Analytic Theory and Applications", Encyclopedia of Mathe-
Mathematics and its Applications 11, Addison-Wesley Publish-
Publishing Company, Reading, Mass. A980). Now distributed by
Cambridge University Press, New York.
365
366
[JoNT89]
[JoTh82]
[Lubi87]
[Mark95]
[Nevai79]
[Pcrr57]
[WaU48]
Chapter VII. Moments and orthogonality
W. B. Jones, 0. Njastad and W. J. Thron, Moment Theory,
Orthogonal Polynomials, Quadrature, and Continued Frac-
Fractions associated with the Unit Circle, Bull. London Math.
Soc. 21 A989), 113-152.
W. B. Jones and W. J. Thron, Survey of Continued Frac-
Fraction Methods for Solving Moment Problems and Related
Topics, "Analytic Theory of Continued Fractions", Lec-
Lecture Notes in Math. 932 (W. B. Jones, W. J. Thron and
H. Waadeland eds.) Springer-Verlag, Berlin A982), 4-37.
P. S. Lubinsky, A Survey of General Orthogonal Polynomi-
Polynomials with Weight Functions on Finite and Infinite Intervals,
Acta Appl. Math. 10 A987), 237-296.
A. Markov, Deux demonstrations de la convergence de cer-
taines fractions continues, Acta Math. 19 A895), 93-104.
P. G. Nevai, Orthogonal Polynomials, Mem. Amer. Math.
Soc. 213, Providence, R. I. A979).
0. Perron, "Die Lehre von den Kettenbriicheii", Band II,
B. G. Teubner, Stuttgart A957).
II. S. Wall, "Analytic Theory of Continued Fractions", Van
Nostrand, New York A948).
Chapter VIII
Pade approximants
About this chapter
Any decent book on continued fractions should contain a section on
Pade approximants (and vice versa). Anything else would mean re-
renouncing one's nearest of kin. On the other hand, the topic of Pade
approximants or more generally rational approximations is treated in
numerous expositions, such as the monograph [BaGr81], to name but
one example. The many conferences on the subject illustrate the rapid
development of the field, as well as the increased interest in applications,
for instance in physics. One example is Lhe 1985-conference in Lancut,
Poland, [GiPS87]. The field and its applications are thus pretty well
taken care of. This justifies, in our opinion, the low profile we have
chosen in our book. We have restricted ourselves to an example-based
introduction to the basic, classical elements of the theory. Next we have
emphasized connections to certain continued fraction expansions whose
approximants follow certain paths in the Pade table. The Pade table
and continued fraction expansions are based upon the same principle,
the principle of correspondence. This means that convergence (or diver-
divergence) results for continued fraction expansions may lead to convergence
(divergence) results for paths in the table, and vice versa.
367
368 Chapter VIII. Pade approximants
Pade approximants, as well as continued fraction expansions for a given
function, can be derived by certain practical algorithms. We decided to
leave these out here, and merely refer to them in the remarks.
The rapid and fruitful development of the theory of Pade approximants
has also lead to interesting generalizations. In the second part of the
chapter some of them are mentioned in a way which is to be regarded
as a list of keywords with some comments.
Classical Pade approximants 369
1 Classical Pade approximants
1.1 A creative problem
We shall let three examples serve as introduction to the main topics of
this chapter.
Example 1 Given the formal power series
(which happens to coincide with the Taylor expansion of exp(z) at z =
0). We want to find the rational function R\^(z) (numerator degree
< 1, denominator degree < 1) whose Taylor expansion at z — 0 agrees
with the given formal series as far out as possible. More precisely: If
the expansion is
= ao + a\z + a2Z2 H ,
we want oq = 1, a\ = l/l!, ..., an = i/n\ for an n as large as possible.
Since Rit\@) = ao = 1, the constant terms in numerator and denomi-
denominator must be equal and ^ 0 (if we ignore the case when they both are
zero). Without loss of generality we may assume them to be = 1, in
which case we have, for R]\ and its Taylor expansion at z = 0:
1±H + b{b - a)z2 + b\a -
= l +
L + DZ
From
a-b - 1
6F-a) = \
we find agreement up to and including the z2-term if and only if
a — - , b — — ,
2' 2'
370 Chapter VIII. Pade approximants
i.e.
_ l + 2Z __ z2 *
is the unique solution to our problem. Since 1/4 ^ 1/3!, the agreement
terminates at the 22-term. We write this L(z) — R\^(z) —
We could raise the problem more generally for rational functions Rm,n(z)
(numerator degree < ra, denominator degree < n). Let us briefly look
at the solution in the case m = 2, n = 1, in which case R2,i(z) and its
Taylor expansion may be assumed to be of the form
#2,1B:) = -J——-— = 1 - (bi - ax)z -f (a2 - aibi + b\)z2
1 + b\z
Agreement with A.1.1) in the coefficients of z, z2 and z3 is obtained if
and only if
+ 6j = -,
-F1 - axb\ + a2b\) = — .
Simple computation leads to the unique solution
2 l a X
O D O
i.e. to the rational function
1 — jjZ 2 D 18
The agreement with the given series terminates at the 23-term (L(z) —
R2j(z) = Olz*]), since 1/18 ^ 1/4!. Since the solution was unique, we
did not have any possibility to require agreement any further. (Observe
that the denominator in R2,i differs from the denominator in R\t\.)
For large values of m and n the computation becomes more complicated.
We shall (try to) require L(z) - Rmin{z) = O[zm+n+l]. One case where
Classical Pade approximants
371
we can write down the solution right away, is the case n — 0, i.e. where
the rational function is a polynomial. In this case the solution is
,m
The solutions may be arranged in a table with increasing m going down
and increasing n going to the right. In the present example the start
of the table (i.e. the top left corner, actually the only corner) is easily
computed, and is shown below. Observe in particular the symmetry
property
which is a consequence of the property exp(z) = [exp(—z)]~ for the
exponential function.
0
1
2
3
4
0
1
1 + z
1 + z + \
i + 2 + T + ir
•
l
i
1-2
i+J«
•
2
l
i-*+ir
•
3
i
l-x+f-«f
1--X+-X2-—23
•
*
•
4
•
¦
We shall look at a related problem, illustrated on the same formal power
series as the one in Example 1.
Example 2 Let L(z) be the formal power series of Example 1. Find
polynomials (^ zero-polynomials)
= a0 + axz + a2z2 ,
Q\(z) - 60 + 612,
372 Chapter VIII. Pade approximants
such that as many as possible of the first consecutive terms of the formal
series
Qi(z) - L(z) - P2(z)
vanish. We find for the start of this formal power series
F0 + bxz) • (l + z + -z2 + -z:i + .. -J - (oo + a,z + a2z2) =
(bo - ao) + F0 + 61 - a^* + (j + 6i - a2) ^ + (y + y
and a system of equations with the following start:
bo — a0 = 0
&o + &i - a>\ = 0
^ = 0
^ + ^ = 0
6 2
This is necessary and sufficient for vanishing of the first four coefficients
of Q1 • L — P-2 (starting with the constant term). Since the total number
of coefficients in P2 a^d Q\ is 5, this is all we can require. Simple
computation leads to the values
"' - 3'
a0 = 60,
2
bo
i.e.
Qi(z) = bQ-jz, P2(z) = bu + huz + ^
where fc0 ^ 0 (since we do not accept the zero polynomial). We have
Qi(z)- L(z) - P2(z) = O[z1]. For the rational function P2(z)/Qi(z) we
find, after having cancelled the factor 6q
\
P2{z) 1 + \z
_
Classical Pade approximants 373
which is the same as the function Ri\{z) in Example 1. Actually, for
the formal series in Examples 1 and 2 the problem of finding Pm(z) and
Qn(z) such that as many as possible of the first consecutive coefficients
of Qn • L — Pm vanish, leads to the same rational functions Rm,n{z) —
Pm(z)/Qn(z), and thus to the same table of rational functions.
O
Remark: Since the numerator and denominator polynomials in
both depend upon m and n, it might have been better to write Pm,
and Qm,n(z)i rather than Pm(z) and Qn(z). The latter notation is chosen
in order to avoid too many subscripts.
We conclude this section by making an observation which illustrates the
connection between continued fraction expansions and tables as the one
in Example 1. We shall use the same formal power series.
Example 3 Let L(z) be the formal power series from Examples 1 and
2. We know from Chapter V, Problem 1, that this series has a corre-
corresponding regular C-fraction of the form
1+1+1+1+1 +•••"
The first approximants are
/o = 1, fi = l + z, ft = 1 + 7 . -^ = ¦:—I,
1+ 1 1-f
2
f _ ,.* -2* g
- , .* zif if zi
+1 1 1 1
12
1 i z iJ b LU
O
374
Chapter VIII. Pade approximants
We look at the table in Example 1 and observe that the approximants
f\i fit hi fi all are in the table, they actually form a diagonal staircase
in the table. If we had extended the table to the case m — 3, n = 2, we
would also have found /r, there. Their location in the table is illustrated
below:
/i
h
h
h
A natural question is if this holds more generally, and in fact, the answer
is YES, under certain conditions. It represents an important example of
the connection between such tables and continued fractions.
1.2 Pade approximants
We shall here, as in Subsection 1.1, let Pm(z) and Qn{z) denote poly-
polynomials of degree at most m and n respectively, and with complex co-
coefficients. We shall furthermore assume Qn(z) to be different from the
zero polynomial. Moreover, we shall regard two rational functions
Pm(z)
and
as identical iff PmQn —QnPm = 0. The three examples in Subsection 1.1
are different examples of approximating a formal power series by means
of rational functions. In the first one a formal power series
L(z) =
c2z
2
A.2.1)
is approximated by rational functions Rm,n(z) in the metric defined in
Chapter V, Subsection i.i, i.e. the metric "turning correspondence into
convergence". The problem is called the Hermite rational interpola-
interpolation problem, and is as follows: To a given formal power series A.2.1)
Classical Pade approximants 375
and given non-negative numbers m, n find an RTnin(z)-> such that when
Rm,n(z) is replaced by its Taylor expansion at 0, then
L(z) - Rm,n(z) = O [*m+"+l] , A.2.2)
meaning that the series on the left-hand side starts with a term of degree
at least m + n -f 1. In the second one we approximated by making
Qn(z)L(z) — Ptn(z) small in the metric of Chapter V, Subsection 1.1,
i.e. starting with a high degree term. This again means, more precisely,
to replace A.2.2) by
Qn(z) ¦ L{z) - Pm(z) = O [zm+"+1] . A.2.3)
Whereas the Hermite interpolating problem does not always have a so-
solution (we shall soon see an example), it can be proved that the second
problem always has a solution.
The rational functons jRm>ri(z) = Pm(z)/Qn(z), where Pm and Qn satisfy
A.2.3) and QTl(z) ^ 0, are the Pade approximants of L, and the two-
dimensional array
R\,0 #1,1 #1,2
#2,0 #2,1 #2,2
is called the Pade table of L.
In the Examples 1 and 2 it seemed that the Hermite problem had a
solution, at least for the (m, n) we computed, and it seemed to lead
to the Pade approximants. It is readily seen, that the solution of the
Hermite interpolation problem, if it exists, is the Pade approximant.
Observe that if Qn@) ^ 0 it follows from A.2.3) that
We have used the term approximation problem, meaning approximation
in the "correspondence metric". On the other hand we have the classical,
376 Chapter VIII. Fade approximants
well established name Hermite interpolating problem. The significance
of the word interpolation is that if the formal power series represents a
function /, we ask for a rational function R{z), for which
jR(*)(O) = /W@) for 0 < Ik < m + n.
(Here /M@) denotes the Jfeth derivative of f(z) at z = 0.) The next
example shows a case where the Hermite problem A.2.2) has no solution.
Example 4 Given the formal power series
_2 I 2n
AY
L(z) = 1 - — + (-1Y
1 ] 2 + 24 { }
+AY 4
2 + 24 { } Bn)\ ^
(which is of course the well known Taylor series expansion of cos z at 0).
Take m = n — 1. A possible solution of the Hermite problem is of the
form
If we ignore the case 60 = 0, which implies that a$ — 0, and thus
that z can be cancelled, then we must have aa = fco in order to have
correspondence in the first term, and with
a := — , b := —
we find
For the desired correspondence we need simultaneously
a — 6 = 0, b — ab = — — ,
which is impossible.
For the other problem A.2.3), i.e. the one leading to the Pade approxi-
approximants, we must solve the equation
z2
F0 + bxz) [ 1 - — + • • •) - (<*> + a{z) =
Classical Pade approximants 377
i.e. the coefficient equations
b0 — a0 = 0 (Constant term),
b\ — a\ = 0 (^-coefficient),
^o*( —2) = ® (^-coefficient).
This system has the solution
ao = *>o = 0, ai = 61 ^ 0.
Hence (with the earlier notation)
Pi(z) = a,2, 61B;) = aiz,
and for the Pade approximant we get in this case
R11 = 1.
We observe that in this case the rational function in the A, l)-place in
the Pade table may be expressed as the ratio of polynomials of degree 0
(i.e. < 1). Observe also that L(z) — R\(z) = O[z2]; i.e. the interpolation
is not good enough as compared to A.2.2).
O
The next example is less trivial.
Example 5 We shall look at the same two interpolation problems for
a formal series starting with
z2 zx
for m = 3, n = 1. A solution of the Hermite problem must have the
form
Oq + CL\Z -\- CL'iZ -f- CL3Z
where fco /0 and a0 = 60 Without loss of generality take ay = 60 = 1,
then R[it\(z) and its Taylor expansion at z — 0 must be of the form
-f
b\)z2
— CI361 -f- fl'2^1 ~~ al^i "f" ^l)'2' + ' * ' •
378 Chapter VIII. Pade approximants
We have the desired correspondence if and only if the following equations
are simultaneously satisfied:
a, - 61 = 1,
ah ±h* l
CLj — ^2^1 + fllO| — b\ = 0 ,
L , x.2 ,3,1 1
— CI3O1 -f 0,20^ — djOj + 0| =^ ——• .
We find successively from the first three equations
1
0J 1 ~ 2 '
When this is inserted into the last equation we find the contradictive
statement 0 = 1/24, showing that the Hermite problem has no solution.
For the problem A.2.3), i.e.
l + 2-y + — j - (a0 + axz + a2z2 + a;,zJ) = O\zh]
the solution is given by
if
= 0,
= 0,
= 0,
y - 0,
^ = 0.
24
We find 60 = ao = 0» ai = ^2 — ^i» a3 — —b\/2,. Without loss of
generality we may take 61 = 1, and find
=: Z + Z2 - -Z* ,
Classical Pade approximants 379
and hence (after cancelling of the factor z)
-z2
Observe that the rational function in the C, l)-place in the Pade table
is expressed as the ratio of polynomials of lower degrees than 3 and 1,
actually 2 and 0.
O
We have here chosen to use A.2.3) as the basis for our definition of
the Pade table, but also A.2.2) is widely used. Both approaches have
their advantages and disadvantages. The use of A.2.2) is in a way more
natural, since we are aiming at a rational approximation (in the corre-
correspondence metric) to a formal series. On the other hand, as illustrated
in the Examples 4 and 5, the A.2.2)-approximation does not always ex-
exist. Important is, however, that when it exists, it coincides with the
Pade approximation in our definition. This implies, for instance, that
the table in Example 1 is a Pade table. If the A.2.2)-approximation fails
to exist for a pair G71,72), then the G71,72) Pade approximant is equal to
a A.2.2) approximant of lower order. This will be evident in the next
subsection.
1.3 Normal tables. Block structure.
In the Pade table in Example 1 we have seen that our entries RTn,n(z)
rational functions where the degrees in numerator and denominator are
exactly 771 and n respectively, and can not be reduced by cancellation,
and the entries are all different. Such a Pade table is called a normal
Pade table. It can be proved that the table in Example 1 is normal (the
whole table, not only the part we have seen). In Example 4 the rational
function in the A, l)-place was 1, the same as in the @,0)-place. It is
easily seen that we also get 1 in the places A,0) and @,1). The upper
380
Chapter VIII. Pade approximants
left corner of the Fade table in this case is
0
1
2
0
1
1
*
1
1
1
*
2
*
*
*
Observe the square block of equal elements.
In Example 5 we found in the C, l)-place the function
1 + z - ?z2
This is obviously also the function in the B,0)-place (since it is a section
of the given series) and even the C,0)-place, since the z3-term in the
series is 0. Simple computation (Problem 3) shows that it is also the
function in the B, l)-place. In the Pade table for the series in Example
5 we have a square block of equal elements as shown below.
0
0
2
3
1
1
+
+
z —
1
z-
1
2Z
1
1
+
+
Z
1
z-
1
1
2'
Z2
Z2
These observations reflect a general structural pattern of the Pade table:
Equal entries appear only in square blocks. In a normal table each block
consists of only one function. The "block theorem" is as follows:
Classical Fade approximants
381
Theorem 1 (The block theorem) Let R(z) = P(z)/Q(z), where P
and Q are relatively prime polynomials of degree m and n respectively.
Assume furthermore that R(z) occurs in the Fade table of a formal power
series L. If, for a non-negative integer r the formal power series
QL-P
A.3.1)
starts with the term of degree m -f n + r + 1, then the set of places where
R(z) occurs is a square block with (r + IJ places and opposite corner
places in (ra, n) and (ra + r, n -f r).
For a proof we refer to [Gragg72].
Remark: Observe that in a square block of size > 1 the elements not
in the upper, leftmost corner have numerator or denominator degrees (or
both) lower than the place (ra, n) "should indicate". This is illustrated in
the Examples 4 and 5. The computation (solution of linear equations)
in these cases indicate why it happens, and a corresponding general
discussion is essential in the proof of the theorem.
The theorem extends to r = oo, in which case the block is unbounded
down and to the right and QL — P is the zero series. This can only
happen if L(z) is the Maclaurin series of the rational function P/Q.
The word normal is used for the approximants R(z) = P(z)/Q(z), where
P and Q are relatively prime and of degree 771 and n respectively, mean-
meaning that QL — P starts with the term of degree m -\- n -f 1. It is used
for the table, meaning that all elements are normal, and for the formal
power series, meaning that the table is normal. Criteria for normal-
normality of a power series ^ cnz" may be expressed in terms of the Toeplitz
determinants
m
cm-l
-in
A.3.2)
where c^ = 0 for k < 0, cmo = 1, m = 0,1,2,..., in e.g. the following
theorem:
382 Chapter VIII. Pade approximants
Theorem 2 An (m, n) Pade approximant of a formal power series
co + c^-f c2z2 + ..- , co ^0, A.3.3)
is normal if and only if the determinants
are a// ^ 0.
For a proof we refer to [Gragg72].
It follows from Theorem 2 that a formal power series and its Fade table
are normal if and only if
m,n
0 for all m, n — 0,1,2,... . A.3.5)
This shows in particular that a formal power series with gaps, i.e. where
at least one c/t = 0, k > 1, is not normal, since cjt,i = c*.
1.4 Connection to continued fraction expansions
In Example 3 we saw (at least for the first entries) that the corresponding
regular C-fraction to the given formal power series was such that the
successive approximants coincided with the rational functions i?o,o> -Ki,o,
-fti,i» ^2,1» ^2,2, • - • in the Pade table for the formal series. This property
actually holds generally, under the condition of normality.
Theorem 3 Let
1 + cxz + c2z2 + c3z3 + - - - A-4.1)
be a formal power series with the property that the Pade approximants
all are normal. Then A.4-1) has a corresponding regular C-fraction
l+K^, A-4.2)
whose approximants fn satisfy
flm — Rmjnt f2jn+\ = Rm+l,rm m = 0, 1, 2, . . . . A.4.3)
Classical Pade approximants 383
For a proof we refer to [JoTh80], p. 191. See also Problem 4.
Even a converse result holds:
Theorem 4 Let A.4-2) be a given regular C-fraction, and A.4-1) the
corresponding formal power series. Then the successive approximants
fn of the C-fraction come as a staircase in the Pade table of A.4-1) by
satisfying A-4-3).
Idea of proof: The correspondence of C-fractions (not only regular C-
fractions) to power series is described in Theorem 5, Chapter V. We also
know that the successive approximants of a regular C-fraction is such
that /„ = An/Bn, where the degrees are given by deg(yl2m+1) = m + 1,
deg(J52m+i) < m, deg(A2m) < "*, deg(J?2m) = m. Furthermore, the
correspondence is such that the Taylor expansion at 0 of Ari/Bu agrees
with the formal series up to and including the term zN, where N = m+n.
For a detailed proof we refer to [JoTh80, p. 192].
Theorem 4 tells about one illustration of the connection between Pade
tables and continued fraction expansions. There are several. Generally,
if {Rmk,nk} is any path in the Pade table with Rmk+l,nk+l ^ Rmk,nk,
then there is a corresponding continued fraction with approximants
{Rmk,nk}'kLo- (See Corollary 8 in Chapter II.) We shall not go into that
here, only mention very briefly one interesting example due to Arne
Magnus [Magn62a], [Magn62b]. He introduced the P-fractions (princi-
(principal part continued fractions), which in a way is related to the regular
continued fractions. Whereas the regular continued fraction may be
constructed by repeatedly taking the integer part of a number and the
reciprocal of the fractional part, the P-fraction is constructed in a simi-
similar way by letting the principal part plus the constant term play the role
of the integer part, and the Taylor part minus the constant term play
the role of the fractional part:
k
-f C2Z2 -f
384 Chapter VIII. Pade approximants
For the formal power series of Example 1 we find the following start of
the P-fraction:
— 4- — a. — -L ... I z 2 ' z
1 I O! Q!
J-. ?i. O.
For the first approximants we find:
/u = 1, /¦ = 1 + ! f = —jf
2 "" 2 L 2
.2
1 1 l+f +
z2
f _ 1 i ~ ~ _ ~ ' 2 ' 12
J2 ~ ^ ^ 1 1 I 12 — -. * . Z2
z
We observe that these elements are the first three diagonal elements in
the Pade table for the series, and it can be proved that the successive
approximants of the P-fraction in this case are in turn the diagonal
elements of the Pade table. This actually holds generally, in the following
sense: For any formal power series
Co + c, z + c2z2 + • • • , Co ^ 0 ,
let fn{z) denote the nth approximant of the corresponding P-fraction.
Then fn is the nth element in the main diagonal of the corresponding
Pade table if we only count distinct elements (i.e. one element from each
square block the main diagonal passes through). This result, proved
by Arne Magnus [Magn62a], tells that the P-fraction picks up exactly
one element from each block intersecting the diagonal. For a normal
table fn(z) is the element on the (n, n)-place. In [Magn62a] it is proved
that the P-fraction also can create the side-diagonals: For any integer
5, take the nth approximant fn\z) of the P-fraction corresponding to
the formal power series
zsL(z) = z'(cq + c,z + c2z
2
Then fn z~s is the nth distinct element in the (m, m — s)-diagonal for
s < 0 and the G71 + s, m)-diagonal for 5 > 0.
Classical Pade approximants 385
1.5 A convergence result
There are several results on convergence of Pade approximants, and sev-
several open questions. They concern different types of convergence. We
have of course some obvious results in the metric defined in Chapter V,
Subsection i.i, the "correspondence metric". Any path in the Pade table
with 77i + n —> oo is such that the sequence of corresponding approxi-
approximants converges to the series from which the Pade table is constructed
in this particular metric. But this is only a restatement of the corre-
correspondence property. Usually one wants more, for instance pointwise or
uniform convergence. We have of course already some results: Conver-
Convergence results for continued fraction expansions may lead to convergence
results along paths in the Pade table, for instance in the case of regular
C-fractions. Some examples are promising, such as for instance the Pade
approximants to the circumference of the ellipse, discussed in Chapter
I, Subsection 3.5, but there are also some nasty results on "nonconver-
gence", for instance that there exists an entire function / such that the
diagonal sequence of Pade approximants is divergent everywhere in the
complex plane except at the origin [Wall74]. We shall here restrict our-
ourselves to one single convergence result, perhaps the most famous one for
Pade approximants. It is due to de Montessus de Ballore [Mont02], and
concerns vertical sequences in the Pade table, i.e. sequences {Rm,n}m=Q •
Theorem 5 (Montessus de Ballore) Let f(z) be holomorphic in the
disk
1*1 < a,
except for n simple poles p\, p2? • - • >Pn> where
Take the formal power series to be the Taylor expansion of f(z) at 0,
and let Rm,n{z) denote the [m,n)-Pade approximant. Then for all z in
the disk \z\ < R minus the poles we have
lim RmAz) = f(z) >
m—»oo
and the convergence is uniform on compact subsets of this set.
386 Chapter VIII. Pade approximants
Proofs may be found in for instance [Gragg72] and [Perr57]. In the
case n = 0 this reduces to convergence of the Taylor series of a function,
holomorphic in the disk \z\ < R. For a fixed n > 0 we get a generalization
to functions, holomorphic in \z\ < R, except for n simple poles in 0 <
\z\ < R. All the rational functions Rm>n with this fixed n have n poles,
tending to the poles of f(z) when n —> oo.
2 Generalizations and extensions
2.1 Two-point Pade table
The ordinary Pade table interpolates one formal power series at one
point, usually, z — 0 (Hermite interpolation), and is connected to con-
continued fractions like regular C-fractions and P-fractions in the way de-
described in Section 1. For different types of continued fractions corre-
correspondence to a formal power series is a property which is appreciated,
and more so if the correspondence is strong enough to make the approx-
approximants entries in the Pade table. This is — as we have seen — the case
for regular C-fractions, whose approximants form a staircase in the Pade
table, and P-fractions, whose entries form a diagonal in the table. The
T-fractions
z z z
1 + d\Z-\r\ + diZ-\-\ -f d$z-\
introduced in 1948 by Thron [Thron48], have a simple structure, and
much can be said about convergence. It also corresponds to a power
series. However, none of the approximants are in the Pade table (except
for the case dn = 0 for all n). (This was said with some implicit regret.)
Much later it was discovered that in addition to correspondence to a
power series
Co 4- C\Z + c-2Z2 -f c3z3 4- • • • (correspondence at 0), B.1.1)
the general T-fraction
Generalizations and extensions 387
also corresponds to a power series
Cq + c*_{z~x + cL22~2 + • • • (correspondence at oo) B.1.3)
under the additional condition Gn ^ 0 for all n. Under certain determi-
determinant conditions on the coefficients we also know the converse, i.e. that to
a pair of series B.1.1) and B.1.3) there corresponds a general T-fraction
B.1.2). The interpolation provided by the general T-fraction is shared
between interpolation at 0 and at oo (actually roughly equally shared
as far as degree of correspondence is concerned). The two-point Pade
table (the points being 0 and oo) is constructed from a pair of series
B.1.1) and B.1.3) in a way related to what is done for ordinary Pade
tables (one-point tables) of one formal series. And in such a table we
find the approximants of the general T-fraction B.1.2). This was first
observed by McCabe and Murphy [McMu76] (for M-fractions, which are
closely related to general T-fractions). Let L and L* denote the series
B.1.1) and B.1.3) respectively. Then the two-point Pade approximant
Pm,n/Qm,n °f (^> L*) is defined by simultaneous requirements on the
orders of the first terms of the series
,/)^ ~~ -* m,n and Qm,n^ ~~ *m,n •
It can be done in different ways. We want the correspondence to be
close to be "equally shared" by the two interpolations, we require (for
even m + n + 1)
- Pm,B = O [z*^] , Qm,nL* - Pm,n = O
m,B = O [z^] , Qm,nL - Pm,n [
B.1.4)
(and a related condition for odd 7n + n + 1)- We shall not go further
into this. We refer to [Magn82] for a precise definition of the two-point
Pade approximants and properties of the two-point table, as well as an
example and references.
Another important connection between continued fractions and two-
point Pade tables is given by the PC-fractions (Perron-Caratheodory
fractions) introduced by Jones, Njastad and Thron. They are continued
fractions of the form
/3, 1 a:iz 1 a5z 1
i ~7> ~~7,— ~7, ~^— ~7> (Z.l.Ol
1 +/32+ ft +&Z+ ft +PZ+
388 Chapter VIII. Pade approximants
where K\ ^ 0 and a.2n+\ = 1 — fanfan+i 7^ 0 for n = 1,2,3, For
these one has an even/odd correspondence as follows
Q~2n ' L- J ' ~ Q2n+]
? - — = O , j , ? - —— = O * I J . B.1.6)
For a description and also more correspondence properties of these and
a proof of the connection to two-point tables we refer to [J0NT86]. Let
it finally be mentioned, that the PC-fractions, or rather certain subfam-
subfamilies are closely related to the trigonometric moment problem, Gaussian
quadrature on the unit circle, and Szego polynomials. The even and odd
parts of PC-fractions are T- and M-fractions respectively if they exist.
See also Remark 2 in Chapter V.
One may have other points of interpolation, and there may be more
than two points. This leads to multiple point Pade approximants. The
reference list in [JoTh80] provides a relevant bibliography on the sub-
subject. There are also bridges between continued fractions and multiple
point Pade approximants. E. Hendriksen and 0. Njastad introduced in
[HeNj89a] multipoint Pade fractions (to mention but one example). See
also [HeNj89b] and the references therein.
One approach to multiple point Pade tables is through the formal New-
Newton series, where the formal power series L is replaced by a formal New-
Newton series
00 n
L = co+X!c»nB:-A)' B-L7)
71=1 k=\
where the points /3fc, not necessarily distinct, are the interpolation points.
Certain staircase sequences of normal Newton-Pade-approximants are
the approximants of a Thiele continued fraction
a2{z - ft,)
" •
-j- 1 +¦ ¦ •
Observe that if all interpolation points /3^, = 0, we are back to the
ordinary normal Pade table and the regular C-fractions. We refer to
[CuWu87] and the references therein.
Generalizations and extensions 389
2.2 Pade type approximants
In the process of interpolating a function or a formal series by rational
functions one sometimes want other conditions to be satisfied in addition
to the correspondence. That, of course, has its price, the payment being
a reduction in the degree of correspondence.
The Pade type approximants, introduced by Brezinski (see e.g. [Brez80])
represent such a case. The background for inventing and studying such
a concept is that the poles of Pade approximants are essentially beyond
control, since the Pade approximants are uniquely defined by the corre-
correspondence requirement. Sometimes, however, we want to choose some of
the poles of the approximant and then determine the numerator and the
denominator in such a way that the Taylor expansion at z = 0 matches
the given formal series as far out as possible. We then get what is called
Pade type approximants. There are two extreme cases: On one hand we
can choose all the poles, on the other hand we choose no pole. Tn the
latter case we are back to ordinary Pade approximants.
We shall not go further into this topic, only refer to [Brez80]. Let it also
be mentioned that further generalizations have been made along the
same line, by fixing not only poles, but also zeros of the approximants
(pseudo-approximants).
2.3 Multivariate Pade approximants
There are different ways of obtaining the univariate Pade approximants,
such as solving the system of equations for the coefficients, or to use
continued fractions to produce staircases of diagonals, both mentioned
earlier in the chapter. Other methods are using some kinds of recursive
schemes to produce the table. These different approaches, being equiv-
equivalent in the univariate case, have been generalized to the multivariate
case. The equivalence between the different techniques, however, is no
longer there in the multivariate case. Also in the multivariate case ra-
rational approximants and interpolants can be constructed by using con-
continued fractions. In Chapter V, Subsections J^.l and ^.5, we have briefly
390 Chapter VIII. Pade approximants
touched upon branched continued fractions and different ways of defining
approximants in order to obtain (a meaningful type of) correspondene.
The definition used in [CuWu87] preserves several properties of the uni-
univariate Pade approximants. The starting point is the bivariate function
/ with Taylor expansion
oo
/(*,?)= E c.\i*V". B.3.1)
From the determinant solution of the univariate Pade approximation
problem (obtained from the system of equations by the Cramer rule)
the multivariate Pade approximants are defined by analogy: The nu-
numerator and denominator polynomials p(x,y) and q(x,y) are given by
determinants related to the ones for P(x) and Q(x) in the univariate
case (approximant P(x)/Q(x)). We refer to the exposition [CuWu87],
which also includes (among other things) muitivariate versions of meth-
methods of computation, such as e-algorithm and <7<i-algorithm (see remarks
in Chapter V), and also examples.
Problems 391
Problems
A) Use the method of Example 1 to compute #2,3B) and #3,2B) in
the Pade table for the series A.1.1).
B) Use the method of Example 2 to compute Pz{z) and Qz(z) of
degrees < 2 and 3 respectively, such that as many as possible of
the first consecutive terms of
H*)Q3(z) - P2(z)
vanish, when L is the power series A.1.1). Compare P2(z)/Qj(z)
to #2,3B) of Problem 1.
C) Compute the function in the B, l)-place in Example 5.
D) Given the formal power series A.4.1). Assume that it has a cor-
corresponding regular C-fraction A.4.2). Compute the approximant
/:1, in terms of the coefficients of A.4.1). Compute next the Pade
approximant #2,1B), and compare it to
E) Determine the start of the P-fraction expansion of the series in
Example 4 (Taylor-expansion of cos z at 2 = 0) up to and including
the third term (i.e. the Oth, the 1st, the 2nd and the 3rd). Compute
the approximants, and verify their positions in the Pade table.
392 Chapter VIII. Pade approximants
Remarks
1. For practical computation of Pade approximants different algo-
algorithms are available. We refer to [CuWu87], Chapter II, Section
3 and to the references therein. Some key words deserve to be
mentioned: The gd-algorithm, the method of Viscovatov, Gragg's
algorithm, and the e-algorithm.
2. The method of vector valued interpolation is introduced by
[Wynn63] and further developed by Graves-Morris and others, see
e.g. [Grav83]. See also Subsection 5.3 and Remark 4 in Chapter
IV. For any proper vector v in a complex finite-dimensional linear
space we define the vector inverse, called Samelson inverse by
where * denotes complex conjugation. It is easily verified, that
with this definition
v~l-v = l and (v~l)~l = v .
As observed by Peter Wynn these inverses may be used to general-
generalize the Thiele continued fraction to treat the case of vector valued
interpolation.
References
[BaGr81] G. Baker and P. Graves-Morris, "Pade Approximants: Ba-
Basic Theory." Encyclopedia of Mathematics and its Appli-
Applications, Vol. 13, Addison-Wesley Publishing Co., Reading
Mass. A981).
[Brez80] C. Brezinski, "Pade-Type Approximation and General Or-
Orthogonal Polynomials", International Series of Numerical
Mathematics, Vol. 50, Birkhauser, Basel A980).
[CuWu87]
[Gragg72]
A. Cuyt and L. Wuytack, "Nonlinear Methods in Numeri-
Numerical Analysis", North-Holland Mathematics Studies in Com-
Computational Mathematics 1, Amsterdam A987).
[Gile78] J. Gilevicz, "Approximants de Pade", Lecture Notes in
Mathematics 667, Springer-Verlag, Berlin A978).
[GiPS87] J. Gilewicz, M. Pindor and W. Siemaszko, "Rational
Approximation and its Applications in Mathematics and
Physics", Proceedings, Lancut 1985, Lecture Notes in
Mathematics 1237, Springer-Verlag, Berlin A987).
W. B. Gragg, The Pade Table and its Relation to Cer-
Certain Algorithms in Numerical Analysis, SI AM Review 14
A972), 1-62.
[Grav83] P. R. Graves-Morris, Vector Valued Rational Interpolants
J, Num. Math. 42 A983), 331-348.
393
394
Chapter VIII. Pade approximants
[HeNj89a]
[HeNj89b]
[J0NT86]
[JoJ?h80]
[Magn62a]
[Magn62b]
[Magn82]
[McMu76]
[Mont02]
[Perr57]
[Thron48]
E. Hendriksen and O. Njastad, A Favard Theorem for Ra-
Rational Functions, Journal of Math. Anal. AppL, Vol. 142,
2 A989), 508-520.
E. Hendriksen and O. Njastad, Positive Multipoint Pade
Continued Fractions, Proceedings of the Edinburgh Math.
Soc. 32 A989), 261-269.
W. B. Jones, O. Njastad and W. J. Thron, Continued Frac-
Fractions Associated with Trigonometric and Other Strong Mo-
Moment Problems, Constructive Approx. 2 A986), 197-211.
W. B. Jones and W. J. Thron, "Continued Fractions: An-
Analytic Theory and Applications", Encyclopedia of Mathe-
Mathematics and its Applications Vol. 11, Addison-Wesley Pub-
Publishing Company, Reading, Mass. A980). Now distributed
by Cambridge University Press, New York.
A. Magnus, Certain Continued Fractions Associated with
the Pade Table, Math. Zeitschr. 78 A962), 361-374.
A. Magnus, Expansions of Power Series into P-Fractions,
Math. Zeitschr. 80 A962), 209 -216.
A. Magnus, On the Structure of the Two-Point Pade Table,
"Analytic Theory of Continued Fractions" (W. B. Jones,
W. J. Thron and H. Waadeland eds.), Lecture Notes in
Mathematics No. 932, Springer-Verlag, Berlin A982), 176-
193.
J. H. McCabe and J. A. Murphy, Continued Fractions
which Correspond to Power Series Expansions at Two
Points, J. Inst. Maths. Applies. 17 A976), 233-247.
R. de Montessus de Ballore, Sur les fractions continues
algebriques, Bull. Soc. Math. France 30 A902), 28-36.
O. Perron, "Die Lehre von den Kettenbruchen", 3. Auflage,
Band II, B. G. Teubner, Stuttgart A957).
W. J. Thron, Some Properties of Continued Fraction 1 +
duz + K{z/(l + dnz)), Bull. Amer. Math. Soc. 54 A948),
206-218.
References
395
[Wall74]
[Wall83]
[Wynn63]
H. Wallin, The Convergence of Pade Approximants and the
Size of the Power Series Coefficients, Applicable Analysis
4 A974), 235-251.
H. Wallin, Convergence of Multipoint Pade Approximants
with a Fixed Number of Poles, Det Kgl. Norske Vid. Selsk-
abs Skrifter, No. 1 A983), 151-158.
P. Wynn, Continued Fractions whose Coefficients Obey a
Non-Commutative Law of Multiplication, Arch. Rat. Mech.
Anal. 12 A963), 273-312.
Chapter IX
Some applications in
number theory
About this chapter
Books in Number Theory usually have a chapter, or at least some sec-
sections, on continued fractions, mostly restricted to regular continued frac-
fractions. This restriction is of course highly understandable, in view of the
role these continued fractions have played (and play) in number theory.
On the other hand, as an undesired side-effect, many people (meaning
mathematicians) think of a continued fraction as "something" within
number theory, and only there. They are highly surprised (hopefully
pleasantly surprised) to see the many fields in mathematics and adja-
adjacent subjects where continued fractions are of use as a descriptive or
problem-solving tool.
Here we have the opposite situation: A book on continued fractions, that
contains a chapter on number theory, and a very restricted one, as far
as topics are concerned. Nevertheless, we do not think we are running
any noticeable risk of making people think that Number Theory, the
"Queen of Mathematics", is a small subset of the Theory of Continued
Fractions.
397
398 Chapter IX. Some applications in number theory
As in most books on Number Theory, this chapter uses exclusively reg-
regular continued fractions (although others would also have been of inter-
interest). We have chosen, in this chapter, wliich is placed in the "applied
part" of the book, to present two applications in number theory: The
simplest examples of the very classical field of diophantine equations,
and the likewise old problem of factoring numbers, which however, in
view of modern (but classically based) cryptography has caused renewed
interest and has led to research, where the combination of classical math-
mathematics and modern computer technology has demonstrated its power.
Some basics on regular continued fractions 399
1 Some basics on regular continued fractions
1.1 The Euclidean algorithm
Let a and b be two positive integers. We want to find the greatest
common divisor of a and 6, here written
gcd(a, 6).
As will be seen in Section 3 we shall for practical reasons be very inter-
interested in finding this quantity, in particular for large numbers a, 6. The
algorithm used for this goes all the way back to Euclid's Elements, al-
although in a slightly different form, and is usually called the Euclidean
algorithm. It goes as follows: There is a unique non-negative integer 5u
and a unique integer i"i,0<ri<6 — 1, such that
a = q$b + ri • A.1.1)
If r[ =0, then 6 divides a, written 6|a, and we have
a
The process stops. In this case the greatest common divisor is 6. If
7*1 ^ 0, we let F, r{) replace (a, 6) in the argument above:
+ r2, 0<r2<r1-l, A.1.2)
and, if 7*2 / 0:
f\ = 92»*2 + 7*3 j 0<7>3<r2-l, A.1.3)
and so forth. Since 0 < r\ < b — 1,0 < r2 < i"i — 1, and so on, we will,
after a finite number of steps, reach an r^+i, which is 0, whereas r{ ^ 0
for all i < fc, and we have
, where qk ^ 0 . A.1.4)
Hence r^rk-x-, and from
Tk-2 = Qk-lTk-l + r A:
400 Chapter IX. Some applications in number theory
it follows that r^|rA_2- Step by step we reach a and 6, and find that r*
is a common divisor of a and 6. Conversely, if d is a common divisor
of a and 6, it must divide in turn r\, r-z,..., 77.. Hence the last non-zero
residuum r^ is the greatest common divisor of a and 6,
gcd(a,6) = rk. A.1.5)
Example 1 Find gcdB587,1547).
2587 = 1-1547 + 1040
1547 = 1-1040 + 507
1040 = 2-507 + 26
507 = 19-26 + 13
26 = 2-13
Hence: gcdB587,1547) = 13.
O
Example 2 Find gcd(96577,1155).
96577
1155
712
443
269
174
95
79
16
15
= 83-1155 + 712
= 1 • 712 + 443
= 1 • 443 + 269
= 1 • 269 + 174
= 1-174 + 95
= 1-95+79
= 1-79+16
= 4-16 + 15
= 1-15+1
= 15-1
Hence: gcd(96577,1155) = 1. We also say, that 1155 and 96577 are
coprime.
O
Some basics on regular continued fractions 401
The equalities of Example 2 can be written
96577 n 712 1
= 83 + —- = 83 +
1155 ' 1155 ' 1155
712
1155 _ 443 _ J_
712 " 1+712~1+ 712
443
79 15
— = 4 + —
16 " " ' 16 " * ' 16
15
15
By repeated substitution of the fractions we find
96577
1155
" + 111 + 1 + 1 + 1+1 + 4
Quite similarly we find that Example 1 gives rise to the following termi-
terminating continued fraction
2587 _ 1 1 J_ 1
1547 " + 1 + 2 + 19 + 2"
Continued fractions where all partial numerators are 1 and all partial
denominators are positive integers are called regular continued fractions.
We permit a term in front (positive integer), and we permit it to termi-
terminate, in which case we call it a terminating regular continued fraction.
We have seen, through the two examples, that the Euclidean algorithm
gives rise to a terminating regular continued fraction
a _ l_ 1 1
b q\ + #H YQk '
where qo is an integer > 0 and </,, 1 < i < k are positive integers. In
expanding a fraction a/b in a terminating regular continued fraction as
seen here, we often assume a and 6 to be coprime, gcd(a, b) = 1, but it
also works (of course) without this assumption.
402 Chapter IX. Some applications in number theory
1.2 Representation of positive numbers by regular continued frac-
fractions
An alternative way of obtaining a regular continued fraction expansion of
a positive number x{i is as follows, and please observe that this algorithm,
contrary to the Euclidean algorithm, is not restricted to merely rational
numbers Xq:
We let (as usual) [a] mean the integer part of a, and start by writing
in the following way:
x0 = [x0] + (x0 - [x0]). A.2.1)
If xq is not an integer, we have Xq — [aj0] > 0, and we define
xi = j—r- A.2.2)
If x i is not an integer, define
^ (L2-2'}
and so on. There are now two possibilities:
1) Either we hit, sooner or later, an integer xn, or 2) no xn will be an
integer.
We shall comment on both possibilities:
1) Since
r i 1
+ —
X\
r i 1
[Xl\+
X 2
and finally
X
n
Some basics on regular continued fractions 403
we have Xq = [xq] + J_ J_ 1 (L2>3)
in which case Xq is a rational number a/b. On the other hand, if we
start with a rational number x^ = a/b, the steps indicated above
all coincide with the steps of the Euclidean algorithm. It suffices
to look at the first step: With xq = a/b, a, b positive integers we
have from A.1.1)
a 7*1
x0 = T - ft) + — •
Here go must be the largest integer < Zo, i.e. [sco], and r\/b the
"fractional part" Xq — [x0], since 0 < rv < 6 — 1. For the later steps
the argument is the same. This means that for rational numbers
the last algorithm is the same as the Euclidean as far as regular
continued fraction are concerned. We have also proved that x0
is a rational number if and only if it has a terminating regular
continued fraction expansion.
Is the expansion unique? Unfortunately the answer is No, as may
be seen from the following example:
3 and 2 + -
are both regular continued fraction expansions of one and the same
number 3. The left one is according to the two algorithms, since
[3] = 3, the other one is not. The same goes for any positive integer
xn > 2, which may be written as
zn or xn - 1 + - .
But the expansion is unique if we require the last partial denomi-
denominator to be > 1.
2) If we never hit an integer in the described algorithm we get a non-
terminating continued fraction, which is a continued fraction in
the proper sense, as defined in Chapter I, Subsection 1.2. Since it
is of the form K(l/&n) where all bn are positive and ? bn = oo, it
follows from Theorem 3 in Chapter III (the Seidel-Stern Theorem),
that the continued fraction converges. See also Example 10 in
404 Chapter IX. Some applications in number theory
Chapter I. To see that it converges to the right value x^, which in
this case has to be an irrational number, we realize that
[scn] < xn < [xn] -f
from which it follows that x^ in value (for all n) lies (properly)
between the two approximant values
r , 1 1 . . . 1 1
[Bo] + r—f , , ?—r and [x0] +
From this and the convergence, it follows that the convergence
is to the right value. Uniqueness of the expansion follows as in
the rational case. Let it also be mentioned, that since the regular
continued fraction is a positive continued fraction where all q^ > 1,
we have (from Theorem 2 in Chapter III) that the sequences of
even order approximants and of odd order approximants both are
monotone,
/o < h< f\<-< hn<"'< /2n+i < h < h < h i A-2.4)
and that we, for the value x0 have the truncation error estimate
A-2.5)
We summarize the results of the discussions in two theorems [Perr54,
Satz 2.2 and Satz 2.6, p. 25 and p. 33].
Theorem 1 Every terminating regular continued fraction represents a
unique positive rational number, and every positive rational number is
uniquely represented by a terminating regular continued fraction where
the last partial denominator is > 2 (or where the last partial denominator
is I).
Theorem 2 Every non-terminating regular continued fraction repre-
represents (converges to) a positive irrational number, and to every positive
irrational number there is a unique, non-terminating regular continued
fraction converging to that number.
Some basics on regular continued fractions
405
To compute the approximants of a regular continued fraction, we can
use the familiar recurrence relations, which in this case, since all a* = 1,
take the form
+
and the initial conditions
/1_! = 1, A0 = 60 ,
B-i = 0, Bo = 1.
We arrange the partial denominators and the approximants in a table
as follows:
0 1
L 0
bo
Au-
Bq
A2
• • •
• a •
The lines indicate the relation
A2 =
We shall illustrate this on four examples.
Example 3 For the rational number 4199/1155 we find the following
regular continued fraction expansion:
4199 _ IIIIIIIII
1155 ~ 3+ T + I + I T T '
In this case the table looks as follows:
0 1
1 0
3
3
1
1
4
1
1
7
2
1
11
3
2
29
8
1
40
11
8
349
96
1
389
107
4
1905
524
2
4199
1155
We shall use this example later.
-O
406
Chapter IX. Some applications in number theory
Example 4 The continued fraction
111 1
1 + 1+ 1H +1H
is known from Problem 3 in Chapter T, and we know that it converges
to (\/5 - l)/2. The table now looks like
0 1
1 0
0
0
1—1
1
1
1
1
1
2
1
2
3
1
3
5
1
5
8
1
8
13
1
13
21
1
21
34
i—i
34
55
¦ ¦ •
We know from Problem 1 in Chapter I that the numerators and denomi-
denominators are the Fibonacci numbers Fn, and the sequence of approximants
is {Fn/Fn+l}.
O
Example 5 z0 = y/2. This is known from Chapter I, where the treat-
treatment was informal and heuristic. If we here use the integer part algo-
algorithm we find
x0 =
v/2
and as we see, all later Xk must be V^2 +1. We find the continued fraction
/-_ -.111 1
~ 2 + 2 + 2 + - ..+ 2+
The table is as follows
0 1
1 0
1—1
1
1—1
1
2
3
2
1.5
2
7
5
1.4
2
17
12
1.417
2
41
29
1.414
2
99
70
1.4143
• ¦ •
• • •
• * •
Some basics on regular continued fractions
407
We have earlier observed how quickly the approximants approach y/2.
O
Example 3 was a terminating continued fraction and represented a ra-
rational number. Examples 4 and 5 were periodic continued fractions, in
both cases of period 1. For convergent periodic regular continued frac-
fractions we know from Theorem 6 in Chapter 111 that their value is of the
form
AN +
where x is the attractive fixed point of
That is, the value is of the form
yfD
E
where D is a non-negative integer, C, E are integers, E ^ 0.
The next example is neither terminating nor periodic. It is the regular
continued fraction expansion of 7r.
Example 6 With x0 = tt = 3.1415926535... we find the following (start
of a) continued fraction:
111111
+ 7 +15+1 +292+1 + 1+..."
The table with the approximants is as follows:
0 1
1 0
3
3
1
3
7
22
7
3.143
15
333
106
3.1415...
1
355
113
3.1415929...
292
103993
33102
3.141592653...
1
•
•
•
Observe how good this is already for n = 3 and n = 4. That it is so
good for n = 3 has to do with the very small tail, caused by the partial
denominator 292.
O
408 Chapter IX. Some applications in number theory
1.3 Best approximation
Already in Chapter I, in Subsection 2.1, we mentioned that a regular
continued fraction produces the best rational approximation to an irra-
irrational number. The main purpose of the present section is to prove this.
We first recall the definition of bestness (A, B, P, Q are integers):
Definition For fractions A/B, gcd(i4, B) = 1, B > 0 we use the term
best rational approximation to a real number ?, if (and only if) every
other fraction P/Q, Q > 0, with |f — P/Q\ < |f — A/B\, has a larger
denominator.
The main theorem to be proved in the present subsection is the following
[Perr54, Sektion 15]:
Theorem 3 The regular continued fraction approximants of order > 1
for a positive number ? are the best rational approximations to
Remark: The lattice point illustration in Subsection 2.1 of Chapter
I illustrates the bestness. No lattice points (P, Q) are contained in the
polygon with corners in @,0), A,0), @,1) and the points (Arn Bn) cor-
corresponding to the nth regular continued fraction approximants off (and
a point on the ray y = ?sc), as described in Example 2, Chapter T.
Our main tool in the proof of bestness is the following Lemma, which in
fact is a theorem of Lagrange, see for instance [Perr54, Satz 2.17]:
Lemma 4 Let ? be a positive number, and let An/Bn be the nth app-
roximant (n > 1) of the regular continued fraction expansion of f in
canonical form. Let P and Q be positive integers such that P/Q ^
An/Bn with 0 < Q < Bn. Then
\Qi -P\> |Bn_,? - An.y | > \BnZ - An\. A.3.1)
Some basics on regular continued fractions 409
Proof of Lemma 4: This proof goes back to Legendre. Let M and
N be such that
AM + AN P , .
Since the determinant of this 2 X 2-system of linear equations with un-
unknowns M, JV is
AnBn-\ — BnAn-\ = ±1,
such numbers M, JV exist uniquely, and they are integers. It is readily
seen, that JV ^ 0, since TV = 0 would lead to An/Bn = P/Q, which is
assumed not to be the case. Furthermore M is either = 0 or has opposite
sign of JV, else Q > /?n, contradicting the conditions.
With M and JV as given above we study the identity
nt, - An) + N(Bn^i - An_,). A.3.3)
Here the two expressions in parantheses have opposite signs by property
A.2.4), and M, JV also have opposite signs (unless M = 0). Hence
\Qi -P\ = \M{Bn(, - An)\ +
and, since JV is an integer / 0, we have
A.3.4)
Since always |i?n_if — An_i | > \Bn? — An\ (Problem 4), the lemma is
proved. ¦
Proof Theorem 3: Let An/Bn be a continued fraction approximant
for ? (canonical form), and let P/Q / An/Bn be closer to f or equally
close, and 0 < Q < Bn. That is,
Simultanous multiplication, left by Q and right by Bni gives
\Qt ~P\< \Bnt ~ An\ .
This contradicts Lemma 4, and the theorem is thus proved.
410 Chapter IX. Some applications in number theory
2 Some diophantine equations
2.1 Linear diophantine equations
Diophantine equations are algebraic equations in two or more unknowns,
with integer coefficients, where one seeks integer solutions. They are
named after Diophantos of Alexandria (around 250 A.D.). Linear equa-
equations in two unknowns are of the form
ax + by = c, B.1.1)
where a, b and c are integers, and where the problem is to find all integer
solutions (k,2/). In some cases one wants to find particular solution(s)
satisfying certain conditions, for instance all positive solutions sc, 3/, both
less than some fixed N.
If d / 1 is a positive integer which is a common factor for a and 6, then
any combination ax + by with integers sc, y must be divisible by d. Hence,
unless also d is a factor in c, the equation does not have any solution
at all. On the other hand, if such a factor exists, it can be cancelled,
and hence without loss of generality we assume a and b to be coprime,
gcd(a,6) = 1.
We shall at first find the "structure" of the set of solutions. Assume
that we somehow have found a solution (aj0,2/0), i.e.
+ by0 = c .
Then any other solution (x,y) must satisfy the equation
a(x - so) + b(y - y{)) = 0 .
Since gcd(a, 6) = 1, the set of solutions is given by
x - xo - tb , y - 2/0 = -ta,
where t ? Z, i.e. the set of solutions is
tbty0-ta)-tteZ}. B.1.2)
Some diophantine equations 411
This means: If we have one solution, we have them all. Differently
phrased: If we are able to find one solution, we have the general solution.
We shall see how we can use regular continued fractions to find one
solution.
In addition to the condition gcd(a, b) = 1 we shall now assume a and 6
to be positive. (As we shall see later, this is no severe restriction.) We
expand a/b into a regular continued fraction. Then the last approximant,
An/Bn must be = a/b. Since gcd(a, 6) = 1 and gcd(An, Bn) = 1, we
have
An = a, Bn = 6,
and hence, by the determinant formula A.2.10) in Chapter I,
From this it immediately follows that
a- (-l)"~li?n_,c + &- (-l)ni4n_ic= c.
Here we have our special solution
x0 = {-\)n-lBn_lC, 2/0 = (-l)nAn_lC, B.1.3)
and the following:
Theorem 5 Let a and b be positive coprime integers, and let An_ \ / Bn-\
be the second to last approximant in one of the two regular continued
fraction expansions of a/b. Then the general solution of the diophantine
equation
ax -j- by — c
is
We shall illustrate this by an example.
Example 7 We shall find all integer solutions of the diophantine equa-
equation
3k + 5y = 2.
412 Chapter IX. Some applications in number theory
The regular continued fraction expansion of 3/5 is
3 „ 1 1 1
0 +
The second to last approximant is A2/B2 — 1/2, and we have A2 = 1,
?2 = 2. We then have the general solution (n = 3)
z = 2.2f5t = 4 + 5(
y=:-1.2-3t = -2-3i ' fc
O
If a and 6 are not both positive it is just as simple. Since we disregard
the case when a or 6 is 0, we may without loss of generality assume
a > 0. If b is negative we get the same aj, but opposite sign for y as
compared to the case a > 0, 6 > 0. We have for instance that
- by = 2
has the general solution
s = 4 + 5t
(Compare Example 7.) Rather than thinking in terms of formulas, such
as the ones in Theorem 5 or some modification for other a, 6-signs, we
should keep the main ideas in mind:
1) To get a special solution by using the determinant formula (from
the expansion of |a|/|&| or |6|/|a|).
2) To get from one special solution to all of them.
We illustrate this by a final example.
Example 8 We shall find the general solution of the diophantine equa-
equation
4199z- 11552/= 3.
Some diophantine equations 413
In Example 3 we found the regular continued fraction expansion of
4199/1155. The second to last approximant was found to be 1905/524.
The determinant formula gives
4199 • 524 - 1155 • 1905 = 1,
and hence
4199-1572- 1155-5715 = 3.
The general solution is then
y = 5715 + 4199* ' G
We find in particular that t = — 1 gives the solution with smallest abso-
absolute value of x as well as y. This solution is
a;'= 417, 3/'= 1516.
_O
2.2 Pell's equation
The Pell equation is a diophantine equation of the form
22 = lt B.2.1)
where D is a positive integer, and where we are looking for integer
solutions (sc,2/) different from the trivial ones (±1,0). If D = C2 for an
integer C / 0, then B.2.1) can be written
(x - Cy)(x + Cy) = 1,
which has only the trivial solutions (±1,0). Therefore we shall assume
that D is not the square of an integer. We shall also be interested in
diophantine equations
x2 - Dy2 = -1. B.2.2)
Let us first look at a special case, which indicates how the regular con-
continued fraction expansion of \[T) enters into the process of solving the
two equations.
414 Chapter IX. Some applications in number theory
Theorem 6 Let D be of the form m2 -f 1, where m is a positive integer,
and let An/Bn be the nth regular continued fraction approximant for
j canonical form. Then the following holds for all k = 0,1,2,3,...;
4!t - DB>k = -1, A\M - DBik+l = 1. B.2.3)
Proof : We find that the regular continued fraction is
y/D = y/m2 + 1 = m + —
2m-f 2m H f-2m-| '
Hence the sequence of right tails /(") is /(") = \D — m for all n.
Therefore
and hence
n+l + An(,/D - m)V Aj
)
Bn+l + Bn{y/D - m)
+ 2AnBn(An+lBn - AnBn+l)(VD - m)
- TTl)
(-1)" (Aw+1 + An(y/]9 - m))Bn + (Bw+I + Bn{yfD - m))An
(Bn+X + Bn{y/B - m))
(-1)" VS
B2 ' Bn+{ + Bn(y/D - m)'
where we used B.2.4) to arrive at the last equality. From B.2.4) it
follows that x = y/D is a solution of the quadratic equation
x(Bn+i + Bn(x - m)) = An+l + 4n(sc - m)
i.e.
2 - mBn - An)x - An+l + Anm = 0.
Some diophantine equations 415
This means that Bn+i —mBn — An = 0; i.e. An = Bn+l —mBn. Inserted
into the last expression for D — A^/B^ this gives
(-1)" y/DBn + Bn+l - mBn (-1)"
which proves B.2.3).
Remark: It can be proved, that the solutions given by Theorem 6 are
the only non-trivial solutions.
Example 9 Take D = 2, that is: We study the equations
x2 - 2y2 = I
and
x2 - 2y2 = -1 .
This example fits right into Theorem 6 with m = 1. We have in Example
5 the regular continued fraction expansion of y/D = y/2, as well as its
first approximants in canonical form. From the table of approximants
we find, by using Theorem 6:
Solutions (x,y) of
x2-2y2 = 1: C,2), A7,12), (99,70),...
x2-2y2 = -1 : A,1), G,5), D1,29),...
Numerical verifications:
992 - 2 ¦ 702 = 9801 - 2 • 4900 = 1,
412-2-292 = 1681-2-841 =-1.
O
Example 10 D = 50 is also an example of Theorem 6. Here m = 7.
The continued fraction expansion of v 50 is
J_ jL _L
T4+T4+14+.
416 Chapter IX. Some applications in number theory
The first approximants are
7 99 1393 19601
1 ' 14 7 197 ' 2772 '*"
The first solutions are :
Foraj2-50y2 = 1: (99,14), A9601,2772),...
Fora;2-50y2 = -1 : G,1), A393,197) ,...
We have, for instance,
196012 - 50 • 27722 = 384199201 - 50 • 7683984 = 1,
and
13932 - 50 ¦ 1972 = 1940449 - 50 ¦ 38809 = -1.
O
We shall see that also for general D the solutions are found by using the
regular continued fraction expansion of y/l).
It is a well known fact that if D is a positive integer, not a perfect square,
then y/D has a periodic regular continued fraction. (This is even true if
D is a rational number such that y/D is non-rational, [Perr54, Satz 3.9,
p. 79].) It turns out that the length of the period enters into the process
of solving the equations and also into the solution itself. Theorem 6
deals with the very simplest case, with period length 1. We conclude
this section by stating without proof a general theorem by Legendre
[Perr54, Satz 3.18, p. 93], followed by two examples as illustration.
Theorem 7 Let D be a positive integer, not the square of an integer.
Further let k be the length of the primitive (shortest) period in the regular
continued fraction expansion of y/l) and let An/Bn be the approximants
in canonical form. The Pell equation x2 — Dy2 — 1 is always solvable.
The set of non-trivial solutions consists of all (a:, y) with
x — Ank-1 , y = Bnk-1 ,
for n — 1,2,3,... for even k, n — 2,4,6... for odd k. The equation
x2 — Dy2 = —1 is only solvable for odd k, and the set of solutions
consists of all (x, y) with
l} 71=1,3,5,....
Some diophantine equations
417
(Observe again that Theorem 7 contains Theorem 6 as a special case
(k = 1). Observe again that the Pell equation x2 — Dy2 — 1 always has
the trivial solution x = ±l,y = 0.)
Example 11
x2 - 51jT = ±1.
VoT has the following 2-periodic regular continued fraction expansion
(Problem 3b):
1111
^ 714 7 14
The first approximants are listed in the table below :
0 1
1 0
7
7
1
7
50
7
14
707
99
7
4999
700
14
70693
9899
* • *
• • •
From Theorem 7 we know that the equation x2 — 51y2 = — 1 has no
solutions (the period length k = 2). The Pell equation x2 — 51y2 = 1
has the solutions (x,y) = (^2m-i>i?2m-i) f°r ro = 1,2,3,..., the first
ones being:
, y) = E0, 7) : 502 - 51 • 72 = 2500 - 2499 = 1,
(se, y) = D999,700) : 49992 - 51 • 7002 = 24990001 - 24990000 = 1.
Let us, just for fun, se what happens to
j i-e- we compute
for D = 51 for some small m- values :
m = 0 : 72 - 51 • I2 = -2 ,
m = 1 : 7072 - 51 • 992 = 499849 - 499851 = -2 ,
m = 2 : 706932 - 51 - 98992 = 4997500249 - 4997500251
= -2
(All of you will be able to guess, some of you may be able to prove what
happens for larger in-values.)
O
418
Chapter IX. Some applications in number theory
Example 12
x2 - 53y2 = ±1 ,
\/53 has the following 5-periodic regular continued fraction expansion
(Problem 3c):
71 I I i L I
Period
From Theorem 7 we know that
^iom-6 ~ 53
for m = 1,2,3,..., and that
for m = 1,2, 3,... The first approximants are listed in the table below:
0 1
1 0
7
7
1
3
22
3
1
29
4
1
51
7
3
182
25
14
2599
357
* • •
« • •
We have for instance
A\ - 53 ¦ B\ = 1822 - 53 • 252 = 33124 - 33125 = -1 .
If we go on with the table we find A9/J&9 = 66249/9100, and we have
A% - 53 • Bl = 662492 - 53 • 91002 = 4388930001 - 4388930000 = 1.
-O
3 Factoring integers
3.1 Introduction
The problem of factoring integers with a large number of digits B0, 30,
40 and more) has become gradually more important, also from a prac-
practical point of view, throughout the last 10-15 years. The reason for this
Factoring integers 419
is the rapid development and increased use of number theoretic cryp-
tosystems for secret communication in business, for instance in banking.
Such systems are beautiful, interesting and, in some cases, rather sim-
simple examples of modern applications of classical (old) mathematics. We
shall not go into this here, but refer the interested reader to the books
[Kobl87], [Schr86].
One important feature of the crypto-system (RSA-cryptography) we
here have in mind, is that the way of encrypting a message (in form
of a number, a string of digits) is publicly known, whereas the way of
decrypting an encrypted message is known only to the receiver of the
message. The way to break the code depends upon the possibility of
writing a certain known number (usually a product of two publicly un-
unknown primes) as a product of at least two proper factors. A proper
factor is a factor that is different from 1 and the number itself. The
methods developed to factor large numbers are of course not aiming at
a criminal type of application. They serve two purposes:
1) They represent examples of what can be accomplished by using
mathematical algorithms adjusted to the potential and power of
present computer technology.
2) They help draw the line for what is presently possible or not, a
line which is vital for the use of cryptography.
With the technology present in the 19807s, the factoring of a number of
100 digits was estimated to take 74 years. A number of 200 digits would
take 3.8 • 109 years. This means that the banks (or other users) were
pretty safe in using as their crucial number for their crypto-system one
that is a product of two prime numbers of approximately 50 digits each.
(A "mere-luck" breaking of the code has probability of the order 1O~50.)
The time estimates do not exclude the possibility of finding one or more
factors more quickly. We can e.g. see it immediately if a 200-digit is di-
divisible by 2 and by 5, say, and if other small prime numbers are present,
they are easily traced in a short time. But there still remains the fac-
factoring problem for a number with almost as many digits as the original
one. Experience seems to indicate, that numbers with few, large factors
are the worst.
420 Chapter IX. Some applications in number theory
A description of factoring methods can be found in for instance [Kobl87]
and [Ries85]. In the present exposition we shall restrict ourselves to one
single method, where a basic continued fraction property plays a crucial
role.
3.2 Fermat factorization
In the rest of the chapter the problem to be discussed is how to factor
a positive integer n. In most cases we will assume n to be an odd
number, not a perfect square (i.e. not the square of an integer). These
restrictions are not severe, they are only meant to rule out the two most
trivial cases. If n is a perfect square, we already have a factorization,
and we may proceed the factoring process on y/n. Similarly with n/2, in
case n had been even. If we do not test these things in advance, it will
show up through the method. Another remark before we start: Once
we have found a proper factor of n, we are through, because what we
are aiming at here, is to describe a method for factoring a number n
into two proper factors. If we need a further factorization, for instance
down to the prime factors, we can use the method again and again on
the factors we successively find (or a simpler method, when we come
down to smaller numbers).
Assume now that n is an odd integer, not a perfect square. Fermatys
method is to search for positive integers a;, y, such that x2 — y2 = n, in
which case we have
n= (x -
Unless x — y — 1, this gives a proper factorization. Since x > y/n we
start the search at x — [y/n\ -f 1 and go on to i = [^/n\ -f 2, x —
[y/n\ -f 3,.. .and every time ask the question: Is x2 — n a perfect square?
When (if) we get a conclusive answer we are through: x2 — n = y2 gives
n = x2 — y2 = (x — y)(x + y). Even if x — y = 1 we are finished. If this
is the very first x for which the answer is yes, then n is a prime. Hence:
The search in Fermat's method will always, sooner or later, lead to a y
with x2 — y2 — n. If it happens "as late as for" x = (n + l)/2, then n
is a prime. (Another story is that for numbers of more than 3-4 digits
we would never go as far as that in the search. Already a prime number
near 1000 would cost more than 450 steps in the process.)
Factoring integers
Example 13
421
a) n = 44377, ^=210.658...
X
211
x2 — n
144
Square?
Yes A22)
n = 44377 = 2112 - 122 = B11 + 12)B11 - 12) = 223 • 199
b) n = 1018579, yfn = 1009.2457...
X
1010
x2 — n
1521
Square?
Yes C92)
n = 1018579 = 10102 - 392 = A010 + 39)A010 - 39) = 1049 ¦ 971
c) n = 962001, v^= 980.816...
X
981
982
983
984
985
•
¦
•
999
1000
1001
x2 - n
360
2323
4288
6255
8224
•
¦
36000
37999
40000
Square?
No
No
No
No
No
•
•
•
No
No
Yes B002)
n = 962001 = 10012-2002 = A001 + 200)A001-200) = 1201 • 801
-O
422
Chapter IX. Some applications in number theory
In working out the last list, we have used the fact that the square of an
integer never ends with 2, 3, 7 or 8.
The Fercnat method obviously is useful only if n can be written as a
product of two factors rather close to >/n. If this is not so, the number
of cases will be large.
A slight modification of Fermat's method is to take a small, positive &,
and with x = [\/kn] + 1, x = [y/kn] -f 2,.. .ask the question: Is x2 — kn
a perfect square? If the answer is Yes, we have
x2 — kn = y2
for some positive integer, and hence
kn — (x + y)(x - y).
C.2.1)
C.2.2)
This is a factorization of kn. Since here k is known, (we have chosen it)
we divide by k and get a factorization of n.
Example 14
a) n = 2813. Take k = 3. Vkn = 91.864
X
92
x2 - 3ra
25
Square?
Yes E2)
3n = 922 - 52 = 97 • 87, n = 97-29
b) Let us try the Fermat method unmodified on the same number:
n = 2813. y/n = 53.0377...
X
54
55
¦
¦
62
63
x2 — n
103
212
¦
¦
1031
1156
Square?
No
No
•
«
No
Yes C42)
Factoring integers 423
n = F3 + 34)F3 - 34) = 97-29
-O
Observe that the modified process in this example is much quicker than
the ordinary Fermat method, the reason being that n — a • 6, where 36
is near a.
Since in most non-trivial cases we have no way of finding in advance a
"good" &-value, like the one in Example 14, we need a further modifica-
modification. Instead of looking for sc, y such that x2 — y2 = n or kn for a given
k, we just look for jc, y such that n is a factor in x2 — y2,
n\(x2 -y2), C.2.3)
or written a different way:
x
2 _
= yl (mod n). C.2.3')
If we are able to find such x, y-values, we find a factor in n by deter-
determining gcd(aj — y, n) or gcd(aj -f y, n) by using the Euclidean algorithm,
which is a very simple and quick procedure. The rest of our discussion
on factoring integers n will be to establish a procedure for finding such
x,y- values.
3.3 Factor bases
In the search for x, y with x2 = y2 (mod n) we shall be helped by the
concept of a factor base, which is a set
B = {pi,P2,--,Ph}, C.3.1)
where the numbers pt are distinct primes, except that p\ may be — 1.
For a given odd integer n and a given factor basis B an integer A shall
be called a B-number iff the unique number a, given by
A2 = a (mod n), -- < a < - C.3.2)
424 Chapter IX. Some applications in number theory
can be written as a product of factors from B.
A more precise name would perhaps have been ^-number with respect to
n, since n is a part of the definition. The way it will be used, though, is
that we are keeping n fixed throughout the process (to is the number to
be factored), whereas we may have to change from B to a B or B* etc.
Accordingly, there will be ^-numbers, B-numbers, i^-numbers etc., all
with respect to n.
We shall now illustrate the concepts of factor base and JB-numbers
through an example. In this example the base and the jB-number seem
to come out of the thin air. This example, however, will come back
repeatedly, and in the end be the first one to illustrate the method we
are aiming at. We will in congruences often omit (mod n) when it is
clear from the context.
Example 15 (First time.) Let n = 6649, and let
B = {-1,3,5}
be the factor base. Then, since
822 = 75 = 3 • 52 ,
1632 = -27 = (-1)-3:|,
10602 = -81 = (-!)• 3%
we see that 82, 163 and 1060 are i?-numbers. Observe that if we switch
over to the factor base B = { — 1, 3}, the numbers 163 and 1060 are B-
numbers, but not 82.
O
Let h be the number of elements in the factor base #, and let Th be the
vector space of dimension h over the field of the two elements 0, 1 with
operations + (mod 2) and • (mod 2). To each /^-number A we associate
a vector in Tin such that the ith coordinate counts the number of times
(mod 2) the base number p, occurs as a factor in a (i.e. 0 if we have
no pi or pi raised to an even number, 1 if we have a p, raised to an odd
number). We shall illustrate this:
Factoring integers 425
Example 15 (Second time.)
n = 6649, B = {-1,3,5}.
From the factors of the a's we find that the ^-numbers are associated
with the following vectors in Ty.
82 > @,1,0),
163 —•» A,1,0),
1060 > A,0,0).
O
We shall now see how these vectors are used. We still assume that we
have an odd, positive number n, not a perfect square, and that we have
a factor base B and some jB-numbers A{. Let a; be the numbers given
by
A2 = n- <T n- <T — A ^ 9'^
i * ' OO
i is supposed to belong to a certain index set /. Assume now, that the
set of vectors associated with the set of A^s is linearly dependent. Since
we only can have 0 or 1 as coefficients in a linear combination, the linear
dependence means that the sum of some of the vectors is @, 0,0,..., 0)
(we may as well assume all, since we may throw away the A^s where
the associated vector has the coefficient 0 in the linear combination in
question). Since the sum of the vectors is the zero vector, the product of
the a;'s must have all its factors Pi,p2, • • • ,P/i an even number of times,
i.e. it must be a perfect square C2. From
H Af = H a; (mod ti) C.3.3)
we find
A2 = C2 (mod n), C.3.4)
where A is the product of the A^s, reduced (mod n) to a positive num-
number < 7i. C is also reduced (mod n) to ± a fixed positive number. But
now we have found what we have been looking for, two squares such
that 7i divides the difference:
n\(A - C)(A + C). C.3.5)
426 Chapter IX. Some applications in number theory
Unless we have bad luck, meaning that A = ±C (mod n) we find a
factor of n by computing
gcd(A + C, n) or gcd(j4 - C, n) .
We illustrate this by going back to our example:
Example 15 (Third time.)
n = 6649, B = {-1,3,5}.
i?-numbers and their vectors:
82 —> @,1,0),
163 _> A,1,0),
1060 > A,0,0).
The three vectors are linearly dependent, since their sum is @,0,0). The
product of the a,'s is (see Example 15, first time.)
75 • (-27) • (-81) = 3 • 52 - (-1) • 33 - (-1) • 31
= (-1) -3 -5 ,
which is a square. We take
C = (-l)-3l-5= -405.
We furthermore use
JJ At = 82 • 163 ¦ 1060 = 14167960 = 5590 = ^1.
We now know, that
6649|E590 - 405)E590 + 405).
We seek the greatest common divisor of 5590 — 405 = 5185 and 6649:
6649 = 1 • 5185 + 1464
5185 = 3-1464 + 793
1464 = 1-793 + 671
793 = 1 • 671 + 122
671 = 5 • 122 + 61
122 = 2-61
Factoring integers
427
Hence the greatest common divisor is 61. We find
6649 = 61-109.
Since both factors are prime numbers, no further factoring is possible.
-O
What we have seen in Example 15 is of no use, unless we, to a given
n, are able to pick a base B and jB-numbers in a "good" way, i.e. such
that we can find .4, B with A2 = C2 (mod n) by using a reasonable
amount of effort. Here we will be helped by the continued fractions. In
the next section we shall present a lemma, which is the basis for the use
of continued fractions in the problem of factoring numbers.
3-4 A lemma on continued fractions
Lemma 8 Let x > 1 be a real number, and {Ai/Bi} its sequence of
regular continued fraction approximants in canonical form. Then
I A? - x'2Bf\ < 2x for all
C.4.1)
Proof : From the proof of Theorem 3 in Chapter III we have, since
the continued fraction is non-terminating,
— X
1
from which it immediately follows that
2r>2i
\M-*'Bi\ =
i-xB>\ =
\2x
Bf
Bi
C.4.2)
— x + 2x
— x
Bf
428 Chapter IX. Some applications in number theory
, Bi 1 \
= 2x —-i- + —5- ) <2x
< 2k
since i?;+i = b{+\Bi -f #i-i > 1?? + 1 for i > 1. This proves the lemma.
Our use of the lemma is for the case
X2 = 7? ,
where n is a positive integer, in which case we have
C.4.3)
As before we assume that n is an odd number, not a perfect square. The
significance of this result to us is that if we from such an A{ compute
Af and reduce it (mod n) to get an a, with
we even get
- 2y/n < ai < 2y/n. C.4.5)
In connection with the choice of base one needs to factor the at's. And
if n has say 40 digits, the a,'s we get by using the continued fraction
approximant numerators will have at most roughly 20 digits rather than
the expected 40. This makes a huge difference as far as computing the
factors of a, is concerned.
We are now ready to describe a continued fraction method for factoring
large numbers.
3.5 The continued fraction factoring algorithm
Let n be the positive integer we want to factor. Without loss of gener-
generality we assume that n is an odd number, not a perfect square.
Factoring integers 429
Step 1 Expand yfn into a regular continued fraction (meaning the start
of the expansion, as far out as it turns out to be needed).
Step 2 Find the corresponding numerators A; of the approximants (ca-
(canonical form), if desired reduced (mod n).
Step 3 Compute Af and reduce (mod n) to a number dj between — n/2
and -f-n/2, which, as we know from Lemma 8, is between —2y/n
and 2y/n.
Except for the question about how far out one should go, these steps
are straightforward and require no estimation or test. The next step is
more of a trial-and-error step.
Step 4 By studying the factors of the computed a^-values we try to
choose a base B. We do this by picking the indices such that
—1 (possibly) and primes occur more than once among the picked
ai's. Let B consist of these primes and possibly —1. Then all Ai
with the corresponding indices are i?-numbers. List the associated
vectors in T\t- If we find a linearly dependent set of vectors, take
as A the corresponding product of the A^s (mod n) and as C2
the product of the corresponding at-'s (since we know that it is a
square). Unless A = ±C (mod n) we are through, since gcd(A +
C, n) or gcd(A — C, n) is a proper factor in n, which is then easily
found by the Euclidean algorithm.
Step 5 If we have not found a linearly dependent set of vectors in
or if we have reached the situation A = ±C (mod n), we have to
continue the expansion and run through steps 1, 2, 3, 4 (and in
unfortunate cases 5) again.
We shall ilustrate this method on the number in Example 15.
Example 15 (Last time.)
n = 6649
%/6649 = 6o + KA/6B) = 81 + j \ J -j-
1 +1 + 5+iH
430
Chapter IX. Some applications in number theory
i
bi
Al
ai
0
81
81
-88
1
1
82
75
*
2
1
L63
-27
*
3
5
897
80
4
1
1060
-81
*
Observe the sizes of the a,'s, compared to
-] = 3324 and [2y/n] = 163 .
Interesting z-values are indicated by *. They are interesting because
the set of a^values {75, —27,-81} is such that all factors occuring, i.e.
—1, 3,5, occur more than once^ and hence it may be a good idea to try
out
B = {-1,3,5}
as a factor base, in which case A\ = 82, A2 = 163, and A\ = 1060
automatically are i?-numbers. Now we are back to what we did in
Example 15 (Third time). The rest of the argument is identical with
what we did there, leading to the factorization
6649 = 61-109.
-O
The only new thing in this version of Example 15 is that factor base and
B-numbers are the result of a search based upon some simple principles.
A final remark in connection with "interesting i-values": Why could
not other i-values have been chosen within the ones for which Ai is
computed? Obviously i = 0 is out, since 11 is a factor in a0 and nowhere
else within the table. But i = 3 could have been chosen. We have 2 as a
factor more than once, but 5 only once. Hence i = 1 has to be chosen,
since also there we have 5 as a factor. But in order to match the factor
3 there we must choose i = 2 or i = 4. But if one of these is chosen,
the other one must be chosen, in order to match the factor — 1. These
considerations suggest the factor base
5 = {-1,2,3,5}.
Factoring integers
ii-numbcrs and vectors are
82
163
897
1060
431
V, = @,0,1,0)
V2 = A,0,1,0)
?:, = @,0,0,1)
Ki = A,0,0,0)
We find that V1,V2, Vt are linearly dependent, V3 is not needed in the
argument. The element 2 G B is not needed to establish a working
factor base. It is a good strategy to keep the factor base as simple as
possible. Thus we are back to B.
We shall briefly present two more examples. They are worked out pre-
precisely as Example 15, and will be presented without comments.
Example 16 Factoring n — 9073 by the c.f. method:
3+I+26 + 2 + ---
i
b;
A{ (mod n)
a, = Aj (mod n)
0
95
95
-48
*
1
3
286
139
2
1
381
-7
3
26
1119
87
4
2
2619
-27
*
Interesting i-values are indicated by *. Suggested factor base: B
{—1,2,3}. ^-numbers and vectors:
95
2619
A,0,1)
A,0,1)
They are linearly dependent (Sum = @,0,0)).
C2 = (-48)-(-27) = (-lJ-2t-31
C = f-l)-22-32= -36
A = 95 • 2619 (mod n) A = 3834
432
Chapter IX. Some applications in number theory
We find gcd(A - C,n) = gcdC870,9073) = 43, for instance by the
Euclidean algorithm. Conclusion: 9073 = 43 • 211.
(Since 43 and 211 both are primes, no further factorization is possible.)
O
In the next example we show how the method at first fails (since A =
±C), and next how we can make it work again by increase of i and
change of base.
Example 17 A case of bad luck.
n = 26069
1 1 1 J_ 1 1
+ 4 + 1+1 +16+ 2 + 4+•
t
bi
Ai (mod n)
0
161
161
-148
1
2
323
53
2
5
1776
-173
3
1
2099
140
*
4
1
3875
-119
5
2
9849
52
*
6
5
982
-229
7
1
10831
61
8
4
18237
-133
*
9
2
21236
65
*
10
4
24974
-149
11
1
20141
172
12
T—l
19046
-19
*
13
16
12049
•
•
•
•
Interesting i- values are indicated by a *. Suggested factor base:
B = {-1,2,5,7,13,19}.
The ^-numbers Ai, the corresponding a; = A* (mod n) and the vectors
are in the table below.
Factoring integers
433
At
2099
9849
18237
21236
19046
a-i
140 = 22 • 5 • 7
52 = 22 ¦ 13
-133 = (-l)-7-19
65 = 5 • 13
-19 = (-!)• 19
Vector
@,0,1,1,0,0)
@,0,0,0,1,0)
A,0,0,1,0,1)
@,0,1,0,1,0)
A,0,0,0,0,1)
The vectors are linearly dependent (Sum = @,0,0,0,0,0)).
A = Y[Ai = 85!!
C2 = (-lJ.2l.52-72-132.192
C = (-l)-22- 5 -7- 13 -19= -34580= -8511. Too bad!
... followed by good luck:
Go on to i = 13. We find
and
= 12049 (mod n)
3 = A23 = 140 (mod n).
Now the interesting i-values are 3 and 13. They suggest the factor base
^ = {2,5,7}.
J5-numbers Ai, corresponding at and vectors are
a:i = 140 @,1,1)
Ai:i = 12049 a13 = 140 @,1,1)
The vectors are linearly dependent, sum = @,0,0).
A = 43-^,3 = 3921
C2 = 2J • 52 • 72 , C = 22 • 5 • 7 = 140
gcdD + C,n) = gcdD061,26069) = 131
Conclusion: 26069 = 131 ¦ 199.
Since 131 and 199 both are primes, no further factorization is possible.
O
434 Chapter IX. Some applications in number theory
A final suggestion: Look at the a^'s, how small they are. They are always
picked to be in each case the unique one for which
|a,| < 13034 (= [|]),
but since we have regular continued fraction numerators as Afs, we
know they must satisfy
Problems 435
Problems
A) Use the'Euclidean algorithm to find:
(a) gcdA19,221),
(b) gcdC839,1711),
(c) gcdD9907,22243).
B) Find the regular continued fraction expansion and the approxi-
mants for the following numbers:
(a) 47/99,
(b) 3839/1711,
(c) 15015/7429.
C) Find the regular continued fraction expansion, including the pe-
period, and some of the first approximants for
(a) >/82,
(b) v/51,
(c) \/53.
D) With notation and conditions as in Lemma 4 prove that
> \Bn?-An\.
E) Find the general solution of the following linear diophantine equa-
equations:
(a) 11k - 3j/ = 5,
(b) 99k - 472/ = 3,
(c) 3839x - 17112/ = 1,
(d) 3839k + 17112/ = 1.
In all cases find in addition the particular solution with the small-
smallest positive k-value. In problem (c), before solving it, justify the
existence of solutions from Problem lb.
436 Chapter IX. Some applications in number theory
F) Find explicit formulas for An and Bn for the continued fraction
used in the proof of Theorem 6. Use these formulas to prove that
i - DBl = (-!)
"+'
(Hint: Put r = y/m2 -f- I — ra, and hence r l = y/m2 -f- 1 + tji.)
G) Find two solutions of each of the following diophantine equations:
(a) x2 - Wy2 = ±1,
(b) x2 - 37y2 = ±1.
(Use Theorem 6.)
(8) Use Theorem 7 to decide which of the following diophantine equa-
equations have solutions and which ones do not. In cases of existence,
find two solutions.
(a) x2 - 6y2 = ±1,
(b) x2 - UOy2 = ±1.
(9) (a) For n = 2077 take B = {-1,2,3,13}. Prove that 45, 46 and
91 are ??-numbers, and use this to factor 2077 as shown in
Subsection 3.3 and Example 15.
(b) For n = 6649 take B = {-1,2,3,5}. Prove that 75, 163, 1060
are B-numbers, and try to use this to factor 6649. (Compare
with Example 15.) Why does it fail?
A0) Use the continued fraction factoring algorithm to factor
(a) 943,
(b) 2077.
A1) Pretend that you overlook the fact that 286 is an even number.
Go ahead with the continued fraction factoring algorithm and see
what happens.
Remarks 437
R emarks
1) For historic information on the Pell equation we refer to Perron's
book [Perr54, Sektion 27]. As remarked there, the name Pell equa-
equation is misleading. It was Euler, who incorrectly believed that a
method, used by Wallis, was due to the contemporary mathemati-
mathematician Pell.
2) The continued fraction method for factoring integers, described in
the present exposition, goes back to 1931, to D. H. Lehmer and
R. E. Powers [LePo3l], but was for a long time regarded as being
of little practical value, because of its fallibility, and was not used.
Towards the end of the 60's John Brillhart suggested that the
advent of electronic computers might have changed the practical
basis for the use of the method. He was in many ways supported
by Lehmer and Donald Knuth. Together with Michael Morrison he
worked out the method, programmed and tested it and attacked,
by use of the IBM 360/91 at UCLA, the seventh Fermat number,
Fj — 2l28 +1, a 39 digit number. They succeeded in factoring it on
September 13,1970. This was the first successfull attempt to factor
ivV, although it had been known since 1905 that it was not a prime.
Brillhart and Morrison published a description of the method in
1975 [MoBr75]. They wrote in their paper that it took 90 minutes
(over a period of 7 weeks) to factor F-j and that this most likely
could be pushed down to 50 minutes, without corning anywhere
near the factoring of F«, though. They emphasized strongly the
importance of the "small" Af (mod n).
The method has been followed up in different ways. On one
hand running time analyses have been made [PoWa83], [Wund79],
[Wund84], often heuristic. On the other hand, the method has
been modified. One version is Cfrac. In the papers [PoWa83]
and [Wund85] implementations of Cfrac on parallel machines are
described.
3) Lenstra and Manasse in April 1989 completed the factorization
of a 106 digit number. This was done by using 80 Firefly multi-
multiprocessor workstations in California, and by borrowing computers
all over the world through electronic mail. They estimated that
this would have taken a century on one processor operating at 1
438 Chapter IX. Some applications in number theory
million operations per second. These informations were given by
Carl Pomerance in a talk at the American Mathematical Society
Short Course in Cryptology and Computational Number Theory
in Boulder, Colorado, August 6-7, 1989. In the same talk he also
mentioned that Alford and Pomerance, by using 140 Zenith PC's
have factored 95 digit numbers through nights and week-ends over
a 5 week period, corresponding to 2 years on a processor as the
one mentioned above.
The use of continued fractions for factoring numbers is likely to
become history pretty soon. Other methods are about to take over,
such as the quadratic sieve method and the elliptic curve method.
But the Morrison-Brillhart method, being essentially the Lehmer-
Powers continued fraction method with factor bases introduced to
combine the congruences in an efficient way, certainly deserves its
place in a chapter of applications of number theory as well as in
the history of factoring numbers. Carl Pomerance, in his Boulder
talk, said: "It can be safely said that the Morrison-Brillhart paper
began the modern era of advances in factoring."
References
[LePo31]
[MoBr75]
[Perr54]
[PoWa83]
[Kobl87] N. Koblitz, "A Course in Number Theory and Cryptog-
Cryptography", Graduate Texts in Mathematics, Springer-Verlag,
Berlin A987).
D. H. Lehner and R. E. Powers, On Factoring Large Num-
Numbers, Bull. Amer. Math. Soc, Vol. 37 A931), 770 776.
M. A. Morrison and J. Brillhart, A Method of Factoring and
the Factorization of F7, Math, of Comp., Vol. 29 A975),
183-205.
0. Perron, uDie Lchre von den Kettenbriichen", Band I,
Dritte Aufl., B. G. Teubner, Stuttgart A954).
C. Pomerance and S. S. WagstafF, Jr., Implementation of
the Continued Fraction Integer Factoring Algorithm, Pro-
Proceedings of the 12th Winnipeg Conference on Numerical
Methods and Computing, A983).
[Ries85] H. Riesel, "Prime Numbers and Computer Methods for
Factorization", Birkhauser, Boston A985).
[Seid46] L. Seidel, Untersuchungen u'ber die Konvergenz und Di-
vergenz der Kettenbruche, Habilitationsschrift Miinchen
A846).
[Schr86] M. R. Schroeder, "Number Theory in Science and Com-
Communication", Second Enlarged Edition, Springer Series in
Information Services, Springer-Verlag, Berlin A986).
439
440 Chapter IX. Some applications in number theory
[Ster48] M. A. Stern, Uber die Kennzeichen der Konvergenz eines
Kettenbruchs, J. Rcine u. Angew. Math. 37 A848), 255-
272.
[Wund79] M. C. Wuiiderlich, A Running Time Analysis of Brillhart's
Continued Fraction Factoring Method, "Number Theory,
Carbondale 1979", Lecture Notes in Mathematics 751,
Springer-Verlag, Berlin A979), 328-342.
[Wund84] M. C. Wunderlich, Factoring Numbers on the Mas-
Massively Parallel Computer, Advances in Cryptology (David
Chaum, ed.) A984), 87-102.
[Wund85] M. C. Wunderlich, Implementing the Continued Fraction
Factoring Algorithm on Parallel Machines, Math, of Com-
Computation, Vol. 44 A985), 251-260.
Chapter X
Zero-free regions
About this chapter
The largest part of the present chapter deals with the problem of find-
finding zero-free regions for sequences of polynomials, given by three-term
recurrence relations. The close connection between continued fractions
and the three-term recurrence relations makes it natural to try contin-
continued fractions as a tool in determining such zero-free regions. The main
purpose of this part is to give examples of how this can be done. In
those examples we will mainly see what can be done by direct use of
established results on continued fractions, i.e. we will illustrate an ap-
approach based upon "continued fraction attitude" and basic knowledge
of continued fractions.
There are, on the other hand, some continued fraction based methods
that are tailor-made (and often very fit) for certain important special
sequences. It is beyond the scope and the purpose of this little chapter
to include a discussion of those methods, let alone bring a "catalogue"
of them. We have included some good references and also a little sub-
subsection, where two such methods are briefly described.
The chapter finally contains a brief discussion of stability of polynomials
including a continued fraction test for stability.
441
442 Chapter X. Zero-free regions
1 Zero-free regions for certain sequences of po-
polynomials
1.1 Introduction
The problem of locating zeros of polynomials is important in mathe-
mathematics and applications of mathematics. Up to and including degree
4 the problem can be solved by explicit formulas (although not always
very manageable). From degree 5 on, however, this can only be done
in special cases. Hence numerical methods are needed. Often we use a
combination of some general result on location of zeros and an algorithm
for the actual determination of the zero. A reference on location of zeros
is volume 1 of [Henr86].
In the present section we shall be concerned with the problem of finding
zero-free regions for polynomials given by certain three-term recurrence
relations. We shall briefly list some of the familiar examples of such
sequences of polynomials, all given by recurrence relations of the form
with some initial conditions, and where bn and an are polynomials of
low degrees.
Tchebycheff polynomials:
First kind:
Tn(z) = 2zTn_1(z)-T,l_2(z), n>2,
= 1, Tx(z) = z.
Second kind:
Un{z) = 2zUn-l(z)-Un-2(z),
Legendre polynomials:
Pn{z) = B-^jzPn^(z)-^l-^jPn.2(z), n>2,
Zero-free regions for certain sequences of polynomials 443
= 1, Pi(z) = z.
Laguerre polynomials:
() ( i) ()n>2,
Hermite polynomials:
IIn{z) = 2zHn_x(z)-2{n-l)Hn_2{z), n > 2,
H0{z) = 1, H
The close connection between continued fractions and three-term recur-
recurrence relations makes it natural to try continued fraction techniques in
the search for zero-free regions for such sequences of polynomials. And
indeed, this has been done. We refer to the recent survey article by de
Bruin, Gilewicz and Runckel [BrGR87]. There some of the most im-
important techniques are described (also for some polynomials satisfying
a fc-term recurrence relation). The article contains an extensive bibliog-
bibliography.
Many of the papers referred to in [BrGR87] establish continued fraction
like methods for the purpose of determining regions where the zeros must
be located, or equivalently: zero-free regions. We shall not do this here.
We shall give some examples of how direct use of established continued
fraction results can lead to results on zero-free regions. See [Waad88].
The idea is to use continued fractions K.(an(z) / bn(z)) where the polyno-
polynomials in question are canonical denominators of the approximants. The
key to the argument is then the following simple observation:
Lemma 1 Let An and Bn be the canonical numerators and denomina-
denominators of J?(an/bn), where all an ^ 0. Then
An + An-Xw and Bn
cannot vanish simultaneously for any w € C.
444 Chapter X. Zero-free regions
Proof : One way to see this is that
An + An^w
Sn(w) =
Bn -f
is a well defined, non-singular linear fractional transformation when all
an / 0. It also follows immediately from the determinant formula, since
= AnBn_i — J5nAn_i = —
n
Hence, if all an(z) ^ 0 for each z in some set jD, and V(z) is a value set
for K.{an(z)/bn(z)) for each z (E D with cx> ^ V{z)i then
J5n(z)
for w(z)
and thus Bn(z) -}- Bn-\(z)w(z) ^ 0 for z (E D. (Of course, one can
carry out the same type of argumentation if {Ki(^)} is a sequence of
value sets for J&.(cLn(z) I bn(z)) with oo 0 Vo(z). Sometimes numerators
of approximants are more convienient to use, rather than denominators.
The argument is the same, only with oo replaced by 0.)
As an illustration of this we shall first see what we can get "almost
for free" about the zeros of the polynomials above. We temporarily
disregard the knowledge we may have about their zeros.
Example 1 From the recurrence relations and the initial conditions we
sec that the TchebychefF polynomials of the second kind for n > 1 are
the denominators of the approximants of the continued fraction
-1 -1 -1
2z + 2z + 2z -\
By using Sleszynski-Pringsheim's theorem, Theorem 1 in Chapter I, we
find that for \z\ > 1 all approximants fn(z) = An(z)/Bn(z) satisfy
< 1. Since An(z) and Bn(z) cannot have any zero in common by
Zero-free regions for certain sequences of polynomials 445
Lemma 1, we may conclude that the polynomials Un(z) have no zeros
in the set given by \z\ > 1. (See also Example 9, Chapter I.)
Although this is already pretty good we can do better. We may restrict
the discussion to the disk \z\ < 1. By an equivalence transformation the
continued fraction changes to
2z
1 + 14-14-
Take any point z in the disk \z\ < 1 and off the real diameter, i.e.
z = rei0, 0<r<l, O<|0|<tt.
Then the point
1 - * e-2i0
4z2 4r2
is in the complement of the ray (—oo, — -j] of the negative real axis and
hence in some parabolic region Pa from the Parabola Theorem (Theorem
20 in Chapter 111). Hence the sequence of approximants will have all its
elements in (a bounded part of) the half plane Vn for this particular z.
(Here V(X is the value set for Pa as described in the parabola theorem.)
The classical approximants for the two equivalent continued fractions
are of course the same (although their canonical forms differ). Hence all
the approximants of K(—l/2z) are finite, and none of the denominators
can have a zero at that z. We thus conclude that the complement of the
interval ( — 1,4-1) of the real axis (complement w.r.t. C) is zero-free.
For later use we also observe that for each such z the sequence of ap-
approximants is bounded away from the boundary of the half-plane Vn, in
particular from — ^.
The first part (that \z\ > 1 is zero-free) can also easily be proved by
using Worpitzky's theorem. This is left as an exercise (Problem 1).
Observe that we, at no point in the proof, have used the fact that the
continued fraction is periodic. To use this would have given another
(but more special) way of finding a zero-free region.
446
Chapter X. Zero-free regions
For TchebychefF polynomials Tn of the first kind we find that they are
denominators of the approximants of the continued fraction
-1 -1 -1
which can be written
where
U = 2
z -
1
f 2z
1 H
4
+ 2z -f
1
z
-u1
1
: 2
- « • •
1
4*2
Since the expression in paranthesis is bounded away from — \ for each
z (fc ( —1,+1), we find again the same zero-free regions. The details are
left to the reader (Problem 2).
O
Example 2 The Legendre polynomials are easily seen to be the denom-
denominators of the approximants of the continued fraction
, where au — — A — — ) and bn = B ) z.
\ nj \ nj
From n = 2 on the Sleszyiiski-Pringsheim condition is satisfied in
\z+ IC (<*„/&„)
71=2
HK)
71
and thus all approximants of the continued fraction
Li
n—2 Dn
have absolute value < 1 when \z\ > 1. Hence the approximants of the
original continued fraction are all finite, and we have established \z\ > 1
as a zero-free region for the Legendre polynomials. (More information
on zeros can be obtained by using the parabola theorem or the limit
periodicty of the continued fraction.) By using the fact that the zeros
are all real (Chapter VII) we may conclude, that the zeros are all located
on the interval (—1, +1) of the real axis.
O
Zero-free regions for certain sequences of polynomials
447
Example 3 The Laguerre polynomials are denominators of the approx-
imants of the continued fraction
1-2+ K
n=2 2 -
1 + 2
n
Again by Sleszynski-Pringsheim's theorem we find that if 2 is such that
for all n > 2
1
n
i.e.
|2n - I - z\ > In - 1,
which holds iff $l(z) < 0, then the values of the approximants of
oo an
K IT
n=2 Dn
are all of absolute value strictly less than one. Since |1 — z\ > 1 when
$l(z) < 0, all approximants of the continued fraction A.1.1) are finite
when $l(z) < 0, and hence the closed left half plane is established as a
zero-free region for the Laguerre polynomials. (Again, by using the fact
that the zeros are real, we find now that they are all positive.)
O
Example 4 The llermite polynomials are the successive denominators
of the approximants of the continued fraction
This can be written
1 -2
22+ 22 -f
-2
1 4z2
1 +
-4
-22
1
22
-4
422
1
-6
+ 22+---*
-6
422
+!+••¦
If 2 is not real, the elements
-2 -4 -6
422 ' 422 ' 422 '
448 Chapter X. Zero-free regions
are located equidistantly along a ray ^ negative real axis from the ori-
origin to infinity. From Theorem 20 in Chapter III it follows that the
approximants of
-2 -4 -6
1 + 1 + 1 +•¦•
all are finite and located in the half plane Va, where
2a =
and hence bounded away from — 1. The approximants of
_1_ -2 -4 -6
22+ 2z + 2z + 2z H
are thus all finite, and the set consisting of the open upper and lower
half-planes (i.e., C minus the real axis) is a zero-free set for the llermite
polynomials.
O
It is well known that for TchebychefT polynomials of both kinds, as well
as for Legendre polynomials all zeros are located on the interval (— 1, -f-1)
of the real axis. The Laguerre polynomials have all their zeros on the
positive real axis, and the Hermite polynomials on the whole real axis.
Observe that in the TchebychefF and Legendre cases the method we used
gave us (as zero-free regions) the whole plane, except for the segment
(or line) where all zeros are located.
1.2 An application of Van Vleck's theorem
The following theorem is due to Runckel [Runc86]. He proved it by using
a continued fraction technique. We present an alternative (continued
fraction based) proof.
Theorem 2 Let {BTl(z)} be the sequence of polynomials, given by the
recurrence relation
Bn{z) = (gnz + Jin)tfn_i (z) + Bn-2{z), n > 1, A.2.1)
Zero-free regions for certain sequences of polynomials
449
with initial values
A.2.2)
Let furthermore ^R(hn) > 0 for all n, and a < arg^n < C for all n,
where j3 — a < ?r. Then Bn(z) ^ 0 for all n > 0 when z is in the angular
opening
A.2.3)
7T
-a- - < argz < -
Figure 1.
Proof : Bn(z) is the (canonical) denominator of the nth approximant
for the continued fraction
+ hA +g2z -f h2 +• •
From van Vleck's theorem, Theorem 2 in Chapter I, we know that if all
guz + hn are in the angular opening
-- + e < arg(ynz + hn) < - - c, 0 < e < - ,
then all approximants are finite, and located in the angular opening
7T
450 Chapter X. Zero-free regions
Here we are merely interested in the finiteness of An(z)/Bn(z), from
which it follows that Bn(z) ^ 0 for all n.
Assume now that, for all n > 1,
\ z + hn) < |. A.2.4)
Then to each TV there is an e^v > 0, such that the condition holds with
e/v for all n < TV, and hence Bn(z) ^ 0 for n < TV. Since TV is arbitrary
we have that A.2.4) implies Bn(z) ^ 0 for all n.
Now, from the conditions of the theorem it follows that
~2 < arg@»z) < 2 for ~ a ~ 2 < Mg Z "^ ~^ + 2 '
and since 3ft(/in) > 0 we have
~2 < ™g(9nZ + hn) < -,
and the theorem is proved. ¦
Observe that the more we know about grn i.e. the more narrow the
angular opening for gn is, the more we can say about zero-free regions.
Assume that for a fixed 7 we have arg^n — 7 for all n. Then a and C
can be chosen arbitrarily close to 7 (a < 7 < C). From Theorem 2 it
follows that for all n, Bn(z) ^ 0 in the angular opening
(-7 -«) + €< arg z < (-7 + -) - c
for any e > 0, and we get
Corollary 3 // in Theorem 2 the condition on arg gn is replaced by
arg#n = 7 for all n, A.2.5)
then Bn(z) ^ 0 /or all n> 0 w&en z ?s m //ie /ia// plane
-7-| <argz<-7 + |. A.2.6)
J/; in particular, all gn are positive, the right half plane is a zero-free re-
region for all Bn(z). If all gn are purely imaginary with positive imaginary
parts, the lower half-plane $s(z) < 0 is zero-free.
Zero-free regions for certain sequences of polynomials 451
1.3 An application of the parabola theorem
We shall use the parabola theorem (Theorem 20 in Chapter IIT) to ob-
obtain results on zero-free regions. Again, we are only interested in the
finiteness of the approximants, not convergence or the value set in itself.
We shall here be interested in polynomials given by recurrence relations
of the form
Bn(z) = Bn.^ + a^Bn^z), n>l, A.3.1)
= 0, J?0(z) = l, A.3.2)
where all an ^ 0. It may seem more natural to study the problem
with anz instead of anz2. But if the latter is solved, it is a simple
transformation to obtain the solution of the problem with anz. The
reason for choosing anz2 is partly that the results are geometrically
more appealing.
The polynomials Bn(z) are the (canonical) denominators of the approx-
approximants of the continued fraction
9 1 9 1
c a-jz anzc
1 + 1 + 1 +•••+ 1 +•••
From the value set part of the parabola theorem we know that if all anz2
are in a parabolic region
|w;| < SR(^e^) + - cos2 6>
for some fixed 6 6 (—7r/2,7r/2), then all approximants An(z)/Bn(z) are
finite (and located in a half plane), and hence Bn(z) ^ 0 for z ^ 0. For
z = 0 we can see that Bn(z) = 1^0 for all n.
We shall, for a fixed 6 and a fixed ra, describe the set 5r,@), which is
such that z G SnF) iff anz2 is in the parabolic region described above:
SnF) is the set of all z, such that
\anz2\ < 3l{anz2e-2ie) + - cos2 9 A.3.3)
holds. With an = |an|e2^n, —?r/2 < ipn < tt/2, this transforms into
\z\2 < ft (z2e2i^-°)) + ^?JL. A.3.3')
'2 an
452
Chapter X. Zero-free regions
(Keep in mind that all an ^ 0.) With
this can be written
COS
>20
or
cos 6
A.3.4)
This shows that SnF) is a parallel strip, as illustrated in Figure 2.
Figure 2.
Obviously all Bn(z) ^ 0 when
oo
z g sF) = f] sn@).
n=2
Since this statement is true for all 6 E (—7r/2,7r/2), the set 5 =
the union taken over all 6 6 (—7r/2,7r/2), is a zero-free region for the
sequence {Bn(z)} of polynomials.
Zero-free regions for certain sequences of polynomials
453
This describes a method for determining zero-free regions for the se-
sequences {Bn(z)} given by recurrence relations and initial condition at
the beginning of this section. But unless we know more about the coef-
coefficients, this method is hardly more than a "pre-method". In the next
subsections we shall study two special cases where the determination of
5 can be carried out to an explicit, simple result.
The Stieltjes case
We are still studying the sequence {Bn} from Subsection 1.3, but now
with the condition
an > 0 for all n.
The reason for calling this the Stieltjes case is that the continued frac-
tions
Figure 3.
00 anw
n=l 1
a
n
0
A.4.1)
are Stieltjes fractions (as in Chapter Til, Subsection ^.5). Let
A = sup an .
n>2
454
Chapter X. Zero-free regions
Then, if A < oo, the set S@) is the parallel strip
I V /I —
as illustrated in Figure 3.
A.4.2)
Observe that the strip intersects the real axis at an angle of #, and that
the boundary lines intersect the imaginary axis in the points ^zi/By/A),
regardless of 6. If A — oo, the strip degenerates to a line. In both cases
we find that the zero-free region
A.4.3)
is the whole plane C minus the two cuts from i/By/A) to oo along the
positive imaginary axis and from —i/By/A) to oo along the negative
imaginary axis. An illustration (with A < oo) is shown in Figure 4.
Figure 4.
Another way of phrasing this result is: All zeros of Bn(z) are located on
the imaginary axis at distance > l/B\/~A) from the origin. (For A — oo
this means the whole imaginary axis minus the origin.)
Zero-free regions for certain sequences of polynomials 455
We shall now replace the z2 by w. It follows immediately that all Bn(z)
are polynomials in z2 and hence in w. Let Gn denote those polynomials:
Gn(w) := Bn(yfiS).
(Branch of y/w is insignificant, since we never have y/w to an odd power.)
The mapping
w = z
maps the two z-cuts onto the w;-cut
-oo, —
' 4A
on the negative real axis, and the rest of the plane to the complement
of the cut (or rather: two copies of it). We thus have the result:
Theorem 4 Let {Gn(w)} be the sequence of polynomials, given by the
recurrence relation
Gn(w) = Gn-\ (w) + anwGn-2{w), n > 1 , A.4.4)
and the initial values
G-i{w) = 0, G0(w) = 1. A.4.4')
If an > 0 for all n > 2, then all Gn(w) ^ 0 in the cut plane
ec;
arg \w + —J < ttJ , A.4.5)
where A — sup an, 0 < A < oo.
The zero-free region is illustrated on Figure 5.
Theorem 4 is a well known result for Stieltjes fractions [IIePf66]
oo anw
K -V" , an > 0 .
n=l 1
Moreover, it is really not much more than a restatement of Remark 2 to
Theorem 20 in Chapter III, the parabola theorem.
456
Chapter X. Zero-free regions
Figure 5.
1.5 The case when an G
The "pre-method" of Subsection 1.3 can be carried out to a method
under additional conditions on an, like e.g. in the last subsection, where
all an > 0. Another natural condition is to require a fixed value, not
necessarily 0 for the argument of all an, or, more generally: to require
argan to be in a given finite set. We shall here let arga.n ? {0,7r}; that
is, all an are real. We are thus interested in the sequence {Bn(z)} of
polynomials, given by the same recurrence relation and initial values as
in Subsection 1.4,
Bn{z) =
n
= 0,
A.5.1)
A.5.2)
but where we now require an G R (instead of an > 0). Tf the set of
negative an is empty, we are back to the situation discussed in Subsection
1.4- If the set of positive an is empty, we get back to the situation in
Subsection 1.4 by the transformation ( = iz. We shall thus, without
loss of generality, assume that neither the set of positive an nor the set
of negative an is empty (although we shall occasionally comment on it).
We want to determine the set SF) by taking the intersection of all the
Zero-free regions for certain sequences of polynomials
457
parallel strips 5ri@), see Subsection 1.3, in particular Figure 2. Let
supari =: A+,
sup(-an) =: A_.
Since the sets of positive an and negative an are both nonempty, we have
0 < i4f < oo,
0 < A- < oo.
Let S+@) be the intersection of all Sn@) with an > 0, and S-(#) the
intersection of all SnF) with an < 0. Then, just as in Subsection 1.4,
we find that S+(Q) is the parallel strip, given by
<
2,/A.
A.5.3)
(See Figure 3.) Almost the same way, but with zie l0, because ipn = — f
when an < 0, we find that S-F) is the parallel strip, given by
-io-
$s{zie-'u)\ = \Sl(ze-lU)\ <
cos 6
A.5.4)
We have SF) — S+@) f\ S-F), which is a rectangle, as shown in Fig. 6.
Figure 6.
458
Chapter X. Zero-free regions
The boundary lines of S+@) go through ±i/By/A+) (indicated by •),
and the boundary lines of S-@) go through ±1/B^^4T), (x). Thus, by
elementary geometry: When 6 varies, the corners of the rectangle will
describe four circles with the four line segments
from
from
from —
and from %
2y/A +
as diameters. Only the semicircles between any two neighboring points
of the four indicated ones and not going through the origin are of interest
to us. The zero-free set
= U5@).
0
is the set bounded by the four semicircles described below (see Fig. 7).
Figure 7.
The result is more simply expressed by switching from z to l/z, since the
four circles then are transformed into straight lines. We replace Bn(l/z)
by Dn(z):
Zero-free regions for certain sequences of polynomials
459
Theorem 5 Let {Dn(z)} be the sequence of polynomials, given by the
recurrence relation
an ? 0, A.5.5)
and the initial values
= 0, D0(z) = L.
A.5.6)
// an 6 R for all n, and there is at least one positive and one negative
an, then the zeros of all Dn(z) are all in the closed parallelogram with
corners in
±2iy/A+ and ±
where A+ = supan; yl_ = sup(—an).
In Figure 8 the zeros are in the "white" region (which is bounded in
this case). The indicated points correspond to the ones in Figure 7, by
z —» l/z.
Figure 8.
460 Chapter X. Zero-free regions
Remark: So far we have used value sets in the argument, in the way
described just before Example 1. Another way of doing it would be to
use critical tail sequences, since hn(z) = Bn(z)/Bn-\(z), where we know
that Bn(z) and Bn_i (z) can not have any common zeros if all an(z) ^ 0.
If {Vn} is a sequence of value sets for J?(an/bn), then
-hm <?Vm => — hn $ Vn for all n > m.
This follows since sn(Vn) C Vn_, => Vn C *-l{Vn-}) => S-l(C\Vn^)C
C \ Vn and since { — hn} is a tail sequence so that —hn — s"^ —/in_i).
1.6 A fundamental recurrence formula
The recurrence relation
Pn(z) = (Z- Cn)Pn_, (z) - \nPn-2{z) , A-6-1)
An ^ 0, P-\(z) = 0, Pq{z) = lj is of great importance in the theory
of orthogonal polynomials. On the one hand, if to a given quasi-definite
moment functional (Chapter VTT, Section 1) Pn{z) are the corresponding
monk orthogonal polynomials, then there exist constants cn and An ^ 0,
such that A.6.1) holds. On the other hand, by Favard's theorem (Chap-
(Chapter VII, Theorem 2), any sequence {PnB)} satisfying some recurrence
relation A.6.1) (with all Ar, ^ 0 and including the initial conditions) is
an orthogonal polynomial sequence for some linear functional.
Since the question about location of zeros of orthogonal polynomials is
important, it is of interest to describe procedures leading to information
about zeros of the polynomials in A.6.1). We shall here restrict ourselves
to the very simplest types of argument.
Observe first that for all n > 0 the polynomials Pn{z) are the canonical
denominators of the Jacobi continued fraction
XA A
Z- C1+Z-C2+---+2- CrI+...
If we assume z to be different from all cn this continued fraction is
Zero-free regions for certain sequences of polynomials 461
equivalent to
Ai —A^ — \n
z-ci (z-Cl)(z-c2) (z - cn_v)(z - cn) A6 3)
1 + 1 +•••+ 1 H '
By using well known element/value set results we find zero-free regions
by using the principle stated earlier in this chapter. From Worpitzky's
theorem we know that if z is such that
A
n
. < - for all n > 2,
(z - cn-i)(z - cn) 4
then all approximants of A.6.3) are finite, and thus all Pn{z) are 7^ 0.
A special case of this is the following result which also can be found in
[Wall48, Thm. 26.2] (proved in a different way):
Proposition 6 If in A.6.1) 0 < |An| < M2 for all n > 2 and \cn\ < N
for alln > 1, then all zeros of Pu{z) are located in the disk \z\ < 2M+N.
Proof : For \z\ > 1M + N we have z / cn for all n and
A,
ln
(z - cn^i)(z - cn)
BM + N - N){2M +N - N) 4
Hence the result follows from Worpitzky's theorem. ¦
We can also base our arguments on the parabola theorem, as we did in
Subsections 1.3-1.5. if, for some 6 E ( — f, f) and all n > 2
\dn(z)\ - RKOOe-™) < ^ cos2 9, A.6.4)
with
then Prt(z) 7^ 0 for all m. A special example of this is as follows, [Jaco89,
Cor. 3.4]:
Proposition 7 // in A.6.1) all cn = 0 anrf all \n are real, ^ 0 and
An < M2, then all zeros of Pn(z) are contained in the strip
<2M.
462 Chapter X. Zero-free regions
Proof : For 5ft(z) > 0 take 0 = - arg z, and for 3?(;z) < 0 take 0 =
it — arg z where the value of arg z is taken to give — ^ < 0 < ^. In both
cases we find in A.6.4):
Left hand side =
\z\
Right hand side = - .
II
From this the conclusion of Proposition 7 follows immediately. ¦
1.7 Chain sequences
As in the previous subsection we shall consider monic orthogonal polyno-
polynomials {Pn{z)} satisfying the recurrence relation A.6.1) for some cri G C,
An G C, Xn ^ 0. We plan to use the following part of the parabola
sequence theorem, Theorem 21 in Chapter III: Let — *¦ < 0 < 5 and
{gn}%Lo be fixed numbers such that 0 < go < 1 owrf 0 < gn < 1 /or
n = 1, 2,3, // \an\ — 8l(ane~l20) < 2^rl_i(l — gn) cos2 6 for all n > 2,
then the approximants An/Bn o/K(an/l) are all finite. We continue
to use the notation dn(z) as in A.6.5) for z ^ c*. for all &. We get:
Proposition 8 Let {Pn(z)} 6e ^zven 6j/ A.6.1) with F_i(z) = 0
= 1. Then Pn(z) ^ 0 for all n G N /or all z ? C such that
\dn(z)\ < gn-i{l - 9n) for n= 2,3,4,... A.7.1)
where 0 < gn < 1 for all n.
Proof : According to the parabola sequence theorem with 0 = 0, we
have that An(z)/Bn(z) ^ oo for all n > 0 if c?i(z) ^ oo and
for 72 > 2.
This holds in particular if |cfn(z)| + |c?rt(-z)| < 2^n_i(l — gn)', i.e. if A.7.1)
holds. ¦
Zero-free regions for certain sequences of polynomials 463
Remarks
1. A sequence {/3n} = {A -7n-iOn} where 0 < 70 < 1 and 0 < 7n <
1 for n = 1, 2,3,... is called a chain sequence. The sequence {7n}
is called a parameter sequence for {/3n}- (A parameter sequence for
a given chain sequence is not necessarily unique.) The condition
A.7.1) can therefore be interpreted as |dnB:)| < /?„ for some chain
sequence {/?„} (with parameter sequence such that 7n = 1 — gn).
This is a classical result in the special case where all An > 0 and
cn G R.
2. Let {/3n} be a chain sequence with parameter sequence {7n}- Then
-Pn = -(l-7n-iO« = -0n-i(l-0n) where gn - l-7n as above.
That is, {—gn}™=o or equivalently {7n — 1}?LO is a tail sequence
forK(-/3n/l).
The following lemma is a classical result which easily follows from the
continued fraction theory:
Lemma 9 Let {/3n} be a chain sequence, and let 0 < j3n < f3n for all n.
Then {/3n} is also a chain sequence.
Proof : Let {7n} be a parameter sequence for {/3n} and let gn = 1 — fn
for all n. We shall use the following part of the oval sequence theorem,
Theorem 26 in Chapter III, with all Cn = 0 and Rn = gn: If for all n
Vn is the disk \w\ < gn and En is the disk \a\ < gn-\(l — gn), then {Vn}
is a sequence of value sets for En. Since — f3n G Eu for all n, it follows
that the continued fraction K(—/3n/l) has a tail sequence {— gn} such
that —gn ? Vn for all n. Hence J3n = A — 7n-iOn where 7ri = 1 — gn for
all 72. ¦
A reformulation of Proposition 8 is therefore: All Pn(z) 7^ 0 for
z ? C such that \dn(z)\ is a chain sequence. This result appeared from
a rather rough application of the parabola sequence theorem. More
careful arguments yield:
464 Chapter X. Zero-free regions
Proposition 10 Let {Pn{z)} be given by A.6.1) with Xn G C \ {0},
cn e C, P-i(z) = 0 and Pu(z) = 1. Then Pn(z) ^ 0 for all n ? N for
all z 6 C such that {/3n}, with
Pn(z,V) = » 2~/j /or n = 1,2,3,...
Zi COS u
is a chain sequence for some 6, — ^ < 0 < |.
Remark: An equivalent expression for f3n{z,6) is
which is easier to check in some cases. (It does not matter which branch
of \/dn(z) we choose since the result is raised to the power 2.) For more
information we refer to [Jaco89].
1.8 Two theorems on zero-free regions
So far our aim has been to give examples of how standard continued
fraction results can be used directly to establish zero-free regions for
polynomials satisfying certain three-term recurrence relations. But it
would be strange to write a section on zero-free regions without including
some of the established results in the theory. We restrict ourselves to two
examples. In both cases the proofs make use of continued fraction type
arguments, essentially on value sets, as we have done in the more direct
approaches. Space does not allow for comparison between methods, but
examples will be included to illustrate the theorems. The first theorem is
the prominent Parabola Theorem by SafFand Varga [SaVa76] (not to be
confused with the parabola theorem in the analytic theory of continued
fractions).
Theorem 11 (Saff-Varga's Parabola Theorem) Let the polynomi-
polynomials qn(z) be defined by the recurrence relation
Qn(z) = (z + f3n)qn-\{z) - CLnzqn-*(z), n > 1 A.8.1)
Zero-free regions for certain sequences of polynomials 465
and the initial values
g_,(z) = O, 5b(*) = l, A.8.2)
where ctn > 0 /or 2<n<N,Pn>0 for 1 < n < N, and
DN := rnin{{Pn - an); 1 < n < N} > 0, A.8.3)
with ct\ = 0. T7ie7i ^B) ^ 0 for 1 < n < N and all z in the parabolic
region
{w G C; \w\ < $l(w) + 2?>/v} . A.8.4)
Example 5 Take in A.8.1)
n n
Then X>, = 2 and DN = I for all N > 2. From the Parabola Theorem
by SafT and Varga we find that the parabolic region, given by
M
is zero-free for all <7r,
With
if = u -+¦ zv , w, v G R,
the parabolic region can also be described by the following inequality:
v2 < 4(u+ 1).
Numerical examples:
n = 2 : q2(z) = z2 + 3z + 3
In Figure 10 these zeros are indicated by •.
n = 3 : g:,(z) = z:i + 4z2 + -^z + 4
Zeros:- 1.61, -1.19±1.03i
In Figure 10 these zeros are indicated by •.
466
Chapter X. Zero-free regions
Figure 9.
Figure 10.
_O
We refer to the survey article [BrGR87] and to the references therein,
in particular reference [32]. For applications and extensions we refer to
Remark 1 at the end of the chapter and also to Problem 10 there.
The second theorem is due to Runckel, [Runc84], see for instance the
survey article [BrGR87]. It gives an angular zero-free region under con-
Zero-free regions for certain sequences of polynomials 467
ditions which are the same as the ones in the SafF-Varga Parabola The-
Theorem.
Theorem 12 Let the conditions be as in Theorem 11. Let furthermore
ct
:= max Bn, Qn'= max ~^ ¦ A.8.5)
l<</V 2<n</V /3 V '
Then qn(z) ^ 0 for 1 < n < N in the angular opening given by
z = re1*, r>0, A.8.6)
A.8.7)
Bn
As an example, we shall apply this to the sequence earlier studied in
Example 5 by the SafF-Varga Parabola Theorem.
Example 6 Take in A.8.1)
1
, ft, = 1 + -
n n
Then
?>/v = 1 for JV > 2.
On = J
We find the following zero-free region: the angular opening z = rel<t>
where
i. e. | argz| < 1.738 .... This is in some respects much better than the
region obtained in Example 5 (for large z-values), in other respects it is
not as good (for small z-values).
O
468 Chapter X. Zero-free regions
2 Stable polynomials
2.1 Introductory remarks
In the present section we shall also present a continued fraction technique
for solving a problem on location of zeros. But this time it has to do with
one polynomial, not a sequence of polynomials, and the technique is also
completely different to the one used in the previous section. We shall be
aiming at a necessary and sufficient condition for a polynomial P(z) to
have all its zeros in the left half plane 5?(z) < 0. Such a polynomial is
called a stable polynomial, or a Hurwitz polynomial, in honor of Hurwitz,
who solved it for real coefficients. Stable polynomials are important
in the theory of differential equations and its application to vibration
problems. The following simple example illustrates the type of problems
which can be handled by using the results of the present section. The
differential equation
Ad2y dy
+ 4 + 6 +
has the general solution
y = deal +
where a, /3, 7 are the zeros of the polynomial
r3 -f- 4r2 + 6r + 4 = 0
(provided that they all are simple). If neither of the constants Ck is zero,
we have
y{t) —> 0 as t —> 00
iff the polynomial r3 + 4r2 + 6r + 4 = 0 is stable. This can of course be
checked by finding the roots, but it is most useful to be able to check it
without knowledge of the roots. The purpose of the present section is
to present a tool for such questions, a practical test for stability, based
upon continued fractions.
Let Q(z) — anzn + an_i zn~] + ... -f o-o^ am > 0 for m = 0,1,2,.. .n be
the given polynomial to be tested. We define
P(z) = an^zn~x + an
Stable polynomials 469
and find the continued fraction expansion of the form
P{z) _ 1 J_ J_ J_
Q(z) 1 + dlz + d2z + d2z-\ \-dkz"
if it exists. In order to make it dear how this is found, by successive
substitutions, we illustrate by some examples with small n:
n = 2
n = 1 :
P(z) =
P(z) a0
ao 1 + —z
«0
i ,
l-\ Z-\
a\
n = 3 : Q(z) = a:izA + a2z2
P(z) = a2z2 + a0 ,
a2z2 +
Q(z) a3z3 + a222 + ax z + a0
a0
1
03 ((aLa2
i z -\-
a2
If aia2 — ao<i;j = 0 we have
Q(*)
470 Chapter X. Zero-free regions
else we have
P(z) _ 1 1 1
«(*) " 1 + ^z , 4 ¦
2.2 Polynomials with real coefficients
The following theorem uses the described type of expansion in a test for
stability.
Theorem 13 Lei
Q(z) = zn + an-{zn~l + . .. + aiz + a{) B.2.1)
he a polynomial with real coefficients, and let
P(z) = an_] z" + an_:}z"~3 -{-... . B.2.2)
Then Q(z) has all its zeros in the open left half plane if and only if the
test function t(z) = P(z)/Q(z) can be written as a terminating continued
fraction
-\ \-duz
where dj > 0, 1 < j < n.
Proof of the "if- part": Assume that t(z) can be written in the form
:— —— ... —— , where d, > 0 for 1 < j < n.
1 + ^12 + ^22+ +dnz J ~ ~
Let z be an arbitrary complex number with $l(z) > 0. Let H denote the
closed right half plane defined by JR(iu) > 0, where the closure is taken
in C so that oo G //, and let sj be the linear fractional transformations,
defined by
for j > 2 .
djz
Stable polynomials
471
Straightforward computation shows that si(H) is the disk given by
w —
Figure 11.
where x = ^R(z). Since d± > 0 and x > 0 this disk is obviously contained
in the disk given by
1
W
1
2
Furthermore, for 2 < j < n, the following computation shows that Sj(H)
is contained in H: For z — iy we have sj(H) — H. For x = 3l(z) > 0 we
find that Sj(H) is the disk given by
w —
2djX
which is contained in //. Hence s 2 o .. .0 srl(H) C H, and thus
f 1 11
...o sn(H) C \w; w - - < -> .
P(z)
-f— = 5, OS2 O...O5n@)
Since
(the zero refers to the auxiliary variable iu) we have, for 5R(z) > 0,
P(z) 1
B.2.4)
472 Chapter X. Zero-free regions
This shows the finiteness of P(z)/Q(z). Since Q(z) and P(z) are the
canonical denominator and numerator of an approximant of a continued
fraction with partial numerators ^ 0, they can not vanish simultane-
simultaneously. Therefore it follows that Q(z) ^ 0. This step in the argument is
the same as the one used in the previous section. We have thus proved
that the conditions on the continued fraction expansion imply Q(z) ^ 0
for $t(z) > 0, or, in other words: Q(z) is stable.
For the proof of the "only-if"-part we refer to [JoTh80, Sec. 7.4], from
which also the essence of the proof above is taken. ¦
Example 7 For the polynomial in the beginning of this section,
we find
the
expansion
4r2
r
+
+
4
6r
4r2 +
+ 4 ~
6r +
1
4,
\r-\
1
1
and the theorem tells us that the polynomial is stable. This agrees
with the fact that the polynomial has the zeros rj — — 2, r-2 = — 1 + i,
r.j = —l — i\
O
2.3 Polynomials with complex coefficients
We shall state without proof the corresponding theorem for polynomials
with complex coefficients, just to give the flavor of the more general
situation. Also here we refer to [JoTh80, Sec. 7.4] for the proof.
Theorem 14 Let
Q(z) = 2n + an_,zn-1 +an_22n~2 + '- + alz + a0 B.3.1)
be a polynomial with complex coefficients
— ak + iffk ? Ai = 0,1,..., n — 1. B.3.1')
Stable polynomials 473
Let
P{z) = an^.lzn-i + z/3n_22n~2 + ccn-:izn~3 + iPn-^'4 + • • • • B.3.2)
f(z) is a stable polynomial if and only if the test function t(z) =
P(z)/Q(z) can be expressed as a terminating continued fraction of the
form
1 11 1
r~» B.3.3)
c,
and dj, > 0 , j = 1,2,..., n.
Let it finally be mentioned that Hurwitz, who was the first one to solve
the stability problem for polynomials in the real case, established a cri-
criterion for stability in terms of certain determinants of the coefficients.
474 Chapter X. Zero-free regions
Problems
A) Use Worpitzky's theorem to prove that the set \z\ > 1 is a zero-free
region for the Tchebycheff polynomials of the second kind.
B) Fill in the details in the proof that the TchebychefF polynomials of
the first kind have all their zeros in the interval ( — 1,1) of the real
axis.
C) Use Theorem 13 to decide which ones of the following polynomials
are stable:
(a) z3 + 6z2 + llz + 6.
(b) z3 - Iz - 6.
(c) zs + 7z2 + 16z + 10.
(d) z3 + 5z2 + 4z - 10.
In some of the "no-cases" we can see directly that the polynomial
is instable. How? Give a general statement.
D) What can be said about the zeros of the TchebychefF polynomials
of the second kind Un(z) merely by using Theorem 2. Hint: Study
the polynomials Vn(z), defined by
inVn(z) = Un(z).
E) Take in Theorem 2 gn = z, hn = 1 for all n. Compute the zeros of
Bi{z) and ^(z), and check that they are in the "right" region.
F) Let h be an arbitrary number with 5R(/i) > 0. Take in Theorem
2 gn — —1/h and hn — h for all n. Find out as much as possible
about the zeros of B7l(z), and check with the theorem.
G) Let{Gn(w;)} be the sequence in Theorem 4. Show by direct com-
computation, that the real solutions r of the equation
(i.e. all solutions) satisfy the inequality
r< i
and relate this to Figure 5.
Problems 475
(8) Let the sequence {DTl(z)} be defined as in Theorem 5, and let 02
be negative and 03 be positive. Apart from that we know that
a2 > -A- ,
a:i < A+.
Find by computation the set of all possible zeros of the function
(9) (a) Use SafF- Varga's parabola theorem, Theorem 11, to determine
a zero-free region for the polynomials qn(z) with /3n = 2,
an = 1. Sketch the parabola. Compute the zeros of qz{z)
and q:i{z), and indicate them on the same figure.
(b) Use also Theorem 12 to find a zero-free region.
A0) Let Pn(z) denote the nth degree partial sum of the Taylor expan-
expansion of the exponential function ez:
zn
+ 77 + + , n>0.
Let qn(z) be defined by
qn(z) = n\ Pn(z) for n > 0 .
Prove that the sequence {qn(z)} satisfies the conditions in SafF-
Varga's Parabola Theorem, Theorem 11, and use the theorem to
prove that for n > 2
Z2 Zn
in the parabolic region
See finally what you can find out about the zeros of Pn(z) by using
Theorem 12.
A1) Use Theorem 2 to prove that the denominator of the continued
fraction from Theorem 13,
1 , , , 7- , ,7-, dj > 0 , 1 < j < n,
1 + d\z-\-d'2z-\ -\-dnz
has all its zeros in the closed left half plane.
476 Chapter X. Zero-free regions
A2) Prove by computation that any polynomial
z2 -\- az -\- b , a,b real,
with both roots (possibly coinciding) in the open left half-plane is
such that its test function has a terminating continued fraction
1 -f d\ z -j- di z
with di > 0, d2 > 0.
A3) Prove a similar result as in Problem 12 for polynomials of
degree 3.
A4) Verify by direct computation Theorem 12 for N = 2.
Remarks 477
Remarks
1) The result obtained by solving Problem 10 is a very special case of a
whole family of problems that can be solved by using SafT-Varga's
Parabola Theorem. Tn fact: For a given formal power series where
the coefficients satisfy certain conditions (some Toeplitz determi-
determinants / 0) the Pade numerators Um^n or denominators Vm,n, in
both cases with a fixed m > 0 will, properly equipped with fac-
factors C(m,n), D(m,n) satisfy the recurrence relation A.8.1) when
an and Pn are chosen in the right way. With additional condi-
conditions satisfied, an and Cn will be positive and have a difference
Pn — an? bounded below by a positive number. In such cases
the SafF-Varga Parabola Theorem permits us to conclude that the
sequence of Pade numerators (or denominators) is zero-free in a
parabolic region. Partial sums of power series are special cases of
Pade approximants. In Problem 10 we had the partial sums of the
Taylor series for ez at 0, and the factor that made them satisfy the
recurrence relations was n\.
Another interesting application of SafF-Varga's theorem is for gen-
generalized Bessel polynomials, where remarkably good results have
been obtained (a region bounded by a cardioid, where every point
on the cardioid is a point of accumulation for the relevant zeros).
H.- J. Runckel has generalized the Parabola Theorem substantially,
to the complex case, and applied it to Bessel polynomials, Bessel
functions and Lommel polynomials.
Gilewicz has studied recurrence relations where the second coeffi-
coefficient has higher degree. De Bruin has studied M-term recurrence
relations, in particular 4-term relations. For all these results we
refer to the earlier mentioned survey article [BrGR87] and to the
references mentioned there.
2) If, in Subsection 1.5, we change the definition for A+ and A_
slightly, the results will also cover the cases when the set of all n
with an < 0 or the set of all n with an > 0 is empty. The modified
definition:
A+ := max{0, supan} , A_ := max{0, sup(—an)}.
478 Chapter X. Zero-free regions
If the sets of positive and of negative an are both ^ 0, we have
A+ = j4+, j4_ = A_. If the set of negative an is empty, A_ = 0,
S-(8) is the whole plane and S@) — S+@) = a parallel strip. If
the set of positive an is empty, we have A+ = 0, and S ^.@) — C,
S@) = S-@)i another parallel strip. In either case the rectangle
in Figure 6 is replaced by one of the two parallel strips, and the
union of the sets S@) will in one case be as in Subsection 1.4, in
the other case of the same form, except that the omitted slits are
on the real axis. (Expand the disks on Figure 7, either with • fixed
or with • fixed.) In Theorem 5, the parallelogram collapses to a
segment on the imaginary axis if A- = 0 and on the real axis if
A+ = 0.
3) Among other results on stability of polynomials we choose to refer
to the recent papers [IsKi83] and [Isma85].
References
[BrGR87]
[CuWu87]
[Henr86]
[HePf66]
[Hurw95]
[IsKi83]
[Isma85]
M. G. De Bruin, J. Gilewicz and H.-J. Runckel, A Sur-
Survey of Bounds for the Zeros of Analytic Functions obtained
by Continued Fraction Methods, "Rational Approximation
and its Applications in Mathematics and Physics, Pro-
Proceedings, Laricut 1985", (J. Gilewicz, M. Pindor and W.
Siemaszko eds.), Lecture Notes in Mathematics No 1237,
Springer-Verlag, Berlin A987), 1-23.
A. Cuyt and L. Wuytack, "Nonlinear Methods in Numer-
Numerical Analysis", North Holland Mathematics Studies 136,
Amsterdam A987).
P. Henrici, "Applied and Computational Complex Analy-
Analysis", I, II and III, J. Wiley & Sons, New York A974, 1977,
1986).
P. Henrici and P. Pfluger, Truncation Error Estimates for
Stieltjes Fractions, Numer. Math. 9 A966), 120-138.
mm
A. Hurwitz, Uber die Bedingungen unter welchen eine
Gleichung nur Wurzeln mit negativen reellen Teilen besitzt,
Math. Annalen 46 A895), 273-284.
M. Ismail and 11. K. Kim, A Simplified Stability Test for
Discrete Systems Using a New z-Domain Continued Frac-
Fraction Method, IEEE Trans. Circuits Sys., Vol. CAS-30 (July
1983), 505-507.
M. Ismail, New z-Domain Continued Fraction Expansions,
IEEE Trans. Circuits Sys., Vol CAS-32 A985), 754-758.
479
480
[Jaco89]
[Jofh80]
[Lange86]
[Runc84]
[Runc86]
[SaVa76]
[WalM8]
[Waad88]
Chapter X. Zero-free regions
L. Jacobsen, Orthogonal Polynomials, Chain Sequences,
Tree-term Recurrence Relations and Continued Fractions,
"Proc. of the Conference on Computational Methods and
Function Theory, Valparaiso 1989", (St. Ruscheweyh, E.
B. Saff, L. C. Calinas, R. S. Varga eds.), Lecture Notes
in Mathematics 1435, Springer-Verlag, Berlin A990), 89-
101.
W. Jones and W. J. Thron, "Continued Fractions: Ana-
Analytic Theory and Applications", Encyclopedia of Mathe-
Mathematics and its Applications 11, Addison-Wesley Publish-
Publishing Company, Reading, Mass. A980). Now distributed by
Cambridge University Press, New York.
L.J. Lange, Continued Fraction Applications to Zero Loca-
Location, "Analytic Theory of Continued Fractions II", (W. J.
Thron ed.), Lecture Notes in Mathematics 1199, Springer-
Verlag, Berlin A986), 220-262.
II.-J. Runckel, Zero-free Regions for Polynomials with Ap-
Applications to Pade Approximants, "Constructive Theory of
Functions, Proceedings of the International Conference on
Constructive Theory of Functions, Varna 1984", Publishing
House of the Bulgarian Acad. Sci., Sofia A984), 767-771.
II.-J. Runckel, Pole- and Zero-free Regions for Analytic
Continued Fractions, Proc. Amer. Math. Soc. 97 A986),
114-120.
E. B. Saff and R. S. Varga, Zero-Free Parabolic Regions for
Sequences of Polynomials, SIAM J. Math. Anal. 7 A976),
344-357.
H. S. Wall, "Analytic Theory of Continued Fractions", Van
Nostrand, New York A948).
H. Waadeland, Some Recent Results in the Analytic The-
Theory of Continued Fractions, "Nonlinear Numerical Meth-
Methods and Rational Approximation", (Annie Cuyt ed.), Rei-
del Publ. Co., Dordrecht, Holland A988), 299-333.
Chapter XI
Digital filters and
continued fractions
About this chapter
Linear system theory is a field where rational approximation is an im-
important tool, and techniques from Pade theory have been quite useful
in many different ways. In the present chapter we have tried to give a
little taste of this. We have been rather restrictive, first of all by limiting
the description and discussion to digital filters, essentially stable digital
filters. And out of the many applications of Fade and continued fraction
theory we have only included two: Stability test and model reduction.
Again, as in other chapters, for instance the number theory chapter, we
see an example where "old mathematics" (the Schur algorithm) proves
useful in "new theory". This is the case for both applications included.
481
482 Chapter XL Digital filters and continued fractions
1 Filters and their representation
1.1 Some introductory examples
Example 1 Let x be a real-valued, continuous function of a real vari-
variable t, defined on some interval of R. Let furthermore aj0, SB), X2v • -be
values of the function at equally spaced, increasing values of t. Without
loss of generality we may assume that x(t) is defined on [0,oo), and
that xn = x{n), n = 0,1,2, Define the sequence {yn}^-o by
2/o = 0, 2/nfi = 2/n +-
Then we have
^ ^ y'2 = 2
and generally for n > 2
Vn = ^o + 2ii + • • • + 2an_l + xn) ,
i.e. yn is the trapezoid formula approximation to the integral
/ x(t)dt,
Jo
with sub-intervals of length 1. Similarly we find, that with
zq = 0 zn+\ — zn + xn+\ , n > 0,
zn is the Riemann sum
with ti = i, i = 0,1,2,.. .,7i, t* = f,-, i > 1, for the same integral.
( At = 1.)
Simpson's formula may also be described in a similar way:
= 0 Wn+, = Un + -
o
Filters and their representation 483
un is then the Simpson approximant (with sub-intervals of length 1) to
the integral
p2n+2
/ x(t)dt.
<0
Example 2 Let x be a real-valued, continuous function of a real vari-
variable t, denned for all t G R- Let {»n}2foo be measured values of x,
possibly containing noise (the word being used intuitively). Let
be denned by
oo
— oo
1
Vn = ^
0
Tn this case the transformation {xn} —> {2/«} is used as a "smoothing
process" for measured values of functions. Other averaging processes
can also be used, generally
Vn = Yl aix"+i» where a, > 0, ^ a,; = 1.
<0
Tn these examples a sequence {xn} was given, and another sequence {yn}
was computed from formulas
oo oo
Vn =
k=—oo A;=l
where in the examples all c^ and djt, except finitely many, are 0. Often,
as in Example 1, xp = yp = 0 for p < 0. Such a formula is often referred
to as a digital filter. The word /i/Jer comes from electrical engineering,
where filters are used to transform signals from one form to another,
in many cases to remove noise. The examples above merely indicated
applications within mathematics itself (although strongly directed to-
towards applications). But the scope of applications is very wide, and
includes such diverse fields as astronomy, economics, medicine, radar
technology, seismology and speach processing. It generally is concerned
with extraction and enhancement of information contained in a sequence
484 Chapter XL Digital filters and continued fractions
of measurements of continuous waveform phenomena. In many of the
applications the variable t is time but not in all. For a short survey of
applications (key words and some comments) we refer to the introduc-
introductory section in the article [JoSt82]. As for books on the subject we refer
to [Hamm77] and [OpSc75].
We shall not go into specific applications. The problems to be discussed
here will be common to several applications of digital filters, and will
illustrate how analytic theory of continued fractions can be used in this
field.
We shall in the rest of the chapter use a much more restricted definition
of a filter than the one given by the formula in the present section.
1.2 Digital filters
As mentioned in the previous section many different problems and situa-
situations give rise to the concept of a digital filter. A digital filter is a device
mapping a sequence of complex numbers into a sequence of complex
numbers. It is a linear mapping. Before presenting a proper defini-
definition we need to introduce some notation and basic concepts concerning
sequences.
We shall let / denote the set of all sequences {dn}^Lu °f complex num-
numbers. With the standard operations, addition:
A-2.1)
and multiplication by a scalar.
c{an} = {can} , A.2.2)
/ is a linear space. We shall need two additional operations, convolution:
{an} * {&„} := I ? akbn-k I , A.2.3)
and unit delay:
D{an} := {a'n} , where a[} = 0, a'n = an-i for n > 1. A.2.4)
Filters and their representation 485
With eacli sequence A = {a^l^Lo we associate a formal power series
a(z) by
oo
a(z) = Y^anZ~n . A.2.5)
This mapping is usually written in the following way:
a(z)»-oA. A.2.6a)
The mapping A o—• a(z) shall be referred to as the z-transform and
often be written
A m a(z). A.2.6b)
The operations on the sequence correspond to operations on the formal
power series in the following way:
A -f B o—• a(z) -f- b(z), cA o—• ca(z),
DA o—• z~* • a(z), A * B o-1* a(z) • b(z).
Here a(z) - b(z) denotes the Cauchy product of the two formal power
series:
F0 + bxz~] + b2z~2 + •-
x + (a2b0 + ai6, + ao62J:~2 + • • • •
Observe that DA also can be written
A * B , where ? = @,1,0,0,0,0,...).
In many cases it is of advantage to operate on the formal power series
rather than on the sequences.
In the case of convergence a(z) represents a function, holomorphic in
some \z\ > r (also at oo). In this case the Cauchy formula gives us the
inverse transform:
an = ^—: f a(z)zn~ldzi
Air i Jc
where C is a circle around the origin with radius > r, traversed coun-
counterclockwise.
486 Chapter XL Digital filters and continued fractions
Certain sub-spaces of / are of special importance, for instance the sub-
space / , consisting of all {an}^_0 for which
limsup|ari|" < oo.
(It is well known and easily proved that / in fact is a linear space.) The
main importance of / lies in the fact that the formal power series a(z)
represents a function, holomorphic at z — oo, if and only if A G / -
Example 3
A,1,1,...) o-'« 1 +2T1 + ••¦ = —— for|z|>l.
At "~~ L
z2
z
A,2,3,...) o^'« 1+ 22T1 + 32T2 + . ..= -—-— for|2f|>l.
\z )
1 1 - z~] z~2 z
) l + ~2~ + ~y + '"' = zlnYI^ for|2f|>l.
-O
We are now ready for the definition of digital filters. In this "theoretical
part" we shall use the concept for the mapping itself, not for some
"device" producing the mapping. Following [JoSt82] we define:
Definition A digital filter F : I—> I is a mapping of sequences
onto sequences {yn} according to formulas
N M
Vn + Yl hkVn-k = ]T akxn-k , n = 0,1,2,..., A.2.7)
k=l k=0
where a0, a\,..., as\j, by, 62,..., b^ are given constants, at\j / 0; 6jv ^ 0
and xn = yn = 0 for n < 0.
Remark: M = 0 or N = 0 is permitted. In the latter case the sum
on the lefthand side is empty, and hence 0, in which case the filter is
called nonrecursive. In all other cases it is called recursive. The sequence
X = {xn} is called the input, and Y = {yn} the output.
Filters and their representation
487
Example 4 (a) N = 0, M — 2: Tn this case the recurrence relations
are:
-f
-f
For arbitrary n > 2 the recurrence relations can be written in
matrix form (n = 4 in the illustration):
2/o
2/i
2/2
2/3
. 2/4 .
—
a0
a,
«2
0
0
0
a0
^2
0
0
0
ao
«i
«2
0
0
0
a0
a,
0
0
0
0
a
(b) JV = 2, M = 2: In this case the recurrence relations are
2/0
2/2 +
f &22/O
2/n
For n > 2 the recurrence relations can be written in matrix form
(n = 4 in the illustration):
0
0
0 0
1 0
bi 1
hi 6,
0 6,
0 0
0 0
0 0
1 0
6, 1
2/o
2/2
2/3
a0
^2
0
0
0
ao
av
a2
0
0
0
a0
ai
«2
0
0
0
CLr\
0L\
0
0
0
0
a0
-O
488 Chapter XL Digital filters and continued fractions
Definition For a given digital filter (F) the rational function
is called the transfer function of the filter. (Keep in mind that a^j ^ 0,
bN # o.;
The reason for this name is that the transfer from input to output can
be done by using the function h. With
A = (ao, Oi,.. .,aM»0,0,0,0,...)
B = A,6,,...,6^,0,0,0,...)
X = (xu,xi,x2,...) G /
Y = (t/u, ?/i, ?/2» • - •) ^ I (since X E / , see remark below)
the filter can be written
B*Y = A*X.
By the z- transform we get
b(z) - y(z) = a(z) • x(z)
=
which is the same as
y(z) = h(z) - x(z). A.2.9)
This gives us an algorithm for computing the output from a given input.
Remark: Observe that X E /l => x(z) is holomorphic in a neighbor-
neighborhood of z = oo =$> ?/(z) is holomorphic in a neighborhood of z — oo ^
Definition Ifh(z) is the transfer function, then the inverse z-transform
sequence H = {hn}}
25 called the shock response of the filter.
Filters and their representation 489
Observe that
= ? hnz~n (by "long division").
The reason for this name is that the input
X = A,0,0,0,...) ("shock")
has the sequence
H = (hu,huh2>...)
as its output.
1.3 Stable filters
Let Zcx; be the linear space of bounded sequences. A filter with the
property that a bounded input gives a bounded output is called a stable
filter. This is an important property. A useful theorem for deciding the
possible stability of a filter is the following, stated in [Henr86, Vol. 2] as
a problem (with reference to Martin Gutknecht):
Theorem 1 The following properties are equivalent:
(a) The filter is stable.
(b) The poles of the transfer function h(z) are all located in the open
unit disk \z\ < 1.
(c) Ifh(z) •-* {hn}, then E?Lo \hn\ < oo.
We shall establish the proof in three steps, the proofs of (a) => (b),
(b) =?> (c) and (c) ^ (a), by operating on the ^-transforms of {scn}, {hn}
and {yn}. A tool in our proof is the following:
490 Chapter XL Digital fillers and continued fractions
Lemma 2 Let g(w) be rational, and holomorphic in the open unit disk
U, where it has the power series expansion
g(w) = d0 -f d^w -f d2w2 + ¦ ¦ • .
If g has a pole of order > 1 on the circle \w\ = 1, the sequence {dn} is
not bounded.
Proof of Lemma 2: Being a rational function, g(w) can be written
as a linear combination of terms of the forms (where r, s are natural
numbers):
q(w\
, \P\ = 1, deg(g) < s ,
- py
and possibly a polynomial.
The coefficient dn will then be the ^"-coefficient we get by expanding
all the terms separately, followed by adding the expansions. The contri-
contributions of the first and third types add up to a function, holomorphic in
some disk \w\ < 1 -f f, € > 0, and their contribution to dn will therefore
—» 0 when n —> oo. Without loss of generality we may restrict ourself
to the study of the terms of the second type. We rewrite the term:
q(w) {-P)sq{w) qo + qiw + • - - + qnwm _i0
~~ ¦. P = e
- py
(i-pwy
This has a partial fraction decomposition of the form
i-pw
where we assume s > 2 and AH ^ 0. The coefficient of wn in the power
series expansion of this expression is
Filters and their representation 491
which is equal to A[ent0 for 5 = 1. For s > 2 it is easily seen to tend
to oo when n —> oo. (No asymptotic cancelling is possible, since for
2< k < s
tk - 2 + n\ k_l
—. ^-—-=- >0 whpnn->oo.)
/k-l + n\ k -l + n '
\ n i
The proposition is thus proved for the case when we only have one term
of the second type or even in the case when we have more such terms,
but only one with maximal s for the function. In case we have more,
say p, terms of the second type and with s maximal, the dominant term
in the coefficient for zn is of the form
where As ^ 0 for all v and all Qv are distinct. Since
v
limsup
n —> oo
U=\
(else lim??=1 As' exp(inOu) = 0, which is not possible, see Problem 6)
and
s -f n — 1
oo
n
a subsequence of {dTl} converges to infinity, and Lemma 2 is proved.
In the application we have w — 1/z. We now proceed to the
Proof of Theorem 1:
Proof of (a) => (b): With the input X = A,0,0,...) the output is
H = {hn}, the shock response. This is then a bounded sequence, and
hence h(z) is holomorphic for \z\ > 1 (also for z = oo). Assume that
h(z) has a pole on \z\ = 1, say for z — ei0. Then
9(>)
z-
492 Chapter XL Digital filters and continued fractions
where g(z) is holomorphic for \z\ > 1 and g(el°) ^ 0. Take X =
e2l°,...), which is a bounded sequence. Then
oo -.
Aez
n=0
and
Z9(z)
y(z) = h(z) • x(z) =
(z-ei0J'
y(z) thus has a pole of order at least 2 at z = el°. Therefore we know
from Lemma 2 that {yn} is not a bounded sequence, which contradicts
the assumption on stability of the filter. Hence (a) ^ (b) is proved.
Proof of (b) => (c): The function h(z) must be holomorphic in \z\ >
1 — 6 for some e > 0, which implies absolute convergence of the series
expansion for \z\ > 1 — e/2, in particular for z-values on the unit circle.
(Remember, h(z) is a rational function and has only finitely many poles.)
Hence ? \hn\ = M < oo.
Proof of (c) ^ (a): Let X = {xn} be a bounded sequence, |scn| < c
for all n. Since
¦i II \ — < h \ sk < Of \
\ynj — \"rjj * i*nj 3
i.e.
rt
2/n == / J %khn—k ?
fc=O
we find
n OO
|2/n| ^ / |x^||Ain_a-| < c ^ ^ |/i^| = c • M. < 00 .
k=0 k=0
Theorem 1 is thus proved. ¦
Remark: In the cases where we have the power series expansion at oo
for the transfer function, the stability can be checked by (c). But there
are cases when the transfer function is given a quite different represen-
representation, in which case other criteria may be of use. We shall return to
this in Subsection 2.2.
Filters and their representation
1.4 Graph representation of filters
493
Thinking of the independent variable as time, the realization of a digital
filter requires that past and immediate values of the input, and past
values of the output be available. This requires the possibility of delay or
storage of the past values. Furthermore, we need means for multiplying
by coefficients and adding the results. We illustrate these operations in
Figure 1. Keeping in mind that the unit delay in the sequence {xn}
•••) —» @, a; 1,
corresponds to multiplication of the z-transform x(z) by z~x, we indicate
the delay by using a "z^-box".
X,
X
n
a
X
—fr
a
X
•xn
n-\
Figure 1.
This is perhaps a good place for some remarks on notation, in order to
avoid confusion: It is beyond the scope of the present chapter to discuss
the sampling of values from some function. From now on we shall take
the sequence {xn} (or other sequences) for granted, and forget about the
"underlying function" from which the numbers xn are sampled. Hence,
when we use the symbol x(z) (or a(z), b(z) etc.) we shall mean the
z-transform of the sequence {xn} (or {an}, {bn} etc.). We shall use
upper case letters X, A, D as symbols for sequences: X = {xn} etc. In
some cases we shall need (as already seen in the description of graphs
of filters) symbols like Xi, X2,. - .for different sequences, in which case
the z-transform shall be denoted xi(z), 22BI • ¦ • •
494
Chapter XL Digital filters and continued fractions
Example 5 The following difference equation fits into the definition of
a recursive digital filter:
Vn + biyn-i + b2yn-2 = o-o^n •
This is an inhomogenous recurrence relation of order 2. The block di-
diagram representation of this digital filter is as shown in Figure 2. In
x
Vn-2
Figure 2.
order to understand this illustration we write the relation in the follow-
following form:
yn = al}xn + (-6
_O
Example 6 The digital filter
N
M
k=l
k=0
has a block diagram illustrated in Figure 3.
-O
This illustration is only one way of many ways to present the filter
(transfer function).
Filters and their representation
495
Vn-N
Figure 3.
A related way of implementing a digital filter is by using a directed graph.
A directed graph consists of two types of elements, points, called nodes,
and simple directed curves from one node to another, called branches.
To each node there is associated a node sequence Xi, or equivalently, by
the z-transform, a formal power series sb,-(z) (possibly a function). The
node sequences are influenced by the other node sequences through a
transmittance sequence T,-j, or equivalently, a transmittance function.
The transmittance sequences in question are all of the form
, ---,0),
where at most one of the numbers a,-,-, &;,-, is
0. The node sequences
496
Chapter XL Digital filters and continued fractions
are interrelated in the following way:
ij *
where the sum is taken over the whole range of node indices, and where
i also ranges over the same set. For the 2-transform this can be written
where tij(z) is one of the three functions; 0 (no influence), ai7 ^ 0
(multiplication), b{jZ~^ (delay and multiplication). We only draw the
arrows (branches) between the nodes where the transition function is not
0. For a directed graph there is one particular node where no branch
ends, the source node, and one where no branch starts, the sink node.
In order to compare the block diagram and the directed graph we shall
look at an example.
Example 7 We use as example the recurrence relation
JJn = aVn-
which is a first-order difference equation, since we only have n and n— 1
A block diagram for this relation is shown in Figure 4.
Figure 4.
With 6 = 0 and a = 0 respectively we would get the two block diagrams:
Filters and their representation
497
x
n
J
a
4
z~
yn
-1
yn-\
Figure 5.
By combining the two diagrams, using the delay (with z l) for X as
well as Y, we get the diagram in Figure 6.
x
n
a
Vn
Figure 6.
A directed graph for the same recurrence relation is shown in Figure 7.
In this graph we have node sequences X{, and their z-transforms X{(z).
There is a sink node with node sequence Y, z-transform y(z), and a
source node with node sequence X and z-transform x(z). Observe that
the recurrence relation by the z- transform is turned into the following
equation for formal power series x(z) and y(z):
y(z) — az y(z) + x(z) + bz x(z).
498
Chapter XL Digital filters and continued fractions
x(z)
xi(z)
Figure 7.
This gives
showing that
is the transfer function. Keep in mind that the arrows go in the direction
of influence.
One way to check this is to establish that it has the right transfer func-
function. We find:
= x(z) + ax;i(z),
x2(z) — xx(z),
= z~lx2(z),
y(z) = x2{z) -f 63:3B:) .
Simple computation shows that
x2(z) - x(z)
y(z) =
az~1x2{z)
Filters and their representation
499
and hence
-O
We conclude this section with an introductory example for a particular
filter to be discussed later. Observe how we can work our way from node
to node, starting with the source node A), ending at the sink node D).
Example 8 For the directed graph in Figure 8 we find:
xx
+
+
X\
Xi
+
+
Z3
Figure 8.
x2(z) =
xi(z) =
1
x3(z)
Simple computation shows that
500 Chapter XL Digital filters and continued fractions
(from the two first equations). Since the arrow from X>i to X% is drawn,
c-2 ^ 0, although the case C2 = 0 is easily dealt with (trivial). If this is
inserted into the last equation we get
The case of particular interest is such that C\ and c-2 are complex numbers
with |ci| < 1, |c2| < 1 and d^ is the positive number d^ — y/\ — \C[\2.
We shall maintain these conditions in the following. In this case we have
where the transfer function is given by
which is a terminating continued fraction (or an approximant of some
continued fraction).
O
In a later section we shall extend this example. We shall here only
indicate the next step: From ?2B) to X:i(z) we have the transfer function
C2j i.e. x,i(z) = c? • x-2{z). Assume that this transfer function is replaced
by another one (possibly given by some directed graph). Let T(z) be
the new transfer function. Then the transfer function h\(z) is replaced
by the function
-1 1
In some interesting cases it is natural to write T(z) in the form T(z) =
C2 + To(z)t and the transfer function
can be regarded as a modified approximant of some continued fraction.
The Schur algorithm 501
2 The Schur algorithm
2.1 An old algorithm
In 1917 and 1918 a paper appeared [Schurl8], in which functions holo-
holomorphic and bounded in the open unit disk were studied. The author
was I. Schur, and the paper has proved to be of interest and of influence
in several ways. We shall not even try to indicate the variety of prob-
problems upon which this paper had an impact, merely show a connection
to digital filters and continued fractions. We first recall two basic facts
from the theory of analytic functions of a complex variable:
Schwarz' lemma: Let f be holomorphic in the open unit disk U,
\f{w)\ < 1 in U and /@) = 0. Then
\f(w)\ < \w\ inU
with equality if and only if f(w) = elOw.
(Schwarz' lemma can be found in most textbooks in complex analysis,
see for instance [Ahlf53, Chapter III, Thm. 13].)
A simple mapping property. Let a be a complex number, \a\ < 1.
Then the linear fractional transformation
1 — aw
maps the closed unit disk U one to one and conformally onto itself.
Following Schur we shall study the family E of functions /, holomorphic
in U and mapping U into E/, f(U) C U. (Observe that the function
taking only the value 1, or elfl, is in E.) For a given /o 6 ? we have the
power series expansion
fo(w) = c0 + cxw -f c2w2 -\ , |u;|<l. B.1.L)
502 Chapter XI. Digital filters and continued fractions
Here |co| < 1. If \cq\ = 1 the function reduces to the constant cq — e*°°.
If |co| < 1, we put
co=:7o- B.1.2)
Then the function g\, defined by
t \ Mw) ~ To
1
is also in E (from the mapping property of SI). Furthermore <7i@) = 0,
and hence the Schwarz Lemma applies, and \g\(w)/w\ < 1, i.e. the
function
/,(») = !fW)]\, B-1.3)
w(l7/(w))
is again a function in E. (The removeable singularity at 0 causes no
problem). The power series expansion of /i starts with
If |7i| = 1, we have
which means that the function we started with was
(This follows easily from B.1.3) by putting f\(w) = el°.)
If |7j I < 1, we construct a second function /2, defined by
=
This is again a function in E, and we can go on. Either we come to
some fn which has a constant value of modulus 1, or we get an infinite
sequence {/„ }g° of functions in E. The forward and backward recurrence
relations are
r / x /n(wOw . 7n + Wn+lW ,o -. ^
fn+1(w)= V _ , /„(«;)=—— r v 7 B.1.5)
w(! ifW) I + 7W(w)
Here 7n is the constant term in the power series of frl(w). We thus have
two cases:
The Schur algorithm 503
1. The algorithm produces an infinite sequence of functions in E. In
this case all 7n have absolute value < 1. A permitted special case
is that for some k the function /* reduces to the value 7^, in which
case fi(z) = 0 for all i > k + 1.
2. The algorithm produces only a finite number of functions, all in E,
the last one a constant with absolute value 1. In this case |7fc| < 1
for all k < n, whereas |7n| = 1. (n = 0 is permitted.)
It can be proved, that the case 2 occurs if and only if /o(w) is of the
form (Blaschke product)
f
fn(w) = e f[ it"++g1^ , 0<K-|<l, |«| = l, B.1.6)
alternatively written
where
n
P(w) = JJA -f Wiiv) = 1 + ife, «; + ••• + knwn
i=\
is a polynomial of at most degree n, and where P(w) is the polynomial
1 -f k\ w + h knwn. We shall not give the proof, although it is rather
simple. We have already seen, that the statement holds for n — 0 and
n = 1. See also Problem 10.
Schur calls his algorithm "kettenbruchartig", continued fraction like, and
indeed it is. It must be, since we get fn from /n+i by linear fractional
transformations, where {/«} acts like a tail sequence. We rewrite the
recurrence relation
in the form
504 Chapter XL Digital filters and continued fractions
This gives
for all ?i in case 1, and also in case 2 if n < N, where N > 2 is the
smallest number for which |7n| = 1. For n = N we have
/u(™) = 7o for N = 0 ,
(l-|7o|2)™ 1 A - Itat-iI2)^
= 7o -t
generally for N > 2. Hence, on one hand case 2 occurs if and only if
fu(z) is of the form
On the other hand, f[)(w) is in case 2 equal to a terminating continued
fraction of the type above. It is not hard to see, that if a function /(J
is primarily given by such a continued fraction, this function can be
written in the product form above.
Continued fractions of the form
- l7ul> 1
7o H1
jw +7+ liw +72+-
where |7n| < 1 for all n were first studied by Wall [Wall48]. They, or the
terminating ones with the last 7n being of absolute value = 1 are called
positive Schur fractions in honor of I. Schur. They have many interesting
properties, for instance convergence and correspondence properties. See
for instance [J0NT86] and [Wall48]. They are furthermore connected to
many different areas, some of which are mentioned in the remarks. We
shall in the next subsection discuss the connection between terminating
Schur fractions and digital filters.
The Schur algorithm 505
2.2 Schur fractions and digital filters
We go back to Example 8 in Subsection 1.4- Figure 8 shows the directed
graph of a digital filter with transfer function
+c2
In the final remark of this example the transfer function c-z was replaced
by another transfer function T(z), in which case we would get the trans-
transfer function
1
If T(z) is given by a directed graph of exactly the same type, we would
have
C2Z
and the transfer function for the "combined graph" would be
-fc.3
Again we could replace C3 by a transfer function of the same type as the
one in Example 8, and we could even go on for an arbitrary number of
steps. The graph is illustrated in Figure 9, where we have renumbered
the nodes for obvious reasons. Instead of merely 4 nodes, as in example
8, we have 4JV nodes, where N is a positive integer. We maintain the
condition |c,-| < 1, except for the last one, for which we assume |ca"+i| =
1, and dk = y/\ - \ck\2, k < N.
506
Chapter XL Digital filters and continued fractions
Figure 9.
It follows immediately from the above considerations that the trans-
transfer function is equal to the terminating continued fraction of the type
above, only with 1/cjv+j as the last partial fraction. From the product
representation in Subsection 2.1 it follows that the poles of the transfer
function all are located in the open unit disk, and from Theorem 1 it
follows that the filter is stable. These results deserve the status of a
theorem:
Theorem 3 Let F be a digital filter represented by a directed graph
of the form shown in Figure 9 with \ck\ < 1 for k — 1,2, ...,7V and
|c/v+i| = 1, and let hp^(z) be the transfer function. Then the filter is
stable, and the transfer function is given by
hN(z) = cx
-1
B.2.1)
This result, due to Jones and Steinhardt [JoSt82j, may seem rather spe-
special, since the graph is the special one in Figure 9. But the next theorem,
also due to Jones and Steinhardt [JoSt82], and being "an almost con-
converse" of Theorem 3, shows that it is of much more general interest that
it may look like.
Theorem 4 Let F be an arbitrary stable digital filter, and let h(z) de-
denote its transfer function. Then one of the following holds:
The Schur algorithm 507
a) h(z) = a constant.
b) There exists a unique finite sequence {cnYni\ with \ck\ < 1, k =
1, 2,..., JV, |cjv+i| = 1 and a positive /3, such that
h{z) =
A - \cN\2)z
c) There exists a unique sequence {cn}™=l with |cfc| < 1, k = 1,2,
and a positive /3, such that
h(z) = C ¦ lim
N>
C/v -f- C{\[Z -f~ 1 /
Outline of proof: h(z) = constant is obviously a (trivial) possibility.
Else, since the filter is stable, h(z) has all its poles in the (open) unit
disk, and from Property (c) in Theorem 1 and the maximum principle
it follows that g*(w) = h(l/w) is bounded in U. With
P = sup \g*(w)\,
u
the function g*(w)/P satisfies the "Schur-conditions" in Subsection 2.1,
and permits repeated use of the Schur algorithm. In case 2 this gives, by
replacing w by z~l, the b-case of the theorem. In case 1 the algorithm
never stops, and one is led to a non-terminating continued fraction,
whose approximants are ail rational functions and the modified approx-
imants
z
cxw
where |?/v| = 1, are holomorphic in \w\ < 1. From Theorem 5.11 in
[Schurl8] (normality of the sequence of modified approximants combined
508 Chapter XL Digital filters and continued fractions
with correspondence) it follows that for any sequence {tn} with \tn\ = 1,
in particular for tn = 1, the sequence of modified approximants converges
to one and the same function. This modification ties the c-case to the la-
case of the theorem. The convergence is locally uniform on \w\ < 1, and
the limit function is holomorphic and of absolute value < 1 for \w\ < 1.
Inserting w = z~l leads to the c-part of Theorem 4.
3 Model reduction
3.1 General remarks
What we have been discussing in this chapter belongs to (and is a very
little part of) what is called linear system theory. One important prac-
practical problem in this theory is the model reduction problem. In vague
terms it means to replace one system <S, having a rational transfer func-
function of high degree, by another system <S(), with a transfer function of
lower degree, such that So in a relevant sense approximates S. A proper
definition of linear systems, as well as a survey of techniques for model
reduction and a huge bibliography from the field is given by Bultheel
and Van Barel in [BuBa86]. This paper, by the way, really illustrates
the role of continued fractions and Pade approximants in this field.
We shall restrict ourselves to the reduction of transfer functions for stable
digital filters, like the ones discussed earlier in this chapter, but first we
shall discuss some general principles.
Assume that we have some kind of transfer function h(z), and that this
has a Taylor expansion
oo
k
h(z) = g(w) = Y, 9kWk C.1.1)
k=o
near w = 0. Here w may be \jz in some cases, z — a for some constant
q in other cases. One of the requirements for model reduction is often
that as many as possible of the coefficients g^ be preserved, i.e. we are
Model reduction 509
looking for some
oo
g(w) = y2<jkwk C.1.2)
k=l
where g^ = g^ for k = 0,1, 2,..., r — 1, and where r is as large as possi-
possible without violating other requirements (one of which is simplification
compared to the original model). The most natural idea to think of is
to produce, somehow, a sequence {gr(w)} of functions corresponding to
the series for g{w). Then raise the question: Can this be done by dif-
different kinds of continued fraction expansion, or by using some path in
the Pade table for g{w)t One such path is the staircase formed by the
successive approximants of a regular C-fraction (if a regular C-fraction
exists).
3.2 Stable filters with rational transfer function
We now turn to the particular type of linear systems we are dealing with
in this chapter, the digital filter. We shall furthermore assume that it
is stable. Let h(z) be the transfer function and H = {hn} its shock
response,
h(z) = ho + h]Z-1 + h2z~2 + • • • , \z\ > 1. C.2.1)
In replacing this by a reduced model we want to preserve
a) stability
b) /io, hi,..., hr for some r.
Example 9 Let
40 + 2Sz~l + 100z~2
This is, as we soon shall see, a transfer function for a stable filter. Its
(terminating) regular C-fraction expansion is given by
2 (84/500)*-' D72/100)z-1 C00/59)z-] E/59)*-1
~5+ 1 1 + 1 - 1
510 Chapter XL Digital filters and continued fractions
If we try to approximate by using its 2. approximant J2{z) we get
10- 432-l
but this has a pole Zq = 118/25 outside of the unit circle, and can there-
therefore not be the transfer function of a stable filter. This is a disadvantage
by using C-fractions, that you may lose the stability when you choose
an approximant. Some methods based upon Pade-type approximants
instead of Pade-approximants work in some cases.
O
For stable filters we shall do it differently, but still based upon continued
fractions and correspondence. The new idea is to use Schur fractions.
Let the function h{z) serve as the first example.
Example 10 Since the poles of h(z) are complex conjugates, both with
absolute value -y/2/5, h(z) is the transfer function of a stable filter, and
since it is not a constant, it can be represented in the form of Theorem
4b or c. Since it happens to be of the form
2
where Pn is a polynomial, we have the b-case, even with /3 = 1. (See
B.1.6').) In fact, if we expand h(z) in a Schur fraction, we get
The graph is the one in Fig. 9, with
2 1
iV = 2, Cl~5' °2~5
The idea is now to replace the tail
0 ~ ?)«-' 1
5 l,-i +1
Model reduction 511
by 1, and get a function
This corresponds to replacing the rightmost part of Fig. 9 (with 4 nodes)
by one single arrow c2 = I between the two now rightmost nodes. We
get a graph as the one in Fig. 9, but now with
N = 1, Cl = - , c2 = 1
5
representing a stable filter with C.2.5) as its transfer function. We find
\ + z~l 2 21 ,
which has a single pole zq — —2/5. Only the first coefficient coincides
with the one for h(z),
2
K = h0 = - .
5
o
This example illustrated how the reduction can be carried out for stable
filters, assuming that we have a non-constant transfer function. Then,
according to Theorem 4 the transfer function can be represented by using
a Schur continued fraction. By truncating to get the 2rath approximant
(last partial fraction being l/cm+i) we get a rational function corre-
corresponding to the transfer function up to and including the 2~m-term
[J0NT86, Thm. 2.2]. In order to make sure it is a transfer function for
a stable filter we replace l/(cm+i) by 1/1 (or l/eItt for arbitrary real
a), and get a terminating Schur fraction, where we know that the poles
are in the unit disk. (The factor /3 has of course no influence on the
poles.) The price we have to pay for this change, is that we lose the last
coefficient in the correspondence, and hence we only preserve
(In Example 10 we used the approximant of order 2, i.e. m = 1 and
replaced I/C2 by 1/1. We got stability, and the coefficient ho.)
512 Chapter XL Digital filters and continued fractions
In order to make this a practically useful method we need to be able
to transform the given transfer function into a constant times a Schur
fraction, or at least a reasonably long start of such an expansion. One
way is already described in Subsection 2.1, we only have to find /3 =
supig^j h(z) first. If we have the special situation of 4b, which means
that the transfer function is of the form
h{z) =
then it can be handled easier by the Schur-Cohn algorithm. It is rather
easily proved from the Schur algorithm. We quote it without proof, and
refer to [BuBa86] and [JoSt82]. The notation in C.2.3') differs from the
one in C.2.3), in order to conform with [J0NT86]. In the latter a proof
is included. The algorithm goes as follows:
From a given array
dj=aj, j = 0,1,.. .,72, C.2.6a)
being the array of coefficients of the denominator polynomial Qn(z),
we construct for m = n,n— 1,...,1 new arrays ar™ in the following
way:
(m) C.2.6b)
(m-1) 0,)+i - Cn-
i-m+l|
with j = 0,1,..., 771 — 1 and cn+i = a[} /% . Then the c^'s are the
parameters of the terminating Schur fraction of h(z). We shall illustrate
this on an example.
Example 11
" - 16
Model reduction
We see that this
4°
(-0
a) }
a,
4])
a[l)
c4
has the
= 1
= 0
= 0
= 0
- -16
_ _ jl
= 0
= -16
= 0
form C.2
C)
ai
aC)
a{3)
a3
40)
.3') with P
= 0
= 0
= 0
= -16
= 0
= -16
= 1
= 1 and
B)
a^ =
a%] =
n = 4.
: 0
: 0
: -16
: 0
513
In this case the Schur fraction is
^ 0 +0+ 0 +0+ 0 +1"
The cutting procedure described in Example 10 gives the following re-
reductions of our model:
1
(/)-1 +0+ 0
and similarly
o
514 Chapter XL Digital filters and continued fractions
Problems
A) Let / be a real-valued function of a real variable u, defined on
an interval [a,6], and let a = Uq, u\,. .., un = b be equally spaced
it-values, and /(uo)> f{v>\ )>• • -the corresponding values of the func-
function. Find a simple transformation u = g(t), such that with
we have f(uk) =
B) Find the z-transform of the sequence {a«}JJL0, where the sequence
is given by
(a) an = (-1)",
(b) an = i(l + (-in
(c) an — 0.
C) Find the inverse /-transform of the formal series or functions in
the following cases
(a) a(z) = exp(l/z),
(b) a(z) = ln(l + 1/z),
(c) a(z)= 1 + z +
D) Find the transfer function and the shock response for the filters
given for n > 0 by the recurrence relation below, where xm =
ym = 0 for negative m.
(a) yn = 2xrl + zn_i,
(b) yn = 2/n-i +2a;u + a;n_i,
Vn = -yn-\ + 2a;n - «„_,.
E) Draw the block diagram illustrations of the filters in Problem 4.
F) Prove the following theorem: Let a and f3 be two distinct angular
values, and A, D two complex numbers ^ 0. Then
n
Problems 515
G) A digital filter F has the transfer function
h(z) =
(z _ Q\2 '
For which values of ft is F a stable filter? Compute
oo
in all cases when F is stable.
(8) Let F be a filter with transfer function
h(z) =
(p + 1 + q)z2 + 2(p - q)z + (p - 1 + q) '
where p and q are real numbers. Use Theorem 13 in Chapter X to
determine for which p, q the filter is stable. (Hint: Transform the
unit disk to the left half plane by
z — 1
It then turns out that
is a polynomial.)
(9) Use the result in problem 8 to establish a criterion for stability of
filters F with transfer function
(z - I)'2
=
= Az2 + 2Bz + {A - 2) '
where A and B are real mumbers. Illustrate in an A, i?-plane.
A0) Write explicitly fo{w) in case 2 of Schur's algorithm when
(a) |7o| <1
(b) |7o|<l
(c) l7o| < 1, 7i = 72 = • • ¦ = 7* = 0, |7it| = I-
516 Chapter XL Digital filters and continued fractions
A1) Let fo(w) be holomorhic in the unit disk \w\ < 1 and
1 there. Assume that all yn produced by the Schur algorithm
are = 1/2. (We have not established that the 7-values may be
prescribed!) Determine the Schur fraction associated with this
sequence of 7-values.
A2) Use results from Chapter III to prove that the continued fraction
in Problem 11 converges for \w\ < 1. Prove that its value is
w - 1 + \/w2 - w + 1 1
fo(w) — = —(- positive powers 01 w .
w 2
Next prove that f\(w) — fo(w), and hence that fo(w) gives rise
to the sequence {7n}> where all 7n = 1/2 when we use the Schur
algorithm. (f\(w) is here the next function in the Schur algorithm,
not to be confused with the approximants of the Schur fraction.
We do not raise the question about sup
A3) Find the continued fraction produced by the Schur algorithm in
the following cases:
(a) /0(ti/) = 0,
(b) fo{w) = Ci where |cL| < 1,
(c) fo(w) = ckwk where \ck\ < 1,
(d) fo(w) — CkWk where |cjt| = 1.
A4) Let h(z) be the transfer function called h\(z) in Subsection 2.2.
Find the shock response {hn} of the filter. For which c2-values is
the filter stable, if Ci is assumed to have absolute value < 1?
A5) Let F be a digital filter with transfer function
Mz)=\i ~, ¦
Find the Schur fraction expansion of Theorem 3 for h(z), and hence
its sequence of cn-values.
A6) Use the Schur-Cohn algorithm to find the Schur continued fraction
of
(a) the function in Example 9.
Problems 517
(b) the function
h(z) =
In problem 16b find also the first three model reductions as de-
described in Example 11, and compare the start of their power series
expansion (in powers of l/z) with the one for h(z).
518 Chapter XL Digital filters and continued fractions
Remarks
A) In Chapter X we have discussed stability of polynomials, i.e. the
property of having all zeros in the open left half plane. A re-
related concept is the concept of disk-stability of a polynomial, i.e.
the property of having all zeros in the open unit disk. There is
an obvious connection between disk-stable polynomials and stable
digital filters. Combination of Theorem 3 and Theorem 4b leads
to the following result on disk-stability. Let
Q(z) = a0 + ai z + {¦ anzn
be a given polynomial of degree n, i.e. an ^ 0. Then Q(z) is
disk-stable if and only if
z»Q(l/z)
=
has a Schur fraction expansion
h{z) = C| +
C2
z
where \cj\ < 1 for j — l,2,...,n, |cn+1| = 1. (See also [JoSt82].)
B) The positive Schur-fractions and the closely related PC-fractions
(Perron-Caratheodory-fractions) are important in the study of the
trigonometric moment problem, and, in connection with that, poly-
polynomials orthogonal on the unit circle (Szego polynomials) and
Gaussian quadrature on the unit circle. For references we refer
to the remark section at the end of Chapter VII.
References
[Ahlf53]
[BuBa86]
[Hamm77]
[Ilenr86]
[J0NT86]
[JoSt82]
[JoTh80]
L. Ahlfors, "Complex Analysis", McGraw-Hill, New York
A953).
A. Bultheel and M. van Barel, Pade Techniques for Model
Reduction in Linear System Theory: A Survey, J. of Comp.
and Appl. Math., 14 A986), 401-438.
It. W. Hamming, "Digital Filters", Prentice-Hall Inc., En-
glewood Cliffs, New Jersey A977).
P. Henrici, "Applied and Computational Complexs Analy-
Analysis", Vol. T, IT, III, J. Wiley & Sons, New York A974, 1977,
1986).
W. B. Jones, O. Njastad and W. J. Thron, Schur Fractions,
Perron-Caratheodory Fractions and Szegd Polynomials, a
Survey, "Analytic Theory of Continued Fractions II" (W.
J. Thron, ed.), Lecture Notes in Mathematics, No. 1199,
Springer-Verlag, Berlin A986), 127-158.
W. B. Jones and A. Steinhardt, Digital Filters and Contin-
Continued Fractions, "Analytic Theory of Continued Fractions"
(W.B.Jones, W.J.Thron and H.Waadeland, eds.), Lecture
Notes in Mathematics, No. 932, Springer-Verlag, Berlin,
A982), 129-151.
W. B. Jones and W. J. Thron, "Continued Fractions: An-
Analytic Theory and Applications", Encyclopedia of Math-
Mathematics and its Applications, Vol. 11, Addison-Wesley,
519
520
Chapter XL Digital filters and continued fractions
[OpSc75]
[Schurl8]
Reading, Mass. A980). Now distributed by Cambridge Uni-
University Press, New York.
A. V. Oppenheim and R. W. Schafer, "Digital Signal Pro-
Processing", Prentice-Hall Inc., Englewood Cliffs, New Jersey
A975).
I. Schur, Uber Potenzreihen die im Innern, des Einheits-
kreises beschrdnkt sind, Journal fur die reine und ange-
wandte Mathematik 147 A917), 205-232 and 148 A918),
122-145.
[Wall48] H. S. Wall, "Analytic Theory of Continued Fractions", Van
Nostrand, New York A948).
Chapter XII
Applications to some
differential equations
About this chapter
A linear homogenous ODE of order 2 is a three-term linear relation be-
between i/, y' and y". Under certain conditions differentiations and rear-
rearrangements will lead to three-term recurrence relations for the successive
derivatives, and in turn to a continued fraction, where {2/'nV2/ } ls a
tail sequence (right or "wrong" tails). Tf the continued fraction converges
to yjy1 or equivalently: {y^/y^n+l^} is a right tail sequence, then we
have a continued fraction representation of the logarithmic derivative of
a solution.
A Riccati equation has an important invariance property with respect
to linear fractional transformations. This property has given rise to
different continued fraction procedures for solving such equations.
There is a transformation from Riccati equations to second order linear
equations and the converse. This opens up the possibility of treating
second order linear equations the "Riccati way" or the other way around.
The topics of the present chapter are limited to the types of differential
521
522 Chapter XII. Applications to some differential equations
equations mentioned above. The emphasis is on the formal algorithmic
part. For rigorous proofs that the expansions represent the solutions (by
correspondence and convergence) we mostly refer to the sources. But in
some cases, in order to present the underlying idea, we outline a proof.
In some very special cases an "a posteriori verification" may be used to
establish that the result of the procedure in fact represents the solution.
Second order linear equations 523
1 Second order linear equations
1.1 Introduction
Already in Chapter I we have seen an example of a connection between
differential equations and continued fractions. In Subsection 2.2 of that
chapter we saw how a linear second order differential equation gave rise
to a continued fraction, which turned out to be a continued fraction
expansion of /'(*)//(«), / being a particular solution of the differential
equation. This can be done more generally:
Given a second order linear differential equation
y = Po{x)y'+ Qo{x)y", A.1.1)
where ; means differentiation with respect to x, and where Po(x) and
Qo(x) are infinitely differentiate. We differentiate and rearrange:
where
for all x where P^{x) ^ 1. We proceed and find generally
»W = PB(*)if("+1) + Qn(a:J/("+2), A.1.3)
where
Since (if no denominator is 0)
J/_ _
y'
y'
y"
y{n) p . Qn
— -*n
y(n+l) n y(n+l)/y(n+2)
524 Chapter XII. Applications to some differential equations
we get the formal identity
y' Pi +P2+---+;
This suggests to look at the continued fraction
Pn
A.1.6)
for which we ask the questions: Does it converge (for the x-values we are
interested in, e.g. an interval on the real axis, a domain in the complex
plane)? In case of yes: Does it converge to y1 jy for some particular
solution y in that domain? If also the answer to the second question is
yes, we easily find this y, and by putting y = y • u we find a linearly
independent solution in the familiar standard way. This way of looking
at the problem can be found in Perron [Perr57] who referred to A. Steen
[Steen73].
If we have the very special case that A.1.6) converges to a continuously
differentable function / which is "manageable", e.g. we have a closed
form for it, the second question can be answered by using the following
lemma:
Lemma 1 Assume that A.1.6) converges to a differentiate function f
with a continuous derivative f. Then there is a solution y of A.1.1)
with
I = /(*) A-1.7)
if and only if
= QuU2 + f), A-1-8)
where
-Po, A.1.9)
is the value of the first tail of A.1.6).
Second order linear equations 525
Proof of lemma: A.1.8) is equivalent to
1 - Pof = Qo(f2 + /') • A-1-8')
Let y be a solution of the differential equation
y
i.e.
y = Ce1'^ , where Ff(x) = f(x).
Then A.1.8) is equivalent to
i'Y , y"y~{y'f
y " \\yJ y2
i.e. to
1 p? -or
1 - JRJT - VU~ i
y y
and hence to A.1.1). ¦
We shall now illustrate the technique in some cases where the conditions
of Lemma 1 turn out to be satisfied.
Example 1 If Po and Q{) are constants, we are back to Example 3 in
Subsection 2.2 of Chapter I. We shall write them as
Qo = -a.
and assume that a and P are complex numbers of different absolute
values, 0 < \a\ < \P\. The process described leads to the continued
fraction A.1.6) which now takes the form
1 ~aP ~°P . A.1.6')
-(a +fi) + -(a + p) + -(a + p) +• • • *
We know from Theorem 6 in Chapter III that the periodic continued
fraction
P
oo -
r?i -(a
526 Chapter XII. Applications to some differential equations
converges to /(!) = a, and hence that A.1.6') converges to
/ = : '
From this it follows that
(a/3)
i.e. the relation A.1.8) holds. Hence, one solution y of
y =-(a + P)y' - apy" A-1-1')
may be found by solving the equation
? = _!
y P*
and we find
By putting, in the familiar way
y — e~x^-
into A.1.1'), we easily find a second, linearly independent solution
and hence the general solution
There is of course no reason to use this method for differential equations
A.1.1'), since the standard method, taught in elementary courses, is
simpler. The purpose was only to illustrate the procedure and the test
by the lemma in a familiar case.
O
Second order linear equations 527
Remark: In Lemma 1 we have assumed that the continued fraction
A.1.6) converges to some /, and that /(*) is the value of the first tail.
But the Lemma also holds if {<7^}JJLo is any tail sequence for A.1.6).
For Example 1 we have for instance that
a
is a "wrong" tail sequence (if we maintain the condition 0 < \a\ < \{3\).
With g = — I/a and gW = j3 we have
1
a a'
g.gW = -Z: = -afl.^ = Qu(g* + gr),
and we find a particular solution of A.1.1') by solving
- = --,
y a'
i.e. wo find the particular solution
y = C2e-*l«,
and together with what we did with the right tail sequence we get the
general solution.
We do not even need convergence of A.1.6). It suffices to have tail
sequences with smooth elements satisfying the lemma. Taking again
Example 1 as an illustration, but now with 0 < \a\ = \/3\,a / /3, the
continued fraction A.1.6) diverges, but it still has
1 1
,/3,p,/3,/3,... and — —, q, a, q, a,...
a p
as tail sequences, (where both are wrong tail sequences). Since in both
cases the equality A.1.8) holds (with (/, /O) replaced by (<7,<7^)), we
find two linearly independent solutions just as before. (It is easy to
prove, that no other tail sequences than the two mentioned above are
such that the equality A.1.8) in Lemma 1 holds.) As an example, take
a = i,/3 — —i in Example 1. The differential equation takes the form
528 Chapter XII. Applications to some differential equations
The continued fraction A.1.6') is in this case
I li li ll
0+0 + 0 + 0 +•••"
It diverges, but the two "useful" tail sequences are
l/m 6* *"— (• ~~~ 6j m m •
and
• ¦ • •
The general solution is
y = Cie1'-" + C2e~Ia:(= (?! cos a: + C2
The next example is also a familiar one from elementary courses. In this
case we restrict ourselves to real equations and look for real solutions.
The differential equation
y = poxy* + qox2y" , x > 0 , Pu,tfuGR, <|o/0,
is an example of a differential equation with a regular singular point at
the origin, actually the Euler-Cauchy equation. The first step in the
standard solution procedure is to insert y — xr and solve the algebraic
equation for r. Another method is to transform it to an equation with
constant coefficients. But we shall here use the continued fraction tech-
technique just described.
Example 2 Given the differential equation
y = - xy' -6x2y", x > 0 .
We shall see what happens if we use successive differentiations and the
continued fraction method:
y'=-yf - xy" - I2xy" -
i.e.
y'= ~^xy" - 3x2y>"
Second order linear equations 529
This is again an Euler-Cauchy equation, this time for y' instead of for
y, and we proceed:
y = -~xy' - -zV
o o
/// 37 u\ 3 2
From
y
y'
y'
y"
y"
y'"
y"'
y«)
13
2
5
3
37
40^
we arc led to a continued fraction
1 -6z2
-3x
6x2
y'ly"
3x2
y"/y'"
try I C \~,2
y'"/y(<)
C/20)z2
y(*) jyw
2 / 2/5)aj2
somehow associated to y'/y. A natural thing to try now is to establish
general formulas for the elements, and from there on try to find out
whether or not the continued fraction converges, and to what. Here we
shall follow another path. With pn and qn as in
x > 0 , n > 0 ,
the continued fraction is of the form
H *
By an equivalence transformation (divisions by x) we get the continued
fraction
1/g 9o ^ 92
530 Chapter XII. Applications to some differential equations
which, apart from the start, is independent of x. In case of convergence
it converges to r/x for some r ? R. In any case we try out
x
in Lemma 1. We find
• /0) = -(-- Po* ) =
x \r )
1 - rp() = 1 + r
x \r ~ J
and
—9 I ~ %)r\r ~ 1) = "^G* —
Hence y1 jy — r/x for some solution of the Euler-Cauchy equation in this
example, if and only if r is such that
-6r(r- 1) = 1 +r,
i.e. r = ^ or r — ^, and we thus find, from the equations
tA and tA
y x y x
the two linearly independent solutions a:1'*1 and a?1/2, and thus the gen-
general solution
Generally we are led to the algebraic equation
qor(r - 1) = 1 - por,
i.e. the same equation as in the standard method for solving Euler-
Cauchy equations.
O
Example 2 shows that the continued fraction method used on Euler-
Cauchy equations merely leads to the standard method of solving such
equations, and that there is no need to make a "continued fraction de-
detour". The purpose of Example 2 is on one hand to give another illus-
illustration of what the continued fraction looks like in a familiar case, on
the other hand to prepare for a not so familiar case in Subsection 1.2.
Second order linear equations 531
1.2 An "almost77 Euler-Cauchy equation
In the present subsection we shall illustrate the use of the continued
fraction method on a special case of differential equations
y = (pf) + bxs)xy' + (q0 + Pxs)x2y" , x > 0 , A.2.1)
where p(), <fc), b, /3 and s are real. We shall first rewrite the equation and
also impose some restrictions. Observe first that if b — j3 — 0 we are
back to the Euler-Cauchy equation. If the algebraic equation
1 =p»r + qor(r-l) A.2.2)
has two distinct real roots ri, r2, the general solution of the Euler-
Cauchy equation is C\Xr* + C'ix1. We shall here assume this to be the
case. We shall furthermore express Po and <jo in terms of r\ and r<i- We
rewrite A.2.2) (note that go 7^ 0 under our assuptions):
r2
_ (i _ El) r _ 1 = o . A.2.2')
\ QoJ qo
We have
i Po
ri+r2 = 1 ,
Qo
1
from which it follows that
The restrictions to be imposed are (in addition to the assumptions on
ri and 7*2)
6
s = -r2, /3 = -,
r-i - 1
where r\ is required to be ^ 1.
The differential equations to be studied are thus of the form
n+s-r + \ + /j_ + b gA x2 x>(j
\ r.s J \rs 1 — r /
532 Chapter XII. Applications to some differential equations
where r and s are real numbers, both ^ 0, r / 1 and r -+ s ^ 0, and
where 6 is a real number. (We have replaced r-i by — s and r\ by r.)
Later we will put on some further restrictions on r and s. Keep in mind
that for b = 0 the equation A.2.1') reduces to the Euler-Cauchy equation
with the general solution
x>0.
We find from A.2.1') that
y fl + s-r \ \rs l-r
xy1
, A.2.3)
xy"
provided that no dominator is 0. We differentiate A.2.1') and rearrange:
rs 1 — r
:2 b
rs 1 — r
By cancelling the factor
xs+2)y"'
/I 6 A
— +- x")
\rs 1 — r J
we find
f s - r)xy"
where we assume (in addition to earlier assumptions) that s ^ —1. This
is an Euler- Cauchy equation for y'. We have
r lf - ti?Li_±A. A.2.4)
Since we have reached an Euler-Cauchy equation we could proceed as in
Example 2.
Second order linear equations 533
But it is just as easy to go on directly: From A.2.3) and A.2.4) and the
way it must continue, we find that the continued fraction A.1.6) (i.e.
the one associated with y'/y) in this case has the form
+ fa.) x + K
where cn and dn are independent of x. If it converges, it must be to
for some constant c ? C. We assume that c / oo and use Lemma 1,
with / as in A.2.6) and thus with
V 1 — r
We find that
/./(') =
—b-
+ fa-) + (A + ^^ c
+
1 + s — r + c
rs
and
/i i. \
x2-
A.2.7)
1 + s — rj-c
Ki
A.2.8)
The expressions in A.2.7) and A.2.8) are equal if and only if
c = -(s + 1) or c = r - 1. A.2.9)
From Lemma 1 we know that we get solutions of A.2.1') by taking y' jy
equal to A.2.6) with either one of the c-values A.2.9).
534 Chapter XII. Applications to some differential equations
c = —(s + 1) gives
y' -1 -s
V [I + -iL±Ax»] x
a
.»-l
_ __ j 1 — T
X -. ,
x
and hence, since x > 0,
c = r — 1 gives
2/' f*
— = — , and hence y =
y x
By combining the two solutions we find the general solution of the dif-
differential equation A.2.1') for x > 0:
y
= C, (*- + b{[+_SJS) + C2x' . A.2.10)
Example 3 We shall solve the differential equation
where d is a real number / 0.
Simple verification shows us that this is an "almost" Euler-Cauchy equa-
equation in the meaning just described, with r = —1, s — —2 and 6 = 2d:
y = @ + 2dx-2)xy' + (i + dx-2)x2y".
According to A.2.10) the general solution is
O
Second order linear equations 535
1.3 Two further examples
In the examples we have seen so far the continued fraction produced by
the method led to, by convergence or by some skilled or lucky choice of
the zeroth tail value, a closed form of y'/y for two linearly independent
solutions of the differential equation. This is of course very special.
Moreover, in many such cases, as seen by the equations with constant
coefficients, by the Euler-Cauchy equation and by the "almost" Euler-
Cauchy equation, the continued fraction method is in fact dispensable,
except for being illustrations of how the continued fraction method works
in familiar cases.
Of practical value is the continued fraction method only in such cases
when we, on the basis of properties of the given differential equation, can
come up with a statement about the solution, related to the continued
fraction A.1.6). Such a statement may for instance be that the continued
fraction A.1.6) converges to yf/y for some particular solution y of the
given differential equation. (Knowledge of a possible closed form is not
taken into account.) Theorems of this type would be related to theorems
on power series solutions of linear second order ODE's.
The continued fraction method described in this section has been dis-
discussed and applied in many different cases, se e.g. [Khov63] and [Steen73]
and the references therein. But in most cases the method has been car-
carried out formally, with little or no attention paid to questions of justifi-
justification. Several things could be done - and ought to be done. We shall
not go into that here, but in the next example we shall include some
remarks containing the relevant key words.
Example 4 The differential equation
has a regular singular point at x = 0. It can be solved by using the
Frobenius method, which is to put
y = ajs(i40 + -Ai x + A2x2 + ...)
536 Chapter XII. Applications to some differential equations
into the differential equation and determine s and the coefficients An.
In the present case we find for s the equation
4s(s - l) + 2s = 0,
with the solution s = 0 and s — ^, for which we get two linearly inde-
independent formal solutions
y = AQ + Axx 4- A2x2 + ... , Ao / 0 ,
y = x?{BQ + Bxx + B2x2 + ...), Bu / 0 .
But they are more than just formal solutions. The given differential
equation has the form
x2y" tx
where P(x) = |,<5(k) = =y. If P and Q are analytic in \x\ < R in the
complex plane, the series YZnLo An2n and Y^=o^n^n both represent
functions, analytic in \x\ < R. In the present case P and Q are entire
functions, and the same will be the case for the two series. The two par-
particular integrals expressed by means of the two series, hereafter denoted
y\ and ?/#, are analytic in the plane and the cut plane (cut along the
negative real axis) respectively. For Aq ^ 0, yf.\/yA is meromorphic in
the whole plane and holomorphic in a neighborhood of the origin. For
B{) / 0, y'nlyB is meromorphic in the whole plane. It has a pole of
order 1 at the origin.
After these preliminary remarks we switch over to the continued fraction
method. Successive differentiations give
+y =0
4xy'" +62/" +,,' =0
+Dn
We find
, = 2±x,
yf y'
or
y' -1/2
y y"
y 1 + 2x{y"/y')
Second order linear equations 537
and generally
A) -1/2
#(") 2n + 1 +
We are thus led to the continued fraction
-1/2 -x -x -x
1 +3 + 5 + 7 +•••'
Since it is equivalent to the continued fraction
-1/2 z/(l-3) z/C-5) x/{4n2 -
where x/Dn2 — 1) —> 0 as n —* oo, we know by Example 1 in Chapter
II that it converges to a meromorphic function in the whole plane, and
by Worpitzky's theorem in Chapter 1 that the convergence is uniform
in some closed disk around the origin. Furthermore, it corresponds to a
power series
1 x
2 6
with the same start as the Taylor expansion at 0 of y\/y.\. From the
way the continued fraction is constructed it follows that it actually cor-
corresponds to the expansion of y\/yA- From the correspondence- and
convergence-properties it follows that
k _ zi zi z? —
yA 1 + 3 + 5 + 7 +...
in the whole complex plane.
Observe that we have used the knowledge of existence and convergence
of series representations of solutions, but not the series themselves. We
furthermore used convergence (uniform) of the continued fraction to
"something" and correspondence to the "right thing", and were able to
conclude that yf/y = the continued fraction, without having to depend
upon a possible closed form of the continued fraction.
In some cases, for instance in using second order equations in order to
solve Riccati equations, this (i.e. the logarithmic derivative of a solution)
538 Chapter XII. Applications to some differential equations
is actually what we need. In other cases we are looking for the general
solution of the second order equation, in which case a continued fraction
representation is not always the best starting point for further progress.
(To connect it to a power series is a possibility, but in that case a very
natural question is to ask if it had not been just as good to use a power
series method from the very beginning.)
Another use of the method is for finding continued fraction expansions
of functions. For such applications knowledge of a solution is needed.
Sometimes this can be carried out by constructing a differential equa-
equation with a prescribed solution. As an illustration we shall see how
the present example can produce a continued fraction expansion for the
function tanu. If
y = cos y/x , x G C ,
then
2y/xy' = — sin y/x ,
and by differentiation
and hence
4xy" + 2yl + y = 0 ,
i.e. y = cosa/sc is a solution of the differential equation we just have
studied. Actually we have
yA = Ay cos
and consequently
— o —x —x —x
2yfx~ 1 + 3 + 5 + 7 +•
and we find
2 9 9
U —U —U —U
taim = — A.3.1)
1+3 + 5 + 7 +-.. V '
in the whole complex plane.
O
Second order linear equations 539
We conclude this section by showing another example of how a function
satisfying a differential equation can be expanded in a continued fraction.
Example 5 The Bessel function Ju{z) of the first kind of order v is
defined by
It converges for all z (E C, and is a solution of the Bessel equation
1 / v2
w" 4- ~wr 4- 1 - -5- I w = 0.
z V z2
Since
L) 2
and thus
1/2 — V r I \ 2l/ + 1
Z2 * ' Z
the differential equation may be written
B) - ^
which leads to the continued fraction expansion
i/ + 1) - 2(i/ + 2) - 2(i/ + 3) ~ • -
for Jl/+\(z)/Jl/(z). (See Subsection ?.^ in Chapter VI.) This continued
fraction was also known to Bessel himself.
O
540 Chapter XII. Applications to some differential equations
2 Riccati equations
2.1 General Remarks
A Riccati differential equation is a first order ODE of the form
y1 = ao(a:) + a, (x)y + a2{x)y2 , B.1.1)
where ao, ai and a2 satisfy certain smoothness conditions on some real
interval or in some domain in the complex plane. In the history of
continued fractions the Riccati equation has attracted a lot of attention.
Already Euler was interested in the connection between the Riccati equa-
equation and continued fractions. Later, many authors have devoted time
and effort to the Riccati equation, for instance Worpitzky, to name but
one example. Actually, the very first mathematical paper we know of
from him [Worp62], dealt with a continued fraction expansion of a so-
solution of a Riccati equation. There are several reasons for this great
interest. First of all: The Riccati equations are among the very simplest
non-linear ODE's, and they appear in applications, such as general rel-
relativity theory, system theory and acoustics. Next: They are, as we
soon shall see, closely related to second order linear ODE, and finally:
They have a certain invariance property with respect to linear fractional
transformations, a property that makes them fit for continued fraction
types of solutions.
In B.1.1) we may assume that a2(x) is not identically 0 in the interval
or domain we are interested in. Restricting ourselves to an interval
or domain where a2(x) ^ 0, we get, if a2 is differentiable, that the
substitution
u{x)
y(x) = -
a2(x)
leads to a Riccati equation for it, where the coefficient of u2 is — 1.
The verification of this is left to the reader. Essentially without loss
of generality we may therefore in the following assume that the Riccati
equation is of the form
y' = ao(x) + ai{x)y - y2 . B.1.2)
Riccati equations 541
The way to get from a Riccati equation B.1.2) to a second order linear
equation is to replace
-<¦>
and
for some "good" choices of /o, /i, /2, and to replace
y by-.
We then get
h{x)w" + /, (x)w' + fo{x)w = 0 . B.1.3)
We get from B.1.3) to B.1.2) by the opposite substitutions. We have the
word "good' undefined, but the idea is of course to make a substitution
leading to a differential equation we can handle.
We shall illustrate the transformation from Riccati equations to second
order linear equations in two examples.
Example 6 The equation
y'=-l + 2y-y2
is a Riccati equation. (It is also a separable differential equation, with
the general solution
V = 1 + ——p; , including y = 1.)
x + O
We want to transform it to a second order linear homogenous differential
equation. Using the above notation, we put
. _ /»(*) „ /¦(«)
One way is to take /^(ac) = 1, fo(x) — 1 and /i(k) = —2. This gives the
linear equation
w" - 2w' + w - 0 .
542 Chapter XII. Applications to some differential equations
The general solution of this is
w = (Ci + C2x)ex .
From this it follows, when C\ and C2 do not both vanish:
which is equal to 1 for C2 = 0 and to 1 4- 1/(k + C\ /C2) for C2 ^ 0, just
as expected.
O
Example 7 Given the Riccati equation
/ 1 1 2
y = -1 + -2 - -y-y -
Xz X
We want this transformed to a second order homogenous ODE. With
notation from above we put
4
a;2 ' /2(k) a;
Take fa{x) = 1 - 1/z2, f{ = l/x, f2(x) = 1. We then get the differential
equation
// 1 / ( 1
W H W | I 1 ;
X \ X'
which is the Bessel equation of index I. (See Example 5.)
O
We now consider the invariance property: Given a Riccati equation
y' = ao(x) + ax(x)y + a2(x)y2 . B.1.4)
By introducing the new dependent variable w given by
6(x) ' v '
Riccati equations 543
(with obvious smoothness and non-singularity conditions on a, j3, 7, 8),
the equation B.1.4) is transformed to a new Riccati equation
wf = d(j(x) -f a~\(x)w + d2(x)w2, B.1.6)
where do, d\, d<i are easy to compute (Problem 2). The crucial thing is
that the ww'-terms cancel.
For the history of continued fraction solutions to Riccati equations we
refer to [Khov63], and more updated to [Coop88]. See also the remark
section at the end of this chapter. Let us here merely mention some
steps from the early history.
Lagrange had proposed the following method for solving differential
equations by using continued fractions: For a given differential equa-
equation let y be "near" ?0 when \x\ is small. Write
ih B-L7)
and insert this into the differential equation. If y\ is "near" f { we repeat
the process, and if it can be repeated without stop (or terminates), we
are led to a continued fraction expansion
^. B.1.8)
a=l 1
The terms ?„ have (in the cases studied) mostly been of the form anxfin,
where a G C and qn > 0.
This idea can not be expected to work generally. But for Riccati equa-
equations we know at least that the new equation also is a Riccati equation,
so in the present section we shall concentrate on this type of a differen-
differential equation. Euler and Lagrange treated differential equations of the
form
(a + ctx)xy' + (P + P'x)y + 72/2 = 6x , B.1.9)
where a, a', /?, /?', 7 and 8 are constants. Euler also applied Lagrange's
idea to
axyf + Py + 2/2 = xk , fceN, y(Q) — ~P , — ^N, B.1.10)
a
544 Chapter XII. Applications to some differential equations
to find the solution
oo xk
K — „. B.1.11)
n=i nka — 6
Worpitzky [Worp62] was briefly mentioned in the introduction. He stud-
studied the Riccati equation
yf + y2 = ax'2 B.1.12)
and found the solution
oo axm
= 1+ K T rr- B.1.13)
k=i km -f 1
2.2 An old example
The "technical part", i.e. the part where some formal continued fraction
for the solution is created, depends upon a "good" choice of a linear
fractional transformation, such that the new Riccati equation can be
transformed again in a reasonably simple way. In fortunate cases one can
see a pattern. The first example is closely related to equation B.1.10),
but not directly a special case of it.
Example 8 Let k be a real number. We study the Riccati equation
xy'+ ky + y2 = -x2, 2/@) = 0. B.2.1)
We try, heuristically, to find a "good" transformation: Insert the power
series expansion
y = ax + 6k2 -f- • • • .
The left-hand side then takes the form
(k + l)ax + (b(k + 2) + a2)x2 + • • • .
If k is not a negative integer, we find that a — 0 and b = —l/(k + 2).
Then it is natural to try f0 = —x2/(k -f 2) in B.1.7), and thus (almost
following Lagrange)
-x2 I -x2
I
x2-z(x) xl[z(x)]
k + 2 + z I ~ k + 2 + (k + 2J (k + 2)
Riccati equations 545
where (by studying the power series expansion) we have 2@) = 0.
We find, by putting this into the given Riccati equation:
-2x2(k + 2 + z) + x[izr kx2(k + 2 + z) x4 _ 2
(k + 2 + zJ ~Jk + 2 + zJ + (Jb + 2 + 2J ~ "^ '
After multiplication by ?~2(& + 2 + zJ and rearranging terms (out of
which several cancel) we get
xz' + (Jb + 2J + 22 = -z2 , 2@) = 0.
This is a Riccati equation of exactly the same type as the one we started
with, only with k replaced by k -f 2. In the same manner we can use the
transformation
-x2
2 =
+ 4 + u
and get the Riccati equation
xu' + (k + 4)w + u2 = -x2 , u@) = 0 ,
and so on. If we, instead of using 2, w, etc. use the notation y\, 2/2 5 ...,
we find by repeated use of these transformations that
22 2
^~ X X X
y ~ k + 2 + k + 4 + .-.+ k + 2n + yn "
(We recall that k is assumed not to be a negative integer). From this
equality one is led to the function defined by the continued fraction
00 —x2
y= K -——-. 2.2.2
n=\ k -f In
We shall outline the proof that this in fact is the solution with 2/@) = 0:
a) The continued fraction converges in the whole plane to a mero-
morphic function, holomorphic in some neighborhood of x — 0
and with 2/@) = 0. In order to see this one can replace — x2 by ?,
and by an equivalence transformation we get the regular C-fraction
j ±
F+2 (* + 2)(fc + 4) (F+4)(Ar + 6)
1 + 1 + 1 +•••'
whose properties can be established as for the continued fraction
in Example 4.
546 Chapter XII. Applications to some differential equations
b) The continued fraction corresponds to a formal power series solu-
solution with 2/@) = 0 of the differential equation: The formal power
series solution starts with the same term —x2/(k + 2) as the series
corresponding to the continued fraction. Moving from the original
differential equation to the next one (for 7) we have the same sit-
situation, with — sc2/(&-f-4) as the first term, and similarly for u and
all subsequent equations. (Every time we jump to the next tail
of the continued fraction.) The correspondence is an immediate
consequence of this.
c) The formal power series solution represents a holomorphic function
in a neighborhood of the origin, since the continued fraction has
this property, and is thus not only a formal solution, but a solution.
Hence the continued fraction is a solution.
The outline is closely related to considerations in Example 4. Note also
that a comparison between B.2.2) and the continued fraction A.3.2) for
a ratio of Bessel functions leads to an expression for y in terms of such
Bessel functions.
O
Example 9 We can use the previous example to re-establish the ex-
expansion of tana; which we found in Example 4.
The differential equation
y' = 1 + y2, 2/@) = 0, B.2.3)
is separable. It has the solution y = tana?. If we insert a power series
and compare coefficients, we find that it is of the form
x3
y = x ^ -f higher powers of x .
o
Following the procedure of Example 8 we substitute
B.2.4)
Riccati equations 547
and get the differential equation
xz' + z + z2 = -x2 , z@) - 0 ,
which is the one in Example 8 with k = 1. Hence
z =
3 + 5 + 7
and finally
2 2 9
or. IE X X
= t 4-^r ±r Mr ±
1+ o + o + 7 +•••
for all x G C. This expansion was discovered by Lambert, and later by
Lagrange and Euler. The book [Khov63] contains several examples of
expansions found by means of Riccati equations.
Let it briefly be mentioned, that if x is replaced by — ix in B.2.5) we
get, since ztan(—ix) = tanha;,
9 9 9
X X X X
tanhz = - — — — . B.2.6)
1+3 + 5 + 7+••• v }
This also converges in the whole plane.
O
2.3 A new example
Among the newer continued fraction results on Riccati equations we
have chosen to mention one particular result, namely a solution by using
general T-fractions. This was presented by Sandra C. Cooper in her
thesis [Coop88]. See also [C0JM88]. We shall here give a very rough
and brief sketch of the idea.
We remember about the general T-fractions that they interpolate and
possibly approximate simultaneously at 0 and 00. In the method to
be mentioned here two initial value problems for a Riccati equation are
simultaneously solved by one and the same general T-fraction.
548 Chapter XII. Applications to some differential equations
Here the Riccati equation is assumed to be of the form
zAQ{z) + Bu{z)Wu{z) + C0{z)WS[z) - zW^(z) = 0, B.3.1)
where
A{j(z), Bu(z), Co(z) are analytic at z = 0 ,
??o@) is not a positive integer,
B.3.2)
Aq(z), Bq(z),Cq(z) are analytic at z = oo,
A)(oo) = Co(oo) = 0,
, Inn zCo(z) f 0 , ^ Urn A - J50(z)) $ Z" .
oo z —» oo
We seek possible solutions, holomorphic at z — 0 and satisfying the
initial condition Wq@) = 0, and possible solutions, holomorphic in a
neighborhood of z — oo, except for a pole of order 1 at z — oo.
The method aims at a solution, represented by a general T-fraction of
the form
where Fn ^ 0, Gn ^ 0 for n > 1.
The T-fraction corresponds at z = 0 to a power series
(Go + Fx)z - (FiGi + F{F2)z2 + higher powers of z,
and at z = oo to
+ —- + higher powers of z~A .
Observe that these power series have the same form as the expansions
at 0 and at oo of the two solutions we are looking for.
The idea is related to the one we saw in Example 8, where a formal
power series solution was inserted and the initial coefficient determined,
from which the transformation of the variable was decided. The main
difference here is that two series are involved. Let the two series (formal
solutions) be
iz + p2z2 + ... .
Riccati equations 549
We insert them into the differential equation, and use our knowledge of
the expansions of A{), B^, Cq at 0 and at oo:
A / \ @) , @) . a / \ a\ ¦ a2 .
A0(z) = %J + a) 'z + ... , A0(z) = —?-+-^ + ... ,
B0(z) = 4°) ^
(o\ <a\ -v@)
Straightforward computations (comparison of coefficients) lead to for-
formulas for the p- and /^-coefficients in terms of the known coefficients
above, in particular for pi,jp!!_i and p^. Since
Go + Fi = Pi, Co = pli and
we find Go,F\ and ??i such that the two series corresponding to the
general T-fraction coincide with the formal series solutions up to and
including the terms p\z and p*_\Z + p^. Rather than writing up the
formulas (which, by the way, are easily found) we shall illustrate this
process later in an example.
After having found Go, F[ and G\ we introduce the new variable
by
Wi(z) (with W^ifO) = 0) is the solution of a Riccati equation of exactly
the same form as B.3.1), only with subscripts 1 instead of 0. Ai(z),
B\(z), C\(z) are easily expressed in terms of Aq(z), Bo(z), Cq(z) and
Gq. We next introduce a new variable ^2B), defined by
rW B-3-4)
Here is the place where the invariance property of Riccati equations is
used: Under certain mild condition ^f^) not only satisfies a Riccati
equation, but a Riccati equation of exactly the same form as B.3.1):
B2{z)W2{z) + C2{z)W22{z) - zWi{z) - 0. B.3.5)
550 Chapter XII. Applications to some differential equations
Continuing the process we set
Wt(z) = ^ ,
and so on. If we can proceed indefinitely, a general T-fraction B.3.3) is
generated, where
Fnz
and where Wn+\(z) is the solution with Wn+ifO) = 0 of a Riccati equa-
equation B.3.5) with subscript 2 replaced by subscript n -f 1.
In the papers [Coop88] and [C0JM88] conditions (on Ani Bn, Cn) are
established, which ensure that the process can be carried out. Recur-
Recurrence relations for AniBn,Cn are proved, as well as formulas for the
parameters FniGn in the general T-fraction.
The main result of [Coop88] and [C0JM88] is as follows:
Theorem 2 Let
zA0{z) + B»(z)Wo{z) + C0(z)WS(z) - zW&z) = 0 B.3.7)
give rise to the general T-fraction
\ jr FnZ r 1 FxZ F'lZ
K G+
a) If B.3.8) converges uniformly in a neighborhood ofO to a function
W(z), then W(z) is the unique solution of B.3.7), analytic in a
neighborhood of z = 0 and with Wo@) = 0.
b) If (GqZ + j!i(Fnz/(l + Gnz)))/z converges uniformly in a neigh-
neighborhood of z — 00 to a function V(z), then W(z) = zV(z) is the
unique solution of B.3.7) at z = 00 with a simple pole at z = 00.
We shall illustrate the algorithm on an example, picked from [C0JM88].
Riccati equations 551
Example 10 Let a, 6, c be distinct real numbers, not 0 or negative in-
integers. We shall illustrate the first two steps of the algorithm (from Wo
to W2) on the Riccati equation
Z x. Z
x. Z
With notation from the text we get
. , x a(b — c)/c a(b — c) / 9 \
1 — z c >. /
aF-c) /111
= ( + ++
at 2 = 00
(
c \z z
at z = 0
^
in/1 1 l \
a-6) (- + — + — -^ I
•¦¦} at^ =
z + z2
111 \
We insert this into the Riccati equation and compare coefficients to find
that
at z = 0 :
, a(b - c)
z : cpx - pi = 0,
c
a(b-c)
=
+ c)
at ^ = 00 :
552 Chapter XII. Applications to some differential equations
\-\-b~ a
sop_x = , (P-\ = 0 is ruled out)
z° : -aF~c) + (a - b)p*0 + (c + a - 6)pl1 +
c
- cB + 6-a) •
Hence
1+fr-a F+l)(a-c-l) 2 + 6-a
c c(l + c) 1 + c
WithW|(«) defined by
we find that W^i(^) is the unique solution, analytic at z = 0 and with
= 0, of the Riccati equation
+
1 — z I — z
c
JL i2
With W2{z) defined by
Wl{z)= f
l+
2t+c j
we find that ^(-j) is the unique solution, analytic at z = 0 and with
= 0, of the Riccati equation
^ c+l '* (-(c + l) + (a-t-3),)
1 1
Riccati equations 553
Observe that we get the differential equation for W2 by replacing
b by 6 -f 1 and c by c -f 1
in the difierential equation for W\. We find F2 and G2 from Fi and
G\ in the same way, and by repeating the argument we are led to the
continued fraction
+ K w ,
1 + G2
n){a-c-n)
where
C [C + 72 — L)(C 4" 72)
_ n+l+6-a
Lrn — , 71 — 1, Z, O, . . .
c-fn
The general T-fraction found here is limit periodic, Fn—* — 1 and
Gn —> 1 when n —* 00. We may thus conclude from the theorem that
the T-fraction represents both solutions (by correspondence and con-
convergence).
O
554 Chapter XII. Applications to some differential equations
Problems
A) Use the procedure of Example 1 to solve the differential equations
(a) y = yr + 6y",
(b) y = -22/' - by".
In which case does the continued fraction converge? diverge?
B) Find the explicit expressions for flo(aj), dy(x) and do(x) in B.1.6).
C) The differential equation
y = l + y2, y@) = 0
is separable, and can be solved as such. Do this first. But it is
also a Riccati equation. Transform it into a second order linear
differential equation. Solve this, and use the solution to find the
solution of the given differential equation. Compare the solutions.
D) Transform the Euler-Cauchy equation
x2w" + xw' — -w = 0
4
to a Riccati equation, and use the solution of the first equation to
find the solution of the Riccati equation.
E) For the equation B.1.10) with k = 1
axy' + j3y + y2 = x , y@) = -j3 , - g N
we find that the power series expansion of the solution has the
start
x
y = —/3 H + higher powers of x .
ex. — p
This suggests the transformation
a-
Problems 555
Show that this leads to the differential equation
axy\ + (a - P)yx + y\ = z , ^ @) = 0 .
Next try the transformation
x
V\ =
2a - /3 + y2
(also suggested by the power series expansion) to find a Riccati
equation for t/2- Proceed in a similar way to obtain formally the
solution B.1.11) in the case k — 1. Finally, introduce
into the equation
x = t and — =
k
axy -\- py -\- y — x
and use what you have found to establish formally the solution of
form B.1.11) (but with t and a\) for the equation B.1.10) (with t
and
F) Take in the Riccati equation B.1.12), studied by Worpitzky, a — 1
and 772 = 2. We then get
y' + y2 = l •
We are interested in the solution where \jy —> 0 when a? —> 0 (actu-
(actually coth x). Use Worpitzky's solution B.1.13) to establish (again)
the continued fraction expansion B.2.6) for tanha;.
G) Carry out the details of the computations in Example 10.
556 Chapter XII. Applications to some differential equations
Remarks
A) In her dissertation from 1988 [Coop88], Sandra Clement Cooper
has a chapter on "A history of continued fraction solutions to Ric-
cati differential equations". We refer to this, in particular to the
part dealing with recent results, and also to the references in the
thesis.
B) In the method of T-fraction solutions of Riccati equations, strong
conditions had to be imposed on the coefficient functions. By using
the ^-fractions of L. J. Lange instead of T-fractions a related, but
more general method has been establised [Coop89].
C) The paper [Steen73] by A. Steen, referred to by Perron, is in dan-
ish. It is in fact a document of invitation to a celebration at the
University of Copenhagen on April 8, 1873, on the occasion of the
55th birthday of His Majesty King Christian IX. Adolph Steer was
a professor of mathematics, but was also active in many other ar-
areas. He was during a long period Rector (i.e. President) of the Uni-
University of Copenhagen. The mathematical paper [Steen73], used
as an invitation document, was meant to be material for teaching
of graduate students. One may wonder how much the participants
at the party, including the King, could understand!
References
[Coop88] S. C. Cooper, "General T-Fraction Solutions to Riccati
Differential Equations", Dissertation, Colorado State Uni-
University (Spring 1988).
[Coop89] S. C. Cooper, 6-Fraction Solutions to Riccati Equations.
"Analytic Theory of Continued Fractions III, Proceedings,
Redstone 1988", (L. Jacobsen ed.), Lecture Notes in Math-
ematics, Springer-Verlag, Berlin-Heidelberg A989), 1-18.
[C0JM88] S. C. Cooper, W. B. Jones and A. Magnus, General T-
Fraction Solutions to Riccati Differential Equations, "Non-
"Nonlinear Numerical Methods and Rational Approximation",
(A. Cuyt, ed.), D. Reidel Publishing Company A988), 409-
425.
[JoTh80] W. B. Jones and W. J. Thron, "Continued Fractions: An-
Analytic Theory and Applications", Encyclopedia of Mathe-
Mathematics and its Applications 11, Addison Wesley Publ. Co.,
Reading, Mass. A980). Distributed now by Cambridge Uni-
University Press, New York.
[Khov63] A. N. Khovanskii, "The Application of Continued Fractions
and their Generalizations to Problems in Approximation
Theory" (translated by Peter Wynn), P. Noordhoff N.V.,
Groningen A963).
[Perr57] 0. Perron, "Die Lehre von den Kettenbriichen", Band 2,
B. G. Teubner, Stuttgart A957).
557
558 Chapter XII. Applications to some differential equations
[Steen73] A. Steen, "Integration af lineaere Differentialligninger af an-
den Orden ved lljaelp av Kjacdebr0ker", Kobenhavns Uni-
versitet, Kobenhavn A873), 1-66.
[Worp62] J. Worpitzky, Beitrag zur Integration der Riccatischen Gle-
ichung, Greifswald A862), 1-74.
Appendix
Some continued fraction
expansions
About this section
This is a catalogue of some of the known continued fraction expansions.
The list is in no way complete. Still it can be useful, both to find a
continued fraction expansion of some given function and to "sum" a
given continued fraction.
As for the notation in this catalogue, we write /(z) = the continued
fraction for z G D to mean that the continued fraction converges in the
classical sense to f(z) for all z G D. The set D is usually an open set,
such as for instance D = C \ i[—1,1]. The continued fraction will then
normally diverge on the cut i(—1,1), and it may converge or diverge at
the end points z = dbi. We give a reference to only one of the possibly
many books or papers where the expansion can be found. We have not
attempted to find the origin of the various results.
559
560 Appendix. Some continued fraction expansions
1 Introduction
It is evident that not every continued fraction expansion can find room
in a book like this. On the other hand, quite a number of the known
continued fraction expansions can be derived from one another by simple
transformations. We have for instance
Similarly, if/ = 60+ K(an/6n) then y = (/-l)/(/ + l) = 1-2/A + /),
that is
Another simple transformation is maybe most easily described for reg-
regular C-fractions. Assume that f(z) = bo + ~K{anz/l). Then f(z~l) =
bu + J?.(anz~l/I). Equivalence transformations lead to
f(z) = bu f- K(anz/l) =>
/1\ GL\ a-i Q>\\ Q>\ CL5
\zj z + 1 + z + l + zH
&2 GL3 d.\ GL&
a5
where f2 = z. We shall not list equivalent continued fractions like this
separately. Another situation that often arises is the following: We have
• A.1.4a)
60+
1 + l + 1
Then
re \ u aiz a2* a*z i, n AU\
f(iz) = b0 — —r- —— - A.1.4b)
1 — 1 — L — • • •
Of course, every time we have a continued fraction expansion / =
&o + K.(an/bn) with all an,bn ^ 0, we can take its even or odd part and
obtain a "new" continued fraction converging to the same value /. Some
Mathematical constants 561
of these variations will be listed. (In particular if they turn out to be
nice and simple.) We give references to other continued fractions in the
appendix which have the present continued fraction as a special case, or
which can be transformed into the present continued fraction by simple
means.
2 Mathematical constants
ill l liiiilii
+ 7151
B.1.1)
[JoTh80, p. 23]. This is the regular continued fraction expansion of tt.
4 I2 22 32 42
[JoTh80, p. 25], (see also C.6.1)). For the Riemann zeta function we
have
1 7T2 1 I4 2'1 3
2CB) =
[Bern89, p.150].
1 I2 1^2 22 2-3 32 3-4 4^
B.1.4)
[Bern89, p. 153], (see also D.7.32)).
1 I-1 I3 2-* 23 3J 33 43 43
- — — — — — — — — B.1.5)
4+ 1 +12+ 1 +20+ 1 +28+ 1 +36+.-.' v '
562 Appendix. Some continued fraction expansions
[Bern89, p. 155], (see also D.7.37)).
e = -------- , B.1.6)
-1 + 2-3 + 2-5 + 2-7+
[JoTh80, p. 25], (see also C.2.1)).
11.11.1111
6 " 2+l + 2 + l+T+4 + T+l + 6+' (
[JoTh80, p. 23].
2 1111
[Khov63, p. 114], (see also C.2.2)).
[Perr57, p. 57].
[Khov63, p. 114], (see also C.2.2) for z - -1).
The golden ratio:
e = I i I ± ± 1 , B.a.10)
1-3 + 64 10 + 14 + 18+--- v '
1
2 i + i + i+... 3-3-3
[JoTh80, p. 23].
=1-1 i I , B.1.11)
333' v '
Elementary functions 563
Catalan's constant G = J2k2a(-l)k fBk + IJ:
t2 22 22 42 42 62 62
2G = 2-t+t+t+t+t+t+t+...' B-L12)
[Bern89, p. 151], D.7.30).
+
1/2 + 1/2+ 1/2+1/2+ 1/2 + 1/2+1/2 +¦
[Bern89, p. 153], D.7.32).
3 Elementary functions
3.1 Introduction
The elementary functions listed here are all special cases of hypergeo-
hypergeometric functions or ratios of hyper geometric functions. Their nice corre-
corresponding continued fraction expansions are special examples of expan-
expansions for the hypergeometric functions in general. Still, we prefer to list
them separately in this section.
3.2 The exponential function
, _ ,. » \zzzzzzz
e = iF
z \z \z 2z 2z 3z Zz Az
l_l+2-3 + 4-5 + 6-7 + 8-9 +•••
C.2.1)
[JoThSO, p. 207]. (See also D.1.4).) The odd part of this continued
fraction is
2z z2 z2 z2 z2 ,
ez = 1 + _____ : zeC, C.2.2)
2-z+ 6 +10+14+18+-- •' V ;
564 Appendix. Some continued fraction expansions
[Khov63, p. 114].
z z I z 2 z 3 ^
• 1c
[BoSh89, p. 32].
The even part of C.2.3) is
e* = l+-L- -±?- -?I- -*±- -zeC, C.2.4)
1- 2 + 2- z + 3- z + 4 -Z+...1 ' v ;
[J0TI18O, p. 272]. Since ez = l/e~z, we can find 4 more expansions from
C.2.1)-C.2.4) by use of A.1.1). For instance, C.2.4) transforms into
I ^_ X^_ 2z_ Jz_
II + Z2 + Z3 + Z4 + Z+' V ;
[Khov63, p. 113]. An unusual expansion is
_t - 2fc ^±i! *i±^! *l±i! C26)
tan^ Aj+ 3tan0 -f 5tan^ + 7tan0 -\ v ;
for tan2 ^ G C\[-l,0]; i.e. tan^ G C\i[-l, 1], [BoSh89, p. 50]. (See also
C.3.8) with q = ik and z — ztanfl and C.6.8).) Lambert's continued
fraction
ez - e~z z z2 z2 z2
ez -\-e~z 1+3 + 5 + 7+-••' '
[Wall48, p. 349], is easily obtained from C.2.2) by use of A.1.2).
3.3 The general binomial function
1 az_ A + a)z A - a)z B + a)z
T-T+ 2 + 3 + 2 4-
B - a)z C + a)z C - a)z
C.3.1)
+ 2 + 7 +••• v ;
Elementary functions 565
for |arg(z+ 1)| < tt, [J0TI18O, p. 202]. (See also D.1.6).) The odd part
of this continued fraction is
^-a2)
2)*2
2+(l-a)z- 3(z + 2) - 5B + 2) - 7B + )
C.3.2)
for I arg(z + 1)| < tt, [Khov63, p. 105]. C.3.2) is also the odd part of
1-1A + 2)- 2 -3A +
B - a)z B + a)z C - a)z
2 _5(l + 2)- 2 -••
for I argB + 1)| < tt, [Khov63, p. 101].
+ qJ- 2 + C + qJ - 3 + E + aJ
C.3.4)
for 3^B) > -1/2, [Khov63, p. 101].
\<* - I az 1A-a)z 2B - a)z 3C - a
^ ^ ~ l-l + a2 + 2-
C.3.5)
for \z\ < 1, [Khov63, p. 102].
For the general binomial function we have
C.3.6)
Hence the equality A.1.1) applied to these 5 expansions gives us 5 new
ones. To find a continued fraction expansion for
my -
¦
we can use any of the 5 expansions C.3.1)-C.3.5) with 2 replaced by
2/(z — 1). For instance C.3.2) gives Laguerre's continued fraction
z — tX"! "^ "T" "•* ~r ¦ •* i**"
for |arg(^Y + 1)| < tt; that is z G C \ [-1,1], [Perr57, p. 153]. An
interesting special case of for instance C.3.1) is obtained by replacing
566 Appendix. Some continued fraction expansions
1 + z by z and a by —1/z:
B-z- l)(z- 1) Bz + l)(z - 1)
2 + 5.z +
C*-!)(*-!) C«+ !)(*-!)
2 + 7z +-..
for |arg.z| < 7r, [Khov63, p. 109]. The even part of C.3.9) is
= 1 +
Z* + I — SZ{Z + 1J —
V'2
C.3.10)
for | argz| < tt, [Khov63, p. 110].
' _ 2a{a - b)z (a-6J(l2-a2)z2
~ 2 + (a+6-a(a-6))z- 3B + (a + b)z)
(a — 6) B*1 — a )z (a —
ioT 2-j?$1 6 C \ I- !]. (Perr57' P- 2641-
T/ie natural logarithm
log(l + z) = z2Fl(l,l'12;-z)=z [ -
Jo 1
2z 3z 3z 4z
-5 + 2-1-7-1-2 +~9~+-
Z \2Z 1^2 222
C.4.1)
for |arg(l + z)\ < tt, [JoTh80, p. 203]. (See also D.1.6).)
, , x z z 1 z 1/2 z 1 z 2/3
locfl + z) = - - —¦— - - —-—
M ^ 1+Z-1 + 1 + Z-1 + 1+2-1 + 1+2-1 + 1 + Z
C.4.2)
Elementary functions 567
for |arg(l + z)\ < tt, [JoTh80, p. 319 (NB! misprints)]. Here the con-
continued fraction has the form K.(an(z) / bn(z)) where all a<2n(z) = —z,
O"\n-\(z) — 1 and a.in+i(z) = n/(n + 1). C.4.1) is the even part of
C.4.2). The odd part of C.4.2) can be written
- —{
z 2z z 3z 2z 42 3z 5z 42
+ 2 Y2~5"T yT "9" Y
z l-2z 1-22 2-32 2-32
1 + -
3-4z 3-4z
for | arg(l + z)\ < tt. The even part of C.4.1) is
for | arg(l + z)\ < tt, [Khov63, p. 111].
2 l2z 22z 322 42z 522
^ + ^^+ C.4.5)
for |z| < 1, [Khov63, p. 111]. The connection
log(l + z) = - log (rl-) = - log (l - ^) C.4.6)
can be applied to C.4.1)-C.4.5) to get 5 new continued fraction expan-
expansions. For instance, from C.4.1) we get
Iz lz 22 2z 3z 32
_ Y_ 7A +
C.4.7)
for | arg(l + z)| < tt, [Khov63, p. 110], and from C.4.5)
for 5RB) > -1/2, [Khov63, p. 111].
2? IV 2V 3V 4V
1_ 3 _ 5 _ 7 _ 9 _.
568 Appendix. Some continued fraction expansions
for |arg(l - z2)\ < tt, [JoTh80, p. 203]. (See also D.1.6).) Of course,
also other continued fraction expansions for log(l + z) can be used to
derive expressions for log((l -f z)/(l — z)). Notice also that
, z + 1 , 1 + 1/2 2 I2 22 32 42
log —— = log / = - — — — — C.4.10)
z1 5ll/z zZzhz7z§zv ;
for z e C \ [-1,1], [Perr57, p. 155].
3.5 Trigonometric and hyperbolic functions
sin*
tan z = = z
cos z
2222
f
[JoTh80, p. 211], (See also D.1.1).) The odd part of C.5.1) is
tan z — z -\
1 • 3 -1> — bz* - a • (• y - i4z^ -
^ . 1 1 v ' Q . 1 7 r'1
9-ll-13-22z2-13-15-17-30z2 \ ' ' )
for all z E C.
ztt 2 I2-*2 32-z2 52-z2 72-z2 /ornX
tanT = T+-^-+-i-+-^-+^-+...C-5-3)
for all z E C.
[Perr57, p. 35]. (See also D.7.7).) From these expansions one also gets
continued fractions for cot z = 1/ tan z, tanhz = — i tan(z'z) and cothz =
ij tan(zz) by use of A.1.1) and A).
Quite another type of expansion for tan z follows from
.(l-Hztan2)Q-(l-2tanz)tt .y - 1
tanaz = —%-. : ; ; — = —i , C.5.4)
tA f ztanz)rt + A -2tanz)Q y+1 v '
Elementary functions 569
where y = (A + ztanz)/(l — z"tanz))Q can be expanded according to
C.3.8). Combined with A.1.4) we get
a tan z (a2 — I2) tan2 z
tanaz = _____ _ _
(a2 -22) tan2 z (a2 - 32) tan2 z
for | arg(l + tan2 z)\ < tt, [Khov63, p. 108]. Since
C.5.5)
* (coth T -
it follows that
TT7 1T7 7* 12(z24-12^ 22(z24
—— coth — = 1 -\
O O 1_l Q _1_ Ci 1 7 1
C.5.7)
for all z e C, [ABJL, Entry 44].
atanhGr6/2) — 6 tanhGra/2)
atanhGra/2) — 6 tanliGr6/2)
ab (a2 + 12)F2 + I2) (a2 + 22)F2 + 22)
T + 3 + 5 +• • • C-5'8)
for all a, 6 G C, [ABJL, Entry 47].
sinhGT2) — sinhGrz) 2z2 4z'1 + I'1 \z4 + 2'1 4z4 + 34
sinhGrz) + sinhGrz) 1+ 3 + 5 + 7 -|
C.5.9)
for all z G C, [ABJL, Entry 49].
3.6 Inverse trigonometric and hyperbolic functions
arctanz = z2F\{\, 1; |; -z2) = --log .
jL L Z2
z l2z2 22z2 32z2 4z /o x
— C.6.1)
1+3 + 5 + 7 + 9 +••• v ;
570 Appendix. Some continued fraction expansions
for|arg(l + z2)| < tt; i.e. z G C\i((-oo,-l]U[l, oo)), [JoTh80, p. 202].
(See also D.1.6).) The C-fraction for arctan z can also be written
z3 3V 2V 5V 4V 72z2 62z2
arctan. = z - _ + _+_ + _+_+_+_+ C.6.2)
for z e C \ z((-oo, -1] U [1, oo)), [Khov63, p. 117].
z 1 • 2z2 1 • 2z2
1A + z2)- 3 -5A +
3 • 4z2 3 • 4z2 5 • 6z2
arctan z =
7 -.(! + ,.)- 11 -. ^
for z e C \ i((-oo,-l] U [l,oo)), [Khov63, p. 121]. (This follows
from C.6.6) with z replaced by z(\ + z2)/2.) Since arctanhz =
iarctan(—z'z), we also get continued fraction expansions for arctanhz
from C.6.1)- C.6.3). Also expressions for
f Z y/\ - Z2
arcsm z = arctan . , arccos z — arctan -
z
can be obtained. For instance, from C.6.1) we get
arcsin z z l2z2 22z2 32z2 42z2
V
for|
l-z2 l(l-z'2)+ 3 +
arg(l - z2)| < 7T, [Khov63,
arccos z 1 12A — z2)
for 3ft(z) >
arcsin z
-z2
¦ z2 z+ 3z
0, [Khov63, p. 119].
2^1B' ~2' 2' Z
z l-2z2 1-2
p. 118]
22A-
-\- bz
But we
and
also
z2
7 +9A-
32A - z2
f- 7z
have
3•4z2 5¦
z)+... V
i C
+••¦ C
• 6z2 5 • 6z2
C.6.4)
T_3_5_7_9_H_13
C.6.6)
for | arg(l — z2)\ < 7r, [JoTh80, p. 203], and thus, since arccos z =
arcsin v 1 — z2
arccos z z 1 ¦ 2A - z2) 1 ¦ 2A - z2)
1-3-5
3-4(l-z2) 3-4A -z2)
C.6.7)
Elementary functions 571
for 3ft(z) > 0, [Khov63, p. 121]. Obviously similar expressions for inverse
hyperbolic functions can be derived, since arcsinhz = zarcsin(—iz) and
(arccosh;z)/\/V2 - 1 = (arccos z)/y/l - z2. A neat formula can be ob-
obtained from C.2.6) in the following way
/iz + l\ia ,.,12 + 1, , , , xx
I 1 = exp(zalog ) = expBaarctan(l/z))
\iz — 1/ iz — 1
= J^ o^ o^ o^
za + 3z + 52 -I- 7z +'
for | arg(l + 1/22)| < tt, i.e. z ? i[-l, 1], [Wall48, p. 346].
arcsinhz „
2A+ 2*) 4^ 4A + ^) C_6_g)
1+1+ 1 + 1 + 1 +
for ft(z2) > -1/2, [ABJL, Entry 37].
arctanz = Z2jFi(?, 1; |;-z2) =
2A + z2) 3Z2 4A + z2)
C.6.10)
1+ 1 + 1 + 1 + 1 +
for 9R(z2) > -1/2, [ABJL, Entry 38].
S.I Continued fractions with simple values
z 2z Zz .
0 = -a-z+ C.7.1)
1 — a — z-\-2 — a — z-\-6 — a — z-\
for z ^ 0 if a is a non-negative integer; i. e. a E No, [Perr57, p. 279].
1=i±l ?+2 ?+3 z+4
z +Z + 1 + Z + 2 + Z + 3 + v l
for z ^ 0,-1,-2,..., [Bern89, p. 112]. (See also D.1.5) with z = 1,
a = z + 1 and c = z -{• 1.)
_ z + a (z + aJ - a2 (z + 2aJ - a2 (z + 3aJ - a2
JL — lo.i.ol
a -|- a + a + a -{
572 Appendix. Some continued fraction expansions
for a ^ 0 and z/a ^ 0,-1,-2,..., [Bern89, p. 118]. (See also D.1.8)
with z = —1, a = 0, 6 = {z/a) — 2 and c = z/a.)
ah (a + d)(b+d) (a + 2<*)F + 2d)
a
- a + 6 + 5d \mm)
for d f 0, 6/d ^ 0, -1, -2,... and 3ft((a - b)/d) > 0 or a = b. Tt also
holds for d = 0 if |a| < |6|, [Bern89, p. 119]. (See also D.1.6) with
z = 1, a replaced by (a + d)/2d, b replaced by a/2d and c replaced by
(a + 6 + d)/2d.) For 6 = a replaced by a + 1 and d = 1, C.7.4) can be
transformed into
a-2a| 1 (Q+i) {a + 2) (Q+3) C75)
2a+3 - 2a+ 5 - 2a + 7 \ • • J
for a f 0, -1, -2,..., [Perr57, p. 105].
2 (a4-2)(fe+2)^
az=
6-
for |*| < 1,6 ^ 0,-1,-2,..., [Perr57, p. 290]. (See also D.1.8) with z
replaced by —z, a replaced by 6 — a, and b = c replaced by 6 — 1.)
z-t-a + 1 z -Ha z-\-2a z + 3a
z + 1 z- l + z-\-a- 1 + Z + 2a-
for a ^ 0 and z/a ^ 0,-1,-2,... . If a = 0, then C.7.7) is periodic
and converges to the said value (which now is 1) for \z\ > 1, [Bern89,
p. 115]. (See also D.1.5) with z replaced by I/a, a replaced by z/a 4- 1
and c replaced by z/a.) If we instead let z = 1 and replace a by z — 1,
c by z — 3 in D.1.5) we get
z2 + z\\ = z z\\ z + 2 z + 3 z + 4 7
2 z + 1 ~ z 3 + Z2 + Z 1+ ^ +Z+1+ ^ " "
z2 -z + 1 z-
for z ^ 3,2,1,0,-1,-2,..., [Bern89, p. 118]. For z = l,a= z- 1 and
c = z — 4 in D.1.5) we get
z3 + 2z -HI _ z z + 1 z + 2 z + 3 z + 4
(z- lK + 2(z-l) + l ~ z-4+z-3 + z-2 + z-l+ z +.>.'''
for z ^ 4,3,2,1,0,-1,..., [Bern89, p. 118]. We can continue the pro-
process.
f-log(\/2+l) r1 t2dt I2 32 42 72 82 II2 ,
2 6V ; = / = — — — — — C.7.10)
2v^ Jo 1 + f1 3 + 7+11 + 15+19+23+•>• ;
[BoSh89, p. 56]. (See also D.1.6).)
Hypergeometric functions 573
4 Hypergeometric functions
4-1 General expressions
i)F\ (c'z) z z z ,
c—-—1V ' '— = c + D 1
F^c+ljz) c+l + c+2 + c + 3+ K ' '
for all z e C, c 7^ 0, -1, -2,..., [JoTh80, p. 210].
2F0(a, 6+1; z)
_ az_ F + l)z (a+l)z F + 2J (a
1111
1-1-1-1-1 _...
for I arg( — z)\ < tt, [JoTh80, p. 213]. The even part of this one is
2F[)(a,b'1z)
az (a+l)F+l)z2 (a + 2)F + 2)z
"^ F + 1J - 1 - (a + 6 + 3)z - 1 - (a + 6 + 5)z - 1 — ?- }
for |arg(-2)| < tt.
(c - a)z (a
° c + 1 + c + 2
(c-a+lJ (a+ 2J (c-a + 2J
- c + 3 +c + 4- c + 5 +•
for all 2 G C, c / 0, -1, -2,..., [JoTh80, p. 206].
D.1.4)
iFi(a+l;c+l;z) c (a+1J (a + 2)z (a + 3)z
^r~; ;^ — : ——z z D.1.
" *
574 Appendix. Some continued fraction expansions
for all z e C, c f 0, -1, -2,..., [J0TI18O, p. 278].
2F\(a,6;c; z) a(c — b)z F + l)(c — a + l)z
C c+1 - c + 2
(a + l)(c -6 + 1J F + 2)(c - a + 2)z
c + 3 - c + 4
a + 2)(c-b +
c + 5
D.1.6)
for I arg(l - z)\ < tt, c / 0, -1, -2,..., [J0TI18O, p. 199]. The Norlund
fraction has the form.
0+ 1,6+1; c+1;*)
(a + 2)F + 2)(z-z2) (a + 3)F + 3)(* - *2)
for ft(z) < 1/2, c ^ 0, -1, -2, The Euler fraction has the form
c + F-n4
(c - a + 2)F + 2J; (c-a + 3)F + .,.
c + 2 + F-a + 3)^-c + 3 + F- a +
for \z\ < 1, c^ 0,-1,-2,....
Letting 6 = 0 in D.1.2), D.1.5), D.1.6) or D.1.7) and using A.1.1) we
get continued fraction expansions for 2^o(o> 1; z) and 2^i(a> 1; c + 1; ^).
Similarly, a = 0 in D.1.3) or D.1.4) gives continued fraction expansions
Hypergeometric functions 575
for | jFj A; c + 1; z). A different expansion is
_v ^ L __^ 1± L (a i q\
for \z - 1| < 1 and c ^ -1,-2,-3,..., [Bern89, p. 164. NB! Mistake
in the condition]. From this follows after some computation, [Bern89,
p. 165] that
Cl — 1 O >. O O —
1A -c)
zc z + 1- z-z + 3- c-
2B - c) 3C - c)
z+h-c-z+7-c
D.1.10)
for |argz| < 7r and c ^ —1,-2,-3,..., [Bern89, p. 165] (the second
continued fraction is the even part of the first one).
4-2 Special examples with qFi
The Bessel functions of the first kind and order v are
so that by D.1.1)
z 0Fi(i/ + 2; -*
!?1
-2(i/ + 2)-2A/ + 3)-2(i/ + 4)-
for ^ € C, v ? -1, -2, -3,..., [JoTh80, p. 211], D.1.1).
576 Appendix. Some continued fraction expansions
4-3 Special examples with 2-Fo
The connection (see for instance [Wall48, p. 352, p. 355])
1 rOO e—l^a—\ 1 /.OO g-'^-l
implies that D.1.2) - D.1.3) lead to continued fraction expansions for
ratios of such integrals. In particular, the incomplete gamma function
a, z) satisfies
e-lta-ldt ~ e-zza-l2F0(l - a, 1; -1/*), D.3.2)
[EMOT53, p. 266]. Hence, by D.1.2)
e~zza 1-a 1 2-a 2 3-a 3
T{a,z) =
z -y l -yz-y 1 -t-z-t- i. fzi
D.3.3)
z + I +z+ 1 +z+ 1 + *+•¦•
2B-a) 3C - a)
z—a—
for |argz| < tt, [AbSt65, p. 260, p. 263], [Khov63, p. 144], where the
second continued fraction is the even part of the first one.
This (and the expressions to come) are to be interpreted in the fol-
following way: The integral in D.3.2) is taken for real z. Then F(a, z)
is the analytic continuation of this function to the given domain. The
complementary error function erfc z satisifies
/oo
[EMOT53, p. 266], which means that by D.1.2)
_j / 1 2 4 6 8
y — P <
\ 2z + 2z + 2z + 2z + 2z
> —
. - - J
\>2 3-4 5-6 7-8
for 3ft(z) > 0, [JoTh80, p. 219]. (There is a slightly different notation
in [JoTh80].) Again the second continued fraction is the even part of
Hypergeometric functions 577
the first one. If we integrate this complementary error function we get
similar expressions: Let
= e~z , z'oerfc2 = erfcz , znerfc2 = / in~leTfctdt D.3.6)
Jz
for 7i = 1, 2,3, Then
zn~~lerfc2 7$
i"erfcz 2F0(
= _ + _i±_ 2_±_ ___2 D.3.7)
2z + 2^ + 2^ +••• v J
for 3?(^) > 0 by D.1.2), [JoTh80, p. 219]. For the exponential integral
/OO c-t e-z
—dt 2F0(l,l;-i), D.3.8)
[EMOT53, p. 267], we get by D.1.2) and its even part
v e-* 1 1 2 2 3 3 4
z) = _______
} z +1 + ^ + 1 + 2 + 1 + ^ + !
D.3.9)
for | argz| < 7r, [Khov63, p. 145]. Similarly for the logarithmic integral
r fZ di r-n \ 2 112 2
hz = = Ei(log2:) = - - r
Jq log t log Z — 1 — log Z — 1 — log 2
^ I2 22 32 42
1 — log z — 3 — log z — 5 — log 2 — 7 — log z — 9 — log z '
D.3.10)
The plasma dispersion function is
•OO ^,-f2
/.00 c-r-
dt = 2ie~z erfc(-zz)
J-oo t - z
1-2 3-4 5-6 7-8
- 2z2 - 5 - 2z2 - 9 - 2z2 -13 - 2z2 -17 - 2z2
D.3.11)
578 Appendix. Some continued fraction expansions
for S(z) > 0, [JoTh80, p. 219].
a
a « + l a + 2 a^3
6+6 + 6 + 6 +¦--
for 3?F) > 0, [Perr57, p. 297].
4-4 Special examples with \F\
From [EMOT53, p. 255] it follows that
P fe^-^l-ty—Ut D.4.1)
for 5R(c) > 0, 5R(a) > 0. Hence D.1.4), D.1.5) and D.1.10) lead to con-
continued fraction expansions of ratios of such integrals. The error function
is given by
i-2--*2\ [EMOT53,p. 266]
j2),[JoTh80,p. 282].
D.4.2)
Hence,
r*2 2z2 4z2
erf(z) =
4z2 Sz2 12z2
D.4.3)
1 - 2z2 + 3 - 2z2 + 5 - 2z2 + 7 - 2z2 +. • •
for z G C, [JoTh80, p. 208, p. 282].
The error function is related to Dawson's integral
fZ ee = zerf(-zz), [JoTh80,p. 208] D.4.4)
Hypergeometric functions 579
and to the Fresnel integrals
C(z) = r cos (-tA dt, S{z) = f* sin (-tA dt D.4.5)
by
C{z) + iS(z) = / eil *'2 dt=J-A I e~u du
JO V VK Jo
D.4.6)
The incomplete gamma function
o a
(a + 2)z
a -a+l-l-a + 2- a + 3 +a + 4- a + 5 +
^-' az A + aJ B + a)z C + a
a -l + a + 2-2-|-a + 2-3 + a-|-2-4 + fl + z
D.4.7)
for all z ? C, [JoTh80, p. 209], [Khov63, p. 149-150].
The Coulomb wave function
,p) = pL+le-ipCt A7)^1 A + 1 - 117; 2L + 2; 2z» D.4.8)
where CL(v) = ^ exp(-7T7?/2)|r(L + 1 + if])\l{2L 4-1)! for 7? G R, p > 0
and L ? No satisfies
+ 1)(L2 + r;2I/2 (L + 2)((L + IJ + r;2)
BL + I)(i7 + L(L + l)/p) - BL + 3)(t?
(L + 3)((L + 2J + r;2)
3)/p)
D.4.9)
580 Appendix. Some continued fraction expansions
[J0TI18O, p. 216]. It is well known that
00 (z\k 00 k -z
E u\ In = e~z E 71— = —iFi(i;«+ i;«), D.4.10)
^o fc! (a + *)
[Bern89, p. 166]. This means for instance that
°° (-«)* _rW !(!-«) 2B - a) D4U)
za z-\-1 — a — z + 3 — a— 2 + 5 — a
for I arg;z| < 7r. For a = 1/2 and z replaced by z/2 this gives
V ** _ /?] a/2 1 1-2 3-4 5-6
D.4.12)
for I argz| < tt, [Bern89, p. 166. NB! Misprint], and
2\k
+ + +9-2*2 +13--..
for 5R(^) > 0, [Bem89, p. 166] (even part and equivalence transforma-
transformation).
4-5 Special examples with
[* tPdi z'>+1 /p+1 ,^P+1 q\
p+l (O^ + p + lJ^9 {lqJzq
P+1)V B^
for (p+1)/^ / 0, -1, -2, -3,... and | arg(l + z^)\ < tt, [Khov63, p. 127].
Hypergeometric functions 581
Incomplete beta functions are given by
= I* tp-\\-t)i-ldt= — 2Fl{p,l-q]p+l:x) D.5.2)
Jo P
for p > 0,9 > 0 and 0 < x < 1, [EMOT53, p. 87]. Hence, by D.1.6) and
D.1.8)
px 2
1A - g)a (p
p+1- p+2 - p+3
2B - g)x
p+4- p+5
px
p + 1 + (p + q)x -p + 2 + (p + q + l)a; -
D.5.3)
for p ^ —1, —2, —3,... and | arg(l — x)\ < 7r in the first continued
fraction, \x\ < 1 in the second, [JoTh80, p. 217]. Legendre functions of
the first kind of degree a and order m are given by
+l\m/2 „ / l~z\
r 2F! -a,Q + l;l-m;—-
-1/ \ 2 /
- m)
In [Gaut70, p. 55] it is proved that
+ a)(m-a- 1) (m + 1 -f a)(m - a)
_ 1I/2 B2_ 1I/2
+ l -a)
2_ 1I/2
582 Appendix. Some continued fraction expansions
(to + a)(m — a — l)y/z2 — 1 G71 + 1 + cx){m ~~ a)(z2 ~
27712 — 2G71-f l)z
(to 4- 2 + a)(m + 1 - a)(z2 - 1)
2(ra+
(m + 3 + a)(m + 2 - a)(z2 - 1)
2(to+3)z
D.5.4)
for $l(z) > 0. Legendre functions of the second kind of degree a and
order to are given by
r(a-m+l)
f°°
J0
coshmf
plTHTC
2J2
In [JoTh80, p. 205] it is proved that
(a 4-m4-3J (a 4-m +4)
Ba 4- l)z - Ba 4- 9)z -
a + m + 5J (q + to4-6J
- Ba + 13)*
for a ^ -3/2, -5/2, -7/2,..., and z ? [-1,1].
D.5.6)
4-6 Some simple integrals
Ilypergeometric functions can be written in terms of integrals. This
has already been used to some extent in the preceding subsections, and
we refer to [AbSt65] and [EMOT53] for further details. Here we shall
just list some simple examples without bringing in the hypergeometric
functions themselves.
r°° p~ldt 1 I2 92 ^2
/ 7Tr = 7TT 7Z1 7^ IT7 ; M(z) > °' D-6^
Hypergeometric functions 583
[BoSh89, p. 20].
I
OO p-t/z
dt
+ ty
z nz \z (n-fl)z 2z (n + 2)z 32
1+1+1+ 1 +1+ 1 + 1 +•••
z nz2 2(n+lJ2 3(n + 2)z2
(n + 2J-1 + (n + 4)^-1 + G1 + 6J-
D.6.2)
for |arg2| < 7r, n ^ 0,-1,-2,..., [BoSh89, p. 157]. The second con-
continued fraction is the even part of the first one. For Jacobi's elliptic
functions sn ?, cti t and dn t with modulus k we have
»OO
/•CXJ
Jo G *SIitdt =
1 1 . 92 - Ik2 ^ - 42 .
J. 1 tj Oft/ O t:
+ Z2 - 32A + k2) + 22 - 52A + k2) +
for aU 2 e C, |ib| < 1, [Wall48, p. 374],
/•OO
/ e~tzsn2tdt =
Jo
2 2 ¦ 32 ¦ 4fc2 4 • 52 • 6k2
22A-I
for all 2 G C, |ib| < 1, [Wall48, p. 375],
r00 lz _, 1 I2 22A;2 32 42A;2 52
/ e-^cn*cfe=- — — — D.6.5)
Jo z+ z + z -{- z + z + z -\
for all 2 G C \ {0}, |ib| < 1, [Perr57, p. 220],
yOO i i2l2 o2 q2jl2 a2 c2l2
/ € dntut = — — — D.6.6)
Jo z+ z + z + z + z + z • • >
for all 2 G C \ {0}, \k\ < 1, [Wall48, p. 374], and
°°sntcn* _„ ..
Jo
dnt C "'"
3-42-5A:1
2 • 12B - A;2) + 22 - 2 • 32B - k2) + 22 - 2 ¦ 52B - k2) + 22
D.6.7)
584 Appendix. Some continued fraction expansions
for all z e C, |fc| < 1, [Wall48, p. 375]. For k = 1 we have
snt = tanhi and cnt — dni = sech t, [Lawd89, p. 39]. D.6.8)
This can be used in D.6.3)-D.6.7) to derive new expressions.
I
Jo
)
ra ar rch (a + l)r 2rcb (a + 2)r
— — — D.6.9)
2+1+2+ 1 +2+ 1 +..- V '
for a > 0,6 > 0,c > 0, |arg2| < tt, where r = A - c)/(l - r5), [Wall48,
p. 359].
^ -11^^ D 6 1Q)
7o sinhi
for »(z) > 0, [Wall48, p. 371].
te-tz
4-I2 4-I2 4-2
4-22 4-3
2
D.6.11)
for 3?B) > 0, but 2 not real and < 1, [Perr57, p. 30]. For z = y/b we get
y4tc x l2 t2 22 22 32 32
/ rr*=7 T -T t ^T V T" > Perr57, p. 30 .
o coshf 1+1 + 1 + 1 + 1 + 1 + 1+...' L J
D.6.12)
/-°° . , dt z 1 3 5
exp / e tanni— = —: — r — r
Jo t z- 1 + 2B- l) + 2B-l) + 2(z-l)+...
D.6.13)
for $t(z) < 1, [BoSh89, p. 157], (after a change of variable x = \jz and
an equivalence transformation).
4-7 Gamma function expressions by Ramanujan
Ramanujan produced quite a number of continued fraction expansions of
ratios of gamma functions. These ratios have all proved to be connected
Hypergeometric functions 585
to hypergeometric functions. Let us first introduce the notation.
+ b + c)T(a-b+c),
JJ T(a + eb + ec + d) = r(a + 6 + c + d)T{a -b + c + d)
e
T(a 4- b - c + d)r(a - 6 - c + d)D.7.1)
and so on. Then
l-R p l2-q2 22-p2 32-g2 42 - p2
1 + R z+z + z +
for 3fcBr) > 0, [Bern89, p. 156], where
D.7.2)
7 ^ (
R = TI 4-r-f-TW ' TT -7 _^ ^n( ¦ D.7.3)
From this it follows that
z - q 4- 2k - 1
11 - R f°° cosh(qt)e~tz ,
= hm = / ^V dt
p—> o p 1 4- it yo cosh ?
1 I2 - a2 22 32 - a2 42
= - — — — — for ft(z)> 0,D.7.4)
z -\- z -\- z -\- z -\- z -\-m • •
[Bern89, p. 148], [BoSh89, p. 157] and
22-a2 32 42-a2
, f f°° sinh(at)e-^ 1 a I2
tanh < / *—f dt } = - —
I yo * cosh t I z + 2
z + z + z 4
D.7.5)
for »(z) > 0, [Wall48, p. 372], and
tanh \ — I dt > =
a I*-a2 22-a2 32 - a2 42 - a2
D.7.6)
z+ z + z + z + z 4 v '
586 Appendix. Some continued fraction expansions
for tt(z) > 0, [Wall48, p. 371]. Solving D.7.2) for l/R gives
2-g2 2'-P2 32~g2 42 - p2
2 + ? + z + 2 H
for $l(z) > 0, [Perr57, p. 34]. The values p = q = l/2 lead to
D.7.8)
and thus, for z = 4n or z = 4n — 2 where n G N, we have
Bn)
1-3 3-5
1 / 2-4----BiQ
iTT \l-3----Bn-
o o c r >jr
" —- , [Perr57, p. 34], D.7.9)
l j
2-4 Bra)
, [Perr57, p. 34].
8n -
I — t
b
g+1 y0 " \l + tj v ^g + 1 (a+l)(g + 2) (a + 2)(g+3)
a /•> a_1/l-A6 26 + 26 + 26 +•••
Jo l \T+t) dt
D.7.11)
for *ftF) > 0, [Perr57, p. 299]. From this follows directly that also
[
Jo
dt
1 + tJ 1-t Q+l (a+l)(a + 2) (a+2)(g+3)
26 + 26 + 26 +•
to
D.7.12)
Hypergeometric functions
587
for 3ftF) > 0, [Perr57, p. 300]. A formula of the same character as D.7.2)
is
n
4
8
!2-g2 !2-p2 32-g2 32-p
for | arg(z2 - 1)| < tt; i.e. for $l(z) > 0 with z
For p — 0 (or q — 0) this reduces to
@,1], [Bern89, p. 159]
V
4
8
\l-ql I2
8
1 I2-?2 32 32-g
Z2 _
for 3?B) > 0 with z $ @,1], [Bern89, p. 145]. One can also derive the
formula
\
z+ 2z + 2z + 2z
^ j
for Jft(z) > 0, [Bern89, p. 140]. For q = 0 and 2 = 4ra - 1 or z - 4n + 1
for an n ? N, this reduces to
-3 Bn-l)\2
2-4-:-Bn) )
7T
4n + 1 +
l °
I2
52
[Perr57, p. 36].
588 Appendix. Some continued fraction expansions
A formula closely related to D.7.15) is
00 /. cosh2at\ _tzdt
U(
cosh 2t ) t
D.7.18)
z1 + 1 + z2 H
for $t(z) > 0, [Wall48, p. 371]. The most involved of Ramanujan's
formulas of this type is
R-Q Sabcdh
R + Q 1{25, - E2 - 2 • 0 • IJ - 4@2 + 0 + IJ} +
64(a2 - 12)F2 - I2)(c2 - 12)(^ - 12)(/i2 - I2)
- E2 - 2 -1 • 2J - 4A2 + 1 + 1J}
64(a2 - 22)F2 - 22)(c2 - 22)(cf2 - 22)(/i2 - 22)
5{254 - E2 - 2 ¦ 2 • 3J - 4B2 + 2 + IJ} +
, D.7.19)
where S4 = a4 + b4 + c4 + d4 + h4 + 1, S2 = a2 + *>2 + c2 + d2 + h2 - 1,
and
R =
l\
n
E r
[Bern89, p. 163]. The expansion D.7.19) only holds if the continued
fraction terminates. But as a corollary one can prove
1-R _ 2abc 4(a2 - 12)F2 - I2)(c2 - I2)
1 + R zl - a2 - b2 - c2 + 1 -{- 3(z2 - a2 - b2 - c2 + 5) +
4(a2 - 22)F2 - 22)(c2 - 22) ^ ?^
H
Hypergeometric functions 589
for $l(z) > 0, [Bern89, p. 157], where
l\
D.7.22)
Replacing z by z/c and letting c —> oo in D.7.22) leads to
rrl = D-7-23)
ab (a2 - 12)F2 - I2) (a2 - 22)F2 - 22) (a2 - 32)F2 - 32)
+ { + Iz
for 9RB) > 0, [Bern89, p. 155], where
. D.7.24)
In particular
1 11 11- R
b -^o b 1 + R
a 12A2 - a2) 22B2 - a2) 32C2 - a2)
z+ 3-z + 5z + 72
for fR(z) > 0, [Bern89, p. 149]. Moreover,
S («+*)(* +
1 (a + lJF+lJ (a + 2JF + 2J
(a+l)F+l)+ a + 6 + 3 + a + 6 + 5 +•••"'
for a, 6 ^ -1, -2, -3,..., [Bern89, p. 123]. We also have
l-R ab 22-b2 22-a2 42 - 62 42 - a2
l + i2~22-l-a2+ 1 +22-l+ 1 -fz2-l+...
D.7.27)
for *ftB) > 0, [Bern89, p. 158], where
590 Appendix. Some continued fraction expansions
Dividing D.7.27) by a and letting a —> 0 gives
g I
I
b 22-b2 22 42-62
for $l(z) > 0, [Bern89, p. 150]. Of course, dividing by b and letting 6 —> 0
gives
for fft(z) > 0, [Bern89, p. 151].
^ (-1)* 1 1-2 2-3 3-4
-^ ^r = -
D.7.31)
z + 2k z+ z + z + z +•¦¦ v y
fC— I
for »(z) > 0, [Bern89, p. 151].
^^^ D.7.32)
for »(«) > 0, [Bern89, p. 152].
Jo
e~tzdt =
sinhc?
ab 4 - I2(l2c2 - a2)(l2c2 - 62)
l(z'2 + c2 - a2 - 62) - 3(z2 + 5c2 - a2 - 62) -
4-22B2c2-a2)B2c2-62)
+ l3c2_a2_62) _... ' D-7-33)
where the coefficients for c2 are 2k2-\-2k + \ in the denominators [Wall48,
p. 370]. D.7.33) is valid for | arg(l ± c2)\ < v.
m ^"^^^^ CI T1 f~l ft m ft I I I f* —^ ft I f I J f* ^^— ft I
m w I I I 1 I (Xv J ^W X. IX. Vrf ^^^ (X I ^i I ^j L. IJi I
C I P fit — — — — in. I <4-1
Jo sinhci 2+ 32 + 52 -)
for R(c/z) > 0, [WaU48, p. 370].
/•oo e~tzdt
I e dt = D.7.35)
Jo (coshf + asinh()°
1 1-6A-a2) 2F+l)(l-a2) 3F + 2)(l-a2)
a(b+ 2)+ 2 + aF + 4) + z + aF + 6)
Hypergeometric functions 591
for 9ft(
for9ft(
/¦<
a) > 0, [Wall48, p. 369].
/ F I n h- -
Jo \ z
1 4 • lat 4 •:
/I *i f *-* 1 O \ f %\ 1 O \ f *-»
» > 0, [Wall48, p. 370].
~ 1
[ o, 2 -|- 1) — > . ¦ ¦
1 I3
1
1B^2 + 2z + 1)
26
smh2t\e-tzdt-
2(a + l)F+l)(o+6)
^(a + 6+3J +
I3 2:i 23
I6
-3B22 + 22 + 3)-
06
D.7.36)
5Bz2 + 22 + 7) - 7B^ + 2z + 13) D-7.37)
for 3F?(jz) > —1/2, [Bern89, p. 153]. The second continued fraction in
D.7.37) is the even part of the first one.
i _.. \, — i —' i - i — •- i * 2 Q 0 ~t" ZAC
1 1
|_ 1J _ a2 _ 62}2 _
2a6 2A2 - b2) 2A2 - a2)
for 3?B) > 0, [Bern89, p. 158]. Dividing by 2a and letting a -> 0 in
D.7.38) leads to
592 Appendix. Some continued fraction expansions
2A2 - b2) 2 • I2
4B2 - b2) 4-2
4A2 - 62I4
4B^J' <(*-r> D.7.39)
5B2 _ b2 + 13)-7B2 _ b2 + 25)— • •
for $l(z) > 0, [Bern89, p. 158]. Let
_2 /2 + 1
i ( 2a VI r V^
l\/
2
D.7.40)
Then
u-v _ 2a?_ 4a4 + 1' 4a1 + 21 4a4 + 3'
w + v~ 12+ 32 + 52 + 72 +•-- \ ' ' )
for 9ftB) > 0, [ABJL, Entry 48]. Let
u=fli1+fe)}' "=Hr fe)}- D-7-42)
Then
w - v a3 a6 - I6 a6 - 26
w + v 1B22 + 22 + l) + 3Bz2 + 22 + 3) + 5B22 + 22 +
for ftB) > -1/2, [ABJL, Entry 50]. Let
2/ = ((l + 22I/2-l)/2 and r = a/(l + 22I/'2, D.7.44)
where 9ft (A + 22I/2) > 0. Then
i2-2 o2^2 o2-2
D 7 45)
+ a+ ^ ' "
00 / i\*7|2fc+l 7 i2-2
Basic hypergeometric functions 593
for 9ft(z) > 0, [ABJL, Entry 14],
y
r + 2k 2-f
D.7.46)
for R(z) > 0, [ABJL, Entry 15],
E
a + 6 + 4 + a + 6 + 6 +
for JftB) > 0, [ABJL, Entry 17].
D.7.47)
5 Basic hypergeometric functions
5.1 General expressions
-b)qz
c 1
1 — cq + 1 — c^2 -{- 1 —
- bq2)(cq2 - a)qz A - aq2)(cq2 - b)q2z
1 - eg1 -{- 1 - cq3 +
E.1.1)
for |g| < 1, 2 G C, c ^ 1, g, q~2,..., Thm 10 in Chapter VI, [ABBW85,
p. 14].
= bo + K(a»/0n) E.1.2)
for |g| < l,z G C,c ^ I,?,?,- •- where
an = {l-aqn)(l-bqn)cqn-1(l-zabqn/c)z
bn = l-cgn-(a + 6-
594 Appendix. Some continued fraction expansions
Theorem 11 in Chapter VII.
q(l — c) —* —- = A — c)q + (a — bq)z —
(a — cq)(l — bq)qz (a — cg2)(l — bq'2)qz
a — bq2)z— A — cq2)q + (a —
E.1.3)
for \q\ < l,c^l,q-\q-'\... [ABBW85, p. 18].
If we choose 6 = 1 in E.1.1), E.1.2) or E.1.3) we obtain continued
fraction expansions for 2^1 (ai tfi CQ\ 5?z) or 2^1 (a9i 95 C95 95z)-
Two general results by Andrews
G(a,b,c;q) aq + eg bq + eg2 aq2 + eg3
F)-1+ 1+1+1
for |(f| < 1, [ABJL89, p. 80] where
. + (l + qgz)gz A + ag2;)g2; g g
1 + 6g2z + 1 + 6g32 H
= x
H(aua>\ qz\ q)
for |^| < 1, z € C,a = —\ja\a2 and 6 = —\ja\ — l/a-2, where
<?z \ fqz \
—5 9 (—5 5
'~> V°2 ^
00
9H0A-2)
k=U
[ABJL89, p. 79]
Basic hypergeometric functions 595
5.3 q-expressions by Ramanujan
The formula E.2.1) can also be found in Ramanujan's lost notebook
[Andr79, p. 90]. Quite a number of Ramanujan's expressions are special
cases of E.2.1) and E.2.3). We refer in particular to [ABJL] for more
details. From E.1.1) we find that
(-a; g)ooF; g)oo - (a; g)oo(-fr; g)
(-a; g)oo(&; g)oo + (a; q)oo(-b; q)
oo
CO
bq bq2 , .,
j i0 >0 5a
y a a
1-g /&? b_, . 2.a2\
\a'a' ' /
a — b (a — 60)(ag — 6) (a — 6g2)(ag2 — b)q
l-<?3 + 1-g5
for |g| < 1, [ABBW85, p. 14].
oo
_a6)(^2+ 1)+ (l-a6)(g' + l) +
for \q\ < 1, [ABBW85, Entry 12].
If we let a = 0 in E.2.1) we get
<p{cq)
where
00
E.3.1)
(a _ bq)(b - aq) (a - 6g3)F - aq*)
F(b:a) aq aq2 cur
v ' = 1 + — —=¦ —r ; \q\ < 1, E.3.
[ABBW85, Entry 15], where
oo ak k2
F(b] a) = V 7-7 ^7 r- . E.3.4)
. , W H + eg2 eg3 bq2 + eg4 eg
= 1 H , 10.6.5)
T 1+ 1 + 1 + 1 + 1 +' l ^
596
Appendix. Some continued fraction expansions
for \q\ < 1, [ABJL, Entry 56]. If 6 = — c this reduces to
oo
1+1+ 1 + 1 + 1
for \q\ < 1, [ABBW85, p. 22].
G(z) = _
G{qz)
q2z q6z q3z
for \q\ < 1, [ABJL, Formula 9.1], where
Jb=O
t?
1-1
[ABJL, Entry 10].
3; g4)oc
o 1-1 + g2-1 + g1 - 1 + g6-• •'
[ABJL, Entry 11].
(-<?2;g2)ool ? ?l+i g
g4 + g2
(-g;g2)oo 1 + 1+ 1 +1+ 1 +1+-
[ABJL, Entry 12].
(g;g'2)
E.3.6)
A * ' ^
E.3.10)
E.3.11)
oo
{(g3;gG)oo}3
+ i +¦
, [ABJL89, Thm 7], E.3.12)
(g;g5)cx:(g4;g5)oo _ I g
_ I g g! [ARTLRQ
+ 1+1 +...'tABJL89'
(g;g8)oo(g7;g8)c
(g3;g8)oo(g5;g8)
g + g
8
i +1+
+•
E.3.14)
Basic hypergeometric functions 597
[ABJL89, Thm 6].
(a; q)^
+ fa
a A — a)gz A — q)aqz (l —
1+ I + i + I +
' E-3-15)
1+i+
[WalM8, p. 376].
References
[AbSt65] M. Abramowitz and I. A. Stegun, "Handbook of Mathe-
Mathematical Functions", Dover, New York A965).
[ABBW85] C. Adiga, B. C. Berndt, S. Bhargava and G. N. Watson,
"Chapter 16 of Ramanujan's Second Notebook: Theta-
Functions and q-Series", Mem. of the Amer. Math. Soc,
no. 315, Providence A985).
[Andr79]
[ABJL89]
[ABJL]
G.E. Andrews, An Introduction to Ramanujan's "Lost"
Notebook, Amer. Math. Monthly, 86, A979), 89-108.
G. E. Andrews, B. C. Berndt, L. Jacobsen and R. L.
Lamphcre, Variations on the Rogers-Ramanujan Contin-
Continued Fraction in Ramanujan's Notebooks, "Number The-
Theory, Madras 1987" (K. Alladi ed.) Lecture Notes in Math.
Springer-Verlag, 1395 A989), 73-83.
G. E. Andrews, B. C. Berndt, L. Jacobsen and R. L. Lara-
phere, "The Continued Fractions Found in the Unorganized
Portions of Ramanujan's Notebooks". To appear in Mem-
Memoirs of the Amer. Math. Soc, Providencs R.I.
[Bern89] B.C. Berndt, "Ramanujan's Notebooks, Part II", Springer-
Verlag, New York A989).
[BBLW85] B. C. Berndt, R. L. Lamphere and B. M. Wilson, Chapter
12 of Ramanujan's Second Notebook: Continued Fractions,
Rocky Mountain J. Math. 15 A985), 235-310.
598
References 599
[BoSh89] K. 0. Bowman and L. R. Shenton, "Continued Fractions
in Statistical Applications", Marcel Dekker, Inc., New York
and Basel A989).
[EMOT53] A. Erdelyi, W. Magnus, F. Oberhettinger and F. G. Tri-
comi, "Higher Transcendental Functions", Vol. 1, McGraw-
Hill, New York A953).
[Gaut70] W. Gautschi, Efficient Computation of the Complex Error
Function, SIAM J. Numer. Anal. 7 A970), 187-198.
[JoTh80] W. B. Jones and W. J. Thron, "Continued Fractions:
Analytic Theory and Applications", Addison-Wesley, En-
Encyclopedia of Mathematics and its Applications, Vol.11,
London, Amsterdam, Don Mills, Ontario, Sydney, Tokyo
A980). Now distributed by Cambridge University Press.
[Khov63] A. N. Khovanskii, "The Application of Continued Frac-
Fractions and Their Generalizations to Problems in Approxima-
Approximation Theory", P. Noordhoff, Groningen, The Netherlands
A963).
[Lawd89] D. F. Lawdcn, "Elliptic Functions and Applications",
Springer-Verlag, Applied Mathematical Sciences Vol. 80,
New York A989).
[Perr57] Perron, O., "Die Lehre von den Kettenbriichen", Band IE,
B.G. Teubner, Stuttgart A957).
[Wall48] H. S. Wall, "Analytic Theory of Continued Fractions", Van
Nostrand, New York A948).
Subject Index
a posteriori truncation error
bounds, 63, 114
a priori truncation error
bounds, 63, 116
absolute convergence, 128
acceleration of convergence, 25,
160
adjoint recurrence relation, 197
analytic continuation, 25, 174
approximant, 8
asymptotic expansion, 353
asymptotic side condition, 162
asymptotically equal, 202
attractive fixed point, 103, 151
Auric's theorem, 207
Auric's theorem, modified, 267
— B —
backward recurrence algorithm,
45
basic hypergeometric functions,
319, 593
basic hypergeometric series, 318
Bauer-Muir transform, 76
Bauer-Muir transformation, 76
Bessel equations, 542
Bessel functions Ju{z)^ 317,
477, 539, 575
Bessel polynomials, 477
best rational approximation,
408
Binet's formula, 193
binomial function, 47
BirkhofFs method, 204
Blaschke product, 503
block diagram, 494
block theorem, 381
branch point, 174
branched continued fraction,
277, 285
branches, 495
C-fraction, 252
canonical contraction, 83
canonical denominator, 9
canonical even part, 95
canonical numerator, 9
Carleman criterion, 360
Cartesian oval, 142
Catalan's constant, 563
Cauchy sequence, 247
chain sequence, 363, 463
Charlier polynomials, 337
chordal distance, 43
classical approximant, 39, 71
classical value set, 110
classification of linear fractional
transformations, 101
601
602
Subject Index
complementary error function,
576
conditional convergence set, 108
conditional general convergence
set, 109
confluent hypergeometric
function, 311
conjugate transformation, 103
continued fraction, 7
continued fraction algorithm, 7
continued fraction expansion,
15
continued fraction factoring
algorithm, 428
contraction, 83
convergence criteria, 93
convergence of a continued
fraction, 8
convergence set, 108
convolution, 484
coprime numbers, 400
correspondence, 23, 242
correspondence at z = a, 243
correspondence at z = oo, 244
Coulomb wave function, 579
critical tail sequence, 60
cross ratio, 62
— D —
Dawson's integral, 578
determinant formula, 9, 411
difference equation, 71
differential equation, 13, 521
digital filter, 486
Diophantine equations, 410
directed graph, 495
disk-stability, 518
distribution function, 257, 342
diverge, 94
dominant solution, 201
— E —
element region, 123
ellipse, arc length of, 26
elliptic transformation, 103
elliptic type, 159
empty product, 128
empty sum, 128
equivalence transformation, 72
equivalent continued fractions,
72
error function, 578
Euclidean algorithm, 399
Euler fraction, 307
Euler-Cauchy equation, 528
even part, 84
exponential function, 281, 563
exponential integral, 577
extension, 83, 89
factor base, 423
Favard's theorem, 345, 460
Fermat factorization, 420
FG-algorithm, 284
Fibonacci numbers, 46, 193, 406
field, 200, 243
fixed point, 101
formal power series, 200, 242
forward recurrence algorithm,
45
forward stability, 218
Fourier expansion, 343
Fresnel integrals, 579
functional equation, 82
Subject Index
603
fundamental inequalities, 182
G-continued fraction, 226
gamma function, 199, 221
Gauss fraction, 295
Gauss quadrature formula, 351
general binomial function, 564
general convergence, 43, 66
general convergence set, 109
general divergence, 94
generalized continued fractions,
228, 236
generalized hypergeometric
series, 312
golden ratio, 46, 562
Gragg-Warner bounds, 140
Gram-Schmidt-process, 342
greatest common divisor, 399
— H —
Hamburger moment problem,
359
Hankel determinant, 341
Heine's continued fraction, 320
Henrici-Pfluger truncation error
bounds, 139
Hermite polynomials, 443
hermitian PC-fraction, 282
Hillam-Thron theorem, 119
history of continued fractions,
50
Hurwitz polynomial, 468
hyperbolic functions, 568
hypergeometric functions, 18,
292, 573
hypergeometric series, 292
identity function, 101
incomplete beta functions, 581
incomplete gamma function,
230-231, 576, 579
input, 486
integer part, 402
inverse hyperbolic functions,
569
inverse trigonometric functions,
569
iteration, 101
Jacobi continued fraction, 345,
460
Jacobi's elliptic functions, 583
— K —
^-periodic continued fraction,
104
Khovanskii transform, 89
Kronecker delta, 333
Lagrange interpolation
polynomial, 349
Laguerre polynomials, 443
left vertex, 146
Legendre functions, 581
Legendre polynomials, 334, 442
limit circle case, 120
limit ^-periodic, 150
limit periodic continued
fraction, 20, 150
limit point case, 116
linear differential equation, 523
604
Subject Index
linear fractional transformation,
6, 62, 101
linear independence, 196
linear space, 192
linear system theory, 508
linearly independent, 196
logarithm, 17, 177, 566
logarithmic integral, 577
Lommel polynomials, 477
loxodromic transformation, 103
loxodromic type, 151
— M —
M-fraction, 284
minimal solution, 201
model reduction problem, 508
modified approximant, 20, 25
modified continued fraction, 44
modified regular C-fraction, 357
modified Stieltjes fraction, 357
moment, 338
moment functional, 338
moment problem, 353
moment sequence, 338
monic polynomials, 333
Montessus de Ballore's
theorem, 385
multipoint Pade fraction, 388
multivariate Pade
approximants, 389
— N —
Norlund fraction, 304, 574
nested closed sets, 116
Newton-Pade-approximants,
388
nodes, 349, 495
non-trivial solution, 201
nonrecursive filter, 486
norm, 243
normal Pade table, 379
— O —
odd part, 85
order of correspondence, 242
orthogonal, 332
orthogonal polynomial
sequence, 339
orthonormal polynomials, 343
oscillation property, 99
output, 486
oval sequence theorem, 145
oval theorem, 141
P-fractions, 281, 383
Pade approximants, 27, 375
Pade table, 375
Pade type approximants, 389
parabola sequence theorem, 136
parabola theorem, 130
parabolic transformation, 103
parabolic type, 157
parameter sequence, 463
partial denominators, 7
partial fractions, 28
partial numerators, 7
PC-fraction, 282, 387
Pell's equation, 413
period length, 101, 150
periodic continued fraction, 101
periodic tail sequence, 105
Perron-tails, 210
PfafT's transformation, 307
Subject Index
605
Pincherle's theorem, 202, 235
Pincherle's theorem, modified,
265
plasma dispersion function, 577
Pochhammer symbol, 199, 311
positive elements, 96
positive-definite moment
functional, 342
pre value set, 110
Pringsheim's theorem, 30
— Q —
q-hypergeometric function, 319
q-hypergeometric series, 318
qd-algorithm, 284
quasi-definite moment
functional, 341
— R —
Ramanujan's lost notebook, 595
rational approximation, 11
recursive filter, 486
reference continued fraction, 44
reflection property, 62
regular C-fraction, 250
regular continued fraction, 5,
401
regular ^-fraction, 281
regular singular point, 528
regular two-dimensional
C-fraction, 277
repulsive fixed point, 103, 151
Riccati differential equation,
540
Riemann sphere, 62
Riemaim surface, 174
Riemann zeta function, 561
Riemann-Stieltjes integral, 337
right Perron-tails, 210
right tail sequence, 59, 210
right vertex, 146
Rogers-Ramanujan continued
fraction, 273
RSA-cryptography, 419
S-fractions, 138, 257
Saff-Varga's parabola theorem,
464
Schur algorithm, 501
Schur fraction, 504
Schwarz' lemma, 501
Seidel-Stem theorem, 98
separable differential equation,
541
sequence of convergence sets,
109
sequence of value sets, 110
shock response, 488
similar transformation, 103
sink node, 496
Sleszyriski-Pringsheim set, 108
Sleszyriski-Pringsheim's
theorem, 30
solution space, 191
source node, 496
speed of convergence, 63
stable filter, 489
stable polynomial, 468
Stern-Stolz divergence theorem,
94
Stern-Stolz series, 95
Stieltjes fractions, 138, 453
Stieltjes moment problem, 353
Stieltjes-Vitali theorem, 123
606
Subject Index
strong Stieltjes problem, 363
successive substitutions, 21, 259
Szego polynomial, 388
unit delay, 484
unstable computation, 80
T-fraction, 26, 245, 251
tail, 56
tail sequence, 59, 209
Tchebycheff polynomials, 230,
332, 442
TDCF, 277
terminating continued fraction,
251
terminating regular continued
fraction, 401
Thiele interpolating continued
fraction, 248
Thiele oscillation, 105
three-term recurrence relation,
191
transfer function, 488
transformations of continued
fractions, 69
trigonometric functions, 568
trigonometric moment problem,
388
truncation error bounds, 63
truncation error estimate, 182
twin convergence sets, 109
twin value sets, 111
two-point Pade table, 387
value region, 182
value set, 110
van Vleck sector, 108
van Vleck's theorem, 32
vector inverse, 392
vector space, 192
Viscovatov's algorithm, 263
— w —
weight function, 332
Worpitzky disk, 108
Worpitzky's theorem, 35
— Z —
z-transform, 485
U —
uniform convergence set, 108
uniform general convergence
set, 109
uniqueness, 68