/
Автор: Robert A.M.
Теги: mathematics mathematical physics mathematical analysis
ISBN: 0-387-98669-3
Год: 2000
Текст
Alain M. Robert
A Course
in p-adic
Analysis
Springer
Graduate Texts in Mathematics
198
Editorial Board
S. Axler F.W. Gehring K.A. Ribet
Springer
New York
Berlin
Heidelberg
Barcelona
Hong Kong
London
Milan
Paris
Singapore
Tokyo
Graduate Texts in Mathematics
1 Takeuti/Zaring. Introduction to
Axiomatic Set Theory. 2nd ed.
2 Oxtoby. Measure and Category. 2nd ed.
3 Schaefer. Topological Vector Spaces.
2nd ed.
4 Hiltok/Stammbach. A Course in
Homological Algebra. 2nd ed.
5 Mac Lane. Categories for the Working
Mathematician. 2nd ed.
6 Hughes/Piper. Projective Planes.
7 Serre. A Course in Arithmetic.
8 Takeuti/Zaring. Axiomatic Set Theory.
9 Humphreys. Introduction to Lie Algebras
and Representation Theory.
10 Cohen. A Course in Simple Homotopy
Theory.
11 Conway. Functions of One Complex
Variable 1. 2nd ed.
12 Beals. Advanced Mathematical Analysis.
13 Anderson/Fuller. Rings and Categories
of Modules. 2nd ed.
14 Golubitsky/Guillemin. Stable Mappings
and Their Singularities.
15 Berberlan. Lectures in Functional
Analysis and Operator Theory.
16 Winter. The Structure of Fields.
17 Rosenblatt. Random Processes. 2nd ed.
18 Halmos. Measure Theory.
19 Halmos. A Hilbert Space Problem Book.
2nd ed.
20 Husemoller. Fibre Bundles. 3rd ed.
21 Humphreys. Linear Algebraic Groups.
22 Barnes/Mack. An Algebraic Introduction
to Mathematical Logic.
23 Greub. Linear Algebra. 4th ed.
24 Holmes. Geometric Functional Analysis
and Its Applications.
25 Hewitt/Stromberg. Real and Abstract
Analysis.
26 Maxes. Algebraic Theories.
27 Kelley. General Topology.
28 Zariski/SaMlel. Commutative Algebra.
Vol.1.
29 Zariski/Samuel. Commutative Algebra.
Vol.11.
30 Jacobson. Lectures in Abstract Algebra 1.
Basic Concepts.
31 Jacobson. Lectures in Abstract Algebra 11.
Linear Algebra.
32 Jacobson. Lectures in Abstract Algebra
III. Theory of Fields and Galois Theory.
33 Hirsch. Differential Topology.
34 Spitzer. Principles of Random Walk.
2nd ed.
35 Alexander/Wermer. Several Complex
Variables and Banach Algebras. 3rd ed.
36 Kelley/Namioka et al. Linear Topological
Spaces.
37 Monk. Mathematical Logic.
38 Grauert/Fritzsche. Several Complex
Variables.
39 Arveson. An Invitation to C*-Algebras.
40 Kemeny/Snell/Knapp. Denumerable
Markov Chains. 2nd ed.
41 Apostol. Modular Functions and Dirichlet
Series in Number Theory.
2nd ed.
42 Serre. Linear Representations of Finite
Groups.
43 Gillman/Jerison. Rings of Continuous
Functions.
44 Kendig. Elementary Algebraic Geometry.
45 Loeve. Probability Theory I. 4th ed.
46 Loeve. Probability Theory 11. 4th ed.
47 Moise. Geometric Topology in
Dimensions 2 and 3.
48 Sachs/Wu. General Relativity for
Mathematicians.
49 Gruenberg/Weir. Linear Geometry.
2nd ed.
50 Edwards. Fermafs Last Theorem.
51 Klingenberg. A Course in Differential
Geometry.
52 Hartshorne. Algebraic Geometry.
53 Manin. A Course in Mathematical Logic.
54 Graver/Watkins. Combinatorics with
Emphasis on the Theory of Graphs.
55 Brown/Pearcy. Introduction to Operator
Theory 1: Elements of Functional
Analysis.
56 Massey. Algebraic Topology: An
Introduction.
57 Crowell/Fox. Introduction to Knot
Theory.
58 Koblitz. p-adic Numbers, p-adic Analysis,
and Zeta-Functions. 2nd ed.
59 Lang. Cyclotomic Fields.
60 Arnold. Mathematical Methods in
Classical Mechanics. 2nd ed.
61 Whitehead. Elements of Homotopy
62 Kargapolov/Merlzjakov. Fundamentals
of the Theory of Groups.
63 Bollobas. Graph Theory.
64 Edwards. Fourier Series. Vol. 12nd ed.
65 Wells. Differential Analysis on Complex
Manifolds. 2nd ed.
66 W7aterhouse. Introduction to Affine
Group Schemes.
67 Serre. Local Fields.
68 W'eidmann. Linear Operators in Hilbert
Spaces.
69 Lang. Cyclotomic Fields II.
70 Massey. Singular Homology Theory.
71 Farkas/Kra. Riemann Surfaces. 2nd ed.
(continued after index)
Alain M. Robert
A Course in
p-adic Analysis
With 27 Figures
T
Alain M. Robert
Institut de Mathematiques
Universite de Neuchatel
Rue Emile-Argand 11
Neuchatel CH-2007
Switzerland
Editorial Board
S. Axler
Mathematics Department
San Francisco State
University
San Francisco, CA 94132
USA
Mathematics Subject Classification (2000): 11-01,11E95, llSxx
Library of Congress Cataloging-in-Publication Data
Robert, Alain.
A course in p-adic analysis / Alain M. Robert.
p. cm. — (Graduate texts in mathematics ; 198)
Includes bibliographical references and index.
ISBN 0-387-98669-3 (he.: alk. paper)
1. p-adic analysis. I. Title. II. Series.
QA241,R597 2000
512'.74 — dc21 99-044784
Printed on acid-free paper.
© 2000 Springer-Verlag New York, Inc.
All rights reserved. This work may not be translated or copied in whole or in part without the written
permission of the publisher (Springer-Verlag New York, Inc., 175 Fifth Avenue, New York, NY 10010,
USA), except for brief excerpts in connection with reviews or scholarly analysis. Use in connection
with any form of information storage and retrieval, electronic adaptation, computer software, or by
similar or dissimilar methodology now known or hereafter developed is forbidden.
The use of general descriptive names, trade names, trademarks, etc., in this publication, even if the
former are not especially identified, is not to be taken as a sign that such names, as understood by the
Trade Marks and Merchandise Marks Act, may accordingly be used freely by anyone.
Production managed by Timothy Taylor; manufacturing supervised by Erica Bresler.
Typeset by TechBooks, Fairfax, VA.
Printed and bound by R.R. Donnelley and Sons, Harrisonburg, VA.
Printed in the United States of America.
987654321
ISBN 0-387-98669-3 Springer-Verlag New York Berlin Heidelberg SPIN 10698156
I -H
F.W. Gehring
Mathematics Department
East Hall
University of Michigan
Ann Arbor, MI 48109
USA
CO/
K.A. Ribet
Mathematics Department
University of California
at Berkeley
Berkeley, CA 94720-3840
USA
Preface
Kurt Hensel (1861-1941) discovered or invented the p-adic numbers1 around the
end of the nineteenth century. In spite of their being already one hundred years
old, these numbers are still today enveloped in an aura of mystery within the
scientific community. Although they have penetrated several mathematical fields,
number theory, algebraic geometry, algebraic topology, analysis, ..., they have
yet to reveal their full potential in physics, for example. Several books on p-adic
analysis have recently appeared:
F. Q. Gouvea: p-adic Numbers (elementary approach);
A. Escassut: Analytic Elements in p-adic Analysis, (research level)
(see the references at the end of the book), and we hope that this course will
contribute to clearing away the remaining suspicion surrounding them. This book
is a self-contained presentation of basic p-adic analysis with some arithmetical
applications.
* * *
Our guide is the analogy with classical analysis. In spite of what one may think,
these analogies indeed abound. Even if striking differences immediately appear
between the real field and the p-adic fields, a better understanding reveals strong
common features. We try to stress these similarities and insist on calculus with the
p-adics, letting the mean value theorem play an important role. An obvious reason
for links between real/complex analysis and p-adic analysis is the existence of
!The letter p stands for a fixed prime (chosen in the list 2, 3, 5, 7, 11, ...) except when explicitly
stated otherwise.
vi Preface
an absolute value in both contexts.2 But if the absolute value is Archimedean in
real/complex analysis,
if x ^ 0, for any y there is an integer n such that \nx\ > \y\,
it is non-Archimedean in the second context, namely, it satisfies
\nx\ = \x+x + --- + xt\ < |jc|.
n terms
In particular, \n\ < 1 for all integers n. This implies that for any r > 0 the subset
of elements satisfying |jc| < r is an additive subgroup, even a subring if r = 1.
For such an absolute value, there is (except in a trivial case) exactly one prime
p such that |/71 < l.3 Intuitively, this absolute value plays the role of an order
of magnitude. If x has magnitude greater than 1, one cannot reach it from 0 by
taking a finite number of unit steps (one cannot walk or drive to another galaxy!).
Furthermore, \p\ < 1 implies that \pn | —> 0, and the p-adic theory provides a link
between characteristic 0 and characteristic p.
The absolute value makes it possible to study the convergence of formal power
series, thus providing another unifying concept for analysis. This explains the
important role played by formal power series. They appear early and thereafter
repeatedly in this book, and knowing from experience the feelings that they inspire
in our students, I try to approach them cautiously, as if to tame them.
* * *
Here is a short summary of the contents
Chapter I: Construction of the basic p-adic sets Zp, Qp and Sp,
Chapters II and III: Algebra, construction of Cp and £2P,
Chapters IV, V, and VI: Function theory,
Chapter VII: Arithmetic applications.
I have tried to keep these four parts relatively independent and indicate by an
asterisk in the table of contents the sections that may be skipped in a first reading.
I assume that the readers, (advanced) graduate students, theoretical physicists, and
mathematicians, are familiar with calculus, point set topology (especially metric
spaces, normed spaces), and algebra (linear algebra, ring and field theory). The
first five chapters of the book are based solely on these topics.
The first part can be used for an introductory course: Several definitions of the
basic sets of p-adic numbers are given. The reader can choose a favorite approach!
Generalities on topological algebra are also grouped there.
2Both Newton's method for the determination of real roots of / = 0 and Hensel's lemma in the
p-adic context are applications of the existence of fixed points for contracting maps in a complete
metric space.
3Since the prime p is uniquely determined, this absolute value is also denoted by |. \p. However,
since we use it systematically, and hardly ever consider the Archimedean absolute value, we simply
write |. |.
Preface vii
The second—more algebraic—part starts with a basic discussion of ultrametric
spaces (Section II. 1) and ends (Section III.4) with a discussion of fundamental
inequalities and roots of unity (not needed before the study of the logarithm in
Section V.4). In between, the main objective is the construction of a complete and
algebraically closed field Cp, which plays a role similar to the complex field C
of classical analysis. The reader who is willing to take for granted that the p-adic
absolute value has a unique extension |. |# to every finite algebraic extension K
of Qp can skip the rest of Chapter II: If K and K' are two such extensions, the
restrictions of |. \k and \.\K' to K D Kf agree. This proves that there is a unique
extension of the p-adic absolute value of Qp to the algebraic closure Q* of Qp.
Moreover, if o e Aut(AVQp), then x i-> \xg\k is an absolute value extending
the p-adic one, hence this absolute value coincides with \.\k- This shows that
o is isometric. If one is willing to believe that the completion Qap = Cp is also
algebraically closed, most of Chapter III may be skipped as well.
In the third part, functions of a p-adic variable are examined. In Chapter IV,
continuous functions (and, in particular, locally constant ones) / : Zp -> Cp are
systematically studied, and the theory culminates in van Hamme's generalization
of Mahler's theory. Many results concerning functions of a p-adic variable are
extended from similar results concerning polynomials. For this reason, the algebra of
polynomials plays a central role, and we treat the systems of polynomials—umbral
calculus — in a systematic way. Then differentiability is approached (Chapter V):
Strict differentiability plays the main role. This chapter owes much to the
presentation by W.H. Schikhof: Ultrametric Calculus, an Introduction to p-adic Analysis.
In Chapter VI, a previous acquaintance with complex analysis is desirable, since
the purpose is to give the p-adic analogues of the classical theorems linked to the
names of Weierstrass, Liouville, Picard, Hadamard, Mittag-Leffler, among others.
In the last part (Chapter VII), some familiarity with the classical gamma function
will enable the reader to perceive the similarities between the classical and the p-
adic contexts. Here, a means of unifying many arithmetic congruences in a general
theory is supplied. For example, the Wilson congruence is both generalized and
embedded in analytical properties of the p-adic gamma function and in integrality
properties of the Artin-Hasse power series. I explain several applications of p-adic
analysis to arithmetic congruences.
Let me now indicate one point that deserves more justification. The study of metric
spaces has developed around the classical examples of subsets of R72 (we make
pictures on a sheet of paper or on the blackboard, both models of R2 ), and a famous
treatise in differential geometry even starts with "The nicest example of a metric
space is Euclidean n-space R72." This point of view is so widely shared that one
may be led to think that ultrametric spaces are not genuine metric spaces! Thus the
commonly used notation for metric spaces has grown on the paradigmatic model
of subsets of Euclidean spaces. For example, the "closed ball" of radius r and
center a — defined by d(x, a) < r — is often denoted by B(a: r) or Br(a). This
notation comforts the belief that it is the closure of the "open ball" having the same
viii Preface
radius and center. If the specialists have no trouble with the usual terminology and
notation (and may defend it on historical grounds), our students lose no opportunity
to insist on its misleading meaning. In an ultrametric space all balls of positive
radius (whether defined by d(x, a) < r or by d(x, a) < r) are both open and
closed. They are clopen sets. Also note that in an ultrametric space, any point of
a ball is a center of this ball. The systematic appearance of totally disconnected
spaces in the context of fractals also calls for a renewed view of metric spaces. I
propose using a more suggestive notation,
B<r(a) = {x : d(x, a) < r}, BSr(a) = {jc : d(x, a) < r]
which has at least the advantage of clarity. In this way I can keep the notation
A strictly for the closure of a subset A of a topological space X. The algebraic
closure of a field K is denoted by Ka.
* * *
Finally, let me thank all the people who helped me during the preparation of this
book, read preliminary versions, or corrected mistakes. I would like to mention
especially the anonymous referee who noted many mistakes in my first draft,
suggested invaluable improvements and exercises; W.H. Schikhof, who helped
me to correct many inaccuracies; and A. Gertsch Hamadene, who proofread the
whole manuscript. I also received encouragement and help from many friends and
collaborators. Among them, it is a pleasure for me to thank
D. Barsky, G. Christol, B. Diarra, A. Escassut, S. Guillod-Griener,
A. Junod, V. Schiirch, C. Vonlanthen, M. Zuber.
My wife, Ann, also checked my English and removed many errors.
Cross-references are given by number: (II.3.4) refers to Section (3.4) of Chapter
II. Within Chapter II we omit the mention of the chapter, and we simply refer
to (3.4). Within a section, lemmas, propositions, and theorems are individually
numbered only if several of the same type appear. I have not attempted to track
historical priorities and attach names to some results only for convenience. General
assumptions are repeated at the head of chapters (or sections) where they are in
force.
Figures I.2.5a, 1.2.5c, I.2.5d, and 1.2.6 are reproduced here (some with minor
modifications) with written permission from Marcel Dekker. They first appeared in
my contribution to the Proceedings of the 4th International Conference on p-adic
Functional Analysis (listed in the References).
Alain M. Robert
Neuchatel, Switzerland, July 1999
Contents
Preface v
1 p-adic Numbers 1
1. The Ring Zp of p-adic Integers 1
1.1 Definition 1
1.2 Addition of p-adic Integers 2
1.3 The Ring of p-adic Integers 3
1.4 The Order of a p-adic Integer 4
1.5 Reduction mod p 5
1.6 The Ring of p-adic Integers is a Principal Ideal Domain . ... 6
2. The Compact Space Zp 7
2.1 Product Topology on Zp 7
2.2 The Cantor Set 8
2.3 Linear Models of Zp 9
2.4 Free Monoids and Balls of Zp 11
*2.5 Euclidean Models 12
*2.6 An Exotic Example 16
3. Topological Algebra 17
3.1 Topological Groups 17
3.2 Closed Subgroups of Topological Groups 19
3.3 Quotients of Topological Groups 20
3.4 Closed Subgroups of the Additive Real Line 22
3.5 Closed Subgroups of the Additive Group of p-adic Integers . . 23
3.6 Topological Rings 24
3.7 Topological Fields, Valued Fields 25
x Contents
4. Projective Limits 26
4.1 Introduction 26
4.2 Definition 28
4.3 Existence 28
4.4 Projective Limits of Topological Spaces 30
4.5 Projective Limits of Topological Groups 31
4.6 Projective Limits of Topological Rings 32
4.7 Back to the p-adic Integers 33
*4.8 Formal Power Series and p-adic Integers 34
5. The Field Qp of p-adic Numbers 36
5.1 The Fraction Field of Zp 36
5.2 Ultrametric Structure on Qp 37
*5.3 Characterization of Rational Numbers Among
p-adic Ones 39
5.4 Fractional and Integral Parts of p-adic Numbers 40
5.5 Additive Structure of Qp and Zp 43
*5.6 Euclidean Models of Qp 44
6. Hensel's Philosophy 45
6.1 First Principle 45
6.2 Algebraic Preliminaries 46
6.3 Second Principle 46
6.4 The Newtonian Algorithm 47
6.5 First Application: Invertible Elements in Zp 49
6.6 Second Application: Square Roots in Qp 49
6.7 Third Application: nth Roots of Unity in Zp 51
Table: Units, Squares, Roots of Unity 53
*6.8 Fourth Application: Field Automorphisms of Qp 53
Appendix to Chapter I: The p-adic Solenoid 54
*A.l Definition and First Properties 55
*A.2 Torsion of the Solenoid 55
*A.3 Embeddingsof Rand Qp in the Solenoid 56
*A.4 The Solenoid as a Quotient 57
*A.5 Closed Subgroups of the Solenoid 60
*A.6 Topological Properties of the Solenoid 61
Exercises for Chapter I 63
2 Finite Extensions of the Field of p-adic Numbers 69
1. Ultrametric Spaces 69
1.1 Ultrametric Distances 69
Table: Properties of Ultrametric Distances 73
1.2 Ultrametric Principles in Abelian Groups 73
Table: Basic Principles of Ultrametric Analysis 77
L3 Absolute Values on Fields 77
1.4 Ultrametric Fields: The Representation Theorem 79
1.5 General Form of Hensel's Lemma 80
Contents xi
1.6 Characterization of Ultrametric Absolute Values 82
1.7 Equivalent Absolute Values 83
2. Absolute Values on the Field Q 85
*2.1 Ultrametric Absolute Values on Q 85
*2.2 Generalized Absolute Values 86
*2.3 Ultrametric Among Generalized Absolute Values 88
*2.4 Generalized Absolute Values on the Rational Field 88
3. Finite-Dimensional Vector Spaces 90
3.1 Normed Spaces over Qp 90
3.2 Locally Compact Vector Spaces over Qp 93
3.3 Uniqueness of Extension of Absolute Values 94
3.4 Existence of Extension of Absolute Values 95
3.5 Locally Compact Ultrametric Fields 96
4. Structure of p-adic Fields 97
4.1 Degree and Residue Degree 97
4.2 Totally Ramified Extensions 101
4.3 Roots of Unity and Unramified Extensions 104
4.4 Ramification and Roots of Unity 107
*4.5 Example 1: The Field of Gaussian 2-adic Numbers Ill
*4.6 Example 2: The Hexagonal Field of 3-adic Numbers 112
*4.7 Example 3: A Composite of Totally Ramified Extensions ... 114
Appendix to Chapter II: Classification of Locally Compact Fields ... 115
*A.l Haar Measures 115
*A.2 Continuity of the Modulus 116
*A.3 Closed Balls are Compact 116
*A.4 The Modulus is a Strict Homomorphism 118
*A.5 Classification 118
*A.6 Finite-Dimensional Topological Vector Spaces 119
*A.7 Locally Compact Vector Spaces Revisited 121
*A.8 Final Comments on Regularity of Haar Measures 122
Exercises for Chapter II 123
3 Construction of Universal p-adic Fields 127
1. The Algebraic Closure Qap of Qp 127
1.1 Extension of the Absolute Value 127
12 Maximal Unramified Subextension 128
1.3 Ramified Extensions 129
1-4 The Algebraic Closure Q* is not Complete 129
1 -5 Krasner's Lemma 130
*1.6 A Finiteness Result 132
* 1 -7 Structure of Totally and Tamely Ramified Extensions 133
2- Definition of a Universal p-adic Field 134
2.1 More Results on Ultrametric Fields 134
2.2 Construction of a Universal Field £2p 137
2.3 The Field £2p is Algebraically Closed 138
2.4 Spherically Complete Ultrametric Spaces 139
2.5 The Field £2p is Spherically Complete 140
3. The Completion Cp of the Field Qap 140
3.1 Definition of Cp 140
3.2 Finite-Dimensional Vector Spaces over a Complete
Ultrametric Field 141
3.3 The Completion is Algebraically Closed 143
*3.4 The Field Cp is not Spherically Complete 143
*3.5 The Field Cp is Isomorphic to the Complex Field C 144
Table: Notation 145
4. Multiplicative Structure of Cp 146
4.1 Choice of Representatives for the Absolute Value 146
4.2 Roots of Unity 147
4.3 Fundamental Inequalities 148
4.4 Splitting by Roots of Unity of Order Prime top 150
4.5 Divisibility of the Group of Units Congruent to 1 151
Appendix to Chapter III: Filters and Ultrafilters 152
A.1 Definition and First Properties 152
A.2 Ultrafilters 153
A.3 Convergence and Compactness 154
*A.4 Circular Filters 156
Exercises for Chapter III 156
4 Continuous Functions on Zp 160
1. Functions of an Integer Variable 160
1.1 Integer-Valued Functions on the Natural Integers 160
1.2 Integer-Valued Polynomial Functions 163
1.3 Periodic Functions Taking Values in a Field
of Characteristic p 164
1.4 Convolution of Functions of an Integer Variable 166
1.5 Indefinite Sum of Functions of an Integer Variable 167
2. Continuous Functions on Zp 170
2.1 Review of Some Classical Results 170
2.2 Examples of p-adic Continuous Functions on Zp 172
2.3 Mahler Series 172
2.4 The Mahler Theorem 173
2.5 Convolution of Continuous Functions on Zp 175
3. Locally Constant Functions on Zp 178
*3.1 Review of General Properties 178
*3.2 Characteristic Functions of Balls of Zp 179
*3.3 The van der Put Theorem 182
4. Ultrametric Banach Spaces 183
4.1 Direct Sums of Banach Spaces 183
4.2 Normal Bases 186
4.3 Reduction of a Banach Space 189
4.4 A Representation Theorem 190
4.5 The Monna-Fleischer Theorem 190
*4.6 Spaces of Linear Maps 192
*4.7 The p-adic Hahn-Banach Theorem 194
5. Umbral Calculus 195
5.1 Delta Operators 195
5.2 The Basic System of Polynomials of a Delta Operator 197
5.3 Composition Operators 198
5.4 The van Hamme Theorem 201
5.5 The Translation Principle 204
Table: Umbral Calculus 207
6. Generating Functions 207
6.1 Sheffer Sequences 207
6.2 Generating Functions 209
6.3 The Bell Polynomials 211
Exercises for Chapter IV 212
5 Differentiation 217
1. Differentiability 217
1.1 Strict Differentiability 217
*1.2 Granulations 221
1.3 Second-Order Differentiability 222
*1.4 Limited Expansions of the Second Order 224
1.5 Differentiability of Mahler Series 226
1.6 Strict Differentiability of Mahler Series 232
2. Restricted Formal Power Series 233
2.1 A Completion of the Polynomial Algebra 233
2.2 Numerical Evaluation of Products 235
2.3 Equicontinuity of Restricted Formal Power Series 236
2.4 Differentiability of Power Series 238
2.5 Vector-Valued Restricted Series 240
3. The Mean Value Theorem 241
3.1 The p-adic Valuation of a Factorial 241
3.2 First Form of the Theorem 242
3.3 Application to Classical Estimates 245
3.4 Second Form of the Theorem 247
3.5 A Fixed-Point Theorem 248
*3.6 Second-Order Estimates 249
4. The Exponentiel and Logarithm 251
4.1 Convergence of the Defining Series 251
4.2 Properties of the Exponential and Logarithm 252
4.3 Derivative of the Exponential and Logarithm 257
4.4 Continuation of the Exponential 258
4.5 Continuation of the Logarithm 259
5. The Volkenborn Integral 263
5.1 Definition via Riemann Sums 263
5.2 Computation via Mahler Series 265
xiv Contents
5.3 Integrals and Shift 266
5.4 Relation to Bernoulli Numbers 269
5.5 Sums of Powers 272
5.6 Bernoulli Polynomials as an Appell System 275
Exercises for Chapter V 276
6 Analytic Functions and Elements 280
1. Power Series 280
1.1 Formal Power Series 280
1.2 Convergent Power Series 283
1.3 Formal Substitutions 286
1.4 The Growth Modulus 290
1.5 Substitution of Convergent Power Series 294
1.6 The valuation Polygon and its Dual 297
1.7 Laurent Series 303
2. Zeros of Power Series 305
2.1 Finiteness of Zeros on Spheres 305
2.2 Existence of Zeros 307
2.3 Entire Functions 313
2.4 Rolle's Theorem 315
2.5 The Maximum Principle 317
2.6 Extension to Laurent Series 318
3. Rational Functions 321
3.1 Linear Fractional Transformations 321
3.2 Rational Functions 323
3.3 The Growth Modulus for Rational Functions 326
*3.4 Rational Mittag-Leffler Decompositions 330
*3.5 Rational Motzkin Factorizations 333
*3.6 Multiplicative Norms on K(X) 337
4. Analytic Elements 339
*4.1 Enveloping Balls and Infraconnected Sets 339
*4.2 Analytic Elements 342
*4.3 Back to the Tate Algebra 344
*4.4 The Amice-Fresnel Theorem 347
*4.5 The p-adic Mittag-Leffler Theorem 348
*4.6 The Christol-Robba Theorem 350
Table: Analytic Elements 354
*4.7 Analyticity of Mahler Series 354
*4.8 The Motzkin Theorem 357
Exercises for Chapter VI 359
7 Special Functions, Congruences 366
1. The Gamma Function Fp 366
1.1 Definition 367
1.2 Basic Properties 368
Contents xv
1.3 The Gauss Multiplication Formula 371
1.4 The Mahler Expansion 374
1.5 The Power Series Expansion of log rp 375
*1.6 The Kazandzidis Congruences 380
*1.7 About T2 382
2. The Artin-Hasse Exponential 385
2.1 Definition and Basic Properties 386
2.2 Integrality of the Artin-Hasse Exponential 388
2.3 The Dieudonne-Dwork Criterion 391
2.4 The Dwork Exponential 393
*2.5 Gauss Sums 397
*2.6 The Gross-Koblitz Formula 401
3. The Hazewinkel Theorem and Honda Congruences 403
3.1 Additive Version of the Dieudonne-Dwork Quotient 403
3.2 The Hazewinkel Maps 404
3.3 The Hazewinkel Theorem 408
3.4 Applications to Classical Sequences 410
3.5 Applications to Legendre Polynomials 411
3.6 Applications to Appell Systems of Polynomials 412
Exercises for Chapter VII 414
Specific References for the Text 419
Bibliography 423
Tables 425
Basic Principles of Ultrametric Analysis 429
Conventions, Notation, Terminology 431
Index 435
1
p-adic Numbers
The letter p will denote a fixed prime.
The aim of this chapter is the construction of the compact topological ring Zp
of p-adic integers and of its quotient field Qp, the locally compact field of p-adic
numbers. This gives us an opportunity to develop a few concepts in topological
algebra, namely the structures mixing algebra and topology in a coherent way.
Two tools play an essential role from the start:
• the p-adic absolute value |. \p = |. | or its additive version, the p-adic valuation
onlp == vp,
• reduction mod p.
1. The Ring Zp of /?-adic Integers
We start by a down-to-earth definition of p-adic integers: Other equivalent
presentations for them appear below, in (4.7) and (4.8).
1.1. Definition
A p-adic integer is a formal series £]l>0 ai Pl w*tn integral coefficients a, satisfying
0 < fl,- < p - 1.
With this definition, a p-adic integer a = J2i^oaiPl can ^e identified with the
sequence (aj^o of its coefficients, and the set of p-adic integers coincides with
2 1. p-adic Numbers
the Cartesian product
X = XP= f]{0, 1,..., p - 1} = {0, 1,..., p - 1}N.
In particular, if a = £,.>0 a,- p\b = J],>o ^ P* (with a*, &/ e {0, 1,..., p - 1})
we have
a = b <=> at = fy for all i > 0.
The usefulness of the series representation will be revealed when we introduce
algebraic operations on these p-adic integers. Let us already observe that the
expansions in base p of natural integers produce p-adic integers (ending with zero
coefficients: Finite series are special series), and we obtain a canonical embedding
of the set of natural integers N = {0, 1,2,...} into X.
From the definition, we immediately infer that the set of p-adic integers is not
countable. Indeed, if we take any sequence of p-adic integers, say
a = ^Oip\ b = ^btp\ c = ^dp\ ...,
i>0 i>0 *>0
we can define a p-adic integer x = X^>o X*P' by choosing
thus constructing a p-adic integer different from a, b, c, This shows that the
sequence a, b, c,... does not exhaust the set of p-adic integers. A mapping from
the set of natural integers N to the set of p-adic integers is never surjective.
1.2. Addition of p-adic Integers
Let us define the sum of two p-adic integers a and b by the following procedure.
The first component of the sum is oq + bo if this is less than or equal to p — 1, or
ao + bo — p otherwise. In the second case, we add a carry to the component of
p and proceed by addition of the next components. In this way we obtain a series
for the sum that has components in the desired range. More succinctly, we can say
that addition is defined componentwise, using the system of carries to keep them
in the range {0, 1,..., p — 1}.
An example will show how to proceed. Let
a= 1 = l+0p + 0p2 + ---,
b = (p - 1) + (p - 1 )p + (p - l)p2 + - - ■ -
The sum a + b has a first component 0, since 1 + (p — 1) = p. But we have to
remember that a carry has to be taken into account for the next component. Hence
this next component is also 0, and another carry has to be accounted for in the
next place, etc. Eventually, we find that all components vanish, and the result is
1. The Ring 7*p of p-adic Integers 3
1 _f b = 0, namely b is an additive inverse of the integer a = 1 (in the set of p-adic
integers), and for this reason written b — — 1. More generally, if
*>o
we define
b = cr(a) = £](p - 1 - a{)pl
so that a + b +\ = 0. This is best summarized by a + cr(a) 4- 1 = 0 or even
cr(a) + 1 — —a. In particular, all natural integers have an additive inverse in the
set of p-adic integers. It is now obvious that the set X of p-adic integers with the
precedingly defined addition is an abelian group. The embedding of the monoid
N in X extends to an injective homomorphism Z -> X. Negative integers have
the form —m — 1 = o(m) with all but finitely many components equal to p — 1.
Considering that the rational integers are p-adic integers, from now on we shall
denote by Zp the group of p-adic integers. (Another natural reason for this notation
will appear in (3.6).) The mapping a : Zp -> Zp obviously satisfies
id and is therefore an involution on the set of p-adic integers. When p is odd, this
involution has a fixed point, namely the element a = J2oo ^T~Pl e ^p-
13. The Ring of p-adic Integers
Let us define the product of two p-adic integers by multiplying their expansions
componentwise, using the system of carries to keep these components in the desired
range {0, l,...,p- 1}.
This multiplication is defined in such a way that it extends the usual
multiplication of natural integers (written in base p). The usual algorithm is simply pursued
indefinitely. Again, a couple of examples will explain the procedure. We have
found that -1 = £(p - l)pf. Now we write
_l=(/7_l).£y, -(p-l)J>' = 1'
Hence 1 — p is invertible in Zp with inverse given as a formal geometric series of
ratio p. Since
P - Ylaipi = ^P + ai?2 + '" * ^ l + °P + °P2 + ' *''
i>0
the prime p is not invertible in Zp for multiplication. Using multiplication, we can
also write the additive inverse of a natural number in the form
_w = (_l).m = ^(p_ l)p' -^Tmip1,
4 1. p-adic Numbers
but it is not so easy to deduce the coefficients of — m from this relation. Together
with addition and multiplication, Zp is a commutative ring. When p is odd, the
fixed element under the involution o is
2 P 2 ^P 2 1 — d 2'
but 2 is not an invertible element of Z2, — \ & Z2, and the involution cr = 02 has
no fixed point in Z2.
7.4. 77ze Order of a p-adic Integer
Let a = X^o at Pl be a p-cidic integer If a ^ 0, there is a first index v = v(a) > 0
such that ciy ^ 0. 77h.s zrufex is the p-adic order v = v(a) = ordp(a), And we get
a map
v = ordp : Zp - {0} -* N.
This terminology comes from a formal analogy between the ring of p-adic integers
and the ring of holomorphic functions of a complex variable z e C. If/is a nonzero
holomorphic function in a neighborhood of a point a e C, we can write its Taylor
series near this point
f(z) = Yla»(z -a)"' (a™ * a |z-a| < £)-
The index m of the first nonzero coefficient is by definition the order (of vanishing)
of / at a: this order is 0 if f(a) ^ 0 and is positive if / vanishes at a.
Proposition. The ring 7Lp of p-adic integers is an integral domain.
Proof. The commutative ring Zp is not {0}, and we have to show that it has no
zero divisor. Let therefore a = J2i>oatPl ^ ^, b = J2i>o^iPl ^ 0> anc* define
v = v(a), w = v(b). Then av is thelirst nonzero coefficient of a, 0 < av < p, and
similarly Z?^ is the first nonzero coefficient of b. In particular, p divides neither av
nor bw and consequently does not divide their product avbw either. By definition
of multiplication, the first nonzero coefficient of the product ab is the coefficient
cv+w of pv+w, and this coefficient is defined by
0 < cv+w < p, cv+w = avbw (mod p). ■
Corollary of proof. The order v : Zp — {0} -> N satisfies
v(ab) = v(a) + v(b),
v(a + b)> min(f(a), v(b))
if a, b, and a+b are not zero.
■
1. The Ring Zp of p-adic Integers 5
It is convenient to extend the definition of the order by v(0) = oc so that
the preceding relations are satisfied without restriction on Zp, with the natural
conventions concerning the symbol oc. The p-adic order is then a mapping Zp ->
N U {oc} having the two above-listed properties.
1.5. Reduction mod p
Let Fp = Z/pZ be the finite field with p elements. The mapping
a = //\aiPl ^ flo mod p
defines a ring homomorphism s : Zp —> Fp called reduction mod p. This reduction
homomorphism is obviously surjective, with kernel
{aeZp:a0 = 0} = {E^i^-p1 = pE^ofly+ip'] = P%p-
Since the quotient is a field, the kernel pZp of e is a maximal ideal of the ring
Zp. A comment about the notation used here has to be made in order to avoid a
paradoxical view of the situation: Far from being p times bigger than Zp, the set
pZp is a subgroup of index p in Zp (just as pZ is a subgroup of index p in Z).
Proposition. The group Z* of invertible elements in the ring Zp consists of the
p-adic integers of order zero, namely
Z* = £>p': «o 5* 0}.
Proof. If a /7-adic integer a is invertible, so must be its reduction e{a) in Fp. This
proves the inclusion Z£ c {J2i>oaiPl : ao ^ 0}. Conversely, we have to show
that any p-adic integer a of order v(a) = 0 is invertible. In this case the reduction
£(a) e Fp is not zero, and hence is invertible in this field. Choose 0 < bo < p
with aob0 ~ 1 mod p and write aob0 = 1 + kp. Hence, if we write a =aG-\- pa,
then
a-bo= l+kp+ pabo = 1+ p*
for some p-adic integer k. It suffices to show that the p-adic integer 1 + Kp is
invertible, since we can then write
a ■ fco(l + KP)~l = 1, a"1 = fco(l + kP)~\
In other words, it is enough to treat the case ao = 1, a = 1 + Kp. Let us observe
that we can take
(1 + KP)~l = 1 - Kp + (^P)2 = 1 + Cip + C2p2 + - - •
with integers c, e {0, 1,..., p - 1}. This possibility is assured if we apply the
rules for carries suitably. Such a procedure is cumbersome to detail any further, and
6 1. p-adic Numbers
another, equivalent, definition of the ring Zp will be given in (4.7) below, making
such verifications easier to handle. ■
Corollary 1. The ring Zp of p-adic integers has a unique maximal ideal, namely
pZp —Zp ~Z*. ■
The statement of the preceding corollary corresponds to a partition Zp = Z£ U
pZp (a disjoint union). In fact, one has a partition
Zp - {0} = \\ P*Zp (disjoint union of p*Z* = v~l(k)).
Corollary 2. Every nonzero p-adic integer a e Zp has a canonical
representation a = pvu, where v = v(a) is the p-adic order of a andu e Z* is a p-adic
unit. ■
Corollary 3. The rational integers a e Z that are invertible in the ring Zp are
the integers prime to p. The quotients of integers m/n e Q (n ^ 0) that are
p~adic integers are those that have a denominator n prime to p. ■
1.6. The Ring of p-adic Integers is a Principal Ideal Domain
The principal ideals of the ring Zp,
(/) = pkZp = {xeZp: ordp(x) > /:},
have an intersection equal to {0}:
Zp D pZp D ... D pkZp D ■■■ D f]pkZp = {0}.
Indeed, any element a J= 0 has an order v(a) = k, hence a g (pk+1). In fact, these
principal ideals are the only nonzero ideals of the ring of p-adic integers.
Proposition. The ring Zp is a principal ideal domain. More precisely, its ideals
are the principal ideals {0} and pkZp {k 6 N).
Proof. Let / ^ {0} be a nonzero ideal of Zp and 0 ^ a e / an element of minimal
order, say k = v(a) < oo. Write a = pku with a p-adic unit u. Hence pk —
u~la el and (pk) = pkZp c /- Conversely, for any b e / let w = v(b) > A and
write
This shows that / C pkZp.
■
2. The Compact Space Zp 7
2. The Compact Space Zp
2.1. Product Topology on Zp
The Cartesian product spaces
Xp = f](0, 1, 2, - - -, p - 1} = {0, 1. 2,..., p - 1 }N
i">0
will now be considered as topological spaces, with respect to the product topology
of the finite discrete sets {0, 1,2,..., p — 1}. These basic spaces will be studied
presently, and we shall give natural models for them (they are homeomorphic for
all p). By the Tychonoff theorem, Xp is compact. It is also totally disconnected:
The connected components are points.
Let us recall that the discrete topology can be defined by a metric
fl ifa^b,
*(a,b)= \ ^
I 0 if a = b,
or, using the Kronecker symbol, 8(a, b) = 1 — 8ab. Several metrics compatible
with the product topology on Xp can be deduced from these discrete ones. For
x = (a0, ax,...), y = (Z?0, bu ...) e Xp, we can define
d(x, y) = sup :— = ,
,->o Pl Pv(x~y)
d (x, y) = > nrj-> an(i so on-
Although all metrics on a compact metrizable space are uniformly equivalent, they
are not all equally interesting! For example, we favor metrics that give a faithful
image of the coset structure of Zp: For each integer k e N, all cosets of pkZp in
Zp should be isometric (and in particular have the same diameter).
The padic metric is the first mentioned above. Unless specified otherwise, we
use it and introduce the notation
ti \d(Xj0) = p-v ifx^0(v^ ovdp(x))9
\x = 1
[0 ifx = 0
(absolute values will be studied systematically in Chapter H). We recover the
P-adic metric from this absolute value by d(x, y) = \x — y\. With this metric,
multiplication by p in Zp is a contracting map
d(px,py)= jd(x,y)
and hence is continuous.
8 1. p-adic Numbers
22. The Cantor Set
In point set topology the Cantor set plays an important role. Let us recall its
construction. From the unit interval Co = / = [0,1] one deletes the open middle
third. There remains a compact set
Ci=[0,f]U[f, 1].
Deleting again the open middle third of each of the remaining intervals, we obtain
a smaller compact set
C2-[0,|JU[|,|]U[|, 2]U[|, 1].
Iterating the process, we get a decreasing sequence of nested compact subsets of
the unit interval. By definition, the Cantor set C is the intersection of all Cn.
remove
[1/3 2/3"
The Cantor set
It is a nonempty compact subset of the unit interval / = [0,1]. The Cantor
diagonal process (see 1.1) also shows that this compact set is not countable. If we
temporarily adopt a system of numeration in base 3 —hence with digits 0,1, and
2 — the removal of the first middle third amounts to deleting numbers having first
digit equal to 1 (keeping first digits 0 and 2). Removing the second, smaller, middle
intervals amounts to removing numbers having second digit equal to 1, and so on.
Finally, we see that the Cantor set C consists precisely of the numbers 0 < a < 1
that admit an expansion in base 3:
0.«ia2 - - ■ = — + ^ + """
with digits at = 0 or 2. We obtain these expansions by doubling the elements of
arbitrary binary sequences. This leads to considering the bijection
+■■?:«*»i:w' z^c-
*>o />o
The definition of the product topology shows that this mapping is continuous, and
hence is a homeomorphism, since the spaces in question are compact.
2. The Compact Space Zp 9
Binary sequences can also be considered as representing expansions in base 2
of elements in the unit interval. This leads to a surjective mapping
*">o
i>0
This map is surjective and continuous but is not injective: The numbers J2i>j 2'
and 2j e Z2 have the same image in [0, 1], as is immediately seen (in the decimal
system, a decimal expansion having only 9's after place j can be replaced by a
decimal expansion with a single 1 in place j). In fact, Card <p~l(t) < 2 for any
t e [0,1].
We can summarize the situation by a commutative diagram of maps
f-.Ts!^ C C [0, 1]
II i / g
<p : Z2 -> [0, 1]
The function g identifies contiguous extremities of the Cantor set C and sends
them onto points of the interval having two binary expansions (rational numbers
of the form a/2j). These constructions will now be generalized.
0 1/9 2/9 1/3
WW
\\ \ 1
2/3
1
/ //
/ //
\
< *
u
0 1/8 1/4 1/2 3/4 1
Gluing the extremities of the Cantor set
2.3. Linear Models ofZp
We choose a real number b > 1 and use it as numeration base in the unit interval
[0, 1]. In other words, we try to write real numbers in this interval in the form
ao/b + ai/b2 -\ with integral digits 0 < a, < b. More precisely, fix the prime
p and consider the maps f = fb (= fb,p) : Zp -> [0, 1] defined by the infinite
series in R
* IE-')-»■££;■
W>0
/>0
with a normalizing constant # chosen so that the maximum of \J/ is 1. Since
this maximum is attained when all digits 0, are maximal, it is attained at
10 1. p-adic Numbers
— 1 = J2t>o(P — l)pl 6 Zp, and its image must be 1:
Ep - 1 b~x /7—1
. A bl+l F }\-b~x b-\
namely
b~\
■d =
1
For p = 2 and Z? = 3 we find that # = 2, and we recover the special case studied
in the preceding section, where \J/ furnished a homeomorphism Z2 -> C C [0, 1].
In general, \J/ ~ \J/b will be injective if the p-adic integers
£(p-l)p'andp'eZp
have distinct images in [0, 1]. The first image is
# - (P - !)£ l/*f"+1 = #(/> - 1)^"V(1 - &"1)
= $b-]-\p - 1)/(Z? - 1) = b~j~l.
The second image is # - Z?~i_1. The injectivity condition is thus & > 1, or b > p.
Let us summarize.
Theorem. The maps \J/b (= fa,p) : Zp —> [0, 1] defined for b > 1 by
\i>0 / f x i>0 "
are continuous. When b > p, fa is injective and defines a homeomorphism of
Zp onto its image fa(Zp). When b = p, we get a surjective map tJsp which is
not injective. ■
The commutative diagram given in the last section generalizes immediately to
our present context.
Comment. When b > p, fa gives a linear model of Zp in the interval [0, 1]; the
image is a fractal subset A of this interval. The self-similarity dimension d of such
a set is "defined" by means of a dilatation producing a union of copies of translates
of A. If we denote by E(A) an intuitive — not formally defined — notion ofextent
of A and if kA is a union of m translates of A, this self-similarity dimension d
satisfies
mE(A) = E(kA) = kdE(A),
2. The Compact Space Zp 11
and hence d - log X = logm and d = log m/ log A. In our case, take A = /> so
that m = /7 and the self-similarity dimension of A = \lrb(Zp) in [0, 1] C R is
log pi log Z? < 1. In this way we obtain a continuous family of fractal models of
increasing dimension for b \ p degenerating in the limit to a connected interval.
It may be useful to look at symmetric models obtained by replacing the digits
dj e {0,1,2,..., p — 1} by symmetric ones in {—^—-,..., ^^-}. Define
v(k) = k- ^— (0<k<p-l).
We can choose the normalization constant & of the map
in order to have
min^-' = —1. max^-' = +1.
(When p = 2, v(£) = (—1)*+1| = ±|, and the corresponding expansion has
fractional digits.) The involution o induces a change of sign in the image. When
p ^ 2 it has the origin as fixed point. Here is a picture of centered linear models
ofZ3 whenZ? \ 3.
• ii ■ i i
■ ■■ ■! •
• ■! ii i
i y i 1 i !
• ii ii i
i ii ii i
• ii it i
i i i . i i y
-1 -1/2 -1/4 0 1/4 1/2 1
A centered linear model of Z3
2.4. Free Monoids and Balls ofZp
Let B<r(a) denote the ball defined by d(x, a) = \x — a\ < r in Zp. It is clear that
this ball does not change if we replace its radius r by the smallest power p~n that is
greater than or equal tor. If the p-adic expansion of a is ao+aip-i [-anpn-i =
sn + pn+xa, the ball does not change either if we replace its center by sn. This ball
is fully determined by the sequence of digits (of variable length giving the radius)
ao> &i, >,an, and we associate to it the word
ao«i •cin e Mp
m the free monoid generated by S = {0, 1, ..., p - 1}.
Conversely, to each (finite) word in the elements of 5 — say a$a\ -an — we
associate the ball of center a = a0 + axp + - - ■ + anpn and radius r = p~n. We
get m this way a bijective map between Mp and the set of balls of Zp\ Observe
that a ball BSr(a) defined by d(x, a) < r is the same as a ball B<r(a) for some
r' >r.
12 1. p-adic Numbers
The monoid Mp has several matrix representations
Mp -> Gl„(Zp).
For example, when n = 2, we can take
-U
s,-*T' = \n .) (seS={0,l,...,p-l}).
Indeed,
TV
and more generally,
™-(s t)(s t)-(» "**)•
Ia0Ial " a« ~ I 0 1 /
Observe that in this representation the length of a word corresponds to the order of
the determinant of the matrix. In terms of balls, the radius appears as the absolute
value of the determinant, whereas a center of the ball is read in the upper right-hand
corner of the matrix. With the preceding notation
B^r(a) = B<r(sn) ^¥,-.fl„(E^)<-> ^0+1 sj\ .
Euclidean models of the ring of p-adic integers will be obtained in the next section
by means of injective representations
Mp -> G1„(R).
Since Mp is free, such representations are completely determined by the images
of the generators, namely by p matrices Mo,..., Mp^\.
2.5. Euclidean Models
Let V be a Euclidean space, namely a finite-dimensional inner product space over
the field R of real numbers. Select an injective map
v : S = {0, 1, 2,..., p - 1} -* V, v(S) = E C V,
and define the vector mappings (using vector digits)
v(a()
;>o i>o
Since Zp — LL0<E5(<3o + pZ*P), we have
;>o i>o °
2. The Compact Space Zp 13
For large enough values of b, the image F = Fv%b = ^fv^Lp will also be a disjoint
union of self-similar images. In this way we get a construction of spatial models
4>(ZP) by iteration (similar to the construction of the Cantor set as an intersection
of compact sets). ^
More explicitfy, let us denote by E the convex hull of E in V. As is known,
this is the intersection of all half spaces containing E. It is also the intersection
of those half spaces containing £ and having for boundary a hyperplane touching
the configuration. Let X be an affine linear functional on V such that
A < 1 on E, X(v) = 1 for some v e E.
Choose & = b — 1. Then
\ i>0 / i>0 U
so that the image F of ^ is also contained in the convex hull of E: F C E = Kq.
Moreover, by choice of the constant #,
From the self-similarity representation of F we get a better approximation
Iterating this inclusion in the self-similarity representation of F we get an even
better approximation:
Eventually, this leads to a representation of the fractal F as the intersection of
a decreasing sequence of compact sets Kn. Several pictures will illustrate this
construction.
(2.5.1) Take, for example, p = 3, V = R3 with canonical basis e0, ei, e2, and
v(k) = ejt. Then the corresponding vector maps Vt> : Z3 —> R3 are given by
;>0 "
Let us choose the constant # such that
*<°> = »£^r = *.
namely & £.^o \/bi+l = &/(b - 1) = 1. In this case, the image of * is contained
ln tne P^ne x + y + z = 1. Since the components of the images VKfl) are positive,
e lrtlage of the map *I> is contained in the unit simplex of R3 (convex span of the
14 I. p-adic Numbers
basic vectors). More precisely, the mappings Vt> are injective for b > 2, and hence
give homeomorphic images — models — of Z3 in this simplex. When b = 2, the
image is a Sierpinski gasket — hence connected — in this simplex. In general, the
image is a fractal having self-similarity dimension log 3/ log b.
Models of Z3: Sierpifisky gasket
(2.5.2) Take now p = 5, V = R2, and the map v defined by v(0) = (0,0), v(l) =
(1,0), v(2) = (0,1), v(3) = (-1,0), v(4) = (0, -1). With a suitably chosen
normalization constant &, the components of an image V(a) — (jc, y) will satisfy
— 1 < jc + v < 1 and — 1 < x — y < 1. The image of V^ is a union of the similar
subsets *(* + 5Zs) (0 < it < 4). Observe that *(5Z5) = b~x^{Z5) and that these
subsets are disjoint when Z? > 3. In this case, the image is a fractal of self-similarity
dimension log 5/ log b. In the limit case b = 3 the image is connected.
,f*4*$
«|» «|* «g» 4$* *|» «|» «|» A «g»
•JilTiiJi
TTT
****
TTT
**+*
Model of Z5 as planar fractal
2. The Compact Space Zp 15
(2.5.3) It is interesting to refine the preceding construction by addition of an
extra component. Take p = 5 as before but V = R3 with v' of the form
v'W = (#),WeR3,
ho = 0, h\ = hi = —hi = — h$ = ft > 0.
The corresponding vector maps Vt> have images in a tetrahedron bounded by an
upper edge parallel to the jc-axis and a lower edge parallel to the y-axis (hence
two horizontal edges: Choosing h suitably, we get a regular tetrahedron). These
edges give linear models of Z2, and the vertical projection on the horizontal plane
(obtained by omitting the third component) is the previous construction. But now,
the vector maps Vt> are already injective for b > 2, and in the limit case b = 2 the
image is a well-known connected fractal, parametrized by Z5. As in (2.2), these
vector mappings furnish commutative diagrams
*i
b ■
<l> = *•
2 •
z5
**(Zs) ~>
if S Z
<KZ5)
Model of Z5 as space fractal
(2.5.4) Take p = 7, 1; : {0, 1, 2,..., 6} -> R3 given by v(0) = 0 and
v(l) = (1,0,-1) i/(2) = (0,1,-1) i/(3) = (-1,1,0)
f(4) = (-1,0,1) i;(5) = (0,-1,1) v(6) = (1,-1,0).
With a suitable normalization constant, all the image points will remain in the cube
-1<jc<1, -\<y<l, - 1 < z < 1-
Ane components of an image also satisfy x + y + z = 0, and hence are
situated in this plane, intersecting the cube in a regular hexagon. For b > 3 we get
16 1. p-adic Numbers
interesting models of Z7 in this hexagon. In the limit case b = 3, a connected
fractal parametrized by Z7 appears.
(2.5.5) We can give a 3-dimensional model refining the preceding one. Still
with p = 7, take the canonical basis e^ e2, e3 of R3 and consider the vector map
corresponding to the choice v(0) = 0 and
i/(l) = e, i/(2) = e2 i/(3) = e3
v(4) = -ei v(5) = -e2 v(6) = -e3.
The image of the corresponding vector map Vt> : Z7 -> R3 is a fractal model
contained in the octahedron
W + M + kl<i
(provided that we choose a correct normalization constant #). A suitable projection
of this model on a plane brings us back to the preceding planar example (contained
in a hexagon).
The preceding constructions are similar to the IFS (iterated function systems)
used for representing fractals: They stem from affine Euclidean representations of
the monoid of balls of Zp. In fact, in this section only translations and dilatations
are used (rotations will also occur in IL4.5 and II.4.6).
Models of Z7
2.6. An Exotic Example
There is an interesting example connecting different primes. We can add formally
(i.e., componentwise) two 2-adic numbers and consider this sum in Z3. We thus
obtain a continuous map
3. Topological Algebra 17
We can make a commutative diagram
Z2XZ2 ^> Z3
i i
CxC -±» C + C
n n
F0,1]2 -±* [0,2].
Recall that the left vertical map is given by
CCrf.Ei,*)-(££.££)
and hence the diagonal composite is
G><,i>2')~2i;^-
Consequently, this composite has an image equal to the whole interval [0,2].
Hence addition C x C —► [0, 2] is also surjective. A good way of viewing the
situation is to make a picture of the subset C x C in the unit square of R2 and
consider addition (x, y) h» (x + y, 0) as a projection on the x-axis. The image of
the totally disconnected set C x C is the whole interval [0, 2].
1
1
0
i
■ ■
■ ■
IBIS
IBIS
■ ■-.
■ ■ '•.
■ p
■ ■-
•-pp \
iuzi\
■ ■-
■ ■
"■-. ■■-
'-■■
\ ■■
'*■■
■ ■-
""PP "-.
■■".
'■mm "-.
■ ■•
•-■*■■
1
A
'A
A
JA
2
^
A projection of C x C
3. Topological Algebra
3.1. Topological Groups
Definition. A topological group is a group G equipped with a topology such
that the map (x, y) i-» xy_1 : G x G -> G is continuous.
18 1. p-adic Numbers
If G is a topological group, the inverse map x h» x~l is continuous (fix x —e
in the continuous map (x, y) h» xy~J) and hence a homeomorphism of order 2
of G. The translations jcbajc (resp. x h» xa) are also homeomorphisms (e.g.,
the inverse of x h» ax is x »->- a"1*). A subgroup of a topological group is a
topological group for the induced topology.
Examples. (1) With addition, Zp is a topological group. We have indeed
c'6fl + pnZp, b' eb + pnZp =*a'-b'ea-b+ pnZp
for all n > 0. In other words, using the p-adic metric (2.1), we have
\x ~a\ < |p"| = p~\ \y-b\< \pn\ = p"" => |(x - y) - (fl - fc)| < p~\
proving the continuity of the map (x, y) h» x — y at any point (a, Z?).
(2) With respect to multiplication, Z* is a topological group. There is a
fundamental system of neighborhoods of its neutral element 1 consisting of subgroups:
1 + pZp D 1 + p2Zp D-Ol+ p"Zp D - - -
consists of subgroups: If a, /? e Zp, we see that (1 + pnP)~} = 1 + p72/?' for some
/?' e Zp (as in (1.5)), and hence
£i=l + Pn<x, b=l + pnp=> ab~l = (1 + p"oO(l + pnp) =l+pny
for some y eZp. Consequently,
a' e a(l + p"Zp), V e b(l + p"Zp) ==> a'b"1 e ab~\\ + pnZp) (n > 1),
and (x, y) »->> x_y_1 is continuous. As seen in (1.5), 1 + pZp is a subgroup of index
p — 1 in Z*. It is also open by definition (2.1). With respect to multiplication, all
subgroups 1 + pnZp (n > 1) are topological groups.
(3) The real line R is an additive topological group.
If a topological group has one compact neighborhood of one point, then it is a
locally compact space. If a topological group is metrizable, then it is a Hausdorff
space and has a countable fundamental system of neighborhoods of the
neutral element. Conversely, one can show that these conditions are sufficient for
metrizability.l
Let G be a metrizable topological group. Then there exists a metric d on G that
defines the topology of G and is invariant under left translations:
d(gx,gy) = d(x,y).
Specific references for the text are listed at the end of the book.
3. Topological Algebra 19
A metrizable group G can always be completed, namely, there exists a
complete group G and a homomorphism j : G ~+ G such that
• the image j(G) is dense in G,
• j is a homeomorphism G -> j(G),
• any continuous homomorphism f :G -> G' into a complete group G' can be
uniquely factorized as f = g o j : G ~> G -» Gf with a continuous
homomorphism g :G ~+ G\
3.2. Closed Subgroups of Topological Groups
As already observed, a subgroup of a topological group is automatically a
topological group for the induced topology.
Lemma. Let G be a topological group, H a subgroup of G.
(a) The closure H of H is a subgroup of G.
(b) G is Hausdorff precisely when its neutral element is closed.
Proof, (a) Let <p : G x G ~> G denote the continuous map (x, y) h» xy~l. Since
H is a subgroup, we have <p(H x H) C H and hence
<p(H x //) = <p(H x H) C <p(H x H) C H.
This proves that M is a subgroup.
(b) Let us recall that a topological space X is Hausdorff precisely when the
diagonal A# is closed in the product space X x X. In any Hausdorff space the
points are closed, and thus
G Hausdorff => [e] closed
=> Ac = <P~l(e) closed in G x G
=> G Hausdorff.
The lemma is completely proved. ■
Proposition. Let H be a subgroup of a topological group G. If H contains
a neighborhood of the neutral element in G, then H is both open and closed
in G.
Proof. Let V be a neighborhood of the neutral element of G contained in H. Then
for each h e H, hV is a neighborhood of h in G contained in H. This proves
that H is a neighborhood of all of its elements, and hence is open in G. Consider
now the cosets gH of H in G. Since translations are homeomorphisms of G,
these cosets are open in G. Any union of such cosets is also open. But H is the
complement of the union of all cosets gH ^ H. Hence H is closed. ■
20 1. p-adic Numbers
Examples. The subgroups pnZp (n > 0) are open and closed subgroups of the
additive group Zp. The subgroups 1 + pn Zp (n > 1) are open and closed subgroups
of the multiplicative group 1 + pZp.
Let us recall that a subspace Y of a topological space X is called locally closed
(in X) when each point y eY has an open neighborhood V in X such that Y n V
is closed in V. When this is so, the union of all such open neighborhoods of points
of Y is an open set U in which Y is closed. This shows that the locally closed
subsets of X are the intersections U C\ F of an open set U and a closed set F
of X. In fact, Y is locally closed in X precisely when Y is open in its closure Y.
Locally compact subsets of a Hausdorff space are locally closed (a compact subset
is closed in a Hausdorff space). With this concept, the preceding proposition admits
the following important generalization.
Theorem. Let Gbea topological group and H a locally closed subgroup. Then
H is closed.
Proof. If H is locally closed in G, then H is open in its closure H. But this closure
is also a topological subgroup of G. Hence (by the preceding proposition) H is
closed in H (hence H = H) and also closed in G by transitivity of this notion. ■
Alternatively, we could replace G by //, thus reducing the general case to H
locally closed and dense in G. This case is particularly simple, since all cosets gH
must meet H: g e H for all g e G, namely H = G.
Corollary 1. Let H be a locally compact subgroup of a Hausdorff topological
group G. Then H is closed. ■
Corollary 2. Let Y bea discrete subgroup of a Hausdorff topological group G.
Then T is closed. ■
The completion G of G is also a topological group. If G is locally compact, it
must be closed in its completion, and we have obtained the following corollary.
Corollary 3. A locally compact metrizable group is complete. ■
3.3. Quotients of Topological Groups
As the following statement shows, the use of closed subgroups is well suited for
constructing Hausdorff quotients. Let us recall that if H is a subgroup of a group
G, then G/H is the set of cosets gH (g e G). The group G acts by left translations
on this set. When H is a normal subgroup of G. this quotient is a group. Let now
G be a topological group and
n : G -► G/H
3. Topological Algebra 21
denote the canonical projection. By definition of the quotient topology, the open
sets U' C G/H are the subsets such that U = n~l(U') is open in G. Now, if U is
any open set in G, then
n~\7TU) = UH = [Juh
is open, and this proves that nU is open in G/H. Hence the canonical projection
n : G —> G/H is a continuous and open map. By complementarity, we also see
that the closed sets of G/H are the images of the closed sets of the form F — FH
(i.e., F = 7T~l(Ff) for some complement Ff of an open set Uf C G/H). It is
convenient to say that a subset A c G is saturated (with respect to the quotient
map tt) when A — AH, so that the closed sets of G/H are the images of the
saturated closed sets of G (but it is not a closed map in general).
Proposition. Let Hbea subgroup of a topological group G. Then the quotient
G/H {equipped with the quotient topology) is Hausdorff precisely when H is
closed.
Proof. Let n : G ~+ G/H denote the canonical projection (continuous by
definition of the quotient topology). If the quotient G/H is Hausdorff, then its points
are closed and H = n~l(e) is also closed. Assume conversely that H is closed in
G. The definition of the quotient topology shows that the canonical projection tt
is an open mapping. We infer that
7Z2=tzxtz:GxG-> G/H x G/H
is also an open map. But Ker(7T2) = HxHcGxG. Hence 7t2 induces a
topological isomorphism
3f: (G x G)/(H x H) -> G/H x G/H.
To prove that G/H is Hausdorff, we have to prove that the diagonal
A = {(x,x):xe G/H]
is closed in the Cartesian product G/H x G/H. Since the map Jf is a homeomor-
phism, it is the same as proving that the inverse image A of this diagonal is closed
in (G x G)/(H x H). This inverse image is
A = {(£, k) mod H x H : gH = /:#}
= {(g, k) mod # x H : k~lg e H}.
But R = {(g, k) : k~lg e H] C G x G is closed by assumption: It is an inverse
image of the closed set H under a continuous map. This closed set R is obviously
saturated, i.e., satisfies
R = R(H x H).
22 1. p-adic Numbers
This proves that its image Rr — A in the same quotient is closed, and the conclusion
is attained. ■
Together with the theorem of the preceding section, this proposition establishes
the following diagram of logical equivalences and implications for a topological
group G and a subgroup H.
G/H finite Hausdorff «=^ H closed of finite index
iy iy
G/H discrete <^=> H open
iy iy
G/H Hausdorff «=► H closed
3.4. Closed Subgroups of the Additive Real Line
Let us review a few well-known results concerning the classical real line, viewed
as an additive topological group. At first sight, the differences with Zp are striking,
but a closer look will reveal formal similarities, for example when compact and
discrete are interchanged.
Proposition 1. The discrete subgroups of R are the subgroups
aZ (0 < a G R).
Proof. Let H ^ {0} be a nontrivial discrete subgroup, hence closed by (3.2).
Consider any nonzero h in H, so that 0 < \h\ (— ±h) e H. The intersection H D
[0, \h |] is compact and discrete, hence finite, and there is a smallest positive element
a G H. Obviously, Z - a C //. In fact, this inclusion is an equality. Indeed, if we
take any b e H and assume (without loss of generality) b > 0, we can write
b = ma -h r (m e N, 0 < r < a)
(take for m the integral part of b/a). Since r = b — ma e // and 0 < r < a,
we must have r = 0 by construction. This proves b = ma gZ-g, and hence the
reverse inclusion H C Z • a. ■
Corollary. The quotient of R by a nontrivial discrete subgroup H ^ {0} is
compact. m
Proposition 2. Any nondiscrete subgroup of R is dense.
Proof. Let H c R be a nondiscrete subgroup. Then there exists a sequence of
distinct elements hn e H with hn—>heH. Hence e„ = \hn — h\ e H ande„ —> 0.
Since // is an additive subgroup, we must also have Z - sn c H (for all n > 0),
and the subgroup // is dense in R. ■
3. Topological Algebra 23
Corollary, (a) The only proper closed subgroups of R are the discrete
subgroups aZ (a € R).
(b) The only compact subgroup of R is the trivial subgroup {0}. ■
Using an isomorphism (of topological groups) between the additive real line
and the positive multiplicative line, for example an exponential in base p
f f-> p\ R -+ R>0
(the inverse isomorphism is the logarithm to the base p) we deduce parallel results
for the closed (resp. discrete) subgroups of the topological group R>0.
Typically, we shall use the fact that the discrete nontrivial subgroups of this
group have the form paZ (a > 0) or, putting 6 = p~a, are the subgroups
6Z = [6m : m e Z}
for some 0 < 6 < 1.
3.5. Closed Subgroups of the Additive Group of p-adic Integers
Proposition. The closed subgroups of the additive group Zp are ideals: They
are
{0}, pmZp (meN).
Proof. We first observe that multiplication in Zp is separately continuous, since
\x'a - xa\ = \a\\x' — jc| -> 0 (*' -► jc).
Since an abelian group is a Z-module, if H c Zp is a closed subgroup, then for
any h e H,
ZH c H =» ZpacZaCH = H.
This proves that a closed subgroup is an ideal of Zp (or a Zp-module). Hence the
result follows from (1.6). ■
Corollary 1. The quotient ofZpbya closed subgroup H ^ {0} is discrete. ■
Corollary 2. The only discrete subgroup of the additive group Zp is the trivial
subgroup {0}.
^oor Indeed, discrete subgroups are closed: We have a complete list of these
(being closed in Zp compact, a discrete subgroup is finite hence trivial).
Alternatively, if a subgroup H contains a nonzero element h, it contains all multiples of h,
and hence WDN-/i.In particular, H 3 pnh -> 0 (n -> oo). Since the elements
Pnh are distinct, H is not discrete. ■
24 1. p-adic Numbers
3.6. Topological Rings
Definition. A topological ring A is a ring equipped with a topology such that
the mappings
(x, y) \-> x + y : A x A —► A,
(x, y) h+ x ■ y : A x A —► A
are continuous.
The second axiom implies in particular that y h* — y is continuous (fix x = — 1
in the product). Combined with the first, it shows that
(jc, y) \-> x — y : A x A —> A
is continuous and the additive group of A is a topological group. A topological
ring A is a ring with a topology such that A is an additive topological group and
multiplication is continuous on A x A.
If A is a topological ring, the subgroup Ax of units is not in general a
topological group, since x h+ x~l is not necessarily continuous for the induced
topology (for an example of this, see the exercises). However, we can consider the
embedding
x \-+ (x, jc_1) : Ax -> A x A,
and give A x the initial topology: It is finer than the topology induced by A. For this
topology, A x is a topological group: The continuity of the inverse map, induced by
the symmetry (jc, y) h* (v, x) of A x A, is now obvious. Still with this topology,
the canonical embedding A x c->- A is continuous, but not a homeomorphism onto
its image in general.
Proposition. With the p-adic metric the ring Zp is a topological ring. It is a
compact, complete, metrizable space.
Proof. Since we already know that Zp is a topological group (3.1), it is enough to
check the continuity of multiplication. Fix a and b in Zp and consider x = a -f- h,
y =b + k'm Zp. Then
\xy - ab\ = \(a + h)(b + k) - ab\ = \ak + hb- hk\
< max(M, |6|)(|A| + |*|) + |A||*| -> 0 (\h\, \k\ -> 0).
This proves the continuity of multiplication at any point (a, b) e Zp x Zp. m
Corollary 1. The topological group Zp is a completion of the additive group
Z equipped with the induced topology. m
3. Topological Algebra 25
To make the completion process explicit, let us observe that if x
is a p-adic number, then
xn = ^ aip1 e N
0<i</i
defines a Cauchy sequence converging to x.
Corollary 2. The addition and multiplication of p-adic integers are the only
continuous operations on Zp extending addition and multiplication of the
natural numbers. m
3.7. Topological Fields, Valued Fields
Definition. A topological field K is afield equipped with a topology such that
the mappings
(x,y)^ x + y : K x K -^ K,
(x, y) h+ x • y : K x K ~> K,
x ^ x'1 : Kx -► Kx
are continuous.
Unless explicitly stated otherwise, fields are supposed to be commutative. A
topological field is a topological ring for which K x = K — {0} with the induced
topology is a topological group. Equivalently, a topological field is a field K
equipped with a topology such that
(x, y) m» x — y is continuous on K x K,
(x, )) h> x/y is continuous on K* x Kx.
Except for the appendix to Chapter II, we shall be interested only in valued fields:
Pairs (K, |. |) where K is a field, and |. | an absolute value, namely a group
homomorphism
extended by |0| = 0 and satisfying the triangle inequality
l* + yl<l*l + lyl (x,yeK)9
or the stronger ultrametric inequality
\x + v| < max(|x|, |j|) (x, y e K).
In this case d(x, y) = |* — y\ defines an invariant metric (or ultrametric) on K,
d(x, y) = d(x — a,y —a) = d(x — y, 0) (a, x, y e K).
= Ei>ofl«P'"
26 1. p-adic Numbers
This situation will be systematically considered from (II. 1.3) on, and in the
appendix of Chapter II we shall show that any locally compact topological field can
be considered canonically as a valued field.
Proposition 1. Let K be a valued field. For the topology defined by the metric
d(x, y) = \x — v|, K is a topological field.
Proof. The map (jc, v) m» x — y is continuous. Let us check that the map (x, y) h+
xy~l is continuous on^x x Kx. We have
x -h h x hy — kx
y + k y y(y + k)'
Hence if v ^ 0 is fixed, \k\ < |y[/2, and c = max(|jt|, |y|),
\x -\~h x
y±k
<2c!^Uo (|*|,l*l-0).
\y2\
This proves that AT is a topological field. ■
Proposition 2. Let K be a valued field. Then the completion KofK is again a
valuedfield.
Proof. The completion K is obviously a topological ring, and inversion is
continuous over the subset of invertible elements. We have to show that the completion
is afield. Let (xn) be a Cauchy sequence in K that defines a nonzero element of the
completion K. This means that the sequence \xn\ does not converge to zero. There
is a positive £ > 0 together with an index N such that \xn \ > e for all n > N. The
sequence (l/xn)n>N is also a Cauchy sequence
1
XnXrr
<s \xn-xm\-+0 (n9m-+oo).
The sequence (l/xn)n>^ (completed with l's for n < N) defines an inverse of the
original sequence (xn) in the completion K. ■
4. Projective Limits
4. L Introduction
Let x = ^2t>oaiPl be a P-adic integer. We have defined its reduction mod p as
s(x) = ao mod p e ¥p. We can also consider the finer reduction a$ + a\p mod p2
or more generally
£n(x) = ^aip1 mod pn e Z/pnZ.
4. Projective Limits 27
By definition of addition and multiplication of p-adic integers, we get homomor-
phisms
£n:Zp-> Z/pnZ.
Since xn = J2i<n a'Pl ~* x (w "^ °°)» we w°uld also like to be able to say that
the rings Z/pnZ converge to Zp. This convergence relies on the links given by the
canonical homomorphisms
<pn : Z/pn+1Z -> Z/pnZ
and the commutative diagram
Z/pn+lZ
£n+l <Pn
/ En \
Zp -+ Z/pnZ
which we interpret by saying that Zp is closer to Z/pn+lZ than to Z/pnZ.
Before proceeding with precise definitions, let us still consider an example
emphasizing a similar situation for sets. Consider the finite products £„ = Yli<n ^
of a sequence (X,-)i>o of sets. We would like to say that these partial products
converge to the infinite product E = Ylt>o %i anc* thus consider this last product
as limit of the sequence (En). For this purpose, we have to formalize the notion of
approximation of E by the En. This relation is given by the projections
pn : E -+ En
omitting components of index i > n. In a sense, these projections are composed
of infinitely many arrows — each (p3 : Ej+X —► Ej omitting a component — as
in the chain of maps
/?,,:£—►--•—> En+2 —> En+\ —^ En-
One can consider that any set X, given with a family of maps /„ : X —► En which
have the same property as above, is an upper bound of the sequence (En). A limit of
the sequence would then be a least upper bound. Thus the limit would be an upper
bound (£, (pn)) such that every upper bound (X, fn) is obtained by composition
with a map / : X —► E as follows:
fn= pnof :X -+ E -+ •■• -+ En+2 -► En+i -► En.
This factorization plays the role of remainder after division of /„ by all maps
(Pj : Ej+i -► Ej for j > n:
fn=<Pn° fn+l = (Pn O ^+1 O /n+2 = ifri ° /•
These preliminary considerations should motivate the following definition.
28 1. p-adic Numbers
4.2. Definition
A sequence (En, <pn)n>o of sets and maps <pn : En+\ -^ En (n > 0) is called a
projective system. A set E given together with maps ^n\E —> En such that
\J/n =<pno i//n+Y (n > 0) is called a projective limit of the sequence (En, <pn)n>o
if the following condition is satisfied: For each set X and maps fn : X —> En
satisfying fn — (pn o fn+\ (n > 0) there is a unique factorization f of fn through the
set E:
fn = ^nof:X^E^En (n > 0).
The maps <pn : En+X -> En are usually called transition maps of the projective
system. The whole system, represented by
Eo *- E\ <—'"<— En <—-••,
is also called an inverse system. "The" projective limit E = lim En is also placed
at the end of the inverse system:
En <r- En+\ < lim En
■■■ fn
The hypothesis fn = <pn o fn+\ can be iterated, and it gives
fn—<pno fn+l =<pno <pn+i o fn+2
= (<Pn O (pn+l O - - - O <pn+k) O fn+k+\ = lJ/nO f
for k > 0. Hence / behaves as a limit of the /) (y —> oo) and \}rn as a limit
of composition of transition mappings <p„ o #?„+] o - ■ • o ^+£ when A: —> oo. The
factorization condition is a universal property in the sense that it must hold for
all similar data. Finally, it is obvious that if (£, (^«)/i>o) *s a projective limit of a
sequence (En, <pn)n>o, it will still be aprojective limit of any sequence (En, <pn)n>k,
since we can always define inductively tyn-i = (Pn~\°i/n f or n < /:. In other
words, projective limits do not depend on the first terms of the sequence.
4.3. Existence
Theorem. For every projective system (En, <pn)n>o of sets, there is a projective
limit E = limEn C Jl En with maps i//n given by (restriction of)projections.
* n>0
Moreover, if(Ef, \l/'n) is another projective limit of the same sequence, there is
a unique bijection f : E' —> E such that \j/fn = \j/no f
4. Projective Limits 29
Proof. Let us prove existence first. For this purpose, define
E = {(*„): <pn(xn+{) = xn for all n > 0} C \\ En-
The elements of E are thus the coherent sequences (with respect to the transition
maps <pn) in the product. If x e E, we have by definition
<Pn(Pn+l(x)) = Pnto;
hence for the restrictions \J/n of the projections pn to £,
The set £ with the maps -tyn can thus be viewed as an upper bound of the sequence
En with transition maps <pn. Let us show that this construction has the required
universal property. For this purpose consider any other set E' with maps TJr'n : Er —>
En satisfying <pn o ^+1 = \J/rn, and let us show that there is a unique factorization
of jj/rn by i//n. It is clear first that the \J/rn define a (vector) map
(rn)-E'^Y\En, yy+Wn{y)).
The relations ^(j) = <PnWn+i(y)) show that the image of the vector map (^)
is contained in the subset E of coherent sequences. There is thus a unique map
f : E' -+ E C Y\En having the required properties \J/rn = i//n o /, and this one
is simply the vector map {}J/'n) considered as having target E. All that remains
is to prove the uniqueness. If both (E, (^n)) and (£', (^)) have the universal
factorization property, there is also a unique map /': E -> E' with \J/n = ^'n o f.
Substituting this expression in ^/'n = \J/n o /, we find that
and /' o / is a factorization of the identity map Ef —> E'. Since we are assuming
that (E',\l/'n) has the unique factorization property, we must have /' o / = idf'.
One proves similarly that / o /' = id^. ■
Corollary. When all transition maps in a projective system (£*„, <pn)n>o cire
surjective, then the projective limit (E, (^/n)) also has surjective projections \//n,
and in particular, the set E is not empty.
Proof. By construction of E in the product |~[ En, it is enough to show that if one
component xn e En is given arbitrarily, then there is a coherent sequence with
this component in En. It is enough to choose xn+l e En+\ with (pn(xn+\) = xn
(this is possible by surjectivity of (pn) and to continue choices accordingly. The
(countable!) axiom of choice ensures the possibility of finding a global coherent
sequence with prescribed nth component. ■
30 1. p-adic Numbers
4.4. Projective Limits of Topological Spaces
When the projective system (En, <pn)n>o is formed of topological spaces and
continuous transition maps, the construction made in the previous section (4.3)
immediately shows that the projective limit (£, \J/n) is a topological space equipped
with continuous maps jj/n : E —► En having the universal property with respect to
continuous maps. Any topological space X equipped with a family of continuous
maps fn : X -* En such that /„ = <pn°fn+i (« > 0) has the factorization property
fn — \]/n o f with a continuous function f : X —► E. Indeed, this factorization
is simply given in components by the fn and is continuous by definition of the
product topology (and the induced topology on the subset lim En C TJ En). When
the topological spaces En are Hausdorff spaces, the subspace lim En is closed: It
is the intersection of the closed sets defined respectively by the coincidence of the
functions pn and <p o pn+\. For future reference, let us prove a couple of results.
Proposition 1. A projective limit of nonempty compact spaces is nonempty and
compact
Proof. Let (Kn,<pn) be a projective system consisting of compact spaces. The
product of the Kn is a compact space (Tychonoff's theorem), and the projective
limit is a closed subspace of this compact space. Hence lim Kn is compact. Define
K'„ = <pn{Kn+X) D Kl = (pn{<Pn+l{Kn+2)) (= <Pn(K+l) ) D ■ • ■ .
These subsets are compact and nonempty. Their intersection Ln is not empty in
the compact space Kn. Moreover, <pn(Ln+\) — Ln, and the restriction of the maps
<pn to the subsets Ln leads to a projective system having surjective transition
mappings. By the corollary in (4.3), this system has a nonempty limit (with surjective
projections). Since lim Ln C lim Kn, the proof is complete. ■
Corollary. A projective limit of nonempty finite sets is nonempty. ■
Proposition 2. In a projective limit E = lim En of topological spaces; a basis
of the topology is furnished by the sets tf/~l(Un), where n > 0 and Un is an
arbitrary open set in En.
Proof. We take a family x = (jcf-) in the projective limit and show that the
mentioned open sets containing x form a basis of neighborhoods of this point. If we
take two open sets Vn C En and Vn-\ C En-\, the conjunction of the conditions
xn e Vn and *„_! e Vn_\ means that
Call Un the open set Vn D <p~\(Vn-\) of En. Then the preceding condition is still
equivalent to x e \l/~l{Un). By induction, one can show that a basic open set in
4. Projective Limits 31
the product — say Y\n<N ^ x Yin>N &" — nas an intersection with the projective
limit of the form ^^l(UN) for some open set Un C £V ■
Corollary. The projective limit of the sequence of initial partial products En —
Y\i<n %i °fa sequence of topological spaces is (homeomorphic to) the
topological product Yli>o Xi of the family. ■
Proof. The canonical projections n«>o ^i —> En furnish a continuous bijective
factorization Yioo ^i —> lim E„9 which is an open map by definition of the open
sets in these two spaces. *~~ ■
Proposition 3. Let A be a subset of a projective limit E = lim En of topological
spaces. Then the closure A of A is given by
Proof. It is clear that A is contained in the above mentioned intersection, and that
this intersection is closed. Hence A is also contained in the intersection. Conversely,
if b lies in the intersection, let us show that b is in the closure of A. Let V be a
neighborhood of b. Without loss of generality, we can assume that V is of the form
\J/~l(Un) for some open set Un C En. Hence i/n{b) C Un. Since by assumption
b g ^~l(^n(A)), we have i/n(b) G i/n{A), and the open set Un containing b must
meet \l/n(A): There is a point a G A with ^fn{p) £ Un. This shows that
aeAmif-l(Uny
In particular, this intersection is nonempty, and the given neighborhood of b indeed
meets A. m
Corollary l.lfK is a compact subset of a projective limit E = lim En, then
Corollary 2. A subset A of a topological projective limit is dense exactly when
all its projections \J/n (A) are dense. ■
4.5. Projective Limits of Topological Groups
It is also clear that ifa projective system (Gn, <pn) is formed of groups Gn andhomo-
morphisms (pn : Gn+\ —> Gn, then the projective limit G = limG„ is nonempty
since it contains the neutral sequence (e, e,...). It is even a group having this
sequence as neutral element, and the projections \J/n : G -^ Gn are group homo-
morphisms. The universal factorization property holds in the category of groups.
An interesting case is the following. Let G be a group and (Hn) a decreasing
sequence of normal subgroups of G. We can then take Gn = G/Hn and (since
32 1. p-adic Numbers
Hn+\ C Hn), <pn : G/Hn+i -> G/Hn the canonical projection homomorphism. The
projective limit of this sequence is a subgroup of the product
G=\myG/HncY\G/Hn
together with the restrictions of projections \//n : G —► G/Hn. Since the system of
quotient maps fn:G -> G/Hn is always a compatible system, we get a
factorization / : G -> G such that fn = \J/no f. It is easy to determine the kernel of this
factorization /:
ker/ = f~l (Pker^) - p)ker/„ = f] Hn.
In fact, we have the following general result.
Proposition. Let G = lim Gn be a projective limit of groups, and let \frn:G —> Gw
denote the canonical homomorphisms. Then H ker jjr„ = {e} w reduced to
the neutral element and G is canonically isomorphic to the projective limit
lim (G/ ker \]/n\
Proof. Let G' = f] ker jj/n and consider the embedding / : G' —► G leading to
trivial composites ^ = \Mc = &n ° /- Since the system (Gr, ^) obviously
admits the trivial factorization g : Gr —► G (constant homomorphism with image
e G G), we have / = g by uniqueness. This proves that the embedding / is trivial,
namely Gr = {e}. Of course, one can also argue that since the projective limit G
consists of the coherent sequences in the product J~[ Gn, with maps \//n given by
restriction of projections, D ker \//n consists only of the trivial sequence. ■
4.6. Projective Limits of Topological Rings
It would be a tedious task to give a list of all structures for which projective
limits can be defined. One can do it for rings, vector spaces,..., and one can mix
structures, for example by looking at topological groups, topological rings, and
so on. Just for caution: A projective limit of fields is a ring, not a field in general
(because a product of fields is not a field). Coming back to the case of a group
G (having no topology at first), in which a decreasing sequence (Hn) of normal
subgroups has been chosen, we can consider the projective limit of the system of
discrete topological groups Gn = G/Hn. Let again G = lim G/Hn and identify
G with its image in G. Then G is dense in G, which can be viewed as a completion
of G. More precisely, the closure //f of //,- in G is open and closed in G, and these
subgroups form a basis of neighborhoods of the identity in G. The subgroups //,
make up a basis of neighborhoods of the neutral element in G for a topology, and
G is the completion of this topological group. At this point one should recall that
a topological group admitting a countable system of neighborhoods of its neutral
element is metrizable.
4. Projective Limits 33
Similarly, if A is a commutative ring given with a decreasing sequence (/„) of
ideals and transition homomorphisms <pn : A/In+l —► A/In, the projective limit
A ~ limA/In is a topological ring equipped with continuous homomorphisms
(projections)^ : A —> A //„. By the universal factorization property of this limit,
we get a canonical homomorphism A -> A that is injective when P| /„ = {e}, and
in this case A can be identified with the completion of A for the topology of this
ring, having the /„ as a fundamental system of neighborhoods of 0.
4.7. Back to the p-adic Integers
We apply the preceding considerations to the ring Z of rational integers and its
decreasing sequence of ideals /„ = pnZ. The inclusions pn+xZ C pnZ lead to
canonical transition homomorphisms
tpn : Z/pn+1Z —► Z/p»Z.
The next theorem gives a second equivalent definition for p-adic integers.
Theorem. The mapping Zp -> lim Z/pnZ that associates to the p-adic
number x = J2a*Pl tne sequence (*/i)w>i of its partial sums xn = ^Zi<n^iPl mod
pn is an isomorphism of topological rings.
Proof. Since the transition homomorphism <p„ is given by
^fli/?1 mod pn+l i-* ^fliP1 mod pn,
the coherent sequences in the product ]~[ Z/pnZ are simply the sequences (xn) of
partial sums of a formal series J^>o a*Pl (® <ai — P~^\ an(* these are precisely
the p-adic integers. The relations
*i=flo» x2=a0 + aip, x3 — ao + aip~\-a2p2, ...
and conversely
*2 ~~ *1 *3 "" *2
a0 = *l, fll = 1 fl2 = ^ ,
show that the factorization Zp —► limZ///IZ is bijective, and hence an algebraic
isomorphism. Since this is a continuous map between two compact spaces, it is a
homeomorphism, whence the statement. ■
One can note that the homomorphisms Z —> Z/pn+lZ -> Z/pnZ furnish a
limit homomorphism Z —> lim Z///*Z, which can be identified to the canonical
embedding Z —> Zp. The map
^a,/?1 mod /?" h-> ^fli/?1 mod //'Zp
34 1. p-adic Numbers
obviously defines an isomorphism Z/p"Z —► Zp/pnZp, and in particular,
Zp/pZp = Z/pZ = Fp.
More generally, the same argument shows that
Zp/pnZp ~ Z/pnZ.
On the other hand, the restriction of the reduction homomorphism Zp -> Z/pnZ
to the subring
Z(p) = {fl/Z? :a eZ9 0 ^ Z? e N and Z? prime to p} C Q
is already surjective and has kernel pnZ(P), hence defines an isomorphism:
Zlp)/p"Zlp) = Z/p"Z.
Starting with the subring Z(p) C Q, we see that Zp appears also as a projective
limit limZ(p)/pnZ(P) and hence as a completion of this ring Z(p).
Comment. The presentation of the ring Zp of p-adic integers as a projective
limit of the rings Z/pnZ shows that one can choose any system of representatives
for Z mod pZ and write a corresponding expansion for any x e Zp in the form
x = YlsiP* W^tn aH digits Si e S. In particular, when the prime p is odd, it can
also be useful to choose the symmetrical system of representatives
O = [ 2~, . . . , U, . . . , ~~2~\-
In practice, we always choose a system of representatives S containing 0 in order to
allow finite expansions x = J] siPl - F°r example, if we choose the representative
p e S instead of 0 e 5, the representations
p • 1 + 0 - p + ^Sip* = 0 • 1 + 1 • p + ^T ^p1
«>2 i>2
are not permitted, since 0^5.
4.S. Formal Power Series and p-adic Integers
Let us derive yet another presentation of p-adic integers. We denote by Z[[X]] the
ring of formal power series in an indeterminate X with rational integral coefficients.
A formal power series is just a sequence (an)ne^ of integers an e Z. Addition is
made coefficientwise,
(an) + (b„) = (c„) with c„ = an + fc„ (n > 0),
and multiplication according to
(an) ■ (bn) = (cn) with cn = ^2 aibj (n ^ °)-
4. Projective Limits 35
These composition laws appear naturally if we use the notation / = f(X) =
X^>o GnXn for the sequence (flw)weN- In this way we identify polynomials to
formal power series having only finitely many nonzero coefficients: Z[X] C Z[[X]].
We shall use formal power series rings over more general rings of coefficients and
shall study their formal properties when needed (VI. 1).
Theorem. The map
£>x* ^ !>/>' -■ zt™ -* zp
is a ring homomorphism. It defines a canonical isomorphism
Z[[X]]/(X -p)^ Zp,
where (X — p) denotes the principal ideal generated by the polynomial X — p
in the formal power series ring.
Proof. Let us consider the sequence of homomorphisms
/„ : Z[[X]] -► Z/pnZ, ^fl/X' m> J^fl,-// mod pn.
Since these maps /„ are obviously compatible with the transition homomorphisms
<pn defining the projective limit, we infer that there is a unique homomorphism
/ : Z[[X]] -► limZ/pnZ = Zp
compatible with the /„. If x = ]£ at Pl *s anY /?-adic integer, then x = /(£ ai X' )>
and this shows that / is surjective. We have to show that the kernel of / is the
principal ideal generated by the polynomial X — p. In other words, we have to
show that if the formal power series ^2aiX* *s sucn that Yli<naiPl e Pn^ ^or
every n > 0, then this formal power series ^aiX1 is divisible by X — p. For
n — 1 the condition implies a^ = 0 mod p9 hence a0 = P&o for some integer aG.
Then, for n = 2 we get
<z0 + flip = 0 mod /?2 => a0 4- «i = 0 mod /?,
and we infer that there is an integer ct\ such that aG + a\ = /tq^. Let us go on:
(a0 + «!/?) + fl2/?2 = 0 mod p3 => <x\p2 + a-ip1 = 0 mod p3,
which gives of] + ai = pa-i for some integer #2. Generally, for n > 1,
/^Ofo-i + anpn =a0~\-alp-\ h anpn == 0 mod pn+l
furnishes an integer an with a„_i + an = pan. All these relations can be
summarized by
Oo = P&0, <*n = P<*n - tt/i-1 (" > 1),
36 1. p-adic Numbers
or still more concisely by
a0axX + a2X2 + • - • = (p - X)(a0 + axX + a2X2 + ■•■),
namely
This concludes the proof. ■
5. The Field Qp of p-adic Numbers
5.7. The Fraction Field ofZp
The ring of p-adic integers is an integral domain. Hence we can define the field of
p-adic numbers as the fraction field of Zp
Qp = Frac(Zp).
An equivalent definition of Qp appears in (5.4).
We have seen that any nonzero p-adic integer* e Zp can be written in the form
x = pmu with a unit u of Zp and m e N the order of jc. The inverse of x in the
fraction field will thus be 1/jc = p~mu~l. This shows that this fraction field is
generated — multiplicatively, and a fortiori as a ring — by Zp and the negative
powers of p. We can write
Qp=Zp[l/p],
The representation \/x = p~mu~l also shows that l/x e p~mZp and
is a union over the positive integers m. These considerations also show that a
nonzero p-adic number x e Qp can be uniquely written as jc = pmu with m e Z
and a unit u eZ*\ hence
meZ
is a disjoint union over the rational integers m e Z. The definition of the order
given in (1.4) for p-adic integers can now be extended to p-adic numbers x eQp.
If 0 ^ x = pmu with a unit u e Z*, then we define
ordp(jc) = vp(x) = vp(pmu) = m e Z.
(When the reference to the prime p is not needed, we simply denote this order by
v(x) =ord;c.) Hence
v-1(m) = pmZp-pm+1Zp = pmZ*p.
5. The Field Qp of p-adic Numbers 37
We have
v(x) > 0 «=» x e Zp,
and this equivalence is valid even when x = 0 with the usual convention v(0) =
-hex) > 0. If x = a/b (a eZp9 O^b e Zp\ then v(x) = v(a) - v(b) e Z, and
the basic relation
v(xy) = v(x) + v(y)
holds for all x, y e Zp (even when xy = 0 with the convention m -h oo = oo -h
co = oo). The p-adic order is a homomorphism
v : Q£ = ]J />mz; -► z
meZ
Moreover, if jc = pvu is a nonzero p-adic number, with u a p-adic unit, we can
write u = Y1 &iPl £ Zp with a0 ^ 0 (0 < fl,- < p — 1), and
* = E^" = 2>^'
is a sum starting at the integer v = ordx e Z, possibly negative.
As in (1.4), we may compare these expansions to the Laurent expansions of
meromorphic functions (in the complex plane, near a pole). The index of the first
nonvanishing coefficient is the order of the power series.
By convention, the order of the zero power series is +oo. Hence the relation
v(x +y)>: min(u(x), v(y))
holds in all cases.
Comment. If Z(P) C Q denotes the subring consisting of rational numbers having
denominator prime to p, we have similar formulas
Q = U^Z<* QX = II^"Z^
m>0 PeZ
since the group Z(*, consists of the fractions having both numerator and
denominator prime to p.
5.2. Ultrametric Structure on Qp
The map jc h+ \x\ = l/pv, where v =ordjc e Z, defines a homomorphism
Q*^(RX)+ = R>0
that we conventionally extend by the definition |0| = 0. This map extends the
previous absolute value on Zp and is called the p-adic absolute value on Qp
38 1. p-adic Numbers
(cf. (2.1), (3.7); absolute values will be systematically studied in Chapter II, cf.
(II. 1.3)). This absolute value has the characteristic properties
|*| > 0 if x ± 0, |jcj| = |*| • |y|, |jc + y\ < max(|*|, |y\).
In particular, we can define a metric on Qp by
d(x,y) = \x-y\.
This distance satisfies
d(x, y) > 0 if x ^ y and d(y, x) = d(x, y)
as well as the triangle inequality in the strong ultrametric form
d(x, y) < max(d(*, z), d(z, y)) < d(x, z) + d(z, y).
This metric is invariant on the additive group
d(x+z,y + z) = d(x,y)
and also satisfies
d(zx,zy) = \z\ d(x,y)
for all jc, j, z e Qp. In particular,
,, , d(x, y)
d(px,py) = .
P
From now on we shall always consider Qp as a metric field, endowed with this
ultrametric distance. By (3.7) Qp is a valued field, and hence a topological field.
Theorem. The field of p-adic numbers Qp induces on Zp the p-adic topology.
It is a locally compact field of characteristic 0. It can be identified with the
completion of Z[l/p] = [apv : a e Z, v e Z}, or of Q, for the p-adic
metric.
Proof. With the metric just introduced Zp is the unit ball centered at the origin in
Qp: For* e Qp we have equivalences
x eZp «=> v(x) > 0 <=> |*| < 1 «=> d(x, 0) < 1.
Similarly, if k > 0, the ideal pkZp is the ball defined by d(x, 0) < p~k. These
balls make up a fundamental system of neighborhoods of 0 in Zp and Qp. Since
the group Zp contains a neighborhood of 0, it is open (and hence closed). In fact,
5. The Field Qp of p-adic Numbers 39
it is a compact neighborhood of 0 in Qp. This proves that the topological field Qp
is locally compact, and hence complete (Corollary 3 in (3.2)). Finally, if
x = ^2,xipl (v = ordjc e Z)
is the p-adic expansion of a nonzero element x eQp, the sequence
Xn = JZ ^
u<i<n
of truncated sums is a Cauchy sequence of Z[l/p] converging to x,
*-*,,= 5^-^iP1 € pnZp,
d(x, x„) = \x- x„\ < p~n -► 0 (n -► oo).
This proves that Z[l/p] is dense in Qp, and this metric space can be viewed as a
completion of the ring Z[ 1 /p] for the induced metric. ■
53. Characterization of Rational Numbers Among p-adic Ones
It is easy to recognize rationals among p-adic numbers if we know their expansions.
The result is similar to the characterization of rational numbers among real numbers
expressed in decimal expansions.
Proposition. Let x = ^aip1 e Qp (i > v(x), 0 < m < p — 1). Then x is a
rational number, Le., x e Q precisely when the sequence (at) of digits ofx is
eventually periodic.
Proof. Multiplying if necessary a p-adic expansion by a power of p, we see that
it is enough to consider the case v(x) > 0, namely x e Zp. If the sequence (at)
is eventually periodic, x is the sum of an integer and a linear combination (with
integral coefficients) of series of the form
Y,ps+i'=psrheQ>
and hence is a rational number. Conversely, suppose that x = ^Xip1 — a/b is
the p-adic expansion of a rational number (as we mentioned, we can assume that
x eZp; hence the summation is made for i > 0). Taking a reduced representation,
a and b will be relatively prime integers, with b prime to p. Adding a suitably
large integer to x, we may assume that x is positive (hence a and b are also
positive). Considering the p-adic expansions of these integers, we are able to write an
40 1. p-adic Numbers
equality
y<£ i">0 k<a
In the left-hand side we have to take into account some carries re according to the
following identities:
b0xe + bixe-i H h bex0 + re=ae + re+lp.
For £ > max(a, /?), we have more simply
boxi + b\Xt-X -\ h bpxt-p +rt= n+ip.
It suffices to compute xt mod p as a function of JC£_i,..., JQ-^ and r^, and then
to take the representative of this class such that 0 < jq < p. This allows the
determination of the carry r^+i by division by p. In other words, starting with the data
(xe-u...,xe-0,re)e(Z/pZf+1
there is an algorithm (taking into account the fixed values of bo,..., bp) furnishing
(xi3 xt-u . - -, xi-fi+u r*+1) e (Z/pZf+l
(the values of xt~\,..., xt-p+\ are simply copied in a shifted position). Since the
set (Z/pZ)P+l is finite, this algorithm will eventually produce a cyclic orbit (as
soon as a vector takes a value already attained, it will produce the next vector
already attained and start a cycle). ■
Corollary. The p-adic integers J] pn and J] Pnl are riot rational ■
5.4. Fractional and Integral Parts of p-adic Numbers
As we have already noticed, any nonzero /?-adic number x eQp can be written
as a series x = J2i>m xiP* startmg at tne index m = v(x) e Z. Let us define
[x] = ^\xipl e Zp : integral part of x,
(x) = z_\Xlpl e 'miP\ c Q -fractional part of jc.
i<0
We thus obtain a decomposition
x = [x] + (x):Qp = Zp+Z[l/p].
If (*) ^ 0, then (x) = apv for integers a and u < 0. This decomposition depends
on the choice of representatives chosen for digits; here 0 < jc,- < /? — 1. With this
5. The Field Qp of p-adic Numbers 41
choice, more can be said of the fractional part as a real number, namely
o<(x> = ev= E ^<(p-i)E7 = '-
Hence the fractional part of any /?-adic number satisfies
<*> e [0,1) 0 Z[l/p}.
Let us consider these representatives modi, namely in Z[l/p]/Z C R/Z.
With the normalized exponential, we can embed the circle R/Z in the complex
numbers:
R -► R/Z -► Cx : t v+ exp(2nit).
This leads us to consider the map (systematically considered by J. Tate, whence
the notation)
t : Qp ^ Cx : x h+ cxp(2ni(x)).
For example, if v(x) = —1, namely x = k/p + y withO < k < p — 1 and v e Zp,
then
r(x) = cxp(2jtik/p) = f *,
where f = exp(2ni/p) is a primitive /nh root of unity in C. The image of all
elements x e Qp with v(x) > — 1 is the cyclic subgroup of order p in Cx:
p-lZp/Zp = t(P-1Zp) = iip C Cx.
It is useiul to introduce some notation. The cyclic subgroup of rath roots of unity
in C will be denoted by
fJim = {zeC:zm = l}.
The union of all these cyclic groups is the group of all roots of unity (in C)
fi = [J fJLm — [z e C : zm = 1 for some integer m > 1}.
m>l
With respect to the prime p, we have a direct product decomposition
I1 = Akp)" Pp00 >
where fx^p) is the group of roots of unity of order prime to p, and /xpoc the group
of roots of unity having order a power of p: /nh power roots of unity. Hence fxpoc
is the p-Sylow subgroup of the abelian torsion group //. It is the union of the
42 1. p-adic Numbers
increasing sequence of cyclic groups
lip C /v C • • • C fipk C - - •,
/v* = U/v c cx-
Proposition. The map r : Qp —> Cx, jc h> exp(27r/(x)) is a homomorphism.
It defines an isomorphism Qp/Zp = /xp^ o/rfte additive group Qp/Zp w/r/i r/ze
^roM/7 ofpth power roots of unity in the complex field C.
Proof. Let us compute the difference
(x + y)-(x)-(y)=x + y-[x+y]-(x- M) - (y - [y]).
It is equal to [x] + [y] — [x + y] € Zp, and hence (x + y) — (x) - (y) e
Z[l/p] flZp = Z. This proves that
exp(27n-[(x + y)-(x)-(y)])=l
and r(x + y) = t(x) + r(y). The map t is a homomorphism. Its kernel is defined
by
kerr = {x e Qp : (x) e Z}.
But (x) e Z means x = [x] + (x) e Zp, so that kerr = Zp. The image of r
consists of the complex numbers of the form
exp(2nik/pm) = exp(2jri/pm)k.
Since exp(27Ti //?™) is a root of unity of order pm, these roots of unity generate —
when m varies among natural integers — the subgroup [ip^. m
In particular, we have
x e p~kZp <=> pkx e Zp <=> t(jc)p* = 1 <=> t(jc) e fipk.
Comment. It is possible to give the factorization of rational numbers into p-
integral and p-fractional components independent of the construction of p-adic
numbers. Indeed, any rational number has the form
x = pv - (v e Z, a and b prime to p).
b
When v = — m < 0, namely when x g Z(P), we can use the Bezout theorem to
express the fact that pm and b are relatively prime,
(pm,b)= 1 =apm +fib\
5. The Field Qp of p-adic Numbers 43
hence multiplying by x yields
a aa 8a n n
x = ^rb = T + ^eZ^ + Z[l/p]-
This gives an elementary description of the decomposition
Q = Z(p) + Z[l/p]
induced by the decomposition Qp = Zp + Z[l//?].
5.5. Additive Structure ofQp and Zp
Let us start with the sum formula Qp = Zp + Z[l/p] proved in the last section.
Observe that this sum is not direct, since
Z„nZ[l/p]=Z.
The various embeddings that we have obtained are gathered in the following
commutative diagrams giving the additive (resp. multiplicative) structure of Qp
(resp.Qp.
z(p)
yx
/
\
/
\
Q
z
(i)
\
\
/
m/P\
pz
Zp
zpx
/
\
Qp
z
Qpx
(i)
\
/
m/p]
pz
If we embed Z in the direct sum Zp © Z[l/p] by means of m t-> (m, —m) and
call T the image, then the addition homomorphisms
Z(p) © Z[l/p] -* Z(p) + Z[l/p] = Q,
Zp ©Z[l/p] -* Zp + Z[l/p] = Qp
have kernel T and furnish isomorphisms
(z(p)©Z[i/p])/r = Q,
(zp©Z[i/p])/r = Qp.
44 1. p-adic Numbers
Thus we have the following diagrams with vertical short exact sequences.
Z
4
Z(p) -+ Z(p)©Z[l//7] <- Z[l/p],
I
Q
z
4
Zp -* Zp©Z[l/p] <- Z[l/p].
4
Here is another pair of diagrams describing the inclusion relations between the
various abelian groups of numbers that we have considered:
Z[l/p] -► Q -► Qp Pz ^ Qx ^ Q*
u u u u u u
Z ^ Z(p) ^ Zp, (1) ^ Z*p) -+ Z*.
Comment. The subgroup Zp of Qp admits no direct complement. Indeed, for any
subgroup T of Qp
mZp = {0}=>r discrete in Qp => T = {0}.
In a sense, the subgroup Z[l//?] is the best near supplement that one can take, and
we have unique sum decompositions with two components:
xeZp, v g [0, 1) n Z[l/p].
But this system of representatives [0, 1) Pi Z[l/p] is not a subgroup.
5.6. Euclidean Models of Qp
It is easy to give Euclidean models of the fields Qp extending the models of Zp
given in (2.5) if we only observe that the inclusions of additive topological groups
—Zp D Zp and Zp D pZp
are similar. In other words, a dilatation of ratio p of the Euclidean model of Zp
gives a model of (l/p)Zp. Iteration gives a model of
An illustration shows a piece of Q7, with central portion Z7.
6. Hensel's Philosophy 45
A piece of Q7 with Z7 as central portion
6. Hensel's Philosophy
6.1. First Principle
Let us explain the first principle in a particular case. Let P(X, Y) e Z[X, Y]
be a polynomial with integral coefficients. When speaking of solutions of the
implicit equation P = 0 in a ring A, we mean a pair (x,y) e A x A = A2 such
that P(x, y) = 0.
Proposition. The following properties are equivalent:
(i) P = 0 admits a solution in Zp.
(m) For each n > 0, P = 0 admits a solution in Z/pnZ.
(Hi) For each n > 0, rfrere Are' integers an, bn such that
P(an, bn) = 0 mod pn.
™oof. (hi) is a simple reformulation of (11). Now for x = Yli>o at Pl eZJp, define
*" = ]C«<„ flip' mod p" € Z/p"Z. Then if (*, y)eZpx Zp, then
^(*„ >n) = Ptx, y) mod ^Zp e Zp/pnZp (= Z/p"Z),
and hence (1) => (if). Conversely, to prove (n) =» (1) let us consider the finite sets
*n = {(*, y) G Z/z^Z x Z/p"Z : P(x, v) = 0}.
46 1. p-adic Numbers
Reduction mod pn furnishes a map <pn : Xn+i —> Xn, and the projective system
{Xn,ipn)n>\ has a projective limit X = limXw C Zp x Zp. The pairs in X
furnish solutions of P = 0 in Zp, and the result follows from (4.4) (Corollary of
Proposition 1). ■
Generalizations. Instead of a single polynomial P in two variables, one can
consider an arbitrary family (P,),G/ of polynomials having a finite number m > 2
of indeterminates and their common zeros. The above result shows similarly that
the algebraic variety defined by the equations Pi = 0 (/ el) will have points with
coordinates in Zp precisely when it has points with coordinates in all rings Z/pnZ
(/! > 1).
6.2. Algebraic Preliminaries
Proposition. Let A be a ring and P e A [X] be any polynomial. Then there are
polynomials Pi and P2 e A[X, Y] such that
P(X +h) = P(X) + h - PL(X, h) = P(X) + h - P\X) + h2P2(X, h).
Proof. Let us write the polynomial P explicitly as a finite sum P(X) = Y^^nXn
with some coefficients an e A. Then
P{X + h) = ^an{X + hf = ^an(Xn + nX"~lh + h\ . •))
= ^anXn+h Y^nanXn~l +h2 - P2(X,h);
hence the result. ■
63. Second Principle
The idea for improving approximate solutions will now be given in its simplest
form. Take a polynomial P e Z[X] and an integer x such that P(x) = 0 mod p.
We can look for a better approximation x of P(X) = 0 in the form of an integer
such that P(x) = 0 mod p2. Without loss of generality, we may assume that x is
an integer ao between 0 and p — l. We are looking for an integer x = ao + a\p
(again with 0 < a\ < p) such that P(x) = 0 mod p2. But we have just seen that
we can write
P(a0 + aip) = P(a0) + P\a0) alP + (aipf ■ b
for some integer b. By assumption, P(a0) = pt, and the desired congruence holds
mod p2 if t + P'(ao) ■ a\ = 0 mod /?. We can suppose r ^ 0 (there is nothing to
prove otherwise). When P'(^o) # 0 mod /? we can take a\ hee —t/P'(ao) mod p
and
pr P(oq)
x = ao + a\ p = ao = x
P'(ao) P\ao)
6. Hensel's Philosophy 47
exactly as in the classical Newton approximation method. With this choice, we have
P{x) = P(a0 + ax p) = 0 mod /?2.
We shall occasionally use the notation
P(JC)
A^Ot) = x —
for the Newton map. It is obvious that x = Np(x) can be/^r from x when P'(x)
is 5W2^//.
Newton's method
6.4. The Newtonian Algorithm
In this section we show that even when the derivative vanishes mod /?, we can
still construct a better approximation of a root of P = 0, but we have to be less
demanding concerning its location.
Proposition. Let P e ZP[X] and x e Zp be such that P{x) = 0 mod pn. If
* = v{P\x)) < /i/2, then x = W/>(jc) = jc - P(x)/P'(x) satisfies
(1) P(x) = 0 mod pn+l (a definite improvement).
(2) x ~ x mod pn~k (a controlled loss),
(3) v(P'(x)) = i;(P'(jO) (= A:) fart invitation to iteration).
Proof. Put />(*) = /?"}> forsomey e Zp, and />'(*) = pku forsomeunitw e Z*.
% definition of Jc,
x — x
PW
-/? yu e p Zp
Un the other hand, still by choice of Jc, the first two terms of the Taylor expansion
of the polynomial P at the point x cancel each other:
PW , 9
P{x) = P(x) - ~ P\x) + (* - x)2 ■ f.
P'(x)
48 1. p-adic Numbers
By (6.2) the t in the last term belongs to Zp. Hence
P(x) = (Jc - xf • t e p2n~2k Zp = p"- pn~2k Zp c pn+lZp
(recall that 2k < n). It only remains to compute the order of P'(x). For this, we
use a first-order Taylor expansion of P' at the point x (6.2):
p'(jt) = P'(X + (Jc - x)) = P'(jc) + (Jc - jc) - s
= PkU + /?"-*£ - J = p*(« + pn~2kZS) = p*U.
Since « — 2& > 0, and since u is a unit,
i; = u + pn-2*ZJ gm + pZp C Z*,
which proves v(P'(x)) = k as claimed. ■
Theorem (HensePs Lemma). Assume that P e ZP[X] andx e Zp satisfies
P(x) = 0modpn.
If k = v(P'(x)) < n/2, then there exists a unique root f of P in Zp such that
f = x mod pn~k and v(P'tf)) = v(P\x)) (= *).
Proof. Existence. Let xq — x and construct an improved root x\ eZp,
xY == x0 mod pn~k and P^) = 0 mod pn+\ v(P'(xi)) = v(P'(x0)) (=k).
Similarly, we can find an improvement x2 of the approximate root xi in the form
of a p-adic integer satisfying
jc2 = Xl mod p71*1"* and P(jc2) = 0 mod p"+2.
Iterating the construction, we get a Cauchy sequence (xn)„>0 having a p-adic limit
f satisfying P(f) = 0 and f = x mod p71"*.
Uniqueness. Let f and ?y be two roots of P satisfying the required conditions:
In particular,
tj = £ modp"-*,
and since n > 2k, we have « — k > & + 1, and a fortiori
?7 == f mod p*+I.
Now,
P{rft = Pg) + f(f )(? - f) + (17 - Hfa
6. Hensel's Philosophy 49
for some p-adic integer a. Hence
(?7-?)(/3,(?) + (?7-^)=0.
But
£U) + (u^jfr =f 0,
order k order > k + I
so that the only possibility is ?7 — f = 0, and uniqueness follows. ■
Note that the uniqueness part of the proof shows that f is the unique root
satisfying the a priori weaker congruence f = x (mod pk+x).
6.5. First Application: Invertible Elements in Zp
Let us consider the first-degree polynomial P(X) = aX — 1, where a ^ 0 is a
p-adic integer. In order to be able to find an approximate root mod /?, we have to
assume that a g pZp (in the p-adic expansion of a, the constant term «o ^ 0).
When this is the case, Pr(X) = a and k = v(P'(x)) = 0, and any root mod p can
be improved to a root mod pn (n > 2). Eventually, we find a genuine root in Zp9
which means that a is invertible in this ring. Thus we have another "proof of the
implication
a eZp- pZp ==» a e Z*.
However, this proof is deceptive, since Newton's method assumes a priori that we
know how to divide: In the first step we are led to replacing x by
P(x) ax-\ 1
x = x = x = — (!)
P\x) a a
Numerically, it is better to apply Newton's method to the rationalfunction f(X) =
l/X - a, for which f\X) = -l/X2. Hence
x = Nf(x) = x - 4^- = x + x2/0c) = 2x - ax2.
With this function, Newton's method uses ^polynomial, and no division is required
to evaluate the successive approximations of the inverse.
6.6. Second Application: Square Roots in Qp
Consider now the quadratic polynomials P(X) = X2 — a, where a is a /?-adic
integer. It is obvious that such an equation can have a root x in Zp only if v{a) =
is even. Then if we divide a by a suitable even power p2m of p, we
are brought back to the case v(a) = 0, namely a e Z*. Since P'(x) = 2x, we see
that the case p = 2 has to be treated separately.
50 1. p-adic Numbers
Case p odd. Hensel's lemma will apply as soon as we can find an approximate
root mod p. But we know that in the cyclic group F* squares make up a subgroup
of index two. The quadratic residue symbol of Legendre distinguishes them:
+1 if a is a square mod p,
—1 if a is not a square mod p.
Let us choose an integer 1 < a < p that is not a square mod p. Then the three
numbers a, p,ap have no square root in Qp. They make a full set of representatives
for the classes mod squares
Q;/(Qp2 = (PZ/P2Z) x (Z*/(Z*)2) ^ Z/2Z x Z/2Z.
Since every quadratic extension of Qp is generated by a square root of an element
(every quadratic extension of a field of characteristic 0 is generated by a square
root), we see that we obtain all quadratic extensions of the field Qp for p > 3 —
up to isomorphism — in the form of the three distinct fields
QP(Va~l Qp(Vp), Qp(Vw).
Case/? = 2. Observe that Z£ = 1 + 2Z2, since the only possibility for the nonzero
constant digit is 1. Now we have
a e Z£ is a square <£==> a e I + 8Z2.
Proof. If a = b2 e Z* for some b = 1 + b{l + b222 + - - • = 1 + 2c, then b2 =
1 + 4(c + c2), and since c = c2 mod 2Z2, we have b2 e 1 + 8Z2 as claimed.
Conversely, if a = 1 mod 8Z2, we can apply Hensel's lemma to the resolution
of the equation X2 — a = 0, starting with the approximate solution x = 1. By
assumption, this is an approximate solution mod 23 (n = 3 > 2k = 2 is suitable).
We get an improved solution jc,
x2 = a mod 23 but x = x mod 22 only,
since w — & = 3 —1 = 2. By iteration, we get an exact root f = 1 mod 4 satisfying
x2 = am Z2. ■
We have
Q27(Q2X)2 = (2Z/22Z) x (Z2X/(Z2X)2).
Since
Z2X = 1+2Z2 = {±1}-(1+4Z2),
we also have
Z2X/(Z2X)2 = {±1} x (1+ 4Z2)/(1 + 8Z2),
©
6. Hensel's Philosophy 51
so that finally
Q2 /(Q2)2 = Z/2Z x Z/2Z x Z/2Z-
There are — up to isomorphism — seven quadratic extensions of the field Q2- They
are obtained by adjoining roots of elements in the nontrivial classes of Q* /(Q£ )2-
If we choose the elements
-1, ±(1+4) = ±5, ±2, ±2-5,
we get the seven nonisomorphic quadratic extensions
Q2(VrT), Q2(V±5), Q2(x/±2), Q2(V±l0).
Examples. (1) Since 32 == 1 mod 8, x = 3 is an approximate root of x2 — 1=0.
Newton's method leads to the improvement x = 7, which is an improved solution
mod 16, but we only have 7 = 3 mod 4 as the theory predicts (and there is no
exact root f = 3 mod 4, since the only roots are f = ±1).
(2) Since a = — 7=1— 8 = 1 mod 8, we obtain
V=7gZ2xcQ2.
(3) The preceding considerations prove that the equations
X2 + l = 0andX2-3 = 0
have no solution in Q2. The polynomials X2 + 1 and X2 — 3 are irreducible in
Q2[X].
We shall determine later the structure of the multiplicative group 1 + 4Z2.
6.7. Third Application: nth Roots of Unity in Zp
Letf beanyrootofunityinQp,sayf" = l.Then/ii;(f) = r;(l) = 0andv(£) = 0.
This proves that all roots of unity in Qp lie in Z* C Q*. In particular, each root of
unity has a well-defined reduction mod p, e(f) e F*. Let us show that the group
Z* contains roots of unity in each class mod pZp, i.e. above each element of F*
The polynomial P(X) = Xp~l-l has derivative Pf(X) = (/7-l)X^2.Forany
unit x eZ*, k = v(P'(x)) = 0, and the simplest case (6.3) of the approximation
method applies. Since the polynomial Xp~l — 1 has p—l distinct roots in the
field F^, namely all elements of F*, Hensel's lemma furnishes p—l distinct roots
in Z*. This shows that the field Qp of p-adic numbers always contains a cyclic
subgroup of order p—l,
consisting of roots of unity.
52 1. p-adic Numbers
Proposition 1. When p is an odd prime, the group of roots of unity in the field
Qp is fip^.
Proof. We have to prove that the reduction homomorphism e : /x(Qp) -> F* is
bijective. It is surjective by Hensel's lemma. So assume that f = 1 + pt e kerf
(r e Zp) is a root of unity, say f has order n > 1,
f" =(1+^ = 1-
Hence «/?r + ©P2^2 H + Pntn = 0, or
This shows that r = 0 (when p\n) or /? | n. In the second case, replace f by f p and
n by «//?: Starting the same computation, we see that t = 0 or p2 \ n (original
n), and so on. Finally, we are reduced to the case n = p. In this case, the above
equation is simply
and since p > 3,
/n)pt + - - - + p^r*-1 = p + p2(- - -) ^ 0.
This proves that t = 0 in all cases and f = 1.
'♦GD'
When /? is odd, /? — 1 is even and —1 belongs to /zp_i. The number —1 will
have a square root in Qp precisely when (p — l)/2 is still even, namely when
p = 1 mod 4. We have
yf^\ e Qp <=> 4|p-l <=> /? = 1 mod 4.
A number i = V—T can thus be found in Q5, Q13, ...
Proposition 2. The group of roots of unity in the field Q2 is /z2 = {±1}.
Proof. We have
-1 = l+2 + 22 + --- g 1+2Z2
and
{±1} = /x2cZ2x = 1+2Z2.
On the other hand, F£ = {1}, and the only roots of unity in Z£ have order a power
of 2. But —1 is not a square of Z£ (6.6), and there is no fourth root of 1 in Q2.
6. Hensel's Philosophy 53
To summarize, we give a table.
Field
Q2
Qp
p odd prime
Units
Z2X = 1+2Z2
z;di+pzp
index p-1
Squares
1+8Z2
index 4
inZ*
index 2
inz;
Roots of unity
M2 = {±1}
Number of
quadratic
extensions
7
3
6.& Fourth Application: Field Automorphisms of Qp
It is possible to determine all automorphisms of the field Qp (over the prime
field Q). For this purpose, we need a lemma.
Lemma. Letx e Q* Then the following properties are equivalent:
(i) x is a unit: x eZ*
(11) xp~~l possesses nth roots for infinitely many values of n.
Proof. If x is a unit, then x ^ 0 mod pZp and xp~~l = 1 mod pZp. Let us put
a = xp^1 and consider the equation P(X) = Xn — a = 0. It has an approximate
root 1 mod p, and when n is not a multiple of p, P'(l) = n does not vanish mod
p. By Hensel's lemma, there is an exact solution of this equation, namely there
exists an element § e Zp such that fn = a = x*7"1. This proves (1) => (11).
Conversely, if xp_1 = y£, we have
(p - l)v(x) = /i • v(y„)9
and « divides (p — l)t>(x)- This can happen for infinitely many values of n only if
v(x) = 0; hence x is a unit (we are assuming x ^ 0 from the outset). ■
Theorem. 77ze only field automorphism of Qp is the identity.
Proof. Let <p be an automorphism of the field Qp. By the algebraic characterization
of units of Q *, the automorphism <p must preserve units. Hence if x e Q * is written
in the form x = pnu (where n = v(x) and u e Z* is a p-adic unit), we shall
have
P(*) = ^(P11") = <P(p"Mu) = pn(p(u)
and v(<p(x)) = n = v(x). This shows that the algebraic automorphisms of the field
Qp preserve the /?-adic order: They are automatically continuous. Now, if y e Qp
is an arbitrary element, we can take a sequence of rational numbers rn e Q with
54 1. p-adic Numbers
rn —► y. For example, we can take these rational numbers by truncating the p-adic
expansion of y. Now, since the automorphism <p is trivial on rational numbers,
(p(y) = <P( Mm yn) = lim <p(yn) = lim yn=y. ■
/i->oo n-^-oo m->oo
Note. The preceding theorem is similar to the following well-known result:
The only algebraic automorphism of the real field R is the identity.
Indeed, if <p is a field automorphism of R, we have (p(x2) = (p(x)2 for all x, and
hence <p(y) > 0 for all y > 0 (write y = x2), and then also
(p(u) < cp(v) for all « < i;
(put y = v — u). This means that these algebraic automorphisms automatically
preserve the order relation <. Since they must be trivial on the prime field Q, they
must be trivial. In detail: If t e R and a, b e Q, then
a < r <b =» a — <p(a) < (p(t) < (p(b) = b.
Thus we see that
\<p(t)-t\ sb-a
is arbitrarily small; hence <p(t) — t — 0.
Comment. Let us stress that in both the p-adic and the real cases, we are
considering purely algebraic automorphisms over the prime field Q: The proofs show
that they are automatically continuous, and hence trivial. But there are infinitely
many automorphisms of the complex field C: Only two of them are continuous,
namely the identity and the complex conjugation. For example, the nontrivial
automorphism
a + byfly-> a-b4l (a,beQ)
of the field Q(\/2) extends to any algebraically closed extension of this field;
in particular it extends to C. This extension is a discontinuous automorphism
ofC.
Appendix to Chapter 1: The p-adic Solenoid
The fields R of real numbers and Qp of p-adic numbers can be linked in an
interesting topological group, the solenoid. We present a couple of constructions
and properties of this mathematical structure.
Appendix to Chapter 1: The p-adic Solenoid 55
AJ. Definition and First Properties
The canonical group homomorphisms
<pn : R/pn+1Z -+ R/pnZ, x mod pn+lZ i-» x mod pnZ (n > 0)
make up a projective system (R/pnZ, (pn)n>o of topological groups.
Definition. The p-adic solenoid S^ is the projective limit Sp = limR/p^Z of
the projective system (R/pnZ, (pn).
By definition, the solenoid Sp is a compact abelian group equipped with
canonical projections
Ife : Sp-► R/p"Z (/!>0)
that are continuous surjective homomorphisms. In particular,
^ = ^0:Sp^ R/Z
is continuous and surjective, and the solenoid can be viewed as a covering of the
circle. The kernel of this covering is obviously ker ty = limZ//?"Z = Zp, and we
have the following short exact sequence of continuous homomorphisms,
0 -> Zp -+ Sp -* R/Z -► 0,
presenting the circle as a quotient of the solenoid, or the solenoid as a covering of
the circle with fiber Zp. Also observe that
pnZp = ker(^) c Zp = ker(^) c Sp.
Alternatively, one could define the solenoid as the projective limit of the system
having transition homomorphisms
tp'n : R/Z -> R/Z, x mod Z h> px mod Z (/i > 1).
A2. Torsion of the Solenoid
We recall the following well-known fact:
For each positive integer m > 1 rfrere is a unique cyclic
subgroup of order m in the circle: It is m~~lZ/Z C R/Z.
Proposition 1. For each positive integer m > 1 prime to p the solenoid Sp has
a unique cyclic subgroup Cm of order m.
Proof. Let us denote temporarily by C^ the cyclic subgroup of order m of the
circle R/pnZ (it is the subgroup m~~1Z/pnZ). Since the transition maps
<pn : R/p"+IZ —> R/p"Z
56 1. p-adic Numbers
have a kernel of order p prime to m (by assumption), they induce isomorphisms
Cm+1 —► Cm. The projective limit of this constant sequence is the cyclic subgroup
Cm C Sp. To prove uniqueness, let us consider any homomorphism a : Z/raZ —►
Sp. The composite
$n ocr : Z/mZ -► Sp -► R/pnZ
has an image in the unique cyclic subgroup Cm of the circle R/pnZ. Hence cr has
an image in Cm, and this concludes the proof. ■
Observe that this unique cyclic subgroup Cm of order m (prime to p) of Sp has
a projection $(Cm) in the circle given by
ijf{Cm) = m~lZ/Z C R/Z.
Since $~l(m~lZ/Z) = Cm x Zp, the cyclic group C, is the maximal finite
subgroup contained in ilr~l(m"1Z/Z).
Proposition 2. The p-adic solenoid Sp has no p-torsion.
Proof. Let o : Z/pZ —> Sp be any homomorphism of a cyclic group of order p
into the solenoid. I claim that all composites
(pn o ^+i o cr : Z/pZ -► Sp -> R/p"+1Z -+ R/pnZ
are trivial. Indeed, the composite
^„-h o cr : Z/pZ ^Sp^ R/p"+lZ
must have an image in the unique cyclic subgroup of order p of the circle R/p"+lZ,
and this subgroup is precisely the kernel of the connecting homomorphism <p„ and
-tyn o o = tpnityn+i o cr). Consequently, there is no element of order /? in Sp (and
a fortiori no element of order pk for & > 1 in Sp). ■
A3. Embeddings ofK and Qp in the Solenoid
Theorem. The p-adic solenoid contains a dense subgroup isomorphic to R. It
also contains a dense subgroup isomorphic to Qp.
Proof. The projection maps /„ : R —> R/p^Z are compatible with the transition
maps of the projective system defining the solenoid
fn = VnO /„+! : R "* R/pn+1Z -* R/pTZ.
Hence there is a unique factorization / : R —> Sp such that
fn = ^nof:R^Sp^ R/p"Z.
Appendix to Chapter 1: The p-adic Solenoid 57
If x ^ 0 G R, as soon as pn > x we have fn(x) ^Oe R/pnZ and consequently
f[x) ^ 0 G Sp. This shows that the homomorphism / is injective (this also
follows from (1.4.5), since f]w>i ker /„ = fl„>i P"z = i0))- Tne density of the
image of / follows from the density of the images of the /„ (1.4.4, Proposition 3)
(in fact, all fn are surjective). Consider now the subgroups
Hk = f-\p-kZ/Z) dSp {k > 0).
We have //0 = Zp by definition, and this is a subgroup of index pk of Hk:
Hk = lim p~kZ/pnZ = p~kZp (k > 1).
Hence
Qp = f-l<Z[l/pyZ) ^U^~\p-~kZ/Z) = \jHkc Sp.
The density of this subgroup of Sp follows from the density of all images
VUQp) = Z[l/p]/pnZ c R/pnZ
(1.4.4, Proposition 3). ■
Corollary. The solenoid is a {compact and) connected space.
Proof. Recall that for any subspace A of a topological space X we have
A connected, A c B C A ==» B connected.
In our context, take for A the connected subspace f(R) c Sp, which is dense in
the solenoid. The conclusion follows. ■
Let us summarize the various homomorphisms connected to the solenoid in a
commutative diagram.
Li ^-> Lip = Lp
\ ■** ^
R <-+ Sp <- Qp
R/Z = R/Z <- Qp/Zp
A.4. The Solenoid as a Quotient
The sequence of continuous homomorphisms
fn:RxQp-> R/pnZ, (t, x) i—> t + ^T^// mod pnZ
i<zn
(if x = J2v<i<oeaiP1' v = ordp(x)) is compatible with the sequence of
connecting homomorphisms defining the projective limit Sp. Hence there is a unique
58 1. p-adic Numbers
factorization consisting of a continuous homomorphism
/:RxQp^Sp, (r, x) h> t + x
having composites $n ° f — fn- Alternatively, the two injective continuous ho-
momorphisms 71 : R -> Sp, j2 : Qp -> Sp furnish a unique continuous
homomorphism
which coincides with the preceding one (we are identifying the product and the
direct sum). This homomorphism / will therefore be called the sum
homomorphism.
Lemma. The kernel of the homomorphism f defined above is the subgroup
ker/ = T = {(a, -a): a e Z[l/p]} C R x Qp.
It is a discrete subgroup of the product R x Qp.
Proof. If f(t,x) = 0, we have in particular fo(t,x) = ^0 ° f(t,x) = 0 e
R/Z, namely t + £,-50 arf e Z> f e -£i<o arf+ Z c Z[l/p]. Similarly,
f„(t,x) = Ogives
r + J] fl,y e p"Z (/i > 1).
This proves that the p-adic expansion of the element teZ[l/p] is given by
t = — IimJ]/<:/1 a,-/?1 in Qp. Hence t = — x e Qp. Conversely, it is obvious
that T C ker/. Let us show that the (closed) subgroup T is discrete. For this it
is enough to show that a suitable neighborhood of 0 in R x Qp contains only the
neutral element of T. Consider the open set
(-1,1) xZpcRxQp.
If a pair (a, —a) is in T Pi (—1, 1) x Zp, then the p-adic expansion of a e Z[l/p]
must be of the form £/>0 ai pl - But we have seen (1.5.4) that in the /?-adic field Qp,
the intersection Z[l/p]DZp = Z contains only the rational integers. In particular,
a eZ f) (-1, 1) ={0}.Hence
rn((-U)xZp) = {0}cRxQp,
and the proof is concluded. ■
Theorem. The sum homomorphism f : R x Qp —> Sp furnishes an
isomorphism /': (R x Qp)/ rp = Sp both algebraically and topologically.
Proof. Since all maps /„ are surjective, the map / has a dense image (1.5.4).
Moreover, using the integral and fractional parts introduced there,
/(r, x) = f(t + (x), x - (x)) = f(s, 30,
Appendix to Chapter 1: The p-adic Solenoid 59
where s eR and y = x — (x) = [x] e Zp. Going one step further, we have
fts, y) = f(s - [s], y + M) = /(«, z),
where u = s — [s] e [0, 1) and z = y + [s] e Zp. This proves
bn / = /(R x Qp) = /([0, 1) x Zp).
A foniori, the image of / is equal to /([0, 1] x Zp), and hence is compact and
closed. Consequently, / is surjective (and /' is bijective). In fact, the preceding
equalities also show that the Hausdorff quotient (recall that the subgroup Tp is
discrete and closed) is also the image of the compact set fi = [0,1] x Zp and
hence is compact. The continuous bijection
/':(RxQp)/rp^Sp
between two compact spaces is automatically a homeomorphism. ■
Corollary 1. The solenoid can also be viewed as a quotient o/RxZp by the
discrete subgroup Az = {(rn, —m): m E Z}
/': (R x Zp)/Az = Sp.
Proof. Since the restriction of the sum homomorphism / : R x Qp —► Sp to the
subgroup R x Zp is already surjective, this restriction gives a (topological and
algebraic) isomorphism
/r:(RxZp)/kerr = Zp.
But
ker/' = (ker/) fl(Rx Zp) = Az = {(m, -m) :meZ}. m
These presentations of the solenoid can be gathered in commutative diagrams
of homomorphisms:
Z[l/p] Z
/ \ / \
R QP R Zp
\ i/ \ i/
&p Sp
Corollary 2. 77ze solenoid can also be viewed as a quotient of the
topological space [0, 1] x Zp by the equivalence relation identifying (l,x) to
(0,x + l) (xeZp).
60 1. p-adic Numbers
Proof. This follows immediately from the previous corollary, since the
restriction of the sum homomorphism to [0, 1] x Zp is already surjective, whereas its
restriction to [0, 1) x Zp is bijective. ■
Comment. This last corollary gives a good topological model of the solenoid:
One has to glue the two extremities of the cylinder [0, 1] x Zp having basis Zp
by a twist representing the unit shift ofZp. This gives a model for the solenoid
as a very twisted ropel On the other hand, it is clear that instead of the subgroup
r = Z[l/p] consisting of the elements (a, —a) (a e Z[l/p]) we could equally
well have taken the diagonal subgroup A, image of
a h> (a, a): Z[l/p] -^Rx Qp,
the isomorphism (R x Qp)/A = Sp now being given by subtraction.
A.5. Closed Subgroups of the Solenoid
Lemma. Let o : Cpm —► Cp«-i be a surjective homomorphism between two
cyclic groups of orders pm and pm~l. Then the only subgroup H C Cpm not
contained in the kernel of a is H = Cpm.
Proof. Recall that any subgroup of a cyclic group is cyclic and that the number of
generators of Cn = Z/nZ is given by the Euler ^-function (p(n). In particular, if
n = pm is a power of p, the number of generators is
<p(pm) = Pm~\p -l) = pm- pm-1.
Consequently, all elements not in the kernel of a surjective homomorphism of a
cyclic group of order pm onto a cyclic group of order pm~l are generators of the
cyclic group of order pm (the kernel has order pm~1). ■
Proposition. For each integer k > 0, there is exactly one subgroup H^ c Sp
having a projection of order pk in the circle: ^(//*) = p~~kZ/Z C R/Z. This
subgroup is /& — ^~~l{p~~kZ/Z) C Sp.
Proof. We can apply the lemma to each surjective homomorphism
p~kZ/pn+lZ -+ p~kZ/pnZ
in the sequence of connecting homornorphisms defining the solenoid as a projective
limit. The projective limit of these cyclic groups is p~~kZp. ■
As a preliminary observation to the following theorem, let us assume that the
solenoid contains a cyclic subgroup H of some finite order m > 1. Taking a
generator* of H and n large enough so that jfrn(x) ^ 0, we see that the restriction of this
homomorphism Vr„ to //must be injective. A fortiori, the restriction of ^n+\ (and all
isNforN > n) to //must be injective. The restriction of <p„ : R//?"+lZ -> R/pnZ
Appendix to Chapter 1: The p-adic Solenoid 61
to i/n+i(H) must be injective. Hence H = ^/n+\(H) has no element of order p
and m is prime to p.
Theorem. The closed subgroups of the solenoid Sp are
(1) Cm, the cyclic subgroup of order m relatively prime to p (m > 1),
(2) Cm x pkZp, where m is prime to p and k e Z,
(3) Sp itself {connected).
Proof. Let H be a closed subgroup of the solenoid Sp. Since H is compact, its
image ^(//) is a closed subgroup of the circle R/Z. The only possibilities are
ir(H) = n~xZ/Z cyclic of order n > 1,
or
j/(H) = R/Z is the whole circle.
(1) The easiest case is the second one,
^(//) = R/Z is the whole circle,
in which case \fr„(H) C R/z^Z must be a closed subgroup offinite index. Hence
it must be open in this circle. By connectivity, irn(H) C R//?nZ. Since this must
hold for all n > 1, we conclude that
H = H = Plf~\UJT)) = Sp
and H — Spin this case.
(2) If f (//) = {0}, then H c ^(0) = Zp c Sp, and we have shown in (3.5)
that the only possibilities are
H = {0}, pkZp for some integer k > 0.
These possibilities occur in the list for Cm = {0} (m = 1).
(3) We can now assume that ^(//) = a~~lZ/Z is cyclic and not trivial. Write
a = pfi • m with & > 0 and m prime to p. By the Chinese remainder theorem (or
the p-Sylow decomposition theorem) this cyclic group is a direct product of the
cyclic subgroups m~lZ/Z and p~~kZ/Z. If k > 1, the above lemma shows that
^+l(//) must contain an element of order /?*+I. As in the proposition, we see
that H contains ^~\p~kZ/Z) = p~kZp C Sp, and finally // = Cm x /T*ZP. If
fc = 0, two possibilities occur: Either ^n(H) is cyclic of order m for all«, or there
is a first n such that this group irn(H) contains an element of order p. In the first
case H = Cm, while H = Cm x pnZp in the second. ■
A. 6. Topological Properties of the Solenoid
We have seen in (I. A.4) that the solenoid Sp can be viewed as a quotient of the
cylinder [0, 1] x Zp, and an image of [0,1) x Zp. This leads to considering the
62 1. p-adic Numbers
second projection of this product as a (discontinuous) map (r, x) h* x. This map
has continuous restrictions to all subspaces [0, n] x Zp (0 < n < 1). It furnishes
continuous retractions of these subspaces onto the neutral Zp-fiber of the solenoid
Recall that we have a continuous surjective homomorphism iff : Sp -> R/Z
leading to a presentation of the solenoid by the short exact sequence of continuous
homomorphisms
0 -► Zp -► Sp -► R/Z -» 0.
The subspaces ir~l([0, rj]) (0 < rj < 1) have continuous retractions on the fiber
Zp, simply since ^"^([G, /?]) is homeomorphic to [0, n] x Zp. The following
statement is then an immediate consequence of these observations.
Proposition 1. Let U be any proper subset of the circle R/Z. Then the sub space
if~~l(U) CSP of the solenoid is homeomorphic to U xZp. The map
(t, X) = (t- [t], X + [t]) H> (0, X + [*])
furnishes by restriction a continuous retraction of ijf~l([0, n]) c Sp onto the
neutral fiber Zp C Sp (0 < 77 < 1). ■
The solenoid has still another important topological property that we explain
and prove now.
Definition. A compact and connected topological space K is called
indecomposable when the only partition of K in two compact and connected subsets is
the trivial one.
Proposition 2. The solenoid Sp is an indecomposable compact connected
topological space.
Proof. Let us take two compact connected subsets A and B covering Sp. We have
to show that if A ^ Sp, then B = Sp. Thus we assume A ^ Sp from now on:
B ^ 0. Since we have
/2>1
for every compact set K, the assumption A ^ Sp leads to jJrn(A) ^ R/pnZ for
some integer n = n0 and hence also for all integers n > n0 (the transition maps
<pm are surjective). It will suffice to show jJr„(B) = R/pnZ for all n > no. Take
such an n and an element b e B. Then
V-\b)cR/pT+1Z
has cardinality p > 2, and the restriction of ^>„ to the connected set
Exercises for Chapter I 63
is not injective. The proof will be complete as soon as the following statement (in
which the situation and notation are simplified) is established.
Let a > 1 be any integer, <p : R/aZ -+ R/Z the canonical projection, and C a
connected subset of R/aZ containing two distinct points s ^t with <p(s) = <p(t).
Then <p(C) = R/Z.
In terms of the restriction <p\c of the map <p to C, we have to prove
<p\c not injective =» <p\c surjective
under the stated assumptions. It is obviously enough to do so when C ^= R/aZ,
In this case, take a point P & C C R/aZ and consider a stereographic projection
from the point P of the circle R/aZ onto a line R. This is a homeomorphism
f:R/aZ-{P}^R.
The image /(C) of the subset C is a connected subset of the real line containing
the images of two different congruent points mod Z. Since any connected set in the
real line is an interval, this proves that /(C) contains the whole interval J linking
these two different congruent points. Hence C contains a whole arc / of the circle
having image <p(I) = R/Z. ■
Exercises for Chapter 1
1. Compute the squares of the following numbers
6, 76, 376, 9376, ....
Show that one can continue the sequence in a unique way: For example, the number
743 740081787109376
appears in the 18th position. Define the limit
a := 2_]ai 10' = - - - a^a^a^a^a^axa^ = ■ - ■ 109376
i>0
as a 10-adic integer: a € Zio- Give the 10-adic expansion of — 1.
Observe that by definition a2 = a, and find the four solutions 0, 1, a, ft of x2 = x
in Zio- What are a + /3, a/3 ?
Prove that Zio = Z5 x Z2. {Hint. Consider the map x h-> (ax, fix).)
2. (a) Give the 5-adic expansion of the integers 15, —1, —3. The integers 2, 3, 4 are
invertible in Z5: Give the 5-adic expansions of the inverses. Give the expansion of
^inZ7.
(b) What is the p-adic expansion of \ if the prime p is odd?
(c) If / is a positive integer, give the expansion of 1/(1 — pf) in Zp.
(d) More generally find the expansion ofl/m in Zp when the integers is not divisible
by p. (Hint Let / be the multiplicative order of p mod m so that pf — 1 = wm.
Then use \/m — —n/(\ — pf).)
64 1. p-adic Numbers
3. (a) Show x e pnZp <=> —x € pnZp and so ordp(—x) = ordp(x)-
(&) Check as in (1.5) that if a € Zp, then (1 + pna)~l - 1 + p"a' for some a' e Zp.
(c) Using the p-adic metric, reformulate (b) in the form
if 0 <r < l,then
\x - 11 < r, | J — 11 < r =^ |jcj - 11 < r.
(J) Let ex denote the involution introduced in (1.2). Show that cr(B<r (a)) = B<r(cr(a)).
4. Show that there is a square root of 2 in Zj. (Compute the first coefficients in a =
<2q + a\l -\-ail1 H iteratively using a2 = 2; do not be surprised if no regular pattern
appears: The same happens for the computation of the decimal expansion of \/2 in R;
cf also (1.5.3).)
5. (a) Solve the equation x2 = 1 in all Z/2PZ (az > 1). Guess the result by making a small
table with the first values n < 4 or 5.
(Hint. Consider separately the cases n = 1, 2, > 3. When w > 3, observe that if
x2 = 1, then x is the class of an odd integer 2k + 1 (0 < * < 2W_1), and 4*(* + 1)
has to be divisible by 2n. In (VII. 1.7) we show that the unit group in Z/2nZ is a
product of two cyclic groups (n > 3), from which the result also follows.)
(b) Solve the equation x2 = 1 in Z2.
6. (a) Let N be a positive integer. Show that the subset {N, N + I, N +2,...} is dense
inZp.
(b) For which values of a and b € Zp is the subset a + bN dense in Zp?
(c) Show that the subset {—1, —2, —3,...} is dense in Zp.
7. Let jp : Q -> Qp denote the canonical injection.
(a) Determine the subring j~l(Zp) of the field Q (this subring is simply written Q D
Zp = Z(p)). What is jp\Zp) n Z[l/p]?
(b) Show that
f| j;l(Zp) = Z.
p prime
(this equality is sometimes simply written PL(Q n Zp) = Z).
8. Let X be a nonempty set and E = XN the set of sequences in X. For two different
sequence a = (an), b = (bn) let us put
d^ b>> = -^ inrr = - ■
min{fl : an ^ £„} v
{a) Show that J defines an ultrametric distance on E.
(b) Show that E is complete for the preceding metric.
9. The distance between two subsets A, B of a metric space is defined by d(A, B) =
infaeA,beB d(a, b). Show that if the metric d is ultrametric, then
'd(a.b) ifr <d(a,b),
d(BMa),BMb))={
0 if r > d(a,b).
More generally, the distance of two disjoint balls B, B' is equal to the constant value
of d(x, xr) forx e B, xf e B'.
Exercises for Chapter 1 65
10. Let A' be a (commutative) field and let ^[[X]] be the ring of formal power series
/(*) = En>o"nXn. ChooseO < 6 < 1 andforg(X) = £n>0^X" ^ /(X), define
d{f{x\ g{X)) = e™{n'a^b»].
Show that d defines an ultrametric distance on #[[X]] for which this space is complete.
Show that the space of polynomials K[X] is dense in #[[X]], and hence this is a
completion of the space of polynomials. The ball {/(X) : d(f(X), 0) < 6n] is the ideal
(Xn) = X"K[[X]]. The fraction field K((X)) = A[[X]][X"1] consists of the Laurent
series £1>v anXn (v e Z). It is a completion of the ring K[X, X-1].
11. Let 6 > I and for any nonzero polynomial / € R[X] define |/| = 6^f. Extend this
definition by |0| = 0 and \f/g\ — \f\l\g\ for a rational fraction f/g e R(X). Show
that this defines an ultrametric absolute value on the field R(X).
12. Let E be a compact metric space and / : Z2 —► E be a continuous surjective map. For
each ball B C Z2 of positive radius, let AB = f(B) be the compact image of B in the
space E. Observe that
AB = AB' U AB» \iB = B!U B",
f| AB = [f(x)}.
Bbx
Conversely, recall that Mi denotes the free monoid generated by two letters, say 0 and
1, and V(E) denotes the set of parts (power set) of E. For any map tp : M2 -> V(E)
having the properties
(a) (p(0) = E, (p(w) = tp(wO) U (p(wl) (w € M2),
(b) S((p(wn)) -> 0 when the wn are the initial segments of an infinite word,
(c) fl (p(wn) ^ 0 when the wn are the initial segments of an infinite word,
show that there exists a continuous surjective map
/ : Z2 -> E such that f(Bw) = (p(w).
13. Let E be a compact metric space. Show that there exists a continuous surjective map
/ :Z2 —> E. In other words, the metric space is a topological quotient of the space
Z2. {Hint. Let (tfi)i<j<* be a covering of E by closed sets of diameter < I. If k > 1
call A0 = tf0 U ■ ■ ■ U Kt and Ax = Ki+l U--UKk with, e.g., I = [k/2]. If £ > 1,
start again and define similarly shorter unions Aoo, A01 such that Ao = Aoo U Aoi-
This leads to finitely many words wt so that Kt = AWj. Proceeding similarly for each
of them, show how to define a map tp : M2 —> E having the properties listed in the
previous exercise.)
Conclude that all spaces Zp are homeomorphic to Z2-
Give an explicit continuous surjective map Z2 —> {1/az : n > 1}
14. Let E be a compact metric space. Show the equivalence
(i) there is a continuous surjective map / : [0, 1] -> E,
(ii) E is path-connected.
(Hint. Use the previous exercise to construct a continuous surjective map /o: C ->
E. where C is the Cantor subset of the unit interval, and extend /o through the
missing intervals — this is possible if the space E is path-connected.)
66 1. p-adic Numbers
In particular, for every compact, convex subset A' of a (real or complex) Hilberr
space, there is a continuous surjective /:[0, 1] -> E ("space-filling curve" or
Peano curve).
15. Let £ be a compact metric space with the following properties:
(a) E is totally disconnected.
(b) E has no isolated point (hence is not a singleton set!).
Show that E is homeomorphic to Z2.
16. Show that the planar fractal image of Z5 is path-connected when it is connected (cf.
picture in text).
17. Construct a planar model of Q7 using v : {0, 1,... 6} -> C defined by
v(0) = 0, v(j) = ejm2*i/6 (1 < j < 6).
Observe the appearance of the von Koch curve in the image of
*7={X>*7" l<fl„ <6|cQ7.
U>o J
18. (a) Give an example of a discrete subset of [0, 1] C R that is not closed.
(b) Prove that if A is a discrete subset of a Hausdorff topological space X, then A is
open in A (the same is true for any locally compact subset in a Hausdorff space).
(c) Let G be a topological group that is Hausdorff, and T a discrete subgroup. Prove
directly that T is closed in G (cf. 1.3.2).
(d) Let G be a group having more than one element, let Gj denote the topological
group G with the discrete topology, and let Go denote the topological group G
with the topology having only 0 and G as open sets (not Hausdorff!)- Prove that
r = Gd x {e} is a discrete subgroup of the topological group Gd x Go- What is
its closure?
19. (a) Let # be a normal subgroup of a topological group G. Prove that the subgroup H
is also normal.
(b) For any topological group G, the quotient G/{e} is a Hausdorff topological group.
(c) Let H be a closed subgroup of a locally compact (topological) group G. Prove that
the space G/H is locally compact.
(d) Let G be a locally compact totally discontinuous group, so that the connected
component of the neutral element in G is {e}. Prove that any neighborhood of the neutral
element contains a clopen subgroup. (Hint. Start with a compact neighborhood K
of e. There is a clopen neighborhood U of e contained in K. Since U is compact
and disjoint from the closed set F = G — U, there is a symmetric neighborhood
W of e such that UW D FW = 0 and hence
£/Wc<TttOcC Fc = £/.
By induction W" C J7 W" C £/. The subgroup generated by W is open and
contained in (J.)
20. Here is an example of a topological ring A that does not induce on its units A x a topology
compatible with the group structure (cf. (1.3.5)). Let H be a complex Hilbert space with
orthonormal basis (ef-)/>o- Hence the elements of H are the series x = £/>o x-t^i such
Exercises for Chapter 1 67
that xi e C and £i>o l*i I2 < oo. Consider the sequence of continuous operators Tn in
H defined by
Tn.ei»\ei, ^^
Prove that for every jc € H, \\ Tnx — x ||2 -> 0, and hence T^jc —► jc and 7n -> / for the
strong topology on the ring A of bounded operators on H. But T~* -f> I for the strong
topology (consider the vector x = J2n>\ €n/n)-
21. Let K be an ultrametric field.
(a) Show that if K is locally compact, then all balls of K are compact (and conversely).
(b) Two balls of K having the same radius r > 0 are homeomorphic.
{Hint. Consider separately the cases |A'X| discrete or dense; remember that all
spheres are clopen, and if necessary, use a bijection (0, r] D | K x | -> (0, r) n | # x |.)
22. Let G be a group and
G = G0 D G\ D G2 D ■ - - D Gn D - - ■
be a decreasing sequence of normal subgroups of G. Show that there is a unique group
topology on G for which (Gn)n>o is a fundamental system of neighborhoods of e. For
this topology, the Gn are clopen subgroups and
G Hausdorff «=* pj G„ = {<?}.
/2>0
When this is the case, show that G is metrizable. {Hint Note that G/Gn is discrete and
metrizable. One can embed G in the countable metrizable product \\ G/Gn.)
23. Let A = M2(ZP) be the noncommutative ring of 2 x 2 matrices having coefficients
in Zp. Show that A is a topological ring (for the product topology). The units in A
constitute a group A* = Gl2(Zp):
g e Gl2{Zp) *=>ge M2(ZP) and detg € Zx.
Show that Gl2(Zp) is a topological group with the topology induced from A. Let Gn C G
denote the normal subgroup consisting of matrices g = (gij) congruent to the identity
matrix mod pn,
gij = Sij mod p"Zp
{8tj — lifi = j and = 0 if i ^ j is the Kronecker symbol). Show that the Gn form a
fundamental system of neighborhoods of the identity in Gl2(Zp).
24. Let (An)n>o be a decreasing sequence of subsets of a set E. Consider the canonical
inclusions An+1 c An as transition homomorphisms. Show that the intersection A =
nw>o ^« together with the inclusions A -> An has the universal property characterizing
the projective limit lim An and hence may be identified with it: Iim An = O An.
25. Let (Xn, <Pn)n>o and (y„, ^n)n>o be two projective systems. One can consider canoni-
cally (Xn xyn,^„x ^n)n>o as a projective system. Prove
lim(Xw x Yn) = IimX/2 x limy„.
68 1. p-adic Numbers
26. Let a e Z be a rational integer. Show that X2 -f X + a =0 has a root in Q2 if and only
if a is even.
27. (a) In which fields Qp does one find the golden ratio (root of x2 = x + 1)?
(6) How many solutions of X4 + X2 +1 =0 are in Q7 ? (Either make a list of solutions
mod 7, or consider y = X2 and solve in two steps.)
28. (a) Show that if a € 1 + j?Zp and the integer az is prime to p, then there is an Azth root
of a inQp.
(b) Give an example of a e 1 + pZp having no pth root in Qp.
(c) Show that if a € I + p3Zp, then a has a pth root in Qp.
29. Let Az be a positive integer, v = ordp«; hence « == pvn' and (/?,«') = 1. For integers
a, b eZ, prove
a = & (mod AzZ(p)) <<=» a =s & (mod pvZ).
{Hint. Observe that nZ(p) — pvZ(P) and nZ(p) I1Z = pvZ.)
30. Let /? and q be distinct primes.
(a) Prove that the fields Qp and Q9 are not isomorphic.
(b) Prove that the fields Qp and R are not isomorphic.
(c) Prove that the fields Qp(iAg-i) and Qg(pLp-i) are not isomorphic.
(//inf. Look at roots of unity. Observe that for each prime p, the field Qp has an
algebraic extension of degree 4, which is not the case of the field R. For part (c),
use the lemma in (6.8).)
31. Let p and q be distinct primes. What is the projective limit
limR^^Zx^Z)?
2
Finite Extensions of the Field
of p-adic Numbers
The field Qp is not algebraically closed: It admits algebraic extensions of arbitrarily
large degrees. These extensions are the p-adic fields to be studied here. Each one
is a finite-dimensional, hence locally compact, normed space over Qp. A main
result is the following: The p-adic absolute value on Qp has a unique extension to
any finite algebraic extension K of Qp.
1. Ultrametric Spaces
1. L Ultrametric Distances
Let {X, d) be a metric space. Thus X is equipped with a distance function d : X x
X —> R>0 satisfying the characteristic properties
d(x, y) > 0 *=> x ^ y,
d(y,x) = d(x,y)9
d(x,y)<d(x,z)+d(z,y)
for all x, y, and z e X. For r > 0 and a e X v/e define1
B<r(a) = {x e X : d(x, a) < r]
= dressed ball of radius r and center a,
!Let me use this unconventional terminology in this section only. From (II.2) on, I shall rely on the
reader for a proper distinction between "open" and "closed', balls.
70 2. Finite Extensions of the Field of p-adic Numbers
B<r(a) = {x e X : d(x, a) < r]
= stripped ball of radius r and center a.
Hence B<r(a) is empty if r — 0, and the stripped balls form a basis of a topology
on X: In particular, all stripped balls are open.
Definition. An ultrametric distance on a space X is a distance {or metric)
satisfying the strong inequality
d(x, y) < max (d(x, z), d(z, y)) ( < d(x, z) + d(z, y))
for all x, y, and z € X. An ultrametric space (X, d) is a metric space in which
the distance satisfies this strong inequality.
The following results are valid in ultrametric spaces.
Lemma 1. {a) Any point of a ball is a center of the ball.
(b) If two balls have a common point, one is contained in the other
(c) The diameter of a ball is less than or equal to its radius.
Proof, (a) If b e B<r(a), then d(a, b) < r and
x e B<r(a) <=> d(x, a) < r %=£r d(x, b) < r <=> x e B<r(b)
proving B<r(a) = B<r(b). The case of a dressed ball is similar.
(b) Take, for example, a common point c of the balls B<r(a) and B<rf(b). By
the previous part, we have
B<r(a) = B<r(c) and B^Q?) = B^(c).
Now, it is clear that #<r(c) C B<Ac) if r < r\ while B<Ac) C B<r(c) if r' < r.
All other cases are treated similarly. Part (c) is obvious. ■
It is immediately seen by induction that ultrametric distances also satisfy the
strong inequality for finite sequences xi, x2, ...,xn e X:
d(xi, xn) < max (d(xux2), d(x2, x3),..., d(x„-u xn)).
Consider a cycle containing n > 3 distinct points: jc,- (1 < i < «), xn+i = jci. We
may assume d(x\, *„) = max, <n d(xt, xI+1): Renumber these points if necessary,
and observe that d(xn, x„+1) = d(xn, x^) — d(x\, xn). Since
d(xi, xn) < max (d(xi, x2), - - -, ^fe-i, x„))
by the ultrametric inequality, it follows that
d{xux„) = d(Xi,Xi^i)
1. Ultrametric Spaces 71
for at least one index 1 < i < n — 1. In other words, the cycle has at least two pairs
of consecutive points with equal maximal distance. In particular, in a set a, b, c
of cardinality 3, at least two pairs have the same (maximal) length. A picturesque
way of formulating this property is this:
In an ultrametric space, all triangles are isosceles {or equilateral), with at
most one short side.
Here is an image of the situation. Let x be the earth and y, z be two stars in a
galaxy not containing the earth, so that d(x, y) > d(y, z). Then we consider that
d{x, y) = d(x, z) (this is the distance of the galaxy containing y, z to the earth).
In other words, ultrametric distances behave as orders of magnitude.
Let us denote by Sr(a) = {x e X : d(x, a) = r] the sphere of center a and
radius r > 0. Then if a ball B does not contain the point a, it lies on the sphere
Sr(a), where r = d(a, B)
if B = B<s(b\ then r = d(a, b) > s and B C Sr(a),
and similarly,
if B = B<s(b)9 then r = d(a, b) > s and B C Sr(a).
Let us reformulate these properties in the form of another lemma.
Lemma 2. (a) Ifd(x, z) > d(z, y), then d(x, y) = d(x, z).
(b) Ifd{x, z) ^ d(z, y), then d(x, y) = max (d{x, z), d(z, y)).
(c) Ifx e Sr{a), then B<r(x) c Sr(a) and
Sr(a)= (J B<r(x). m
xeSr(a)
Balls within a ball
The stripped balls are open in any metric space: By definition, they make up a
basis of the topology. Similarly, the dressed balls are closed in any metric space.
In an ultrametric space we have some other peculiarities.
72 2. Finite Extensions of the Field of p-adic Numbers
Lemma 3. (a) The spheres Sr{a) (r > 0) are both open and closed.
(b) The dressed balls of positive radius are open.
(c) The stripped balls are closed.
(d) Let B and Br be two disjoint balls.
Then d(B, B') — d(x, x') for any x e B, x' e B'.
Proof, (a) The spheres are closed in all metric spaces, since the distance function
x m> d(x, a) is continuous. A sphere of positive radius is open in an ultrametric
space by part (c) of the previous lemma.
(b) If r > 0, then B<r(a) = #<r(fl) U Sr(a) is open.
(c) If r > 0, the sphere Sr(a) is open; hence B<r(a) = Bsr{a) — Sr(a) is closed.
If r = 0, B<r(a) = 0 is closed.
(d) Take four points: x, y e B and x', yr e Bf. The 4-cycIe of points x, x', y', y
has two pairs with maximal distance: They can only be d(x, xr) = d(y, y'), since
we assume that the balls are disjoint. AH pairs of points x e B, xr e B' are at the
same distance, and d(B, B') := iniXGB,xfGBf d(x, xf) is this common value. ■
Due to the frequent appearance of simultaneously open and closed sets in ultra-
metric spaces, it is useful to introduce a definition.
Definition. An open and closed set will be called a clopen set.
Lemma 4. {a) A sequence (xn)n>0 with d{xn,xn+\)-^Q (n -> oo) is a Cauchy
sequence.
(b) lfxn -> x ^ a, then d(xn, a) = d(x, a) for all large indices n.
Proof, (a) Observe that if d(xn, xw+i) < e for all n > N9 then also
) < max d(xn+l,xn+i+l) < e
0<i<m
for all n > N and m > 0.
(b) In fact, d(xn, a) = d(x, a) as soon as d(xn, x) < d(x, a). ■
Proposition. Let Q C X be a compact subset.
(a) For every a e X — Q, the set of distances d{x, a) (x e Q) is finite.
(b) For every a e Q, the set of distances d(x, a) (x e Q — {a}) is discrete
in R>0.
Proof, (a) We have just seen that
d(x, y) < d(x, a) => d(y, a) = d(x, a);
hence the function / : x h-^ d(x,a), Q —> R>0 is locally constant and continuous.
Its range is finite: The sets f~l(c) (for c e f{£l)) form an open partition of the
compact set Q.
1. Ultrametric Spaces 73
(b) The map / : x h» d(x, a)y Q — {a} —> R>0 is locally constant as before.
For e > 0, its restriction to the compact subset £2 — B<B(a) has finite range. This
proves that all sets
[s, oo) fl {d(x, a) : x e Q, x ^ a]
axe finite. Hence f(Q — {a}) is discrete in C R>o- ■
Let us summarize.
Properties of ultrametric distances.
(a) Any point of a ball is a possible center of the ball
b e B<r(a) ==> B<r(b) = B<r(a) (and similarly for stripped balls).
(b) If two balls have a common point,
then one is contained in the other.
(c) A sequence (x/2)nGN is a Cauchy sequence
precisely when d(xn, xw+i) —> 0 (n —> oo).
(d) In a compact ultrametric space X, for each a e X,
the set of nonzero distances {d(x, a) : a ^ x e X] is discrete in R>o-
1.2. Ultrametric Principles in Abelian Groups
Let G be an additive (abelian) group equipped with an invariant metric d, namely
a metric satisfying
d(x + z, y + z) = d(x, y) (x, y and z e G).
For* e G, define
|*| = d(x, 0).
Then
1—jc| = d(-x, 0) = d(0, x) = d(x, 0) = |jc|
and
|x + y\ = d(x + y, 0) < d(x + v, y) + d(y, 0)
<d(x.0)+d(y,0) = \x\ + \yl
This shows that x h+ — x and (jc, y) h+ x + y are continuous and G is a topological
group when equipped with the metric d. We shall say that G is a valued group
when such a metric d has been chosen.
Assuming that this metric satisfies the ultrametric inequality, we shall have
similarly
|* + y| = d(x + y, 0) < max (*/(* + y, y), d(\\ 0))
< max (d(x9 0), d(y, 0)) = max (|*|, \y\).
74 2. Finite Extensions of the Field of p-adic Numbers
In particular, all nonempty balls centered at the neutral element 0 e G are
subgroups of G. These subgroups are
£<r(0) = {x e G : \x\ < r] (r > 0),
£<r(0) = {x e G : \x\ < r] (r > 0).
Instead of applying (1.1) to see that the balls B<r(0) and B<r(0) are open and
closed when r > 0, one can observe that these subgroups are neighborhoods of
the origin and use (1.3.2) to reach the same conclusion.
Conversely, if we are given a function G —> R>o : x t-* \x\ satisfying
|jc| >0 for* ^0, |-*| = |jc|,
\x + y\< \x\ + \y\ (resp. <max(|*|, |y|))
then we can define an invariant metric (resp. ultrametric) on G by
d(x,y) = \x-y\.
The characteristic properties of distances are immediately verified (see the specific
references at the end of the volume). A pair (G, |. |) consisting of an abelian group
G and a function G —> R>o : x \~> \x\ satisfying the preceding properties, with
the ultrametric inequality
\x + y\<max(\x\9\y\) (x,y e G),
will be called an abelian ultrametric group.
The study of convergence for series in a complete abelian group is simpler in
ultrametric analysis than in classical analysis. Let (0,-),->o be a sequence and define
If this sequence of partial sums sn has a limit s, then
an = -Wi ~ s„ -► s - s = 0.
This necessary condition for convergence of the series X!i>o aiIS sufficient in any
complete ultrametric group. Indeed, if 5*w+i — sn = an —> 0, the sequence (5*w)
is a Cauchy sequence and hence converges. Moreover, reordering the terms of a
convergent series, and grouping terms, alters neither its convergence nor its sum.
Proposition. Let {at), gn be a sequence in a complete ultrametric abelian group.
Assume that at —> 0, so that the series X!i>o ai converges: Let s be its sum.
Then
{a) for any bijection a:N^Nw have s = X!i>o a<r(o*
(b) for any partition N = \}j l} we have s = J2j [Jliei ai )■
1. Ultrametric Spaces 75
Proof, (a) For e > 0, define the finite set
7(e) = {i:|fll-|>fi}
and the corresponding sum
For any finite set J D 1(e),
\ J-Ke)
This proves that the family (a{) is summahle. This notion is independent of the
order on N. Explicitly, for e > 0, 77 > 0 we have
\s(e)-s(y])\ <max(e, //),
since 5*(e) — s(rf) is a finite sum of terms having absolute values between e and 77.
In particular, (s(l/n))n>o is a Cauchy sequence, and we call s its limit. If e > 0,
letting n -> 00 in
|5-(e) — 5-(1/az)| 5 max(e, 1/n)
we get
|5*(f) — 5*| < £.
Hence we can say that s(s) —> s when e -> 0. Now, if ^ = flCT(f) is a rearrangement
of the terms of the series and 5*^ = 5Zi<n fl/»tne inequality
\s-s'n\<e
holds when {cr(/) : i < «} contains the finite set 1(e), hence for all sufficiently
large n.
(b) Let 57 = ]Cig/ fl'»so tnat we nave to Prove s — Jlj sj- Take any e > 0 and
define the finite sets
Ij(E) = Ijni(E).
Obviously, the nonempty Ij(e) make a partition of the finite set 1(e), and
s(E) = J2ai=J2\ H a*
/(£) j \ielj(e)
< max Ifl.l < e.
76 2. Finite Extensions of the Field of p-adic Numbers
Finally,
|5--]T5-7-| <max I |j-j(£)|, J] I ]P ai J -J]*; J
j \ \ j \izlj(e) J J 1/
('
< max j |5* — 5*(e)|, max
J2 ai~sJ
< E.
Since this is true for all e > 0, the conclusion follows. ■
Corollary. Let (fli7)i>o,v>o be a double sequence such that for any e > 0 the
set of pairs (i, j) with |fl,7| > s is finite. Then this double family is summable
and
*">0 \v*>0 / v*>0 V>0 /
Proof. The family (fl,7),->o,v>o is summable over the countable set N x N by
hypothesis, and the sum of the corresponding series J] ■ ■ a^ can be computed in
any order. It can also be computed using the two groupings mentioned. ■
Comments (1) Summable families over arbitrary index sets will be considered
later (cf. (IV.4.1)). The above proposition will be generalized correspondingly.
(2) In classical analysis, there is a distinction between conditionally convergent
and absolutely convergent—or commutatively convergent, or summable — series
(of real or complex numbers): This distinction disappears in non-Archimedean
analysis, since the sum of a convergent series can be computed in any order, any
grouping. But in both contexts a grouping in a divergent series may produce a
convergent one: Think of a, = (— 1)', \a,\ = 1 -f> 0; here is a grouping that leads
to a convergent series
(1 - 1) + (1 - 1) + ■ ■ • = 0 + 0 + - - ■ = 0,
and here is another grouping,
1 +(-1 + 1) + (-1 + 1) + --- = 1+0+ ()+•■-= I
leading to a different sum. Or think of the divergent series J]/2>o an where all
an = 1. A suitable grouping of its terms leads to a convergent series:
1 + (1 + - - - + 1) + (1 + ■ - - + 1) + - - - = 1 + p + p2 + - ■
p terms
p2 terms
1. Ultrametric Spaces 77
Basic Principles of Ultrametric Analysis in an Abelian Group
(1) The strongest wins
M>|yl=H* + yl = l*l-
(2) Equilibrium: All triangles are isosceles {or equilateral)
fl-hfr-hc = 0,|c| < \b\ => \a\ = \b\.
(3) Competitivity
fftere w / ^ y such that \ai\ — \aj\ — max |^|.
(4) y4 dream realized
(an)n>o is a Cauchy sequence ^=> d{an, an+\) —> 0.
(5) Another dream come true {in a complete group)
yj >0 an converges ^==> an —> 0.
When J]n>o a" converges, J],2>o lfl« I may diverge but
E/2>o fl* I — SUP \a" I and th* infinite version of {3) is valid.
(6) Stationarity of the absolute value
a„ -> aj^O ==» fftere is N with \an\ = \a\forn > N.
13. Absolute Values on Fields
Definition 1. An absolute value on afield K is a homomorphism
f:Kx-+ R>0
extended by /(0) = 0 and such that f{x + y) < /(*) + /(j) (*, J e #).
The trivial homomorphism f{x) = l(x e Kx) defines the trivial absolute value
on K. We shall usually denote by f{x) = |jc| an absolute value, and by definition,
such a function will always have the characteristic properties
1*1 > 0,
|jc| = 0 <=» x = 0,
\xy\ = W-UI-
l* + y| < l*| + |y|
for all x, j e K. The pair (AT, |. |) is a valued field (1.3.7).
78 2. Finite Extensions of the Field of p-adic Numbers
If y2 = 1 e AT, then |*|" = |*"| = 1 and |*| = 1. In particular, |-1| = |1| = I.
Also, |2| = |1 + 1| < 1 + 1 = 2, and by induction
\n\<n («eN)
(here n = n ■ 1# e K in the left-hand side of the inequality, whereas n e R>0 in
the right-hand side). Also, quite generally,
IjI
By induction
1*1 + *2 + • - - + *„| < |*j| + |*2| + - * * + \Xn\
for every positive integer n.
Definition 2. Art ultrametric field is a pair (K, |. |) consisting of a field K and
an ultrametric absolute value on K, namely an absolute value satisfying the
strong triangle inequality
|* + y\ < max (|*|, |y\) < |*| + Iy\ (x,ye K).
As before, induction shows that
l*i + *2H + *»l < max(|*i|, |*2l, .--,l*«l).
In this case, we have |2| = |1 + 1| < 1 and by induction
|/i|<1 (/ieN).
Hence ultrametric fields have the non-Archimedean property
\nx\ < |*| (n e N).
The following lemma is obvious (cf. (1.2)).
Lemma. All balls containing 0 in an ultrametric field K are additive subgroups.
The dressed unit ball £<i(0) is a subring of K. The balls B<r(0) (r < 1) are
ideals ofB<i(0). The balls £<r(0) (r < 1) are ideals ofB^(0). ■
Proposition. Let x h* \x\be an absolute value on afield K. Then:
(1) d(x, y) — |* — y\ defines a metric on K.
(2) For each exponent 0 < a < 1 , * i-> |*|" still defines
an absolute value on the field K.
(3) Tjf* h-> |*| wa/2 ultrametric absolute value,
then for each positive exponent a > 0, * h^ |jc|a still defines
an ultrametric absolute value on the field K.
1. Ultrametric Spaces 79
Proof. AH statements are obvious except perhaps the triangle inequality, which is
nevertheless a simple exercise. ■
The trivial absolute value defines the discrete metric: d(x, y) = 1 ifx^y.
I A. Ultrametric Fields: The Representation Theorem
Let K be an ultrametric field. We use the general notation
A = [x e K : \x\ < 1}: dressed unit ball,
M = {x e K : |jc| < 1}: stripped unit ball.
Hence
A = A* UM
is a disjoint union, where Ax, the multiplicative group of invertible elements in A,
is the unit sphere |jc| = I.
Proposition. The subset A is a maximal subring of Ky and M is the unique
maximal ideal of the ring A.
Proof. Indeed, if Ar is any subring strictly containing A^ it will contain an element
y such that |y\ = r > 1 together with all its powers yn. Hence B<r« — ynA C A\
and since rn = |yn | —> oo, we see that K = U«>i ynA = A'. Moreover, any ideal
not contained in M contains a unit, and hence coincides with the whole ring A.
This shows that M is the unique maximal ideal of A. ■
Definition 1. A subring A of a field K such that
for every x e K*, x e A or l/x e A
is called a valuation ring of K. A commutative ring A having a single maximal
ideal is called a local ring.
The unit ball in an ultrametric field is a local ring and a valuation ring.
Definition 2. If K is an ultrametric field, its residue field is the quotient k =
A/M of its dressed unit ball, the maximal subring ofKt by its unique maximal
ideal
The residue field parametrizes the stripped balls of unit radius in the dressed
unit ball of K. If S C A is a set of representatives for the classes mod M, then
A = Bsl{0) = \\B^{x\
xeS
Theorem. Let K be a complete ultrametric field, A its maximal subring
defined by \x\ < 1. Choose an element % with |£| < 1 together with a set of
80 2. Finite Extensions of the Field of p-adic Numbers
representatives S C A containing Ofor the classes A/%A. Then each nonzero
element x e K* is a sum
x = ^fl,-?' {m e Z, m e S, am ^ 0)
with m > 0precisely when x e A. The mapx \-+ (sn) where sn = ^2m<i<n a£l
defines an isomorphism A = lim A/%nA.
Proof. The conditions |f | < I, £ e A is not a unit, and £ e M are all equivalent.
Starting with x e A, there is a unique fl0 e S with x — «o e £ A,
x = fl0 + £*i (*i e A).
Repeating the procedure for xx, and so on, we get by induction
x = a0 + flif + ■ - - + fl„-ifB"1 + xn$n
with fl/ 6 S and x„ e A. In the notation of the statement of the theorem, we can
write x = sn + *n£w- Since |xn£w| < |£n| = |£|w -> 0, the sequence (sn)n>0 is a
Cauchy sequence, and the series J]*>o a&1 converges to the element x e A. Since
for any x e K there is an integer k such that \?-kx\ < 1, namely such that ?-kx e A,
the preceding expansion can be derived for this element, and we obtain a series
expansion for x starting at the index i = m = —k. ■
Observe that even when K is not complete, each x e K x has a series
representation as indicated in the theorem, but an arbitrary series
^Tflif' (m e Z, a,- e S, am^ 0)
will — in general — converge only in the completion of K. In other words, even
when K is not complete, we get an injection
A <-» ^=limA/£M.
7.5. General Form ofHenseVs Lemma
Theorem (Hensel's Lemma). Let Kbea complete ultrametric field with
maximal subring A and f e A[X]. Assume that x e A satisfies
\f(x)\ < \f\x)\\
Then there is a root £ e A of f such that |f — x\ = \f(x)/f'(x)\ < |/'(jc)|.
This is the only root off in the stripped ball of center x and radius \ f'(x)\.
Proof. In spite of the similarity with (1.6.4) (particular case K — Qp), we give a
complete proof with absolute values (instead of congruences). The idea is again to
1. Ultrametric Spaces 81
use Newton's method iteratively. Since the polynomials / and /' have coefficients
in the ring A, we have \f(x)\ < 1 and 0 < \f'(x)\ < 1.
First step: Estimates concerning the distance ofx=x — f(x)/f'(x) to x.
The assumption is c := \f(x)/f'(x)2\ < 1. We have
/(*) f(x) _,, .
x — x — — = - f (x),
fix) f'{x?
\x-x\=c\f(x)\.
Similarly
\x-x\2=c\f(x)l
The second-order expansion (1.6.2) of / at the point x gives
/(* ) = f(x) + (x - x)f\x) +(* - xfr (re A: \r | < 1),
=0: Newton's choice
\f(x)\ < F-x\2 = c\f(x)\ < |/(jc)|,
and?is an improved approximation to a root. The first-order expansion (1.6.2) of
/' at the point x gives
f(x) = f(x) + (x-x)s (seA: \s\<l),
\f(x) - f'(x)\ < \x-x\ = c\f'{x)\ < \f'(x)\.
It shows that
lf'(x)l = I fix) + (f'{x) - /'(*)) | = \f{x)\.
strongest
The invitation to iteration is clear.
Second step: Further iterations.
Let now jc = N/(x)
< c|/(*)l _ 2
" l/'^)|2 C '
This iteration furnishes
l/(*)l < c"l/(*)l < cc|/(jc)| < c2+,|/WI,
and since \f(x)\ — c\f'{x)\2 by definition, we obtain
!/(*)! < c4\f'(x)\2.
fix)
fix)2
82 2. Finite Extensions of the Field of p-adic Numbers
We can construct the sequence xo = xn x\ = jc~, xn = * ,... inductively with
jc,-+1 = jq. Define also C\ =*c, ci+\ = c}. The preceding estimates show that
l/(*i)l < d'-i" - -cic|/(x0)| < c2'~l\f(x0)\ = c2'\f'(x0)\2 ^ 0 (i -> oo),
|*2 "*ll = |f -*| < C2|/Vo)l < c|/Vo)l = |*i -*0i,
and by induction,
k-+i - *l < c2' |/'(*0)l < c|/'(xo)l - |*i - x0\ (i > 1).
In particular, |x, — xq\ = \x\ — jc0| = \x — jc| = c I/'(*()) I is constant for j > 1
(these Xj are closer to each other than to xo).
Third step: The limit root £.
The sequence (jc,-)i>o is a Cauchy sequence, so it converges in the complete field
K. Since all iterates jc,- belong to the closed subring A, we have
£ = lim jc,- € A,
It - jtbl = |jc, - x0\ = \x-x\= c\f'(x)\ < \f'(x)\,
/(f) = /(lim jc,) = lim /(jc,) = 0.
n—>oo n—>oo
Fourth step: Uniqueness of the root%.
Let % be as before and rj have the required properties, say rj = £ + h. Hence
\h\ = \f] - £| < \f'(x)\ = |/'(£)|. The second-order expansion (1.6.2) of/ at the
point £ gives
0 - f{ji) = f{S)+hf'{t;) + h2t (teA: \t\ < 1),
o = *( /'(£) + fa) = /*(/'(£) + fa);
strongest ^0
hence /z = 0, i.e., r\ = £. ■
Observe that when the absolute value is trivial, it takes only the values 0 and 1,
the assumption reduces to
0=|/W|<|//(x)l2=l,
and the statement is trivially correct.
1.6. Characterization of Ultrametric Absolute Values
Theorem. Let x \-^ \x\be an absolute value on afield K. Then the following
properties are equivalent:
(0 N S \ for all natural integers n e N.
(ii) The absolute value is bounded onN ■ 1#.
1. Ultrametric Spaces 83
(hi) |1 + *| < I for every x e K such that \x\ < 1.
(iv) x h* \x | is an ultrametric absolute value,
(v) {x € K : \x\ < 1} is a subring of K.
Proof. We proceed according to the following scheme of implications:
(i) => (ii) => (iif) => (iv) => (v) => (iii)
and
O'v) => (i).
Among these implications, several are trivial, namely,
(i) => (i7), (iv) => (v) => (iii) and (iv) => (i).
It only remains to prove two implications. For (ii) => (hi) we can assume \n\ < M
for all integers n and use the binomial formula to compute
|1 +x\n = |(1 +xT\ = |]T ("VI < £ W-
When |jc| < 1, we obtain
ll+jcl" <(/i + l)M,
|1 -f-jc| <(n + l)1/n-M1/n
for all integers w > 1. Since (« -h l)1/n —► 1 as well as Ml/n —► 1 for n —► oo, we
infer |1 -f jc| < 1. To prove (iii) => (iv), we can — without loss of generality —
assume that |jc| > |j| in \x -h J I, |jc| > 0 and estimate this quantity as follows:
\x + y\ = \x\ ■ \l+y/x\ < \x\ = max(|*|, |v|). ■
Corollary. Any absolute value on afield of characteristic p ^ 0 is ultrametric.
Proof. Indeed, any absolute value is bounded on the image of N in a field of
characteristic p, since this image is a finite prime field. The second condition of
the theorem is automatically satisfied. ■
The absolute values that are not bounded on the prime field of K (necessarily
of characteristic zero) are sometimes called Archimedean absolute values: They
have the property that
if x ^ 0, then for each y there is an n e N such that \nx\ > \y\.
1-7. Equivalent Absolute Values
distinct absolute values can define the same topology on a field K. It is not always
Useful to distinguish them.
84 2. Finite Extensions of the Field of p-adic Numbers
Theorem. Let \.\\ and |. |2 be two absolute values on a field K. Then the
following conditions are equivalent:
(0 There is ana > 0 with |. |2 = | -1".
(ii) |. |i and |. |2 define the same topology on K.
(Hi) The stripped unit balls for |. |i and |. |? coincide.
We say that \. h and |. |2 are equivalent absolute values when these conditions
are satisfied.
Proof, (i) => (ii) Since \x — a\2 < r <=$> \x — a\i < r1/a, the stripped balls are
the same for the two topologies. Hence the topologies defined by |. |i and |. |? are
the same.
(ii) => (Hi) Let us observe that
|jc|x < 1 <=> xn -► 0 (for the topology defined by |. |i)
and similarly for |. |2. By assumption we obtain
Mi <1<=>M2<1.
(Hi) => (i) Let us assume Mi < 1 <=^ M2 < 1- Since |1/jc|i = 1/|jc|i and
similarly for |. |2, we see that
Mi > 1 <=* M2 > 1
and consequently
|X|1 = 1<=>M2 = 1-
If I. Ii is trivial, \x\x = 1 for all x e Kx, and the same is true for |. |2, so that
we can take a = 1 in the statement of (i). Otherwise, we can find jc0 € Kx with
l*o I i ^ 1> and replacing xG by 1/jco if necessary, we can assume |jc0|i < 1. Define
= log l^o b
log\xQ\i'
so that |*o|2 = l*ol? by definition. Take then any element x e Kx with |jc|i < 1
and consider the rational numbers r > 0 such that |jc|j < |jc0|i- These rational
numbers r = m/n are those for which
1*17 < l*ol7,
< i.
i
By assumption, these are the same as those for which
\xm\
— < 1,
2. Absolute Values on the Field Q 85
namely \x\% < (jcoIS or \x\i < \xoh- On the other hand, these rational numbers
are precisely those for which
rlog|x|i < logroll (resp. r log|x|2 < log|x0|2)
or
r > log|x0|i/log|x|i (resp. r > log|x0|2/log|x|2)
(all logarithms in question are negative). This proves
log|*oli/log|*|i = log|x0|2/log|x|2,
log|*b/log|x|i = log|xo|2/log|xoli =a-
Hence |jc|? = |jc|", as was to be shown. ■
2. Absolute Values on the Field Q
2.7. Ultrametric Absolute Values on Q
Let us recall that if p is a prime number, we can define an absolute value on
the field Q of rational numbers by the following procedure. If x = pma/b with
a,b,m eZ, b ^ 0, and p prime to a and b, we put
MP = p-m.
In other words, we put \p\p = \/p < 1 and \n\p = 1 for any integer n prime to
p, and extend it multiplicatively for products. Since
Q* = pz x Z*, = {J P^p),
meZ
this defines the absolute value uniquely. This absolute value is an ultrametric
absolute value on Q.
Theorem (Ostrowski). Letx \-+ \x\bea nontrivial ultrametric absolute value
on the field Q. Then there exists a prime p and a real number a > 0 such that
\x\ = \x\ap (xeQ).
Proof. Since the integers generate Q (by multiplication and quotients), the absolute
value must be nontrivial on N. As we have seen, any ultrametric absolute value
satisfies \n\ < 1 (n € N). Hence there must exist a positive integer n with \n\ < 1.
The smallest such integer is a prime p because in any factorization n = n\ ■«?, we
have \n\\ ■ \n2\ = \n\ < 1, and consequently one factor m must satisfy \nt\ < 1.
Let us call this prime p so that by definition
\n\ = 1 for 1 < n < p
86 2. Finite Extensions of the Field of p-adic Numbers
butO < |p| < 1 -1 claim that for every integer m e Z prime top, we have \m\ = 1.
Indeed, if m is prime to p, the Bezout theorem asserts that there are integers u and
v with up + vm = \. Hence
1 = |1| = \up + vm\ < maxflwp), \vm\) < 1.
Since by assumption \up\ = \u\\p\ < \u\ < 1, the maximum must be \vm\ = 1
and hence \m\ = 1 (we know a priori that \v\ < 1 and \m\ < 1). There is now a
unique positive real number a such that
\p\ = a/pr
(indeed, take a = (log|p|)/(log(l/p)) — a quotient of two negative numbers —
independent from the basis of logarithms chosen). Then if the rational number x
is written in the form x = pva/b e Q* with p prime to a and b (i.e., a/b e Z*p)\
we shall have
w = \p\v = d/pr = u\ap
and the theorem is completely proved. ■
2.2. Generalized Absolute Values
Observe that if |. | is an absolute value and a > 0, then |. |a is not an absolute
value in general. For example if |. | is the usual absolute value on Q and a = 2,
then f(x) = \x\2 does not satisfy the triangle inequality
4 = /(2) = /(l + 1) > /(l) + /(l) = 2.
But it satisfies
f(x + y) = \x + y\2 < (\x\ + |^|)2 < (2max{|x|, |v|})2 = 4max(/(x), f(y)).
This is one reason for considering generalized absolute values.
Definition. A generalized absolute value on afield K is a homomorphism f :
K x —>► R>o extended by f(0) — Ofor which there exists a constant C > 0 .smc/z
fix + y) < Cmax(/(jc), /(y)) (jc, y € AT).
Observations. (1) For any generalized absolute value / and any a > 0 (not only
for 0 < a < 1), /" is also a generalized absolute value: Replace C by Ca.
(2) The ultrametric absolute values are those for which the above inequality
holds with C = 1. Moreover, if / is a (usual) absolute value, then
fix + y) < /(*) + /O0 < 2max(/to, /(y)),
and (usual) absolute values are generalized absolute values: The above inequality
holds with C = 2. Let us prove a converse.
2. Absolute Values on the Field Q 87
Theorem. Let f be a generalized absolute value on afield K for which
f(x + y) < 2max(/(x), f(y)) (x, y e K).
Then f is a usual absolute value: It satisfies the identity
f(x + y)<m+f{y) <x,yeK).
Proof. Iterating the defining inequality for generalized absolute values, we find
that
f(ai +a2+a3+a4) <C max(/(ai + ci2), f(a3 + a*))
< C2maxi<1<4 f{at).
More generally, by induction if n = 2r, then
f(ax + • - - + an) < C max /fa).
Since we are assuming that the constant C = 2 can be taken in the preceding
inequalities, we have
f{a\ H h a„) < 2r max /(#/) = n max /(fl,-)-
Now, if n is not a power of 2, say 27""1 < n < 2r, we can complete the sum by
taking coefficients a, = 0 for n < i < 2r and still write
/(tf i H h an) < 2r max /(ar-) < 2n max /(a,-)-
We shall have to use two particular cases of this general inequality:
(1) f(n) < 2n (take at = 1 for 1 < i < «),
<2> / (£l<,<„ «<) ^ 2"maX /(«•) ^ 2" El<i<n /(«/)■
To estimate /(a-f£), we shall estimate /((a+b)n) thanks to the binomial formula
(the nth power of a + £ is a sum of « + 1 monomials)
< 2(«+1)52 / ((")) - mm?-*
< 2(n+ 1)^2Q/(«)7W""'
= 4(n + lX/(a) + /(W.
Let us extract nth roots:
/(« + fc) < 4V"(n + D1/n • (/(«) + /(&)) -* /(fl) + f(b) {n -* (X)). ■
88 2. Finite Extensions of the Field of p-adic Numbers
2.3. Ultrametric Among Generalized Absolute Values
We can give a generalization of (1.6).
Theorem. Let f be a generalized absolute value on afield K.Iff is bounded
on the image of the natural numbers N in Kt then it is an ultrametric absolute
value.
Proof. Let n = T be a power of 2 and consider a sum of n terms ai. As in (2.2),
we see by induction that
f(aY + • • • + an) < Cr max /(fl|-).
Take now x e K and consider the element
0<I<FI V l '
Since this sum has n elements, we have
(/(l +x))n-1 = /((l +xT'1) < C max If ((" ~ ^ • fix')] .
If / is bounded on the image of N in K, say f(k) < A for all k e N, we shall
have
(/(l + x))n~l < CM max (1, /W1"1)
and
/(l +jc) < Cr/('z-1)A1/('z-1)max(l, /(*)).
Letting again n —> co, we obtain
/(1+x) < max (1,/(*)).
If now a ^ 0 and b € K, then /(a) ^ 0 and
/(0+6) = /(fl)/(l+6/0)
< /(A) max (1, f{b/a)) = max (/(a), /(£>)). ■
2.4. Generalized Absolute Values on the Rational Field
The ultrametric absolute values on the rational field Q have been determined in
(2.1). Here, we treat the generalized absolute values.
Theorem. Any nontrivial generalized absolute value on the rational field Q is
either a power of the usual absolute value or a power of the p-adic absolute
value.
2. Absolute Values on the Field Q 89
Proof. Take any nontrivial generalized absolute value / and assume that
/(x + y)<C.max(/(x),/W).
If C < 1, then / is ultrametric, and we conclude by (2.1). Assume now C > 1.
By induction — regardless of the size and number of addends — we can prove
/(«o + • - • + ar) < Cr • max /fo).
Let us fix an integer n > 2 and put A = An = max (/(l),..., /(«)) > 1. Now,
any integer m > 2 can be expanded in base n, say
m = /J 171,-w1 (0 < mi < n, mr ^ 0).
Hence
/(m) < Cr • max /(m^/V)
< Cr A„ • max /(«/ = CrAn max (1, /(«)r).
But mr ^ 0, «r < m, and thus r < logm/ log «, so that we can write
f{m) < AnCl0*m/]0zn - max(l, f(n)]^m/logn)J
/(m)l/logm < Al/log/«Cl/logn ,max(lf /(„)*/>°g«).
Let us replace m by m* (keeping « fixed), so that the left-hand side is unchanged,
and let it -> oo, whence Alnm°gm) ~> 1. We obtain
/(m)l/lQgm £ cl/logn .max(lj /(n)1/10^).
In other words, we have obtained an inequality in which the constant An does not
appear. We can now replace n by nk, and since Cl^k —> 1, we have simply
/(m)l/logm < max(1? /(W)l/k««).
First case: 77zere w fl« integer n > 2 vv/f/z /(«) < 1.
We can use such an integer n in the inequality just found and deduce
/(m) < 1 for every integer m > 2.
Hence / is an ultrametric absolute value by (2.3). Finally, Ostrowski's theorem
(2.1) applies: / is a power of the p-adic absolute value
f{x) = \x\ap (xeQ)
for some real a (determined by the condition f(p) = \p\ap = (l/p)a).
90 2. Finite Extensions of the Field of p-adic Numbers
Second case: We have f(n) > 1 for every integer n > 2.
The general inequality
/(m)1/,ogm < max(l, f(n)1/lo^n)
is now simply
/(m)l/logm < /(„)l/log"
Since we can permute the roles of n and ra, we must even have
/(m)i/iogm = f(n)i/k*nm
Hence f(n)l/}ogn = ea is independent from n. This leads to
/(w) = ealogf,=#ia
for all integers n > 1, and with the usual absolute value
f(n) = \n\a (neZ).
By the multiplicativity property, we also have
f(x)=\x\a (xeQ).
Since 0 < a < oo, the map / is a power of the usual absolute value, and the
theorem is completely proved. ■
Comment. The preceding result shows that for a generalized absolute value / on
the field Q, the only possibilities are
• / is trivial,
• |/71 < 1 for some prime p and f is a power of the p-adic absolute value,
• \n\ > 1 for all positive integers n and f is a power of the usual Archimedean
absolute value.
Observe that the two nontrivial cases can also be classified according to the
value of |2|: If |2| < 1, / is a power of the 2-adic absolute value; if |2| = 1, / is a
power of the p-adic absolute value for some odd prime p; if |2| > 1, / is a power
of the usual Archimedean absolute value.
3. Finite-Dimensional Vector Spaces
3.1. Normed Spaces over Qp
Let V be a vector space over the field Qp. A norm on V is a mapping
||. || : V - {0} ^ R>0
3. Finite-Dimensional Vector Spaces 91
extended by ||0|| = 0 and satisfying the following characteristic properties:
||ax|| = kz|||x|| (aeQp, xeV),
||*-r->>||<max(|M|,||y||) (x,yeV).
In particular, the norms that we are considering are ultrametric norms. A normed
space over Qp is simply a vector space over this field equipped with a norm. A
norm defines an invariant (ultra-)metric on the underlying additive group of V.
Hence a norm defines a topology on V, which becomes an additive topological
group in which scalar multiplications
x\^> ax:V -+ V (a e Qp)
are continuous homeomorphisms.
Examples. (1) Let V = Qp with norm ||jc|| = c\x\ where c > 0 is a fixed,
arbitrarily chosen positive real constant. This example shows that {||t> || : v e V]
can be different from the set of absolute values of scalars, i.e. the absolute values
of the elements of the field Qp. (This is a difference from real and complex normed
spaces).
(2) Let V = Qnp for some positive integer n. Then for x = (jc,-)i<i<ii € V we
can put H^Hoo = supl5l-5/J |jc,-| = maxi</<„ |jc,-|. This defines an ultrametric norm
on V.
Two norms x \-+ \\x\\ and x \-+ \\x\\' are called equivalent when they induce
(uniformly) equivalent metrics on V, namely when there exist two constants 0 <
c < C < oo with
ex
ll'<C||x|L
This happens precisely when the topologies defined by these two norms are the
same (exercise).
Theorem. Let V be a finite-dimensional vector space over Qp. Then all norms
on V are equivalent.
Proof. Let n = dim V and choose a basis (ef)i<f<„ of V. Hence
x = (xt) h> v = ^2xEet = tp(x)
defines an algebraic isomorphism <p : Qnp -* V. On the space Q^, we consider the
sup norm given in the above example. We have to show that the isomorphism <p is
bicontinuous. But
u2x'e*
< maxllx.^ll = max|^|||ej| < max||e/|| ■ maxl^l = CHxHoo,
92 2. Finite Extensions of the Field of /?-adic Numbers
where C = max ||e,-||. (Note that the strong triangle inequality is not really
necessary here since it would be enough to observe that || Ylxiei\\ — Yl \\xiei\\ <
J2 Ik/ll -max|jc,-| = C || x || oo-) This proves that || ^>(x) || < C||x ||oo and
discontinuous. Finally, we show that <p is an open map. Let B = Bsl = {x € Qnp : ||x||oo < 1}
be the unit ball in Qnp. We have to show that <p(B) contains an open ball of positive
radius centered at 0 in V. Denote by Si the unit sphere
Si = {xeQnp:\\x\\00 = l}
in Q" Then Si is a closed subset of the compact set B<\, and hence is compact.
This implies that <p(Si) is also compact. This image does not contain the origin of
V (remember that <p is bijective). Hence the distance from 0 to <p(S\) is positive,
and the minimum is attained for some point <p(xo):
x e S2 => I^COII > ||^(xo)|| = e > 0.
If v e V — {0} has norm ||t>|| < e, the norm of all multiples Xv where |A| < 1 will
also satisfy ||At>|| < s. Hence in particular, if ||t>|| < e, then
Xe K, \X\<1 =>\vg<p(Si).
Since (e,-) is a basis, we can write
Without loss of generality we may assume that the largest component is the last
one:
0^ Kl = max |u, | = 11(^)1100-
With X = \/vn we have Xv = <p((vi/vn)) = <p(w) e <p(Si). The remark made
before proves that this scalar X satisfies |A| > 1, so that
llfe)lloc=l^l = T|T<l.
|A|
This shows that v = <p((vi)) with ||C^)IIoo <\:ve <p(B), where B = £<i(0, Q£).
Consequently,
B<£(V) C <p(B). B
Corollary 1. Let V and W be two finite-dimensional normed vector spaces
over Qp and a : V —► W a linear map. Then a is continuous. ■
Corollary 2. Any algebraic isomorphism of a finite-dimensional normed vector
space over Qp is bicontinuous. 0
3. Finite-Dimensional Vector Spaces 93
Corollary 3. Let V be a finite-dimensional vector space over Qp. A subset
S c V that is bounded with respect to one norm on V is bounded with respect
to any other norm on V. ■
Remark. Observe that the proof could be simplified if we knew that all norms of
elements of V were absolute values of scalars, namely if || V|| = \QPV But this
equality is in general not satisfied.
3.2. Locally Compact Vector Spaces over Qp
There are not many compact normed spaces over Qp. In fact, any nonzero element
x of a vector space generates a line, and the norm is an unbounded continuous
function on this line because
ll**« = |A|||*|| tteQp).
This shows that the only compact normed space is the trivial normed space {0}.
Let us turn to locally compact normed spaces over Qp.
Theorem. If V is a locally compact normed space over Qp, then its dimension
isfinite.
Proof. Let us select a compact neighborhood £2 of 0 in V. Also choose a scalar
a e Qp with 0 < \a\ < 1 (for example a — p with \a\ = \/p will do). The
interiors of the translates x + aQ (x e V) cover the whole space. A fortiori there
is a finite covering of the compact set £2 of the form
£2<z[J(ai+a£2) (for some at e V).
finite
Consider the finite-dimensional subspace L generated by the elements at. By
(3.1), this finite-dimensional subspace is isomorphic to a normed space Qdp, and
hence is complete. Consequently, this subspace L is closed, and in the Hausdorff
quotient V/L (1.3.3) the image A of the set £2 is a compact neighborhood of 0 and
satisfies
ACaA (ora~lA <Z A),
whence a~n A C A by induction. Since \a~n\ —> oo, we see that
AcV/Lc\Ja-nAGA.
In particular V/L is compact: V/L = 0, V = L is a finite-dimensional space. ■
Corollary. In a locally compact normed vector space over Qp, the compact
subsets are the closed bounded sets.
94 2. Finite Extensions of the Field of p-adic Numbers
Proof. The compact subsets of any metric space are closed and bounded (by
continuity of the distance function). Conversely, if V is a locally compact normed
vector space over Qp, it has finite dimension and its norm is equivalent to the
sup norm of this space (3.1). But in Q" any bounded set is contained in a
(compact) product of balls of Qp. Hence the closed bounded sets are compact subsets
ofQJ- ■
3.3. Uniqueness of Extension of Absolute Values
Let K be a finite (hence algebraic) extension of the field Qp. We can consider K as
a finite-dimensional vector space over Qp. Each absolute value on K that extends
the p-adic absolute value of Qp is a norm on this vector space, and we can apply
the results of (3.1).
Proposition. There is at most one absolute value on K that extends the p-adic
absolute value of Qp.
Proof. Let |. | and |. |' be two absolute values on K that extend the absolute value
of Qp. These two norms must be equivalent, and there exist constants 0 < c <
C < oo such that
c|jc| < |xf < C|jc| (jc e K).
Replace x by xn in the preceding inequalities:
c|jc"| < |jc"f < C|jcn|.
Since |. | and |. |' are absolute values, they are multiplicative, and the preceding
inequality is simply
c|jc|n < Mm < C\x\\
or
c1/f,|jc| < |jc|' < c:1/fi|jc|.
Letting « -^ oo, we have cl/n -* 1 and Cl/n -> 1. This proves |jc| = \x\'. ■
Application. Let A' be a Galois extension of Qp and assume that the p-adic
absolute value of Qp extends to K. Then for each automorphism o of K/Qp we
can consider the absolute value \x\' = \ax\. By the preceding proposition, this
absolute value must coincide with the original one. Let G = Gal (K/Qp) and for
each x e K, consider the element
N(x)=Y\cxxeQp.
(T€G
3. Finite-Dimensional Vector Spaces 95
We must have
\N(x)\ =
ceG
cgG
Hence with d = #(G) = [K : Qp] = d\mQp(K),
1*1 =
\N(x)\l,d.
Since N(jc) e Qp, this formula gives an explicit expression for the extension of
the absolute value of Qp (provided that one exists!). This observation can be used
to prove the existence of an extension of the absolute value of Qp.
3.4. Existence of Extension of Absolute Values
Let again K be a finite extension of degree d of the field Qp. The relative norm
(as defined in field theory, not to be confused with a vector space norm!) is a
multiplicative homomorphism (3.3)
N = NK/Qp:K*-+Q;, x^N(x),
which coincides with the dth power on Q *. It can be defined either by embedding
K in a Galois extension L and taking a product over the d distinct embeddings
K c-> L or by using the determinant of the Qp -linear map y \-+ xy of the Qp -vector
space K.
Theorem. Let K be a finite extension of degree d of the field Qp of p-adic
numbers. For each x € K, let £x denote the Qp linear operator y \-+ xy in K.
Then
f(x) = \N(x)\l/d^\dct£x\1/d
defines an absolute value on K that extends the p-adic one. This is the unique
absolute value on K having this property.
Proof. If a e Qp, it is obvious that N(a) = ad whence \N(a)\]/d = \a\, and
the proposed formula is an extension of the p-adic value. The multiplicativity
f(xy) = f(x) - /(y) is a consequence of the multiplicativity of the determinant
(or of the norm). We still have to check the ultrametric inequality. For this crucial
point we use the local compactness of K. Let us choose any norm x \-^ \\x \\ on K
with \\K\\ = \QP\. For example, pick a basis e\,..., ed of K over Qp and use the
sup norm on components in this basis. Since the continuous function / does not
vanish on the compact set ||jc|| = 1, it is both bounded above and below on this
set, say
0 < £ < f(x) < A < oo (||jc|| = 1).
96 2. Finite Extensions of the Field of p-adic Numbers
For x e K* choose AeQp with ||x|| = |A|. Hence the vector x/X has norm 1,
s<f(x/k)<A (Jt^O),
and since f(x/k) = f(x)/\X\,
sW<f(x)<A\k\ (x^O),
e\\x\\ < f(x) < A\\x\\ (x^O).
Thus with a = s~~l we have both ||x|| < af(x) and f(x) < A\\x\\. Suppose now
f{x) < 1 (hence \\x\\ < a). We infer
fd+x) < A||l -f-jc|| < Amax(||l||, ||jc||)
< Amax(||l||,a) = C = Cmax(/(1), f(x)).
If more generally f{y) > /(jc), we can divide by y and apply the preceding
inequality to jc/y, since f(x/y) = f(x)/f(y) < 1. Finally, multiplying both sides
by /(y), we obtain the general inequality
/(^-hy)<Cmax(/(x),/(y)).
This proves that / is a generalized absolute value. Since / extends the p-adic
absolute value, it is bounded on N C Qp C K and is an ultrametric absolute value
by (2.3).
The uniqueness of the extension has already been proved in (3.3). ■
3.5. Locally Compact Ultrametric Fields
In locally compact ultrametric fields K, we shall use adapted notation
R = #<i D P = £<i
instead of
which will still be used in the general — not necessarily locally compact — case.
We are going to prove the following general result.
Theorem. Let K be a field equipped with a nontrivial ultrametric absolute
value and consider the corresponding (ultra-)metric space. Then K is locally
compact precisely when the following three conditions are satisfied:
(1) K is a complete metric space.
(2) The residue field k = R/P is finite.
(3) \K x | is a discrete subgroup of R>o,
hence of the form 0Z for some 0 < 6 < 1.
4. Structure of p-adic Fields 97
Proof. Assume first that the field K is locally compact. Hence there is a compact
neighborhood of 0 in AT. This neighborhood contains a ball £<e(0), where e > 0.
This ball B<e is compact. Using dilatations, we see that all balls £<r(0) of K are
compact. Any Cauchy sequence in K is bounded, hence contained in a compact
ball: It must converge in K. This shows that K is complete (recall more generally
that every locally compact topological group is complete (1.3.2)). Now the residue
field parametrizes the open unit balls contained in the unit ball £<i(0). If this
last one is compact, the preceding partition in open sets must be finite, which
proves (2). Finally, since the open unit ball ^<i(0) is closed in the compact ball
fl<i(0), the continuous function x \-^ \x\ must attain a maximal value over the
compact set B<i(0). Call 6 < 1 this maximal absolute value. The only possible
nonzero absolute values are now the integral powers of 0. Indeed, a multiplicative
subgroup of R>o is either discrete or dense (1.3.4). (Alternatively, one could use
the last property of ultrametric distances mentioned in (1.1) for the compact sets
B^ - B<r, 0 < r < s.)
Conversely, assume that the three conditions are satisfied and choose an element
7T e K with largest possible absolute value less than 1: it e P C R => ttR C P.
The reverse inclusion also holds:
x e P => \x | < |it | => x = jt - x/n (x/n e R) => x e n R.
This proves that P = (jc) = ttR is principal. By the representation theorem
(1.4), the complete ring R is topologically isomorphic to the projective limit
R = lim R/nnR of the finite rings R/ixn R:
R = B<i (0) is isomorphic to R compact.
The field K is locally compact, since it has a compact neighborhood of 0. ■
4. Structure of p-adic Fields
4-1. Degree and Residue Degree
Let K be a finite extension of the field Qp of p-adic numbers. Hence K is locally
compact and complete. Let us choose an element n of maximal absolute value
smaller than 1, say 0 < \n\ = 0 < 1, and come back to the usual notation for
the ring
/? = {*€ K : \x\ < 1}
and its maximal ideal P = ttR. The residue field k = R/P is finite, hence a finite
^tension of ¥p - Zp/pZp. If / - [k : ¥p] = dimFp(k), then
k = Fg, q=#(k) = mVP))f = Pf,
98 2. Finite Extensions of the Field of p-adic Numbers
since there is — up to isomorphism — only one finite field having q elements.
Since the integer p belongs to P, we have
VP = \P\=Oe, \"\ = \P\lle
for some integer e > 1.
Definitions. The residue degree of the finite extension K of Qp is the integer
f = [k:Fp] = dimFp(k).
The ramification index of K over Qp is the integer
e = l\K*\ : |Q;|] = [|*x| : pz] = #(|/^X|/Jpz).
Warning. I hope that the degree / will not occur next to a polynomial /(X) or a
function /, or if it does, let me rely on the reader to distinguish them (using P for
a polynomial could similarly lead to a confusion with the maximal ideal P = it R
in a finite extension K of Qp, and here tt is not 3.14159...!) In the same vein, k
will usually denote a residue field and here, we try to avoid its use as a summation
index.
Let a\ and#2 £ Qp, *i and JC2 e K* be such that
|fll*l| = |fl2*2l (9*0).
Then |jci| = \a2/ci]\ ■ \x2\ e pz \x2\, and the absolute values of X\ and*2 belong
to the same coset mod pz. Consequently, in a finite sum
Y^cuxi (a, eQ£, xt e K*)
of nonvanishing terms, if the |jc, | belong to distinct cosets mod pz, we cannot
have a competition of absolute values, and necessarily £ atX[ ^ 0. This argument
shows that n — [K : Qp] = dim(K) > e. One can also see directly that n > f
(exercise!). Let us prove that n > ef (we even prove n = ef below).
Proposition. In the situation described in this section, we have ef < n.
Proof. Let us choose a family (s1/)i</</ in R such that the images si e k make up
a basis over the prime field ¥p. I claim that the elements
(si7rJ)\<i<f. 0<J<e
are independent over Qp. Consider indeed a nontrivial linear combination
4. Structure of p-adic Fields 99
where Xj = Yli cijsi- Then for each j there is an index £ = £(j) such that
\cij\ > |cl7| for all i,
and Xj/cij = Yli(cu/C£j)s* ~ YliYisi ls a nontrivial linear combination with
coefficients in R (and y£ = 1). Consider this relation mod P: Define yt = Yi m°d
P. Since (£),- is a basis of the residue field k = /?/P considered as a vector space
over its prime field, we have
i
simply because yt — \. Hence
£**. */>.
E^
and |jt/| = \ctj\ e |Q* | is a power of /?. There can be no competition among the
absolute values of the distinct terms Xj7TJ, and this proves
y^ cijSi7rj = 2\,xJ7tJ t£ ®-
This proves the expected linear independence, and hence the inequality stated in
the proposition. ■
Theorem. For each finite extension K of Qp, we have
ef = [K: Qp] - n.
Proof. By the above proposition it is enough to prove the existence of a set of
generators of K over Qp containing ef elements. We shall show that the family
of the Proposition generates the Qp-vector space K. For this purpose we use the
representation theorem (1.4) for the complete field K and the element § = p e
P C R. In this case R/pR is finite with representatives
5 = | J2 cVSi3t' : ° - C*J - P ~ l \ •
Hence one can write any element x e R as a series
x = ^2,ctp* (cteS).
If we write explicit expressions for the coefficients
l<i</.0<.7<:e
100 2. Finite Extensions of the Field of p-adic Numbers
we obtain
£>0 l</</,0<7<e
and if we sum in a different order (only £ can take infinitely many values, and
pl —► 0: The family in question is summable by the Proposition in (1.2)), then
x = J2 E^PJ-*«■*'■
But Cij = J2e cijtPE e Zp and x = J^-- cijSiit*. This proves that the e/ elements
■S/ttj (1 Si < /> 0 < y < p — 1) constitute a spanning set of the field K
considered as a vector space over Qp. Together with the proposition, this concludes
the proof of the theorem. ■
A finite extension K of Qp is said to be
unramified when e = 1, i.e., when [K : Qp] = /,
totally ramified when / = 1, i.e., when [AT : Qp] = e,
tamely ramified when p does not divide e,
wildly ramified when e is a power of p.
In other words, an extension K/Qp is
unramified when p is a generator of the maximal ideal P C /?,
totally ramified when the residue field does not grow.
Comment. Let us come back to the analogy between p-adic numbers and
functions of a complex variable already mentioned in (1.1.4) and (1.5.1), since it is also
responsible for the preceding terminology. Let us explain this in its simplest form.
Let § ^ 0 be a meromorphic function defined in a neighborhood of 0 in C. It is
known that there is a representation
^(Z) = ^2anzn (am^0)
validin a punctured disc 0 < \z\ < e. The smallest index m e Zsuchthat^ ^ Ois
the order off at the origin. This integer is positive when f vanishes and is negative
when § has a pole at the origin. In this way, we see an analogy between the field
L of meromorphic functions defined in a (variable) neighborhood of the origin in
C and the p-adic field Qp consisting of the formal expansions x = X^>m anPn
(m eZ). The functions that are holomorphic at the origin make a maximal subring
Lq of L comparable to Zp in Qp. The local construction of the field L is also a
justification for calling Qp a local field.
Now take an integer e > 1 and consider the change of variable
u \-^ z = ue : C -* C.
4. Structure of p-adic Fields 101
This is a canonical example of a ramified covering of degree e at the origin, in a
topological sense: The inverse image of any z ^ 0 consists of e distinct preimages,
while u = 0 is the only preimage of z = 0. Iff = Yln>m a^ (0 < kl < e) is as
before a meromorphic function in a neighborhood of the origin, we can make the
change of variable z = ue and obtain a new expansion
In this way, the field L is embedded in the field U consisting of convergent Laurent
series in the variable u. There is no function yfz defined in a neighborhood of z — 0
in C, so that the field V = L(zl'e) is a proper extension of the field of convergent
Laurent series L in the variable Z- This extension U is totally ramified over L, with
degree e: It is obviously comparable to the extension Qp(tt) of Qp if n = pl^e.
Observe that with meromorphic functions it is traditional to work with the order-of-
vanishing function ordo(f) = m, instead of a corresponding ultrametric absolute
value |f |o = 0m (for a choice 0 < 6 — \z\o < 1; there is no canonical choice for
6 here).
The rational field Q can similarly be compared to the field of rational functions
C(z), the completions Qp (letting now the prime p vary) corresponding to the
fields of meromorphic functions near a variable point a e C instead of the origin.
4.2. Totally Ramified Extensions
Let us recall the following well-known irreducibility result stated over Zp rather
than over Z.
Theorem 1 (Eisenstein). Let f(X) e Zp[X] be a monic polynomial of degree
n > 1 with f(X) = Xn mod/?, /(0) ^ 0 mod/?2. In other words,
f(X) = Xn+an-lXn-l+..-+a0,
ordfo) > 1 (0 < i < n - i), ordfao) = 1-
Then f is irreducible in the rings Zp[X] and QP[X].
Proof. Take a factorization / = g - h in ZP[X] — or in QP[X]; this is the same
by an elementary lemma attributed to Gauss — say
g = btXl + - - - + b09 h = cmXm + ... + c0.
Hence
£ + m = n, bgcm = 1, b0c0 = a0.
Since «o is not divisible by p2, p can divide only one of the two coefficients bo
and Co. Without loss of generality we can assume that p divides Co but p does not
divide Z?0- Consider now all these polynomials mod p. By assumption / = Xn is
a monomial, so that its factorization / = g • h must be a product of monomials
102 2. Finite Extensions of the Field of p-adic Numbers
and h = Co is a constant. Considering that bgcm = I, the only possibility now is
m = 0 and a trivial factorization. ■
The preceding argument mod p can be made directly on the coefficients. Let
r > 1 be the smallest power of X in h having a coefficient not divisible by p:
p does not divide cr but p divides cr_i, cr_2> • - •, Q)-
The coefficient of Xr in the product of g and /z is
ar = b0cr + fcicr_i + fc2Cr-2 H = b0cr + p(- - ■).
Since b$cr is not divisible by p, the preceding equality shows that p does not
divide ar either. By assumption, this shows that r =n. Sumrning up,
n=ra-f-£>ra>r=:fl
implies m = n and £ = 0. The factorization g h of f is necessarily trivial.
The same proof shows the following more general result.
Let A be a factorial ring with fraction field K, and n a prime of A. Any
polynomial
f = anXn + an-iXn~l H -ffloG A[X] {of degree n>\)
with an not divisible by n, a, divisible by n for 0<i<n — 1, ao not divisible by
it2, is irreducible in the rings K[X] and A[X].
Definition. A monic polynomial f(X) e ZP[X] of degree n > 1 satisfying the
conditions of the theorem, namely
f(X) = Xn modp, /(0) ^ 0 modp2,
is called an Eisenstein polynomial.
Theorem 2. Let K be a finite, totally ramified extension of Qp. Then K is
generated by a root of an Eisenstein polynomial
Proof. The maximal ideal P of the subring R = B<\ of K is principal and
generated by an element it with \7t\e = \p\. Since n = [K : Qp] = e by assumption,
the linearly independent powers (7tl)o^i<e generate K and K = Qp[tc]. The
irreducible polynomial of this element can be factored (in a Galois extension of Qp
containing AT) as
/(X) = Y\& -n") = Xe+J2 OtX1 ±Y\*a
The constant term has absolute value in^jr^) = \n\e = \p\ (by (3.3) all
automorphisms o are isometric), whereas the intermediate coefficients a, satisfy \a( \ < 1
4. Structure of p-adic Fields 103
(each is divisible by one tt° at least, and ax e Zp). Hence these intermediate
coefficients are in pZp as required: / is an Eisenstein polynomial. ■
Examples. (1) In the field Q2, — 1 is not a square (1.6.6), and we can construct the
quadratic extension K = Q2(i) = Qi[Xy(X2 + 1). Since
(1 + l)2 = i2 + 2i + 1 = 2i\
the element i -+-1 is a square root of 2i. With the (unique) extension of the 2-adic
absolute value we have
|i + l|2 = |2i| = |2| = i, 11 + 11=^1,
so that i + 1 is a generator of the maximal ideal P of the maximal subring R of
the field K: P = (1: -h 1)R. The quadratic extension K is totally ramified of index
e = 2, hence wildly ramified. Let jc = 1 + 1. Then jc — 1 = 1 and (jc — l)2 = — 1
shows that jc is a root of the polynomial
X2-2X+2 = (X- 1)2 + 1.
This is an Eisenstein polynomial (relative to the prime 2), and K = Q2O") is also
obtained as a splitting field of this Eisenstein polynomial.
(2) For p ^ 2 let us add a primitive pth root of unity to Qp. In other words, we
are adding to Qp a root of fp - I = 0 with ? ^ 1. Hence f is a root of
<Dp(X) = (Xp - 1)/(X - I) = Xp~l + • - • + X + 1-
This is the pf/z cyclotomic polynomial: It is irreducible, since the change of variable
X — 1 = Y produces
a>p(X) = [(r-hi)^-i]/r = ^-1-hP(--o + p,
an Eisenstein polynomial. Hence we obtain an extension of Qp of degree p — 1
prime to p. We shall prove that it is totally ramified. Since the powers f' are also
roots of the same equation when i is not a multiple of p, the powers f' (1 < i <
P — 1) form a complete set of conjugates of f, and
Obviously, <l>p(l) = 1 + h 1 = p, so that
But all absolute values |1 — f'| are equal by (3.3), since these elements are
104 2. Finite Extensions of the Field of p-adic Numbers
conjugate. The preceding inequality leads to
\p\= n ii-fi^u-tr1-
This proves that n = 1 — f is a generator of P in R: The extension tf = Qp(f)
is ramified with degree n = e = p — 1, hence totally and tamely ramified.
In the course of the preceding deduction we have used the uniqueness of
extension of valuations again. However, in the present context, it is obvious that
i-?i" = (i-?)(i + --- + r~1)
implies |1 —f'| < |1 — f |. But the roles of f and f' may be reversed: ff is also
a generator of the cyclic group [ip of order p when I < i < p — 1, so that £ is
a power (f' y of ff (take y" such that ly = 1 mod p). This furnishes the equality
11 — f' | = 11 — f |. By the way, this proves that
|l+? + --■ + ?
*-i|
i-r"
i-f
and l-hf-h----f~fz~1are units of the maximal subring R c ^ = Qp(f )•
These are the so-called cyclotomic units of tf. Since f = 1 mod P, we have
1 + ? H h f'"-"1 = i mod P.
4.3. Roots of Unity and Unramified Extensions
Let K be a (commutative) field of characteristic 0 and let fi(K) be the multiplicative
group consisting of the roots of unity in K. Since every element of this group has
finite order (by definition), we can apply the Sylow decomposition theorem (or the
Chinese remainder theorem) and write a direct product fi(K) = /jipoc(K) - /z(p)(tf),
where elements in jjlpx(K) have a pth power order and elements in fi^P)(K) have
order prime to p. We shall prove that when K is a finite extension of Q^, the group
/jL(p)(K) is finite, and compute its order. (In the next section we show the finiteness
of the group jjlpx{K).)
In any valued field, all roots of unity are on the unit sphere:
f» = i => |fr = in = in = i => if i = l.
In the case of an ultrametric extension of Qp,
KDA = B<:i DM = B<{,
we see that ji — fi(K) CAX C Kx. By reduction mod M,
e: A-> A/M = k
we obtain £(/z) C &x. To explain the effect of reduction mod M on roots of unity
let us give a lemma.
4. Structure of p-adic Fields 105
Proposition 1. Let K be any ultrametric extension of Qp. Then
Hpco(K) = iJL(K)n(l+M).
Proof. First, if f e n(K) has order a power of p, denote by \ = e(f) G k its
reduction. Then
fpf = 1 => fp/ = 1 G * => f = 1 => f G 1 + M,
since the field A: has characteristic p. Conversely, if r g 1 + M has order n > 1,
write f = 1 + f with 0 ^ |f | < 1. Then
1 = (1 + £)" = 1 +Wf + - -. + f» = I + f(w + f«)
implies n -h f a = 0, and
N = lf«|<|f|<l
implies p | n. If n ^ p, we can replace f by fp, which has order n/p > 1, and
iterate the procedure. Eventually, we see that n is a power of p. ■
Corollary 1. 77ie restriction of the reduction map eW/jl(K) has kernel [ip™ (K).
It is injective on jji(p)(K): The distance between two distinct roots of unity of
order prime to p is 1. ■
Corollary 2. If the residue degree f = f(K/QP) is finite, then the group
JjL(P)(K) of roots of unity having order prime to p in K is finite and
#(/W/0)<p/-1- ■
When K/Qp is finite, the next proposition shows that the order of fJ>(P)(K) is
exactly pf — I.
Proposition 2. Assume that the extension K of Qp is complete with residue
field k algebraic over Fp. Then we have a split exact sequence
(1) -» Hfc(K) -> fi(K) -> *x -» (1).
Jjff/ze residue field is finite, say f = [k : Fp] < oo, f/zen ?/ze cyclic group /i{p)(K)
has order pf — 1.
Proof. Let e : /x —> £x be the group homomorphism obtained by restriction
of the reduction (ring) homomorphism A —> A/M. It will be enough to show
that e induces an isomorphism fi(p)(K) = kx. By the preceding proposition, the
reduction map induces an isomorphism of /x(p)(AT) into kx. We have to prove that
it is surjective. Let a G kx and replace k by the finite field Fp(a) = Fg so that
106 2. Finite Extensions of the Field of p-adic Numbers
a is a root of unity of order prime to p, dividing m ~ q — I = #(ftx). Choose
an element a e A in the coset a (mod M) and consider the solutions x of the
following problem:
Xm - 1 = 0 with x = a (mod M) (i.e., e{x) = a).
Since m is prime to p, and tf is complete, Hensel's lemma (1.4) can be applied,
and this furnishes an element x in Kx with xm = 1; hence jc e /z(p)(tf) and
e(jc) = s(a) = a.
This proves that — when the residue field ft is algebraic — the restriction of the
reduction mod M is an isomorphism fi(p)(K) -> ftx. m
Application. Let K be a locally compact (i.e., finite) extension of Qp and adopt
the usual notation corresponding to this case:
K D R = BS1(K) D P = nR,
k - /?//>, / = [ft : ¥p], q = pf = #(ft).
Then we have canonical isomorphisms
V(P)(K) x (1 + P) ^> Rx (multiplication),
Li(P)(K) -> ftx (reduction mod P).
With a choice of jt, we also have an isomorphism
jrz x H(P)(K) x(l + P)4 tfx (multiplication).
We infer that if ps is the highest power of p such that K has a root of unity of
order ps, then
fip~(K) = fi(K) Pi (1 + P) has order p5.
The p-adic logarithm will furnish a way of analyzing more precisely the structure
of the abelian group 1 + P (cf. V.4.5).
It is useful to relativize the definitions of ramification index and residue degree
as follows. Let K C L be two finite extensions of the p-adic field Qp and denote
by R the maximal subring of K, P its maximal ideal, ft — R/P (residue field of
K) as before. Introduce the maximal subring Rl of L, the maximal ideal Pt of
RL, and kL — Rl/Pl (residue field of L). We can define
e = e(L/K) = [\Lx\ : \KX\],
f = f(L/K) = [kL:k] = dimkkL,
n = [L:K] = dim* (I).
4. Structure of p-adic Fields 107
Then n = ef simply because this relation holds for both index and degree over
QP
ri = e'f (where ri = [L : Qp],...)
n" = e"f" (where n" = [K : Qp],...),
and we can divide these relations,
#! = — = -■-£
#l" *" ' /""
Theorem. Ltff K (Z L be two finite extensions ofQp. Then there is a unique
maximal intermediate extension K C Kur C L that is unramified over K.
Proof. If the residue field kL of L has order qLi we have seen that Lx contains
a cyclic subgroup /x(p)(L) of order ^ — 1 consisting of the roots of unity having
order prime to p in L. More precisely, if q = #(£) and / = f(L/K) is the
residue degree of the extension, then qL — qf. The unramified extensions of
K contained in L correspond one-to-one to the extensions of k = Fq in kL. This
correspondance is order-preserving, hence the uniqueness of a maximal unramified
extension. Explicitly,
Km = K(fi(p)(L)) = K(tiqL-i) C L. ■
4.4. Ramification and Roots of Unity
Let us keep the notation introduced in the preceding section for the group of roots
of unity in the extension K of Qp.
Theorem. Let f be a root of unity in the field K having order p* (r > 1).
Then |f - 1| = \p\lM^ < 1, where <p(p') = pT~~1(p - 1) denotes the Euler
(p-function.
Proof. (1) Case t = 1, the root f has order p. In this case fp = 1 but f ^ 1 and
? = 1 + f (If I < 1) is a root of the polynomial (Xp - l)/(X - I):
(I -l. t\p __ i i
0=- ^ = -r(p$ + p£2x + Sp) (\x\<l).
Hence
p(l+$x) + $?-] = 0,
and since |f | < 1 and |jcJ < 1, we have |1 + px\ = 1,
\$p-l\ = \-pQ+px)\ = \p\,
k-l| = l?l = lp|1/<p_I)<l-
108 2. Finite Extensions of the Field of p-adic Numbers
Since this absolute value occurs frequently in p-adic analysis, let us introduce a
special notation for it:
I = |p|<|p|^r :=rp <1,
so that
r2 = \ = |2|, rp > ± (p odd prime).
(2) General case: The order of f is precisely p/+1 (t -h 1 > I). Then zpt has
order p, and by the special case already treated,
|^'-l|=rp<l.
Let us write f = 1 + rj with \rj\ < I, so that
^-l=(l+^f-l=/ + pW
with |v| < 1. Since
\pr)y\<\p\<rp = \l-Spt\,
we see that rp — |1 — fp'\ = \rjp'\ and finally \rj\ = rjp as expected. ■
Location of the 2nth roots of unity on the unit sphere
The appearance of the Euler ^-function is even more natural if we proceed as in
(4.2). Let us give this deduction as a reminder of the properties of the cyclotomic
polynomials. Recall that
Xp - 1
<J>„(X) =
p X-l
denotes the pih cyclotomic polynomial (of degree p — 1).
4. Structure of p-adic Fields 109
Location of the pnih roots of unity on the unit sphere (p = 3 and 5)
Then, it is well known that the p'th cylotomic polynomial (of degree (p(pT) =
pt~l{p — 1)) is given by
vp' i
If £ is a root of unity of order p\ then the other roots of unity having the same
order are the powers fJ of f, where the integer j is prime to p, hence the preceding
cyclotomic polynomial has a factorization
®pt(X) = x(p-1)pl~l + - - • + xpt~x + i
= n <* - *'■>
with a product restricted to the integers y prime to p: There are cp(pT) linear factors
in this product. On the other hand, substituting X = 1, we get
p= n &-&•
But f = 1 (mod P) and
iZ^. = l+f+... + f>-i=y (mod P).
When p is prime to j, we infer |1 - f-7! = |1 — f |, and all factors in the above
Product have the same absolute value,
|p| = |l-f|^), |f_I| = |p|lMp').
110 2. Finite Extensions of the Field of p-adic Numbers
Corollary 1. If the ramification index e = e(K) is finite, then the group {jlp^{K)
of roots of unity in K having order a power of p is finite. More precisely,
#(AV»(ff))<
i-l/p
Proof. In general, if the field K has a root of order p*, the preceding theorem
shows that the ramification index e is a multiple of y>(pT) = p* — p/_1. Hence
pt(l-l/p)<e.
This gives a bound for the order p* < ep/(p — 1), and
#(/V»(/0)< *P
p-1
Observe that the result of this corollary is valid for any valued field K of
characteristic 0, provided that its absolute value extends the p-adic one on Q.
In particular, if e = 1, we have #(/zpoo(tf)) < p/(p — 1),
#0vc(/O) = lifp>3,
whereas # (/z2oc( tf)) < 2 if p = 2. This proves again a result obtained in (1.6.7).
Corollary 2. 77ze grow/? of roots of unity in Qp is precisely
^(Qp) = M(p)(Qp) = Mp-i P odd prime,
MQ2) = W(Q2) = {±1}. ■
Example. Let A^ be the extension generated over Qp by a primitive pth root of
unity and K' the extension of K generated by a primitive root of unity of order p -
Both extensions are totally ramified. The degrees of these cyclotomic extensions
are determined by the previous theory, and a diagram summarizes the situation.
K' = Qp(fa)
degree p | wild
K = QPUP)
degree p — 1 | tame
The element n = £p — 1 has absolute value \tt\ = |p|1/(p_1) generating the group
of values \K*\: P = nR C R C K = Qp(fp). Similarly, the element it' = f p2 -1
has absolute value |jt'| = Ip^/Mp-D generating the group of values | tf/x|:
P' = tt'R' CR'CK' = Qp(fa).
4. Structure of p-adic Fields 111
4,5. Example 1: The Field of Gaussian 2-adic Numbers
The ring of Gaussian integers Z[i] is a square lattice generated by 1 and i = a/-T
in the complex field:
Z[i] = Z©iZcC.
It is known that this ring is a principal ideal domain. We can also embed it in an
algebraic extension of the 2-adic field Q2. Since we have seen that —1 has no root
in Q2, the extension K = Q2(i) has degree 2 over Q2. Observe that (1 + i)2 = 2i;
hence |1 -f i\ = |2|1/2, and this extension is totally and wildly ramified: e = 2.
The general notation gives in this case
K = Q2(i) D R = Z2[i] d P = (1 + i)R.
We shall consider the generator
7T = I - 1 = 1(1 + I)
of the maximal ideal P,
n2 = -2i, \n\ = |2|1/2.
Since the residue field of K is
k = R/P = F2,
we can consider representations with "digits" in the representative system
s = {0,1} CQ2C Q2(0-
Expansions in base b — tt of nonzero elements of K = Q2(i) have the form:
E,->v«.-* to e5, vgZ, o^O),
while elements of Z2[j ] have expansions
E,>o^, to e 5).
A parametrization of Z2[i] is given by the set of binary sequences, hence a bijective
map
<D:SN->Z2[i], fe)^X>fc',
orequivalently,
<S> : 7>(N) -> Z2[i], J h> J2b'-
j
Proposition. The elements of Z2[i] admitting a finite expansion
Eo<;</2fl*^> (where at e S, n e N) in base b are precisely the Gaussian
112 2. Finite Extensions of the Field of p-adic Numbers
integers, and we have
Z[i] = \ ^2 &? : J a finite subset of IS } .
Proof. Since Z[i ] is a ring containing I and b, it certainly contains all polynomials
in b. We have to prove the converse inclusion, namely:
Every Gaussian integer admits a finite representation in base b.
Let F = <!>(S(N)) C Z[i] be the image of the finite binary sequences. It will
be enough to prove that this image is a subgroup of Z[i], since it contains the
generators 1 and b. In other words, we have to prove
Starting with
we infer successively
F + F C F and - F C F.
b = i-l, fe+l=i\
b2 = -2i, b4 = -4,
b\b + 1) = 2,
fc2(fc+l)+l=3,
fe4+fe2(fe + l)+l = 3-4 = -l.
We have obtained the expansions
2 = b2 + fc3,
and more generally,
-1 = i+t2 + b* + b49
2bl =6/+2+6,+3>
-#' = £>,"+£>,"+2+i,+3+£>,"+4.
These expansions give reduction algonthms to prove that for finite subsets J and
JCofN,
JV + JV gF,
J
4.6. Example 2: The Hexagonal Field of3-adic Numbers
Here, we consider the quadratic extension K — Q3(\/^3) of the field Q3 of 3-
adic numbers. Since it is obtained by adjunction of the root of a generator of the
maximal ideal 3Z3 of Z3, it is totally and tamely ramified with index e = 2. This
4. Structure of p-adic Fields 113
quadratic extension contains f = (1 4- \/^3)/2, which is a root of unity of order
6: One can check in succession
?2 = ? - 1. ?3 = ?2 - ? = (? - 1) - ? = -1.
Also observe that if we add a root rj of unity of order 3 to Q3, we obtain a totally
ramified extension of degree 2, for which rj — 1 is a generator of the maximal ideal.
In fact, we can take 77 = f2 and check (with the 3-adic absolute value)
(17 - l)2 = f4 - 2?2 + 1 = -f - 2(f - 1) + 1 = 3(1 - f) = -3?2,
and since 17 + 1 = f2 4- 1 = f, it follows that [77 + 11 = I and
W - H2 = |i? - II2 = |-3f 2|, |i? - 1| = I3I1'2 = VT73.
We shall now take the generator & = \f^b and consider expansions J2ia'bl navmS
coefficients at in a fixed set of representatives (containing 0) of the residue field
k = R/P = R/bR = Z3/3Z3 = F3.
We could take {0, 1, -1} as a set of representatives. However, we shall take S =
{0,1, f}: Indeed by definition 2? = 1 + fc, so that
-1 = -2? + fc = f+fc-3f=f+fc + ffc2 = f (mod ft)
and we can replace the representative — 1 by f. It is easy to check that
2? = 1+6,
2 = ^ + 6 + 62 + f^3,
i + f = £fc + ffc2 + fc3 + ffc4.
These relations show how to compute sums. Finally, a picture shows(\) how the
image of the finite ternary sequences
F = <S>(S(N)) C Z[f ]
J
1
WB*1 A A »
1/,» '_»
A A
'-» A '"» A
A „ -->
A
Finite sums 5^o<i<3'' a'^'
fills in the whole lattice Z[f ].
A A .
A A
bA
*r
L
1
114 2. Finite Extensions of the Field of p-adic Numbers
As in the preceding section, we have obtained unique representations for
the elements of the hexagonal lattice Z[f ], which is the ring of integers in
0(7=3).
Proposition. Let b = V^3, f = (1 + V^A and S = {0, 1, f}. Then the
finite sums Ylj aj& (aj £ S) fill up the hexagonal lattice Z[f ] in C (or in
K = Q3(V==3)). ■
4.7. Example 3: A Composite of Totally Ramified Extensions
Let us consider the following quadratic extensions of Q3:
Ki = QsCV^), K2 = Q3(V3).
They are both totally (tamely) ramified, since |V=3| = |V3| = |3|]/2. Hence
n = e = 2, / = I for both. Let K — K\- K2 denote the composite (in a common
extension). Obviously, V=T = \/^3/-\/3 g tf, and the cyclic group of roots of
unity in K contains /X4. But the residue field of Q3 is F3 = Z3/3Z3; it contains
only the roots of unity ±1. Hence the residue field of K contains the quadratic
extension F9 and its cyclic group of units fig. On the other hand, as we have seen
in the preceding example,
Ki = Q3(VZ3) D Q(V=3) D Z[f 1
where f = f6 = (1 + V—3)/2 is a root of unity of order 6. Altogether, tf contains
/x8 • /x3 = /X24 (Chinese remainder theorem). Both the residue degree and the
ramification index of K must be greater than 1. The only possibility is e(K) = 2,
/(/O = 2 (and h(/0 = 4).
Q3(V3, V=3) = Ch(V3, V^T)
/* \
^1 K2
\ /
Qs
It is interesting to observe that although both K{ are totally ramified over Q3, their
composite K is not totally ramified over Q3. In fact, take an odd prime p and
a positive integer a prime to p that is not a square mod p. Then the quadratic
extensions Qp(^fp) and Qp(^/ap) are nonisomorphic and totally ramified over
Qp. But they generate
Qp(Vp, Vw) = Qp(Jp, VS),
which contains the unramified quadratic extension Qp(^/a) of Qp. The image of
yfa in the residue field of Qp(V^) *s a square root of a mod p. Hence / > 2, and
since e/ = 72 = 2, we have e = 1.
Appendix to Chapter 2: Classification of Locally Compact Fields 115
Appendix to Chapter 2: Classification of Locally
Compact Fields
In this appendix we shall give an approach to the classification of locally compact
(commutative) fields of characteristic 0. This contains our main case of interest,
namely that of ultrametric fields. For this purpose we shall take for granted the
existence of a Haar measure on such a field: On any locally compact group G
there exists a positive Radon measure /i on G — or equivalently a regular Borel
measure /x on G — that is left invariant. Thus we view this measure either as a
(1) positive continuous linear functional
/z:Cc(G;R)^R, / h> /x(/)
on the space of compactly supported continuous functions on G, invariant under
left translations
M/) = / A*)dn(x) = f f(gx)dfi(x) (g e G),
Jg Jg
or as a
(2) o -additive function on a suitable a -algebra of subsets containing the relatively
compact open sets U of G. We also write /x(£/) for the measure of the subset U.
If U is a relatively compact open subset of K, we denote by vol (U) the measure
of U. By left invariance of this measure, we have vol (U) = vol (gU) for any
g eG. The Radon measure can be extended as a linear form on a vector space of
functions containing the characteristic functions of relatively compact open sets
V C G, and if we denote by cpu the characteristic function of U, the two points of
views are linked by the relation vol (U) = fi(<pu)- By abuse of notation, we shall
also write vol (U) = fi(U).
The uniqueness of Haar measures will play an essential role and will be admitted
here without proof:
Let fi and v be two Haar measures on a locally compact group G;
then there exists a positive constant a such that [i = av.
For a general classification of locally compact fields, not necessarily
commutative and in any characteristic, the reader can consult the references given at the
end of this volume.
A.L Haar Measures
Let K be a locally compact commutative field (the general definition of topological
fields was given in (1.3.7)) and let us choose and fix a Haar measure \x on the additive
group K. By invariance, we have vol (U) = vol (U + a) for any a e K.
For any automorphism a of the field K, the invariant measure a(/x) defined
by a(/x)(L/r) = /ji(aU) (for all U in the suitable o-algebra) is proportional to
M, say a(fi) = m(a) • \jl. Since two Haar measures are proportional, this scalar
^faO is independent of the choice of Haar measure. Now take in particular for
116 2. Finite Extensions of the Field of p-adic Numbers
automorphism a an automorphism of the form a : x h> ax where a ^ 0 e K. In
this case we shall simply denote by m(a) the resulting scalar. By definition
vol(aU) = m(a) ■ vo\(U) (a e Kx).
The associativity of multiplication in K gives immediately
m(ab) = m(a)m(b) (a,b e Kx).
Hence m is a homomorphism K* -> (Rx)>0, m(\) — 1 and m{a~l) = m(a)~l.
This homomorphism m is the modulus of K. It is conventionally extended by
m(0) = 0. We shall eventually show that it is a generalized absolute value on K.
A.2. Continuity of the Modulus
Take a compact neighborhood V of 0 in K and choose a e K. Since a V is compact
and the Haar measure is regular, for each s > 0 we can find an open set U D a V
with
voI(£/)<voI(aV) + £.
By continuity of multiplication in K, there is a neighborhood W of a such that
U d WV. Thus forx e W
vol (jc V) < vol (U) < vol (aV) + e,
m(jc) < m(a) + e/vol (V).
Since m(*) > 0 and ra(0) = 0, this inequality proves that m is continuous at the
point 0. It also proves that m is upper semicontinuous at each point a e K. But for
fl^Owe can write m(a)= \/m(a~l), whence m is also lower semicontinuous at
such points. This proves the continuity of the modulus on K.
A.3. Closed Balls are Compact
For r > 0 we denote by Br = {x e K : m{x) < r} a closed ball in tf. Fix again
a compact neighborhood V of 0 in K. We shall prove
Br is contained in a compact set of the form y V.
As a.first step, we construct a sequence (t^^o C V with ttw —> 0. Since
0 • V = {0} C V,
there is a neighborhood U of 0 in K for which we still have U V C V (take Vo
an open neighborhood of 0 in V and choose £/ such that U V is contained in Vb)-
We can find an element n e U H V with 0 < m(7t) < 1. Hence n2 e UV C V>
7t3 = n -7t2 e UV C V, and by induction, itn e V (n > 1). But V is compact, so
that the sequence {7tn) must have a cluster value tt' in V. By continuity of m, m(7r0
must be a cluster value of the sequence (m(7rn)). Since m(nn) = m{jt)n —> 0,
the only possibility is m(7r') = 0 and n' = 0. This proves that the sequence
Appendix to Chapter 2: Classification of Locally Compact Fields 117
(nn) has only one cluster value in the compact set V: It must converge, and
yrn -> 0. Finally, observe that since n e U and UV c V, we have n V C V and
V C fl~l V. We see by induction that the sequence of compact sets Tt~n V increases
monotonically.
Second step: We show that Br CZ7t~NV for some large TV > 1. Since we already
know that Br is closed and n~NV is compact, this will indeed show that Br is
compact. Let a e Br.By the first part nna -► 0, and there is a first integer n such
that 7Tna eV.IfagV, this first positive n is such that izna e V but 7Tn~1a g V.
In other words, 7tna e V — ttV. The set V — n V is relatively compact (in V
compact) and
0£Q:= V -nV.
We can define r' = info m(x) > 0 and choose N > 1 such that m{jt)N - r < r'.
Hence
m(7t)N r </ < m{7tna) = m(7r)nm(a) < m(7rfr (a e Br).
This shows that m(7t)N < m(7t)n and hence n < N. Thus we have a e 7t~nV C
Tr-w v for all a e Br: The ball Z?r is contained in the compact set n~N V. ■
Corollary 1. The balls Br (r > 0) make up a fundamental system of
neighborhoods ofO in K. In particular,
an -* 0 in K ^=> m(a) < 1.
Proof. If V is any compact neighborhood of 0 in K, put r = maxv m(x) in
order to have V C Br. Since 0 is not in the closure of Br — V, the minimum r'
of m(x) on the closure Q of Z?r — V is positive; for 0 < r" < r' it is clear that
£r» C V. ■
Corollary 2. Any discrete subfield of K is finite.
Proof. Let F be a discrete subfield of K. Choose any a e K with m{a) > 1. Then
we have m{a~n) = m(a)~~n -> 0, whence g_/i —> 0, and since F is discrete it
shows a i F. This proves F c Bx. But we know that F is closed (L3.2). Thus F
ls compact and discrete, hence finite. ■
Remark. If the field K has characteristic 0 but is not assumed to be commutative,
we see here that its center is a locally compact nondiscrete (commutative) field.
Indeed, this center is closed and contains the rational field Q by assumption, hence
ls not finite. It is locally compact and not discrete by Corollary 2.
118 2. Finite Extensions of the Field of p-adic Numbers
A. 4. The Modulus is a Strict Homomorphism
We claim that V = m(Kx) is closed in R>0 and m : Kx -► T is an open map.
For r > 0, the compact set m(Br) is simply m(Br) = {0} U (r Pi [0, r]). In
particular, if 0 < £ < r < oo, T Pi [e, r] is closed in R>0- Since the interiors of
the intervals [e, r] cover R>0, we can conclude that V is closed in this topological
space. If V is a neighborhood of 1 in K*, we have now to prove that m(V) is
a neighborhood of 1 in T. It is enough to show that for every sequence (yn) in
T such that yn —► 1, there is a subsequence yni in m(V) (a subset A is not a
neighborhood of 1 in T when there is a sequence yn -> 1 in T and yn & A). Let
us write yn = m(xn) for some elements xn e V. Since V is compact, the sequence
(xn) must have — at least — one cluster point x e V. By continuity of m, m(x)
must be a cluster point of m(xn) = yn —> I. This proves m(x) = 1, namely
x e N := ker(m) C ^ x. But VN is a neighborhood of jc e N. By definition of a
cluster point, for each no there must be an integer n > no with jcw e ViV and hence
yn = m(xn) e m(VN) = m(V). This proves the existence of the subsequence of
(yn) in m(V) as desired.
Corollary. If the field K is locally compact and nondiscrete, the subgroup
m(K*) is either R>0 or of the form {6n : n e Z} = 6Z for some 0 < 0 < 1.
W/zen C = max {m(l + x) • ^ G #i} = I, the second case occurs.
Proof. Since 1 +B\ is a neighborhood of 1 in K x, its image must be a neighborhood
of 1 in r. When C = 1, this neighborhood is contained in (0, 1] and its image
under t \-> t~* is a neighborhood of 1 in T contained in [1, oo). The intersection of
these two neighborhoods of 1 in T is reduced to the single point {1}, thus proving
that T is discrete in this case. ■
In an obvious sense, the modulus m defines the topology of K: Any
neighborhood of an element x e K has the form x + V for some neighborhood V of 0 in
K, and m( V) contains a neighborhood of 0 e T, namely,
there is an s > 0 such that m{x) < s =$> x e V,
which implies that the given neighborhood x + V contains x + Be.
A. 5. Classification
Let us recall the result obtained above (Corollary 2 in A.3): In a nondiscrete locally
compact field, any discrete subfield is finite. Now the discussion of cases can be
made according to the value of the constant
C = max m(\ 4- x) > 1.
It is obvious that
m(a -\-b) < C - max (m{a), m{b)),
Appendix to Chapter 2: Classification of Locally Compact Fields 119
since if 0 ^ m{a) > m(b), we can divide by a so that x = b/a e B\ and
tn(l +b/a) < C, m{a-\-b) < C • m(fl) — C • max(m(fl), m{b)). Hencem defines
a generalized absolute value (II.2.2) on AT. In every case, a suitable power of C
will be less than or equal to 2, and a power of m is a metric defining the topology
of K. This shows that any locally compact field is metrizable.
First case: C > 1. In this case, the field K is not ultrametric; hence it is
automatically of characteristic 0 and contains the field Q. If K is not discrete, Q is not
discrete either (because infinite, by the result just recalled), and the metric induced
by K on Q must be equivalent to the usual Archimedean metric. The completion
R of Q for this metric must also be contained in K. Hence K is a real vector space.
Being locally compact, it must be finite-dimensional. One can show that the only
possible cases are K = R. C (or H: Hamilton quaternions if it is not commutative).
Second case: C = 1. Then K is ultrametric. If we assume K to be of characteristic
0, it contains the field Q, and as before, the induced metric on Q is not trivial. By
the classification of ultrametric absolute values on Q we infer that K must induce a
p-adic metric on Q and contain a completion Qp. Since K is assumed to be locally
compact, its degree over Qp is finite (II.3.2). We leave out the positive characteristic
case (interested readers can find a complete discussion in the specific references
given at the end of this book).
It is easy to see that contrary to the real case, there are extensions of Qp of
arbitrarily large degree (cf. (III. 1.3)).
A.6. Finite-Dimensional Topological Vector Spaces
In order to approach the structure of locally compact fields (having no a priori
norm), we have to give a few general definitions and results concerning
topological vector spaces. Instead of limiting ourselves to the field of scalars Qp, let us treat
the case of arbitrary valued fields: This general context has the advantage of
emphasizing the individual properties needed to establish each result. Thus we shall
consider in this section that K is any ultrametric valued field (II. 1.3), nondiscrete:
1^*1 =£ {1}. In particular, K is a metric space.
Definition. A topological vector space over K is a vector space V (over K)
equipped with a Hausdorff topology for which
the additive group V is a topological group,
the multiplication (a,v)h-> a • v : K x V —> V is continuous.
Let U be a neighborhood of 0 in such a topological vector space. By continuity
°f multiplication at (0, 0). there is £ > 0 and a neighborhood UoCUofO such
that
Ui := {av : a e AT, \a\ <e, v e Uq} C U.
120 2. Finite Extensions of the Field of p-adic Numbers
This neighborhood U\ C U of 0 has the property
a e K, \a\ < 1 =» aU\ C t/i.
Definition. A nonempty subset U in a topological vector space V is balanced
when
ae K, \a\ < 1 =» aU c U.
The balanced neighborhoods of 0 in a topological vector space play the role of the
balls in normed spaces. We have just proved that in a topological vector space
there is a fundamental system of neighborhoods of 0 consisting of balanced
ones.
Theorem 1. A one-dimensional topological vector space V over K is
isomorphic as a topological vector space to K. More precisely, for each 0 ^ v e V,
the map a\-+ av : K —> V is a bijective linear homeomorphism.
Proof. Fix 0 ^ v e V. The one-to-one linear map a t-+ av : K -> V is
continuous, since V is a topological vector space over K. We have to show the
continuity of the inverse, namely
Vs > 0 3 U neighborhood of 0 in V such that av e U =» \a\ <£.
We proceed as follows. If £ > 0 is chosen, we take b e K with 0 < \b\ < £ and a
balanced neighborhood U of 0 in V such that U $ bv ^ 0 (this is possible, since
we assume that V is Hausdorff). Now, ifav e U, then
b
bv = - - aa i U =*>
a
"*—v—' U balanced
> 1 =» \a\ < \b\ < s.
Lemma. A linear form cp : V —* K on a topological vector space V is
continuous precisely when its kernel is closed in V.
Proof. If the linear form cp is continuous, its kernel is closed. Conversely, assume
that the kernel of y is closed. We may assume <p ^ 0 and take v0 e V with
<p(vo) ^ 0. Replace vq by Vo/<p(vo), so that <p(vo) = 1 - The linear variety
{<p = 1} = i;0-hkerp
is closed and does not contain the origin. Hence there is a balanced neighborhood
U of 0 that does not meet this closed subset:
(u0 + kerp)n£/ = 0.
Appendix to Chapter 2: Classification of Locally Compact Fields 121
I claim that cp(U) C #<i, so that ip is bounded, continuous at the origin, and
continuous. Now, if v e U and <p(v) ^ 0, consider the scalar a = l/(p(v). We have
<p(av) = 1 ==> av g U ==> \a\ > 1.
U balanced
This proves \cp(v)\ < 1, as expected. ■
Theorem 2. Assume that the field K is complete. Then a finite-dimensional
topological vector space V over K is isomorphic as a topological vector space
to a Cartesian product Kd. More precisely, for any basis (ef-) of V, the linear
map
(ki)*-+J2kiei:Kd-+V
i
is an isomorphism of topological vector spaces.
Proof. We proceed by induction on the dimension of the vector space V: The
dimension-1 case is covered by the first theorem. Assume that the statement is true
up to dimension d — 1. If dim^ V = d, select a basis e\,..., ed of V and consider
the linear span W of the first d — 1 a. By the induction assumption, the space W
is isomorphic to Kd~l and hence complete and closed in V. The linear form
ip : y^^i^i »-> ^</> V —* K
i
is continuous, since its kernel kerfy?) = W is closed. The one-to-one linear map
Kd = K"-1 xKXwxKed^V
is continuous. Its inverse is
x\-+{x- cp(x)ed , <p(x)ed)
and hence is also continuous. ■
A.7. Locally Compact Vector Spaces Revisited
We have seen in (3.2) that locally compact normed spaces V over Qp are finite-
dimensional. Using the existence of Haar measures, we can now prove the same
statement without the assumption that the topology is derived from a norm.
Theorem. Any locally compact vector space over Qp is finite-dimensional.
Proof (Weil). The proof is based on (A.6): A finite-dimensional subspace of a
locally compact vector space V over Qp is isomorphic as a topological vector space
to a finite product Q^, hence is complete, and hence is closed in V and locally
compact. Let now V be any locally compact vector space over Qp. In particular,
122 2. Finite Extensions of the Field of p-adic Numbers
it is a locally compact abelian group, and we can choose a Haar measure jjl on V.
We can define a modulus homomorphism
mv : Q* -* R>0
as for locally compact fields (A.l). For 0 / a e Qp, the map U h» vol (af7) is
also a Haar measure on V, and by uniqueness, there is a unique positive scalar
my(a) > 0 such that \o\(aU) = mv(a) • vol(U) (for all relatively compact
open sets U C V). For example, If W = Qdp has dimension d over Qp, then
mw(a) = \a\d- Since //* -> 0 in Qp, we have
wv(p)n = mv(Pn) -* 0,
and this proves mv(p) < 1 for all locally compact Qp-vector spaces V. Select
now a /i-dimensional vector subspace W of V. Integrating in succession over W
tmdF= V/W,
f \-> / dfiF(y) / /U + y)d/xw(x),
Jf Jw
we get an invariant Radon measure on V, which we may take for /xy (or we can
change the choice of Haar measure on F to obtain this equality). Hence
/ f(x)d/xv(x)= / dfifiy) I f(x + y)dfiw(x)
Jv Jf Jw
for all continuous functions / with compact support on G. We see that
mv{a) = mw(a) ■ mF(a) = \a\d ■ mF(a),
mv(p) = \p\d-mF(j>)<\p\d,
logmv(p) <d-log|p|,
and by division by log \p\ < 0,
d < logmv(p)/log \p\.
This shows that the dimension d of finite-dimensional subspaces of V is bounded,
and this implies that V itself is finite-dimensional. ■
A.8. Final Comments on Regularity of Haar Measures
Let us consider the Haar measure on the locally compact group G =Rxfe
where the first copy of R has the usual topology and the second copy the discrete
topology. The usual Lebesgue measure /z^ is a Haar measure on R, and we can
take for Haar measure of R^ the counting measure
/x^(A)-#(A) (AcRj).
Exercises for Chapter 2 123
The product of these two Haar measures is a Haar measure on the product R x R^.
The subset A = {0} x R^ has the discrete topology, and
fi(A) = inf fi(U) = oo,
r/opeiOA
simply since each open set U D A contains an uncountable family of open intervals
of positive length. However, a compact set K C A is finite (because discrete), hence
fi(K) = 0, and sup^ compact cA ti(K) = 0 is different from fi(A) = oo. In general,
inner regularity holds only for subsets having /x(A) < oo (and in a suitable algebra
containing the Borel subsets). This pathology disappears in locally compact spaces
that are countable at infinity. This last property holds for all locally compact fields:
we have seen this in characteristic zero in (A.5).
Exercises for Chapter 2
1. Let X be an ultrametric space. Show that the spheres of radius r > 0 in X are the
complements of one open ball of maximal radius r in a closed ball of radius r.
2. Let X be an ultrametric space.
(a) Fix a positive radius r > 0. Show that the condition d(x, y) < r is an equivalence
relation x ~ y between elements of X. The equivalence classes are the closed
balls of radius r, and the quotient space is the uniformly discrete metric space of
closed balls of fixed radius r (the inequality d(x, y) < r also defines an equivalence
relation, for which the equivalence classes are the open balls of radius r).
(b) Fix a e X and assume that {d(x, a): x e X] is dense in R>0. Show that the ordered
set of closed balls containing the point a (with respect to inclusion) is isomorphic
to the half line [0, oo) c R.
(c) Assume that for each x e X, [d(x, y) : y e X} is dense in R>o- Define Tx as the
ordered set of closed balls in X (with respect to inclusion). Prove that this is a tree.
Recall that we denote by 8(A) the diameter of a bounded subset of a metric space,
so that 8B<r = r. We have two natural maps
XxR>0 -* Tx (fl,r) h> B = B<r(a)
Tx
is
R>o
(a,r) h> B = B<cr
i S
8(B) = r
For r > 0, the fiber S-1(r) is the uniformly discrete metric space consisting of
closed balls of fixed radius r. If X is separable, this fiber is countable. For any subset
A C X define Tx(A) as the subset consisting of the (dressed) balls B meeting A.
Prove that this is a subtree of Tx- Take for A successively sets containing only one,
two, or three elements: What are the possible configurations?
id) The metric space Zp can be embedded in an ultrametric space X satisfying the
condition required in (c) (cf. Chapter III). Sketch Tx(Zp) and show that the picture
does not depend on the choice of ambient space X.
Let |. | be an absolute value on a field K.
(a) Prove the triangle inequality
l* + y|a<|jc|a + |y|a (x,yeK, 0 < a < 1).
124 2. Finite Extensions of the Field of p-adic Numbers
(b) When the absolute value is ultrametric, prove the same result for all a > 0.
(c) If a = a0 + £i<i<rt fl< and \ai\ < lfll for 1 < / < n, prove |a| = maxo<,<„ |a,-|.
4. As corollary of the proof of Theorem 1 of (II. 1.4) we see that (with the notation of the
theorem): If AfeA is finite, then A/£nA is also finite and #(A/%nA) = [#(A/f A)f.
More generally, show that in any integral domain A,
#(A/(ab)) = #(A/(a)).#(A/(b))
ifab ^ 0. (Hint. Observe that multiplication by b on (a) — a A leads to an isomorphism
of the A -modules A/(a) and Ab/(ab). Then use the isomorphism A jab A = A/a A x
aA/abA.)
5. (a) Let P(X) = X2 - 2X + 1 € Z[X]. This polynomial has the root x - 1. Find
explicitly the sequence of iterates given by Newton's method starting at an element
x ^ 1: Does this sequence converge in Qp?
(b) Let >4 be the maximal subring of an ultrametric field as in (1.4), and let P(X) e A [X]
be a polynomial having a simple root x = £.
Show that for any x in the open ball of center £ and radius \P'{t;)\ ^ 0 Newton's
method furnishes a sequence of iterates that converges to £.
6. Prove directly the following: If an -> 0 and bn -> 0 in an ultrametric field, then
cn = £o<i</i fl/fc«-/ -* ° and
n>0 n>0 n>0
[/#nf. The assumption implies that the two sequences are bounded, say
Iflil <C \bi\ <C for all i > 0,
and for each given £ > 0 there exists N = Ne such that
|flil<*, \bi\<e (i>N).
For i + j > 2iV, we have |a/£y| < eC, since one index at least is greater or equal to
N.]
7. Show that two norms on a vector space define the same topology when mere exist two
constants c, C such that
c||*|| < \\x\\' < C\\x\\.
(The unit ball for one norm must contain a ball for the other norm; observe mat mis
condition is independent from the ultrametricity.)
8. Let K'/Kbea finite extension of ultrametric fields. Show directly that the residue field
k! of K' has finite degree over the residue field of K and
/ - [*': k) < n = [Kf : K]
(cf.4.1 and 4.3).
9. Let K be a valued field that is an extension of Qp, and let ? € K. Suppose that there
exist integers ao(j), a\(j) an-\(j) € Z (j > I) such that
Ifn +flfi-iO-»n-1 + " ■ ■ +000)1 -► 0 (/ -► oo)-
Exercises for Appendix to Chapter 2 125
(a) Show that |£ | < 1. (If you cannot, glimpse at the proof of Proposition 3 in (IIL2.1)).
(b) Prove that £ is algebraic of degree less than or equal to n over Qp.
(Hint. Consider the nonempty sets Xm c (Z/pmZ)n consisting of the families
(ao mod pm,... ,^-1 mod pm) such that |£" H-fln-i^"1 + ■ • - + aol < \pm\ =
\/pm- Then any element of HmXm ^ 0 gives a polynomial dependence relation
for £ over Qp.)
10. Let s < t and £ a root of unity of order p*, £' a root of unity of order p', both in Q£.
What is the distance |£ - £'|?
11. Let A' be an ultrametric extension of Qp. Prove that if the group /x(A') of roots of unity
in K is infinite, then this field K is not locally compact. (Hint. Can you find a convergent
subsequence?)
12. Show that the quadratic extensions Qs(>/2) and Qs(V5) of Q5 in Q<j coincide, by two
methods:
(a) Use the fact that 6 has a square root in Q5.
(b) X2 - 2 and X2 - 3 are irreducible over F5 (hence /x24 C Qs(\/2), 1^24 C Qs(V^)).
13. Consider the following quadratic extensions of Q7 in Q^
Qi(V^l), Q7(V2), Q7(>/3), Q7(V5), Q7(V6).
By (1.6.6), they cannot be distinct: Give identities. What is the degree of Qj(y/2, V3)
over Q7? (What is the degree of Q(>/2, V3) over Q?)
Exercises for Appendix to Chapter 2
1. Let U be a neighborhood of 0 in a topological vector space V over a valued field K.
Show that
n w
Jt€ff,|Jt|>l
is a balanced neighborhood of 0 contained in U.
2. Let A' be a nondiscrete ultrametric field. Assume that A is not complete and consider
the topological vector space K over K.lfa,b e A' are linearly independent over Ar, the
two-dimensional subspace Ka + Kb is not isomorphic, as a topological vector space,
to A2. (Hint. The one-dimensional subspaces of A'2 are not dense in this space!)
3. Let A be a finite extension of Qp (hence locally compact). A character of K is a
continuous homomorphism x :A->U(l) = {zeCx : \z\ = 1}.
(a) Prove that such a character x is locally constant and takes its values in /xpo=.
(b) If jj/ is a fixed nontrivial character, consider the characters i/a(x) — jj/(ax) (a e K).
Show that a h> ij/a is an injective homomorphism f . K ^> K- where A'r is the
(multiplicative) group of characters of A\ (For a nontrivial character on K, one can
take the composite of the trace Tk/qp and the Tate homomorphism r (1.5.4).)
(c) Define a topology on Ar having for neighborhoods of a given character x the
subsets
VsMx) = {*' 6 K~ : Ix'to - xtol < s}
(s > 0, A a compact subset of A). Show that the above-defined homomorphism
/ : a h-> jj/a is continuous.
126 2. Finite Extensions of the Field of p-adic Numbers
(d) Show that the inverse homomorphism tya h> a is continuous on f(K). Conclude
that this image is locally compact, and hence closed in Kz. (Hint. Use Corollary 1
in (L3.2).)
Comment For any locally compact abelian group G, one can define its Pontryagin
dual
G-* = {x:G->U(l)a continuous homomorphism}
and show that Gz is again a locally compact abelian group with (G~)~ canonically
isomorphic to G. When G = K is the additive group of a locally compact field, one can
show (as above) that K and K~ are isomorphic. This generalizes the known situation
for the field R.
3
Construction of Universal
p-adic Fields
In order to be able to define ^-valued functions by means of series (mainly power
series), we have to assume that K is complete. It turns out that the algebraic
closure Q* is not complete, so we shall consider its completion Cp: This field turns
out to be algebraically closed and is a natural domain for the study of "analytic
functions." However, this field is not spherically complete (2.4), and spherical
completeness is an indispensable condition for the validity of the analogue of the
Hahn-Banach theorem (Ingleton's theorem (IV.4.7); spherical completeness also
appears in (VI.3.6)). This is a reason for enlarging Q£ in a more radical way than
just completion, and we shall construct a spherically complete, algebraically closed
field Qp (containing Q* and Cp) having still another convenient property, namely
\Qp\ = R>o- This ensures that all spheres of positive radius in Qp are nonempty:
B<r(a) ^ B<r(a) for all r > 0. In fact, we shall define the big ultrametric extension
&p first — using an ultraproduct — and prove all its properties (this method is
due to B. Diarra) and then define Cp as the topological closure of Q* in Cp. This
simplifies the proof that Cp is algebraically closed. By a universal p-adic field we
mean a complete, algebraically closed extension of Qp.
In this chapter Qa denotes a fixed algebraic closure of Qp.
1 • The Algebraic Closure Q^ of Qp
1-1. Extension of the Absolute Value
There is a canonical absolute value on Qap. Indeed, the absolute value of Qp extends
uniquely to Q" as the following observation shows. If Ki and A^ are two finite
128 3. Construction of Universal p-adic Fields
extensions of Qp in Qp, the two extensions (II.3.4) of the absolute value of Qp t0
these fields must agree on their intersection K\ f! K2 by uniqueness (II.3.3). Hence
all the extensions of the absolute value of Qp to finite subextensions of Q£ define
a unique extension of this absolute value to Qap. As a consequence, this algebraic
closure is an ultrametric field, and we set
Aa := the maximal subring of Q* : |*| < 1,
Ma := the maximal ideal of Aa : |*| < 1,
ka := Aa/Ma the residue field of Qap.
We shall see below that Qa is not complete, hence not locally compact. Moreover,
the residue field ka is infinite, and |(Qpx | is a dense subgroup of R>o- Hence none
of the conditions of (II.3.5) for local compactness are satisfied!
1.2. Maximal Unramified Subextension
We have seen in (II.4.3) that every finite extension K of Qp contains a maximal
unramified subextension: Since K is complete, the group fi^{K) of roots of
unity in K having order prime to p is isomorphic to the cyclic group kx of order
q — \ = pf — 1, where / is the residue degree of K:
Kur = Qp(^-l) = Qp(^-l) C K.
It is not difficult to generalize this result to the algebraically closed extension
Proposition. The residue field k° of the algebraic closure Qap of Qp is an
algebraic closure of the prime field Fp.
Proof. Since any algebraic element x e Qap generates a finite-dimensional
extension K of Qp, the residue field of K is also finite-dimensional over Fp. This
proves that the residue field of Q£ is algebraic over Fp. Conversely, if £ ^ 0 is
algebraic over Fp, it belongs to the cyclic group Fp(£)x and hence is a root of unity
of order m prime to p. Now consider the cyclotomic extension Qp(/xm) obtained
by adjoining to Qp all roots of unity of order m. Iff ^ rj are two such roots, then
If — r)\ = 1 and the reductions off and r) are distinct (cf. II.4.3). Hence the residue
field of Qp(/xm) contains m distinct mth roots of unity and contains f. ■
We shall denote by F£ = Fpoc = |J/>i *V an algebraic closure of Fp and by
(Qap)ur = Qp(/^(p)) C Q£ the extension generated by all roots of unity having
order prime to p. This is the maximal unramified extension of Qp in Q£.
Corollary- The residue field of the maximal unramified extension ofQp in Qap
is an algebraic closure of the prime field Fp. ■
1. The Algebraic Closure Qap of Qp 129
13. Ramified Extensions
One can give another reason for the fact that the extension Q£ has infinite degree
over Qp. Choose algebraic numbers ne = pl^e (e > 2). We have
KI = IpI = i/p, Ki = (i/p)1/e,
and consequently the ramification index of Qp(jre) is greater than or equal to e.
This proves that Qp has algebraic extensions of arbitrarily large degree. Indeed,
the polynomial Xe — p is an Eisenstein polynomial, and hence is irreducible (II.4.2)
in ZP[X] or QP[X]: This defines an extension of degree e of Qp. More generally,
if K is any finite extension of Qp (contained in Qp, it is locally compact, and we
can choose a generator n for the maximal ideal P of R. The polynomial Xe — n
is an Eisenstein polynomial, hence is irreducible in R[X] and K[X]y whence K is
not algebraically closed. These simple observations show that
I(QPX I D PQ = {Pv ■ v e Q} = (J p^z.
Proposition. The absolute values of algebraic numbers over Qp are fractional
powersofp:\(Qapr\ = p<*.
Proof. If x eQJ-Qp is any algebraic number not in Qp, it satisfies a nontrivial
polynomial equation
J2 aix1 - 0 (a{ e Qp)
of degree n > 2. By the principle of competition, there are two distinct indices,
say i > ji with
\aixi\ = \ajXJ\*0.
Hence
\x\l~j = \aj/ai\epz,
and \x\ e p(1/e)Z (e = i - j > 1). ■
1A. The Algebraic Closure Qa is not Complete
A complete metric space X is a Baire space: A countable union of closed subsets
Xn in X having no interior point cannot have an interior point. In particular, such
a countable union cannot be equal to X. Recall that locally compact spaces and
complete metric spaces are Baire spaces.
Theorem. The algebraic closure QL of Qp is not a Baire space.
130 3. Construction of Universal p-adic Fields
Proof. Let us define the sequence of subsets
Xn = {x e (?p : deg* - [Qp(x) : Qp] = n} CQJ (n > 1)
so that Qap = Un>iXn. It is also obvious that XXn C Xn (X e Qp), Xm-\-Xn c
Xmn, and in particular,
Xn -h Xn C Xn2.
(a) These subsets are closed. If x ^ 0 is in the closure of Xn, say x = limjc^
with a sequence (*,-) in Xn, then for each xt let /)(X) e Qp[X] be a polynomial
of least degree with jc, as a root and coefficients scaled so they lie in Zp and at
least one of them is in Z*. Extracting if necessary a subsequence of (ft), we can
assume that it converges (in norm, coefficientwise), say f• —> /, so / e ZP[X]
has degree less than or equal to n and at least one coefficient in Z*, so f(X) ^ 0.
By the ultrametric property, the convergence f~>f is uniform on all bounded
sets of Q^,. Since the convergent sequence (jcf) is bounded, we have
/(*) - fi(Xi) = f(x) - f{xg) + fin) - f{xg) -> 0.
v v < v v '
-►o -*o
This implies f(x) = lim/)-(*/) — 0 and x e Xn.
(b) The subsets Xn have no interior point. Since for any closed ball B of positive
radius in Qa we have Q£ = Qp • £, such a ball cannot be contained in a subset
Xn, and no translate can be contained in Xn. m
Corollary. The space Qa is neither complete nor locally compact. ■
1.5. Krasner 's Lemma
Theorem 1 (Krasner's Lemma). Let K c Q£ be a finite extension of Qp and
let a e Qap (so that a is algebraic over Qp). Denote by a° the conjugates of
a over K and put r = minfla^fl \aa — a\. Then every element b e B<r(a\ Q£)
generates (over K) an extension containing K(a).
Proof. Take any algebraic element b such that a g K(b). Since we are in
characteristic 0, Galois theory asserts that there is a conjugate aa ^ a of a over K(b)
(the automorphism a fixes K(b) elementwise) and we can estimate the distance
of a to b as follows:
\b-a°\ = \(b-af\ = \b-al
\a-aa\ < max(\a-b\.\b-aa\)^ \b - a\.
This shows that
\b ~a\ > \a-aa\ > r.
Hence if b e B<r(a), namely \b — a\ < r, we have
a e K(b), K(a) c K(b). ■
1. The Algebraic Closure Qap of Qp 131
Examples, (a) Take K = Q2 and a = V—T = 1. Then iCT = —1 and
r = |i-r| = |2i| = |2| = I.
Hence for fc g Q£,
|6-i|<i=»ieQ2(6).
(ft) Take K = Q3 and a = V^. Then a47 = - V^ and
r = \a - a°\ = |2V=3| = |V=3| = |3|1/2 = J\.
Hence for ft g Q3,
\b - v^3i < yr =» v^ g 03(6).
Recall that the norm of a polynomial /(X) = ^<fl anXn is the sup norm on the
coefficients ||/|| = maX|<„ \an\*
Theorem 2 (Continuity of Roots of Equations). Let K be a finite extension
of the p-adic field Qp and fix an algebraic element a e Qap of degree n over
K corresponding to a monic irreducible polynomial f G K[X] (of degree n).
There is a positive s such that any monic polynomial g G K[X] of degree n with
\\g — f\\ < s has a root b e K(a) also generating this field: K(b) = K(a).
Proof. Let us factorize the polynomial g in the algebraic closure Qap of K, say
g(X) = I1(X — bi), and evaluate it at the root a of /:
Y\(a-bi) = g(a) = g(a)-f(a).
With M = max0<,-<„ (\a\l) = max(l, \a\n) we can estimate
J] I* " *'l = teW - /<*)! ^ W8 - f\\ • ^,
hence for one index i at least,
l«-6.-|<IIS-/ll1/n-Af1>'1.
By the preceding theorem, if s > 0 is chosen small enough, then ||g — f\\ < s
will imply K(b{) D #(#) for some i. But the degree of bi is less than or equal to
1, since it is a root of the nth degree polynomial g e K[X]. This proves #(&,-) =
K(fi). m
Corollary 1. Lef / G #[X] be a monic irreducible polynomial, a e Qap a root
off, and (g,-)i6N # sequence of monic polynomials with coefficients in K of the
same degree as f. If gi —> f (coefficientwise), there is a sequence (xt) of roots
of these polynomials such that xi G K (a) for large i and Xi —> a.
132 3. Construction of Universal p-adic Fields
Proof. As soon as \\gi — f\\ < s is small enough, the above result is applicable
and shows that \a — x{ \ is small for at least one root jcf- of g,. More precisely, the
inequality
shows that \a — Jcf-1 -> 0, and the convergence xt -> a in AT (a) follows. ■
Corollary 2. The algebraic closure Qa 0/ Qp is a separable metric space.
Proof. Take aeQap and let / be its minimal polynomial over Qp. Since Q is
dense in Qp, we can find monic polynomials g e Q[X] as close to / as we want
If we choose a sequence gn —> /, the continuity principle for the roots shows
that a is a limit of roots xn of the polynomials gn. This shows that the algebraic
closure of Q is dense in Qap. But this algebraic closure is a countable field since
the set of polynomials of fixed degree with coefficients in the countable field Q is
countable. ■
1.6. A Finiteness Result
In the last two sections of this chapter, let us prove a couple of theorems easily
obtained with the techniques developed above. (We shall not need them in the
sequel.)
Theorem. Let K be a finite extension of Qp and n > 1 an integer Then there
are only finitely many extensions of K of degree n in Qa.
Proof. (1) Let F be an extension of degree n of K and let e be its relative
ramification index, / its residue degree: n = ef. The cyclic subgroup consisting of
roots of unity in F having order prime to p is isomorphic to the cyclic group
of nonzero elements in the residue field of F (11.4,3). These roots generate the
maximal unramified subextension Fur of K in F,
[FUT :K] = f
(II.4.4), and the extension F/Fm is totally ramified of degree e. If the residue
degree / is given, there is only one unramified extension of degree / of K in
Q£. Hence the announced result will be established as soon as the same finiteness
property for totally ramified extensions is established.
(2) Let us show that there are only finitely many totally ramified extensions of
given degree n = e of K. Fix such an extension F and \etK^R^P=^7TR
(conventional notation). By (II.4.2, II.4.4) it is generated by an element having
minimal polynomial
Xn+an^Xn~l+--- +w07r,
1. The Algebraic Closure Qap of Qp 133
which is an Eisenstein polynomial. Its coefficients at belong to P, and uo e R* is
a unit: uo e K and |w0| = 1- The Cartesian product
Pn~l x /?*
is compact, and by continuity of the roots of equations (1.5), each element of
this product has an open neighborhood corresponding to polynomials having
their roots generating the same extension F in Qa This completes the flniteness
proof. ■
1.7. Structure of Totally and Tamely Ramified Extensions
It is possible to improve the result (II.4.2) concerning the generation of totally
ramified extensions.
Theorem. Let K C L C Qap be finite extensions of Qp. Assume that L/K is
totally and tamely ramified of degree e. Then there exists a generator n of the
maximal ideal P of R C K such that L is generated by an eth root ofn in Q£.
Proof. By assumption e = [L : K] is prime to p. The proof will be accomplished
in three steps.
(1) Consider arbitrary generators n of P C R C K and ni of Pl C Rl C L.
Since L/K is totally ramified of degree e, \nL\e = \n\ and nl/n = u is a unit in
Rl- Since the residue degree of L/K is 1, the residue fields are the same, and there
is a unit f of R (one can take a root of unity in K) such that f = u (mod P)l.
Let us write
7t£=7t-h, u = i;+7rLv (v e RL)-
Hence
7VeL = 7T • (f -h 7TLi;) = f 7T + 7T7TLi;.
The element f 7r is also a generator of the ideal P of R. We are going to show that
L is generated by a root of the equation Xe — f 7r. Let us replace the generator it
by 7r' = ^ an(j simply denote it by n again. Thus we assume from now on that
*he generators ttl and 7r are linked by a relation
7TeL= 7T -\- 7T7TLV (v G ftjj.
(2) The polynomial / = Xe — 7r is an Eisenstein polynomial (II.4.2) of /?[X].
Hence it is irreducible over K[X\ We have
f{7TL) = 7TeL - 71 = 7T7TJ.U, |/(7TL)| = |7T7TLi;| < |7r|.
Let us factor/ in Q£:
f(X) = Xe-n= YliX-a,)
134 3. Construction of Universal p-adic Fields
(where f|af- = ±tt). Since / is irreducible, the roots a,- are conjugate and have
the same absolute value in Qp, say |a,-| = c independent of i. Hence
Ce=Y[\*i\ = \7Tl
\<*i\=C=\7T\1'e = \7rLl
and
\nL - cti\ < maxda,-!, |ttl|) = |ttl|.
If we come back to the polynomial /, then
n^-a'>
i<i<c
= i/(^)i<i^i = i^r
shows that at least one of the factors is smaller than \ttl\. Without loss of generality
we may assume
\nL-<xi\ < \7tL\.
(3) The roots of f(X) = Xe - n = 0 are the at = ff-a, where £/ = 1. Since e
is prime to p, we have |f,- — 11 = 1 when £,- ^ 1 by Proposition 1 in (II.4.3). This
proves
\a(-a\ = \a\ = c=\nL\ (i ^ 1),
\nL - a| < |ttl| = 1« - a, | (i ^ 1).
By Krasner's lemma, we infer that
K(a) c K(nL),
and since the element a has degree e, this inclusion is an equality. ■
Example. If we add a primitive pth root fp of unity to Qp, we obtain a totally
ramified extension K of degree p — 1. Hence AT/Qp is tamely ramified and can
be generated by a (p — l)~th root of the generator —p of /?Zp.
For /? = 3, we have seen in (II.4.6) that b = yf^% works:
Q3fe) = Q3(V^3).
2. Definition of a Universal p-adic Field
2.7. More Results on Ultrametric Fields
Let us start with a couple of general results concerning (nondiscrete) ultrametric
fields.
2. Definition of a Universal p-adic Field 135
Proposition 1. Let K be an ultrametric field and K its completion. Then K is
still an ultrametric field and
(a) \K\ = Ijfl,
(b) K and K have the same residue field.
Proof. Let A be the ring of Cauchy sequences in K. The ideal I of A consisting
of Cauchy sequences a = (an) with an —»► 0 (also called null Cauchy sequences)
is a maximal ideal: If an -/> 0, then an / 0 except for finitely many indices n
and a is invertible in the quotient A/1. We can define K = A/1 with a canonical
injection K c-> K given by constant sequences. If a = (an) e A — I is a Cauchy
sequence that is not null, the sequence (\an\) is stationary (stationarity principle),
and we define an absolute value on K by
\a\ = lim \an\ e \KX\ C R>o fora ^ 0 and |0| = 0.
Obviously, the canonical injection K c-> K is an isometric embedding, and we
view AT as a subfield of AT: The absolute value of K extends the absolute value
of K. The residue field k of K parametrizes the open unit balls £<i(tf) (a = 0
or \a\ = 1) contained in the closed unit ball: &x parametrizes the open unit balls
contained in the unit sphere Si = {x e K : \x\ = 1}. Any Cauchy sequence of the
closed unit ball has all its final terms in an open unit ball; hence it corresponds to
a fixed element in the residue field k. ■
An extension L of an ultrametric field K having same residue field kL = k and
the same absolute values \L\ = |K\ is called an immediate extension of K. Hence
the completion of K is an immediate extension of K.
Proposition 2. Let K be a nondiscrete ultrametric field and put
A = {x e K : |jc| < 1} : maximal subring of K
M = {x e K : |jc| < 1} : maximal ideal of A.
Then, either M is principal, or M = M2 and the ring A is not Noetherian.
Proof. By hypothesis T = \KX\ / {1}, and either T fl (0, 1) has a maximal
element 0 or it has a sequence tending to 1. In the first case we can choose it e M
with \tt I = 6, and M = n A is principal. In the second case, for each x e M, namely
1*1 < 1, we can find an element y such that |jc| < \y\ < 1, so that
* = y (*/y) e m2.
Since v and jc/y belong to M, this shows that x e M2, and we have proved M = M2.
In this last case, the subgroup T = | Kx \ is dense in R^.0, and all the ideals
Ir = B<r = B<r(0- K) = {xeK : \x\ < r]
for r e T fl (0, 1) are distinct: The ring A is not Noetherian.
■
136 3. Construction of Universal p-adic Fields
Proposition 3. With the same notation as before:
{a) IfK is algebraically closed, so is the residue field k.
(b) IfL is an algebraic extension ofK, the residue field ki ofL
is also an algebraic extension of the residue field k of K.
Proof. In any ultrametric field, |£ | > 1, \ax\ < 1 (i < n) implies
in>iri>k£'l ('"<")>
Ifl" >|J>£'I,
i<.n
and hence
r + X>ri = l£l">l,
i<.n
r + X^-rX).
This proves that any root of a monic polynomial having coefficients \a-t | < 1 belongs
to the closed unit ball |*| < 1.
(a)LetXn +Yli*znaiXl ek[X] be a monic polynomial of degrees > 1.Choose
liftings at e A of the coefficients, i.e., at =aI- (mod M), and consider the monic
polynomial
Since the field K is algebraically closed, this polynomial has a root x e K. By the
preliminary observation, x e A and x mod M is a root of the reduced polynomial
Xn + Yli<n atXl - This proves that k is algebraically closed.
(b) Let 0 / f e kL and choose a representative x e AL — ML of the coset
£ ^ 0: |jc| = 1. By assumption, this element is algebraic over K, and hence x
satisfies a nontrivial polynomial equation
]T^jc"=0 (n> 1, ai e K).
By the principle of competitivity, at least two monomials have maximal competing
absolute values
\at\ = \atxl\ = \ajxj\ = \aj\ for some/ < j.
Dividing by a(, we obtain a polynomial equation with coefficients \ark \ < L ark. £ ^
and at least two of them not in M. By reduction mod M we get a nontrivial
polynomial equation satisfied by £. •
2. Definition of a Universal p-adic Field 137
22. Construction of a Universal Field Qp
Let R be the normed ring £°°(Qp) consisting of bounded sequences a = (tt,-),-€N of
Q" with the sup norm
IMI =supI-6N|aI-|.
Let us also choose and fix an ultrafilter U on N containing the subsets [n, oo)
(n e N). (Readers not familiar with ultrafilters can find all definitions and properties
used here in the Appendix to this Chapter.) Recall that for each subset AcN either
A e U or Ac = N — A e U. On the other hand (here is the reason for choosing an
ultwfiltet), each bounded sequence of real numbers has a limit along U, and we
put
<p(a) = lim|a£-| > 0.
Proposition 1. The subset J = <p~~l{Q) is a maximal ideal of the ring /?, and
the field Qp = R/J is an extension of the field Qap.
Proof. Let us show that each element a g J is invertible mod J. But if a = (an)
is not in the ideal J, the limit r = <p{a) > 0 does not vanish, so we can find a
subset A e U such that r/2 < |a(-| < 2r (i e A). Define a sequence p — (fit) by
f$i = — for i e A and pt = 0 for i g A.
Since |#| < 2/r (i e A), the sequence & is bounded fi < 2/r and p e R.By
construction 1 — aft vanishes on the set A, hence 1 - a/3 e J. This shows that
a mod J is invertible in the quotient Qp. Consequently, the quotient is a field,
ar*d J a maximal ideal of R. Finally, constant sequences provide an embedding
QJ-SV -
The map <p defines an absolute value on the field Qp: For a = (a mod J) we
put
\a\ = |fl|n = cp(a) = \im \cti\.
ls absolute value extends the absolute value on Qa (considered as a subfield of
p through constant sequence).
r°P°sition 2. The absolute value \. \q coincides with the quotient norm of
R/J, namely for a = {a mod J\
\a\Q = \\a mod J\\R/J := inf \\a - p\\.
138 3. Construction of Universal p-adic Fields
Proof. We have lim^ \yA < sup \yt\(y e R), and hence
lim \at | = lim la, - ft | < sup \at - ft | (P e J),
Mn<l|a-0ll tfej).
This proves
Mo < l|fl|li?/j-
Conversely, if a — a mod J\ then for any subset A e U we can define the sequence
£ = (ft)asft=0 (i € A) and #=<*,■ (i /<= A) so that P e J and ||a - p\\ =
suPi€>\la«l and
ll«ll/?/j < inf sup |af| = lim sup|a, | = lim |a(-| = |fl|n. H
AeU ieA U
From now on we shall simply write \a\ = \a\Q for either the absolute value on the
field Qp or the quotient norm in R/J.
Proposition3. We have
|fi£|=R>o.
Proof. This is a simple consequence of the fact that |Q^ | is dense in R>o- Indeed,
each positive real number r is limit of a sequence (r„) of elements rn e \Qap\, say
rn = l««l («„ G Qp, so that the sequence a is bounded and defines an elements
in the quotient Qp with \a\ = r. ■
Comment. This construction of Qp is reminiscent of nonstandard analysis. Let
X =Qap and in the Cartesian product XN introduce the equivalence relation
(*n) ~ (y„) <=> {n € N : xn = yn} € W.
The quotient *X := XN/~ is an ultrapower of X (as systematically used in
nonstandard analysis, in the construction of superstructures). The subset bX consisting
of classes of bounded sequences is the set of limited elements in this ultraproduct
*X, and the classes of sequences tending to zero (along U) are the infinitesimal
elements lX C bX. The quotient bX/'X = R/J = Qp has more simply been obtained
in one step.
2.3. The Field Qp is Algebraically Closed
Let / e £2P[X] be a monic polynomial of degree n > 1, say
2. Definition of a Universal p-adic Field 139
We show that this polynomial / has a root in the field £2p. Select representative
families for the coefficients:
cik = (dki)t mod J.
We can consider the polynomials
fi(X) = Xn+J2<xkiXi eQap[X].
k<zn
Since the field Q* is algebraically closed, each of these has all its roots in Q*.
More precisely, the product of the roots of f is (up to sign) the constant term aQi
of this polynomial, so that we can choose at least one root £■ with £,- < |aof-|1/n.
The sequence f = (£,) is bounded ||f || < ||floll1/n, £ £ /?, and the class * of f is a
root of/in ^p. ■
2.4. Spherically Complete Ultrametric Spaces
Consider a decreasing sequence (B<rn(an))n>o of closed balls in an ultrametric
space X:
d(aj, an) < r„ for all pairs i >n.
When r„ \ 0, the sequence of centers is a Cauchy sequence; hence it converges if
we assume that the space X is complete. The limit of this sequence belongs to every
B<rn(an) (these balls are closed). In particular, this shows that the intersection of
the sequence is not empty.
At first, it seems surprising that even in a complete space, a nested sequence
of closed balls may have an empty intersection when the decreasing sequence of
radii has a positive limit. Consider, however, the following situation. In the discrete
space N with the ultrametric distance d(n, m) = 1 — 8mn, the decreasing sequence
of closed sets Fn = [n, oo) has an empty intersection (they all have diameter equal
to 1). This space is complete (it is uniformly discrete), and a small modification of
the metric (cf. the exercises) transforms these sets Fn into closed balls of strictly
decreasing radii.
Definition. An ultrametric space X is called spherically complete when all
decreasing sequences of closed balls have a nonempty intersection.
A spherically complete space X is complete: If (jc„) is any Cauchy sequence of
*» consider the decreasing sequence (r„) where rn = supm>„ \xm — xn\ (which
converges to 0). Then (B<rn(xn)) is a decreasing sequence of closed balls having
0r intersection a limit of the sequence.
Comment. Any extension of an ultrametric field K which has the same residue
eld and the same value group (in Rx) is called an immediate extension of K. It
Can De proved that each ultrametric field admits an immediate extension that is
Pnerically complete. For example, there is a spherically complete extension of
140 3. Construction of Universal p-adic Fields
Q* that has residue field Fp™ and value group /?Q. In fact, spherically complete
extensions are maximal elements among extensions having prescribed residue field
and value group.
2,5. The Field Qp is Spherically Complete
Let us consider any decreasing sequence (Bn)n>G of closed balls Bn = B<rn(an)
in the field Qp. The ultrametric inequality shows that such a sequence of balls
decreases if
\an+i — an\ <rn and (r„) decreases.
Take liftings an e R of the centers an e R/J in the following way. Since
\an+\ ~an\ < r„ < r„_i
and since the absolute value is the quotient norm, we can proceed by induction and,
once an has been chosen, successively choose the next lifting an+\ still satisfying
||a„+i — an || < r„_j. Then ||a* —an || < r„_i for all k > n. The ith component will
a fortiori satisfy |a^ — am | < r„_i (k > «). Consider now the diagonal sequence
t = (&) in /? defined by §, = <*„-. Then
II? ~ c*„|| < sup |£ - «m-| < r„_i
because the interval [«, oo) of N belongs to the ultrafilter U, whence for x = £
mod ^7,
|x-fl„| < Ht-cUl <r„_!,
I* ~«„-il <max(|x ~an\,\an - an^\) < r„_i,
namely x e Bn_i- Since this happens for all integers n > 0, we infer x e p| Bni
and the intersection of the given decreasing sequence of balls is not empty.
The field Qp is spherically complete, hence complete.
3. The Completion Cp of the Field Qap
3.7. Definition of Cp
Let us define
Cp = Q£ = closure of Q£ in Qp.
Hence Cp is a completion of Q£:
Proposition. The field Cp is a separable metric space.
3. The Completion Cp of the Field Qap 141
Proof. The algebraic closure Q£ of Qp is a separable metric space (by Corollary
2 in (1.5)) and is dense in Cp. Any countable dense subset of Qap is automatically
dense in Cp: For example Qa is dense in Cp. m
The universal field Cp is not locally compact: |C*| = pQ = {pv : v e Q} is
dense in R>o- We shall use the following notation
Ap = {x e Cp : \x\ < 1}: maximal subring of Cp,
Mp = {x eCp : \x\ < 1}: maximal ideal of Ap.
Hence Mp = M2p, and Ap is not a Noetherian ring (2.1).
3.2. Finite-Dimensional Vector Spaces over a Complete
Ultrametric Field
Let us formulate and prove a generalization of (II.3.1) (cf. Theorem 2 in (II.A.6)
for the most general version).
Theorem I. Let Kbea complete (nondiscrete) ultrametric field and V a finite-
dimensional vector space over K. Then all norms on V are equivalent.
Proof. We use induction on the dimension n of V. Since the property is obvious for
« = 1, it is enough to establish it in dimension n assuming that it holds in dimension
« - 1. Choose a basis (e,-)^-^ of V and consider the vector space isomorphism
V • Kn -> V sending the canonical basis of Kn onto the chosen basis of V.
Considering that Kn is equipped with the sup norm, we have to show that for any
given norm ||. || on V, the mapping <p is bicontinuous. First, for x = (jc,-) e Kn,
we have
llxie! + • - • + x„e„|| < £ tallfe-1| < max |x,-| ■ E||e,-||,
||p(x)|| < C||x|| (C = E|k||)f
which proves the continuity of the map (p. Conversely, let F be the subspace of
generated by the last n — 1 basis vectors. Since the dimension of F is n — 1,
toe induction hypothesis shows that on this subspace, the given norm is equivalent
to the sup norm of the components. In particular, F is complete and closed in V.
lnce e = et ^ F, we can define
rf(e,F)=inf||e-y||>0
yeF
andputy =d(e, F)/||e|| < 1. By the induction hypothesis, there is also a constant
c^ such that
llyll > cF ■ max \xt\ (y = E2</<:„xIeI g F).
142 3. Construction of Universal p-adic Fields
For each v = <p(x) e E — F, say v = £e -f~ y (£ ^ 0, y e F), we can write
v = *(e + y/$)
with
IN = l5l-||e + y/?|| = g|-||e-y/H
>|f|-^(e,F) = |f|-y||e|| = y.||fe||,
and hence
llyll = llv-?e|| <max(||v||,||fe||) < max(||v||, y^lMI) = IMI/y
(since y < 1). This shows that ||v|| > y ||y||. We have thus proved
IMI>y||?e||, ||v||>y||y||,
l|v||>ymax(||$e||,||y||),
and since ||y|| > Cfmaxi>2 |x,-|> we have
llp(x)|| = ||v|| > y maxflf|||e||, cFmax,->2 1^1)
> cmax,->! \xi\ = c- ||x||,
with *i = £ and c = cy = y min(c/r, ||e||). ■
Corollary. If K is a complete (nondiscrete) ultrametric field and L is a finite
extension of K, there is at most one extension of the absolute value of K to L.
Any K-automorphism ofL is isometric.
Proof. Same as in (II.3.3). ■
We can now give Krasner's lemma (L5) in a more general form.
Theorem 2. Let fi be any algebraically closed extension of Qp and K C®
any complete subfield. Select an algebraic element a(e £2) over K and denote
by a° its conjugates over K. Let r = rrmv^fl \a° — a\. Then every algebraic
element b over K, b e B<r(a), generates with K an extension containing
K(a).
Proof. We can proceed as in (1.5), since we now have uniqueness of the extension
of absolute values for finite extensions of K. For any algebraic element b such
that a £ K(b), a has a conjugate a° ^a over K(b) (the automorphism o leaves
all elements of K{b) fixed), and
\b-au\ = \U>-ay\ = \b-al
\a - aa\ < maxfla - b\, \b -aa\) = \b- a\.
3. The Completion Cp of the Field Q* 143
Hence
\b~a\ > \a -a°\ > r.
Taking the contrapositive, \b — a\ < r=>a e K(b) and K(a) C #(£). ■
33. The Completion is Algebraically Closed
Theorem. The universal field Cp is algebraically closed.
Proof. Let L (C £2p) be a finite — hence algebraic — extension of Cp. We can
apply the general form of Krasner's lemma to the extension Cp C Qp, since we
already know that
• the field Cp is complete,
• the field Qap is algebraically closed,
• he field Qp has an absolute value extending the p-adic one.
Assume that L = Cp(a) is generated by an algebraic element a of degree n > 1
and let / e Cp[X] be the monic irreducible polynomial of a. By density of the
algebraic closure Q* of Qp in Cp, we can choose (1.5) a polynomial g e Qap[X]
sufficiently close to / in order to ensure that a root of g generates L over Cp. But
Q£ is algebraically closed, so that g has all its roots in Q£, and this proves that /
has degree \\L = Cp. m
Comment. We have not used the possibility of extending the absolute value of
Cp to finite extensions of this field, since we work in the field Qp constructed
in (2.2). The general possibility of extending absolute values for finite (algebraic)
extensions—where the base field is not locally compact—involves other algebraic
techniques.
3*4. The Field Cp is not Spherically Complete
Proposition. The universal field Cp is not spherically complete.
tttoOF. Here is an argument showing the existence of strictly decreasing sequences
01 closed balls of Cp having an empty intersection (without explicitly constructing
one such sequence!).
Let rn ~~> r > 0 be a strictly decreasing sequence of T = p® = |C£ ]
ro > n > ■ ■ ■ > r„ > - • - > lim rn = r > 0.
n lhe ball B = £<ro(0) we can choose two closed disjoint balls Z?o and B\ with
e same radius r\ < r0. In each of these we can select two closed disjoint balls of
radii'"2<r1,say
B(q and Bt\ closed and disjoint in Bt.
144 3. Construction of Universal p-adic Fields
Continuing these choices, we define sequences of closed balls having decreasing
radii given by the sequence (r„) and satisfying in particular
Bt D Bij D • • • D Btj...k D £,7. .*/ D • - -
(with multi-indices equal to 0 or 1). By construction, two balls having distinct multi-
indices of the same length are disjoint. If (i) = (i"i, /? , - - -) is a binary sequence we
can define
Such an intersection is either empty or is a closed ball of radius r = lim r„ having
for center any element in it, as always in the ultrametric case. In any case, all B^
are open subsets of Cp (this is where r = lim r„ > 0 is used). If two sequences (i)
and (j) are distinct — say in ^ jn — then by construction /?,-,...,-„ and Bjy..jn are
disjoint, and a fortiori B^ C B^...^, B^ c #;,--./„ are disjoint. Since the metric
space Cp is separable, the uncountable family of disjoint open sets (£(i>) can only
be a countable set of distinct open sets (any countable dense subset must meet all
nonempty open sets). This forces most of the B^y to be empty! ■
A pictorial representation of the preceding proof is sketched in the exercises.
3.5. The Field Cp is Isomorphic to the Complex Field C
The result of this section will not be used in this book. It gives the answer to a
natural question, namely: What is the algebraic structure of the field Cp?
Let us start by the determination of the cardinality of the field Cp.
Lemma. The field Cp has the power of the continuum.
Proof. The unit ball of Qp is Zp ~ rL>o(0> 1, -. -, p — 1}, hence has the power of
the continuum c: numeration in base p gives a 1-1 correspondence with [0, 1] C R
except for contably many overlaps, so these sets have the same cardinality. (In
fact, each Zp is homeomorphic to the Cantor set: Exercise 13 of Chapter I.) The
field Qp itself has the same power, since it is the countable union of balls pm^p
(each having cardinality c). All finite extensions of Qp have the same power. The
algebraic closure Q^ of Qp still has the same power (the ring of polynomials in
one variable over Qp has also the power of the continuum). Finally, a countable
product (QpN cannot have bigger cardinality. Such a product contains all Cauchy
sequences of Q" and
Card(Cp)<Card((QpN) = c. ■
Recall the terminology used for field extensions. A transcendence basis of a field
extension Kfk is a family (X)^/ in K such that
3. The Completion Cp of the Field Qap 145
the subfieldk{Xt)lQi C K is a purely transcendental extension ofk,
and K/k(Xi)iei is an algebraic extension.
Here are some general results of Steinitz concerning field theory:
Two algebraic closures of a field k are k-isomorphic.
Every field extension K/k has a transcendence basis.
Two transcendence bases of K/k have the same cardinality.
For example, let Qa be the algebraic closure of Q in Cp and Qb the algebraic
closure of Q in C. Then there is an isomorphism
Q^Q*.
These fields are countable. But the fields Cp and C have the power of the continuum,
hence the same transcendence degree (over the prime field Q or its algebraic
closure).
Theorem. The fields C and Cp are isomorphic.
Proof. Any extension of the rational field Q having the power of the continuum
has a transcendence basis having this cardinality. By the above lemma the
transcendence degrees of C and Cp over Q (or its algebraic closure) are the same, and
we can select transcendence bases (X)iGl in C and resp. (Y)ieJ in Cp (indexed by
the same set). Now, C is an algebraic closure of Q(X,-)IG/ and Cp is an algebraic
closure of Q(y()iG/. Hence these two algebraic closures are isomorphic. ■
As a consequence, we can view the field Cp as the complex field C endowed with
an exotic topology. But the preceding considerations do not lead to a canonical
isomorphism between these universal fields: The axiom of choice has to be used
to show the existence of such an isomorphism.
Reld D £<! d £<, Residue field Nonzero |. | Properties
QP D Zp d pZp Fp pz locally compact
9 w H ' ' ' H [ locally compact
QJDA-3M- *-=K = Fp« P« [algebraically closed
p p p I not locally compact
CP3APDMP F»=F^ p« (algebraically closed
p p p p ** I complete
o -n a _ ,, kr> _ f algebraically closed
uncountable [ sphencally complete
146 3. Construction of Universal p-adic Fields
4. Multiplicative Structure of Cp
4.1. Choice of Representatives for the Absolute Value
Definition. Let G be an abelian group written multiplicatively and n > 2 an
integer We say that
1. G is ^-divisible if for each g e G, there is x e G with xn = g,
2. G is uniquely ^-divisible if for each g e G, there is a unique x e G
with xn = g,
3. G is divisible if it is n-divisible for all n > 2.
A simple application of Zorn's lemma will show the possibility of extending all
homomorphisms having a divisible group as target.
Theorem. Let G be a divisible abelian group. For each abelian group H and
homomorphism (p : Ho —► G of a subgroup Ho C H, there is a homomorphism
ifr : H —> G extending (p.
Proof. Consider all homomorphisms Ho C H' —> G (//' is a subgroup of H
containing Ho) extending a given homomorphism <p : //0 —> G. There will be a
maximal one ^ for the order relation given by continuation: Every totally ordered
set of extensions has an upper bound, defined in the obvious way on the union
of the increasing chain of subgroups. I claim that the domain of such a maximal
homomorphism is the whole group H. Indeed, if the domain of an extension y is
a proper subgroup H' C H, let us show that it is not maximal. For this purpose,
select any element g e H,g g H' and consider the subgroup H" generated by H'
and g, namely the image of the homomorphism
(£, x') h> gV : Z x H' -+ H.
When the only power of the element g that lies in H' is the trivial one, the subgroup
H" is isomorphic to Z x H', and an extension of (pr is given by
<p"{glx') := <p'{x').
If other powers of g lie in H\ the inverse image of Hr by the homomorphism
I m> gl : Z -* H is a nontrivial subgroup mL C Z (m > 0) (in other words, gm lS
the smallest positive power of g in //')- In this case, we choose an mth root ^^
of <p'{gm) e G such that zm = ^'(£™). We can define the extension y?" : H"-* G
by
?V*'):=*V(*').
This is well-defined because if gilx[ = gllx^ (x[ e //'), we have
gl^=x'2{x\T'eH';
4. Multiplicative Structure of Cp 147
hence l\ — £2 = km is a multiple of m and
*>V'~'2) = <p'{gkm) = ?>'(gm)* = (zm)* = zm*,
*>'(x>'(xjr1 = v'(4(^)_1) = *>V'-'2) = ^'-'2,
and finally
zV(*i) = zV(*2>- ■
Remarks. (1) For an additively written abelian group G, divisibility requires that
all equations nx = a (x e G,« positive integer) have (at least) one solution x e G,
hence the terminology. For example, the additive groups Q and R are divisible,
but Z is not a divisible group.
(2) An abelian group G having the extension property mentioned in the statement
of the theorem is called injective group or injective Z-module.
Application. The universal field Cp is algebraically closed; hence the
multiplicative group C* is divisible. The homomorphism <p : Z —> C* defined by
(p(n) = pn e C* has an extension \j/ : Q —* C*. This extension is one-to-one,
since its kernel is a subgroup of Q with ker ^ nZ = {0}. The image of ^ is a
discrete subgroup V C C* isomorphic to the subgroup /?Q C R>o- Instead of \^(r)
we shall often write pr e Cp and ^r(Q) = /?Q. But — although the notation does
not emphasize it — this subgroup /?Q c C * depends on a sequence of choices of
roots of p in Cp and is not canonical. When we consider /?Q as a subgroup of C *,
we have to remember that \pa\ = \jpa > 0. This subgroup is a complement to
the kernel
U(l) = {xeCp:|x| = l}cC;
of the absolute value. In particular, we have a direct product decomposition
c; = r-u(i) = pQxU(i)
(analogous to polar coordinates in Cx) given by
x = r - (x/r) h> (|x|, x/r) (r e T, |x| = |r|, x/r e U(l)).
aince both Ap and Mp are clopen subsets of the metric space Cp, the subgroup
(*) = Ap — Mp is clopen and the preceding product is a topological isomorphism.
4-2- Roots of Unity
nrst analysis of the structure of the group of units
U(1) = AP-MPCC;
148 3. Construction of Universal p-adic Fields
is made by looking at the reduction mod Mp. The restriction of the (ring) homo-
morphism
e :Ap-+ Ap/Mp^Fpoo
(where Fp°c is an algebraic closure of Zp/pZp = Fp) to units gives a surjective
(group) homomorphism (II.4.3)
e:U(l)->F^
with kernel £ (1) = 1 + Mp C U(l), whence a canonical isomorphism
U(l)/(l+Mp)^F^.
In the algebraically closed field Cp, we can find roots of unity of all orders, so
that /jL = /x(Cp) is isomorphic to the group of roots of unity in the complex field.
There is a canonical product decomposition of this group,
fji = /X(p) - fip°o (direct product),
where /X(p) is the subgroup consisting of the roots of unity of order prime to /?, and
fipcc the subgroup consisting of the pth power roots of unity (in Cp).
The restriction of the reduction homomorphism £ gives an isomorphism of this
subgroup /X(P) with Fie, and hence a direct product decomposition
u(i) = /i(p).(i + Mp)cc;.
On the other hand,
/v*c (i+mp)hq;.
Let us recall the more precise result established in (II.4.4).
Theorem. Let f e jjlp^ c Cp be a root of unity having order p* (/ > 1). Then
l?-H = lp|1Mp/)<l (^(p') = Pr""I(p-l))- ■
For a subextension K of Cp, the link with the notation used in (II.4.3) is
fjL(p)(K) = /X(P) n K: roots of unity (in A^) having order prime to /?,
fjipoc(K) = fipoc n K: pth power roots of unity (in A^).
4.3. Fundamental Inequalities
In the preceding section (4.2) — based on II.4.4 — we recalled the estimates for
absolute values of pth powers. Such estimates form a recurring theme of p-adic
analysis, and we give a few more precise forms of these estimates for convenient
reference. The first one is purely algebraic.
4. Multiplicative Structure of Cp 149
Fundamental Inequalities: First form. Denote by I = (/?, 7) the ideal of the
ring Z[7] generated by the prime p and the indeterminate 7. Then
(1 + T)pn - 1 e 7 • /" (/! > 0).
Proof. For n = 0, the assertion is a tautology, and we proceed by induction on
n > 0. Assume that (1 + T)p" = 1 + Tu for some u e /". Hence
(1 + T)p"+1 = (1 + W = 1 + /?7Wv + Tpup
for some polynomial i; e Z[T]. But
p7m e t Pin<z r./n+1,
7^m^ = 7 • 7^~ V E 7 • In+[
(since /? > 2), and the sum pTu + 7^mp belongs to 7 - In+1 as expected. ■
Let us replace the indeterminate 7 by an element t e ApdCp. Since each
element in /" is a sum of terms containing factors plTn~~l for 0 < i < n, the
ultrametric inequality shows that all elements obtained have an absolute value
smaller than or equal to the maximum of | pl t"~~l \, and we see that we have obtained
the following inequality.
Fundamental Inequalities: Second form. Lett e Cp, \t\ < 1. Then
1(1+ t)pn - 1| < |r| - (max(|f|, |p|))» (n > 0)
(cf.(V.4.3)). ■
Other forms are often used (they are not completely equivalent to the preceding
°nes, but also admit useful applications). We mention them briefly.
Third form. Let K be a finite extension ofQp, K D RD P. Then
(1 + P)p" C 1 + Pn+l (n > 0).
" P = nR and \n\ = 6 < 1 (generator of the discrete group \K*\ c R>o),
^en in K the announced inclusion is equivalent to
|r| <6>=>|(l+r)^-l| <6>"+1. ■
nis third form follows from the first one (replace 7 by n) but is less precise than
*e second form because
peP, \p\=9e
and0 = lpp>|p|if^> 1.
150 3. Construction of Universal p-adic Fields
Fourth form. With the same assumptions as in the third form, we have
(1 + tf = 1 + nt (mod pntR)
ift e 2pR (neN, Z or even Zp). ■
If we look at the first term only in the expansion
(1 + tf - 1 - nt = n(n - l)f2/2 + • - -,
we find that for t/2 e pR,
n(n - \)t2 x t
= (n — 1) • nt • - e nt • pR.
2 2
It only remains to check that the next terms are not competitive. Since we shall
not need this form before Chapter VII, we refrain from giving a proof now. It will
be obtained by a general method in (V.3.6). ■
4A. Splitting by Roots of Unity of Order Prime to p
We have a direct product decomposition (4.2)
U(l) = wP)X(l+Mp)
of the multiplicative subgroup defined by |x| = 1 in C* The corresponding
projection U(l) -^ /X(P) is the Teichmiiller character. It can be made explicit in
several forms. Let |x| = 1 and K = Qp(x) have residue degree /. The residue field
k = R/P of K has order q = pf, and the reduction homomorphism £ sends the
given unit x to an element £(x) e F* of order dividing q — 1 (II.4.3). Hence
£{xf~l = 1, xq~l = 1 (mod P).
The fundamental inequality (second form) shows that the pth powers of xq~~l =
1 +1 (f e P C K or t e Mp C Cp) tend to 1:
A fortiori, taking n = fm,
x^
Say x^+l = x^(l + £m) where £m -► 0. Hence xq^ - x*" = x?mem -+ 0,
and the Cauchy sequence (xqm )m>o has a limit f in the complete (locally compact)
field K C Cp. Obviously, C* = £ and
f = lim xqm = x + (x* - x) + (**' - xq) + - • • - x (mod />).
4. Multiplicative Structure of Cp 151
The map
x m» f = co(x) — lim xq
m—>oo
defines a homomorphism U(l) Pi Kx -> /x^_i C K* that corresponds to the
projection on the first factor in the direct product decomposition (II.4.3)
U(i)nrs/iH x(i + P).
It is possible to give a formula working independently from the residue degree of
x e U(l). Indeed, if q is given, the subsequence (xp") has an end tail in (xgm).
We have obtained the following result.
n'
Theorem. Let x e Cp with \x\ = 1. Then the sequence (xp ") converges to the
unique root of unity that is congruent to x (mod Mp) and the homomorphism
a): x y~> ^ = co(x) = lim xpn
m—>oo
corresponds to the projection on the first factor in the direct product
decomposition
U(l) = /z(p)x(l+Mp). ■
4.5. Divisibility of the Group of Units Congruent to 1
In this section we investigate the divisibility properties of the multiplicative group
Proposition 1. The group 1 -h Mp is divisible. For each m > 2 prime to p, it is
uniquely m-divisible.
Proof. It is enough to prove that the group 1 + Mp is p-divisible and uniquely
*" -divisible for each m prime to p.
(l)Letl-hr e 1+Mp and select a root x e Cp of Xp~(l+t): this is possible.
S1nce this field is algebraically closed. Since |jc|p = \xp\ = |1 + 11 = 1, we have
l*l = l:*(EU(l).Now
(x mod Mp)p = xp mod Mp = J e k
lrjiplies x mod Mp = 1. since k has characteristic /?. This proves x = 1 + s e
]+Mp.
(2) Let 1 -+- t e 1 + Mp and select a positive integer m prime to p. We are
Poking for a root of the polynomial f(X) = X™ - (1 + i)- We already have an
aPproximate root j = 1 for which the derivative mXm~l does not vanish mod Mp
(P does not divide m):
/(>>>= l-(l+f) = -r, r(j) = /n, |/'(v)| = 1.
152 3. Construction of Universal p-adic Fields
Thus we have |/(j)//"'^)21 = |—t\ < 1, and Hensel's lemma (II. 1.5) is applicable:
There is a unique root of / in the open ball of center 1 and radius 1. ■
In fact, for each (G/zmC fi(p) C F*™, there is one root x of / with x = f
(mod Mp). These m roots of / are all the roots of this polynomial, and for each
given t; e fim there can be only one root of / congruent to this root of unity f.
For later reference, let us formulate explicitly the following characterization of
the topological torsion of C *.
Proposition 2. For x e Cp we have
x e 1 + Mp «=» xpn -> 1 (n -► oc).
Proof. Ifjc=l+fel+ Mp, the sequence
xPn -\ =(i+f)P"_i
tends to 0 by the fundamental inequality (4.3) (second form). Conversely, assume
that xpn —> 1 (for some x eCp) and take an integer n such that xp" belongs to the
open neighborhood 1 + Mp of 1 in Cp. Since we have proved in (4.1) that there is
a torsion-free subgroup V (= /?Q) of C * and a direct-product decomposition
c;=r-M(„)-(i+Mp),
we see that x e /X(p) ■ (1 -f-Mp). The first component f of x is trivial simply because
it has an order prime to p:
/Gl+Mp=^^" = l=^f = l. ■
Observe that the convergent sequence (xpn)n>o is eventually constant precisely
when x is a pih power root of unity
x e fipoc c 1 +Mp.
Appendix to Chapter 3: Filters and Ultrafilters
A. 7. Definition and First Properties
Let X be a set. A family T of subsets of X is a filter when
l.AeT, B eF=> ADB eT,
ZAeT,A'DA=>A'eT.
If there is a filter on a set X, then this set is not empty by the condition 0. The
condition 1 shows (by induction) that the intersection of a finite family of subsets
At e T is an element of the filter T and in particular is not empty. The intersection
of all elements of T may be empty, in which case we say that this filter vsfree.
Appendix to Chapter 3: Filters and Ultrafilters 153
Example. Let X be a subset of a topological space Y, choose a point y e X — X,
and define a filter JonXas follows:
J7 z= {V HX : Visa neighborhood of y in Y}.
This example is typical, since if T is any free filter on a set X, we can define a
topology on the disjoint union Y = X u {co} by specifying its open sets:
subsets of X and subsets A U {co} {A e T).
The topology induced on X is the discrete one, but co is in the closure of X in Y,
and the filter on X attached to co is precisely T.
A family # C T is a basis of this filter if any A e T contains afie&
Lemma. Let B be a family of nonempty subsets of a set X such that
A eB, B eB =» there exists C eB such that C C A n B.
Then the family of subsets ofX containing elements ofB is a filter having B as
a basis. m
The filter constructed in the previous lemma is called the filter generated by B.
Lemma. Let T be a filter on a set X and let f : X —► Y be a map. Then the
family f{J^ = {f(A) : A e T} is a filter on f(X) and a basis of a filter on Y.
Example. Let T be a free filter on N. Choose for each neNan element An e T
such that n £ An. Hence A = f]lSn<N An e T and [AT, oo) d A, and hence
W, oo) e T. Any free filter on N contains all subsets [N, oo) (N e N).
More generally, let X be an infinite set. Then any free filter on X contains
aH cofinite subsets (i.e. complements of finite subsets) as elements. The cofinite
subsets form the Frechet filter on X.
A-2. Ultrafilters
he inclusion relation for families T C V(X) is an order relation, and if T' D T,
We say that T is finer than T. For example, any free filter on X is finer than the
frechet filter.
In an obvious sense, the subsets of a finer filter T' are smaller than those of T}
definition. Maximal filters are called ultrafilters.
Compare with coffee powder, where finer grinding also provides finer granules!
154 3. Construction of Universal p-adic Fields
Any totally ordered sequence of filters on a set X has a majorant (the union
in V(X)), and by Zorn's lemma, any filter is contained in a maximal one. For
example, the Frechet filter on X is contained in an ultrafilter (necessarily free).
Theorem. Let T be a filter on a set X. Then T is an ultrafilter precisely when
the following criterion is satisfied:
for each Y C X eitherY eTorYc^X-YeT.
Proof. If the condition is satisfied, T is obviously maximal. Conversely, assume
that there is a subset Y C X with Y £ T and Yc £ T. Observe that all A e T
meet Y:
Yc $T ==> Yc?>A=>YnA^0.
AeT
Define T1 D T as follows:
T' = {A' C X : A' d A n Y for some A e T}.
Hence Tr is a filter, and Y e T. Since Y £ T,Tr is stricdy finer than T, proving
that this last filter was not maximal. ■
Corollary 1. Let U be an ultrafilter on a set X. IfA\,..., An is a finite family
of subsets ofX such that Ui<i<n A" e ZY» then there exists at least one index i
for which At eU.
Proof. It is enough to prove the assertion for two subsets (by induction). If A £ U
and B £ U, we infer from the above criterion that Ac e U, Bc e U\ hence
(A U B)c = Ac Pi Bc e U, and A U B £ U. ■
Corollary 2. Let f : X -► Y and let U be an ultrafilter on the set X. Then
f(U) is an ultrafilter on f(X) and a basis of an ultrafilter on Y.
Proof. It is enough to prove the assertion when / is surjective. For any A C F*
either f~l(A) or f~\A)c = f\Ac) belongs toZY; hence
either A = f{f~~\A)) or Ac = f(f~l(Ac)) belongs to U.
By the criterion, f(U) is an ultrafilter on Y. ■
A. 3. Convergence and Compactness
Definition. Let X be a topological space. A filter T on X is said to converge to
a point x e X if it is finer than the filter of neighborhoods of this point, namely*
when each neighborhood ofx in X contains a subset A Gf.
Appendix to Chapter 3: Filters and Ultrafilters 155
For example, the filter of neighborhoods of a point converges to this point. In a
Hausdorff space, a convergent filter can converge to at most one point.
Let X be a compact space. Then for each filter T on X, the family (A)Aep has
nonempty finite intersections; hence by compactness
Q := p| ~a ^ 0.
AeT
If U is any open set containing Q, then
Uc n fi = 0 => £/c H p| Aj = 0 => £/ d p| ^7 d p| Aj ( e J7)
for some finite family (Aj) of subsets Aj e T. This proves that U contains an
element of T and this filter is finer than the filter of neighborhoods of Q.
Theorem. In a compact space, every ultrafilter converges.
Proof. Let U be an ultrafilter on the compact space X and choose x in the
nonempty intersection f]A&u A. The nonempty subsets
Ur\V (U eU,V neighborhood of jc)
generate a filter finer than U, hence equal to U. Hence this ultrafilter converges to
x, and a posteriori
p[A = {X}.
AeU ■
Application. Let U be an ultrafilter on the set N of natural numbers and let (<z„)„>o
be a bounded sequence of real numbers. Then Wmu an exists and
inffl„ < liman < sup<z„.
n U „
Proof. Since the sequence (<z„)„>o is bounded, then
—oo < a := inf<z„ < £ := sup<z„ < oo,
" n
and this sequence defines a map
rt h> an : N -^ \a, ft] c R
taking its values in a compact space. The image of the ultrafilter U is a basis of an
ultrafilter in the compact space [a, fi\\ hence it converges in this space. ■
156 3. Construction of Universal p-adic Fields
A. 4, Circular Filters
Let K — Qp be the spherically complete extension of Qp constructed in (III.2).
Recall that |Q* \ = R>o and the residue field kn is infinite.
To each closed ball B C K we associate a filter Tb on K defined as follows:
If the ball B is a single point {a}, we take for TB the filter of neighborhoods of
this point, generated by the B<B(a) (e > 0).
If B = BSr(a) has positive radius r, we take for Tb the filter generated by the
subsets
A(e, au...,an)= B<r+e(a) - (J B<r_E(ai) (0 < £ < r, a,- e £).
When £ decreases and/or the number of points n increases, these subsets decrease,
and we see that these subsets make up a basis of a filter. The filter Tb generated
by this basis is the circular filter associated to the closed ball B.
By definition, the subset A(£y a\,..., an) contains
r<\x — a\<r + e (x e K)
(observe that this set is independent of the choice of center a e B). Also, for any
b e B there is a S > 0 such that
[x e K : r — S < |x — b\ < r] C A(e, a\,..., an).
Lemma. Let B be a closed ball of radius r > 0 and choose a e B. Then a
basis of the circular filter Tb is given by the following subsets
A'(£, aly..., an) = {r - £ < \x - a\ < r + e] - [J B<r^E{at\
finite
where the at are chosen on the sphere Sr(a) : \x — a\ = r and0 < £ < r. ■
Here, replacing £ by a smaller one, we may even assume that the points a{ satisfy
\ai-aj\ =r(i ^ j).
The preceding definitions can be relativized to a subset X C K = Qp. Assume
XnA^0forallA e TB,
so that Tb induces a filter on X. Then this induced filter TB(X) = TB n X is still
called a circular filter on X.
For example, let X = Cp. When the closed ball B <Z K does not meet Cp, we
have r := d(B, Cp) > 0, and if 8(B) = r, the trace of Tb on Cp is a circularfilter
without a center in Cp.
Exercises for Chapter 3
1. Prove that Qp is not complete by considering the series £, n)=l pnpl/n.
(Hint. Let x be the sum in a completion of Q£ and let A' be the completion of Qp(*)*
Exercises for Chapter 3 157
Show by induction that all p1/" e K for (n, p) — 1, and hence K is not algebraic over
Qp-)
2. Let K be an algebraically closed valued field. Prove that its completion K is also
algebraically closed.
(//*>tf. Let /(X) = X" + an-iX"-1 + h aiX + ao e K[X] and select monic
polynomials fj(X) = X" + ^-.i^X""1 + h ai ^X + ao,; e # [X] converging
coefficientwise to /. Then 8j := ||/)+1 - /j-|| = max/ kj+i - a,j| -> 0 (j -> oo).
Choose inductively a root x7- (in the algebraically closed field K) of // so that (x;); is
a Cauchy sequence (cf. III. 1.5) and hence converges in the completion K to a root of
/. This type of proof also appears in (VI.2.2).)
3. Let X be the real Banach space consisting of sequences x — (jc„)„>o of real numbers
converging to zero with the norm ||x|| = sup |x„| = max |x„|. Consider the sequence
in X defined by
flO = (0), fl„ = (1 + L, 1+ £,..., l + £, 0,0,...) («>L)
so that \\an || — 2 (n > I). Show that with the induced metric, the set A ~{an\n> 0}
is an complete ultrametric space which is not spherically complete.
(Hint The induced metric on A is given by
d(an, an+k) = \\an - an+k\\ = 1 + ^L (n > 0, k > 1).
What is the closed ball of center an and radius 1 + ^?)
4. (a) Let X be a complete metric space having the following property: Any decreasing
sequence of possible values of the distance function converges to 0. Show that X is
spherically complete.
(b) If a complete metric space is not spherically complete, show that we can replace
its metric by a uniformly equivalent one 8 for which it is spherically complete. (Hint:
For given x and y, define <5(x, y) ~ 2", where the integer n e Z is chosen so that
d(*» y) < 2" < 2d(x, y). Then use (a).)
5. Prove that the residue field of £lp is uncountable.
(Hint. Each sequence N -> />t(P) C Q£ leads to a nonzero element of the residue field
*QOfftp.IfN->fo is any map, use Cantor's diagonal procedure as in (1.1.1) to define
an element not contained in the image.)
6- There are many possible choices of copies of p® in Cp. Let tpn denote the homomor-
phismx h> x": C£ -> C£ (n > 1), then kerlim^„ — lim/>t„T gives a parametrization
of choices. (Recall that a countable projective limit of surjective maps is surjective (4.3).)
'■ Let K be an extension of Qp with \KX\ dense in R>o- Recall (exercise of Chapter II)
that the tree 7# is the ordered set of closed balls of K. This tree comes with a projection
^ : T% -> R>0. For r > 0, the fiber 8~l(r) = K/B<r is the uniformly discrete quotient
group of closed balls of radius r.
(a) Show that the maximal totally ordered subsets of T% are isomorphic to either [0, oo)
or to (0, oo): Let us call these subsets maximal branches, and in the first case, we
say that the corresponding branch bears a fruit. The projection by 8 of a maximal
branch is either an isomorphism with the interval [0, oo) or an isomorphism with
an interval (r, oc) (r > 0): The fruit of a branch can lie only above r = 0.
158 3. Construction of Universal p-adic Fields
(b) Show that K is complete exactly when all maximal branches having a projection
containing (0, oo) do bear a fruit, i.e., are isomorphic to [0, oo) by projection.
(c) Assume that the field K is separable, so that all fibers S~~l (r) (r > 0) are countable.
Show that such a field cannot be spherically complete.
(Hint. The set of distinct branches having a nonempty intersection with any
8~l[r'y r"] for some fixed rr < r" is uncountable.)
K K/B^
r = 0 r' r" r = l
Tree of K: Fruits, branches, and holes
(d) Define an action of the 2 x 2 upper triangular matrix group T^(K) C Gl2(#) on
TK (cf. fVI.3.I)). When K = Qpy show that this action is transitive on the subtree
defined by 8 > 0.
8. Let a e&p-Cp and r := d(a, Cp) > 0. Show that the cases Sr(a) n Cp = 0 and
Sr(a) nCp^0 both occur.
(Hint. Choose first a € £lp with \a \ = 1 and residue class a e Icq not algebraic over the
prime field: In this case r = 1 and the sphere S\ (a) meets Cp. On the other hand, select
a decreasing sequence of closed balls B<rn(an), rn \ I having an empty intersection
in Cp and choose a in the intersection of the same balls of £2p: The sphere Si(a) does
not meet Cp.)
9. Let K be an ultrametric field. Assume that both k (the residue field) and | K * | are
countable. Prove that for fixed r > 0, the set of dressed balls of radius r is also countable.
(Hint. Observe that the set of open balls of radius r is countable. Define a surjective
map from the set of open balls to the set of closed balls of the same radius.)
10. Let K be an ultrametric field with | K x | dense in R>0- For real t > 1 let Vt denote the
partition of the closed unit ball A ~ {x e K : \x\ < 1} into its cosets mod the additive
subgroup £<!/, = {x e A : |x| < \/t}. The family (Vt) indexed by t e [0, oo) has the
property
for 5 > t > 1, Vs is strictly finer than Vt.
(The "continuous family" (At\>\ of associated a-algebras is a filtration of the space
A in the sense used in the theory of stochastic processes.)
h
^
fv
^
/
Exercises for Chapter 3 159
11. Let us denote by B<r {r > 0) the additive subgroup |x| < r (in Cp or in any ultrametric
field having dense valuation).
{a) For 0 < r < s, show that the subgroup £<r of B<s has no supplement: B<s is not a
direct product of £<r with another subgroup. In other words, the short exact sequence
0 -+ H = B<r -> G = Bss -> G/H -> 0
does not split. {Hint. For all x e G = Z?<5, pnx -> 0.)
{b) For 0 < r < 5 < 1 show that the multiplicative subgroup 1 -h B<r of 1 + B<s has
no supplement, (//wt If |x| < l,then(l + x)Pn -> 1.)
(c) For 0<52<r<5<l, prove that there is a canonical isomorphism
(1 + B<s)/{1 + £<r) -^ Bss/B<r.
{Hint. Consider the homomorphism xki=x-1 mod £<r.)
{d) We have iip™ n (1 -1- £<rp) = {1}, but the direct product iipx - (1 -1- £<rp) is a
proper subgroup of 1 + Mp. Show that 1 + B<rp has no supplement in 1 + Mp
and more precisely, ixp^ is maximal among the subgroups Hcl+ Mp such that
H n (1 -1- £<rp) = {1}. (Tfinf. The sequence (1 + r)^" —> 1 is eventually stationary
precisely when 1 +1 e iip™.)
12. Prove the first form of the fundamental inequalities by induction, using a — {\ + T)pK
and the factorization
ap - 1 = {a - l)(l + a + - - - + ap~l),
where each ak e 1 + / {k > 1) so that 1 + aH ha^"1 € p + 7 = /. (Observe that
the case n = 1 of the statement is crucial, and the induction step is based solely on it!)
4
Continuous Functions on Zp
The goal of this chapter is the study of continuous functions on subsets of the
p-adic field Qp with values in an extension of Qp. Since Qp admits a partition into
clopen balls x + Zp (x e Qp/Zp = Z[l//?]/Z), it is enough to study continuous
functions on Zp. Thus, we shall typically study continuous functions Zp -> Cp.
Since the natural numbers N form a dense subset of the ring Zp, we shall start by
the study of functions on N or Z and with values in any abelian group.
In classical analysis, real- or complex-valued functions that are continuous on
an interval can be uniformly approximated by polynomial functions (theorem of
Weierstrass). But there is no canonical series representation for them. It is a specific
feature of p-adic analysis that continuous functions Zp -» Cp have a canonical
Mahler series representation. As has been noticed and proved by L. van Hamme,
many systems of polynomials can also be used instead of the binomial system.
This leads us into the umbral calculus, where suitable systems are found.
Due to the granular structure of Zp, the locally constant functions also constitute
a dense subspace of C(Zp; Cp) (these functions correspond to the step functions on
an interval in the classical theory). A basis of this space consisting of characteristic
functions of suitable balls has been devised by M. van der Put.
1. Functions of an Integer Variable
1.1. Integer-Valued Functions on the Natural Integers
A polynomial f(x) e Q[x] can take integral values on all natural integers even if its
coefficients are not integers. For example n2 = n (mod 2) shows that \x2 — ix lS
1. Functions of an Integer Variable 161
such a polynomial. More generally, np = n (mod p) shows that the polynomial
lxp _ I* is also such a polynomial.
P The study of these polynomials is based on the following observation. Each
binomial polynomial
(x\ x(x — 1) • - - (x — n + 1)
0
n; n\
€QM (#i>0)
defines an integer-valued function N -> N. This (and the theorem below) explains
their central role in this chapter.
The first binomial polynomials are
(;)-• (0- (0-T-i-
One can read the sequence of values given by (J in Pascal's triangle: The first
values are 0 (outside of the triangle)
CHQ- er)=°-(;)='■(-:>«•
etc.
In the figure below, we exhibit the values of the binomial polynomials in vertical
columns, with special attention to Q).
1
1
1
1
1
1
1
1
1
1
2
3
4
5
6
7
8
1
3
6
10
15
21
28
1
4
10
20
35
56
G)
1
0
0
0
0
1
5
15
35
70
1
6
21
56
1
7 1
28 8 1
Values of the binomial polynomials as vertical columns
n tne other hand, introduce the finite-difference operator V defined by
(V/)(x) = f{x + 1) - f(x).
J*his is a discrete analogue of the gradient operator, whence the notation; we
^P A for a discrete analogue of the Laplace operator.) This forward-difference
P^ator acts on any function / on N taking values in an abelian group. An abelian
£roup can always be considered as a Z-module, and conversely, any Z-module is
n a^elian group. Thus we shall now consider functions / : N -> M where M is
162 4. Continuous Functions on Zp
a Z-module. The action of the finite-difference operator on the binomial functions
is easily determined: An elementary computation shows that
The binomial polynomials behave with respect to the difference operator as the
polynomials xl/i\ do with respect to the derivation operator:
D(x°) = 0, D[ — ) = = (i > 1).
This analogy will be exploited and generalized.
Theorem. Lex M be any abelian group and let f : N —»• M be an arbitrary
map. Then there is a unique sequence (iw,-)i>o of M such that
/<*) = £ m, (*) = J2 «•■ t) (* e N).
Forx 6 N only finitely many terms of the sum are nonzero, and mi = (V'/)(0).
Proof. Since (*) = 0 for x = 0 and i > 1, we see that mo = /(0) is uniquely
defined. The finite-difference operator can be used repeatedly to bring any coefficient
into the constant term position:
Hence m^ = V*/(0). These computations already prove the uniqueness of the
coefficients m* and show how they have to be computed. Conversely, if the function
/ is given, let us compute the iterated differences V*/(0) e M and define g(x) =
5Zo</<jc ^' /(0)G)> 0> = / ~ £■ The iterated differences of <p vanish at the origin
by construction: <p(0) = 0, <p(\) — <p(0) — 0, whence <p(\) = 0 from which
it is apparent that <p vanishes at the points 0, 1, 2, More formally, one can
establish by induction the general formula
vV(0) = £(-iyY*W - o = v(k) + - - -
The induction hypothesis ^(y) = 0 for all j < k and V*^(0) = 0 implies
?<*) = ~J2 (-w(k)<P(k - 0 = 0 (* > 0).
1. Functions of an Integer Variable 163
Hence (p = 0, as expected. This proves f = g and the existence of an expansion
of the desired form. ■
Comments. (1) The preceding proof shows that the expansion of / is simply / =
Z/>o mi C) ■ T^s series converges pointwise: Although infinitely many coefficients
rm will be nonzero in general, for each fixed x 6 N the sum E,->o mffix) is a
finite sum. Let us introduce the Pochhammer symbol
(x)o= 1, (x),-=x(x--l)-.-(x-i+l) (i > 1),
so that
V(jc)i = j(*),-_i and
The preceding series expansion of / takes the form
;>o l ■
which is strikingly similar to the Taylor-MacLaurin power series of an analytic
function (of a real or complex variable).
(2) The formulas
v*/(0) = £(-i)*-,Y*)/(0
correspond to the formal power series identity
*>0 /7>0
between these two generating functions.
J-2. Integer- Valued Polynomial Functions
We shall denote by L = L(Z) c Q[jc] the Z-module consisting of polynomial
functions taking integer values on the natural integers:
L = {/ € QM : /(N) C Z}.
We have seen in (1.1) that Z[x] C L is a proper inclusion: All binomial functions
belong to L.
Theorem. The Z-module L consisting of polynomial functions f e Q[x]
integer-valued on N is free, with a basis given by the binomial polynomials ( ).
**oof. Let / be an integer-valued polynomial. Obviously, all the iterated
differences of / have the same property, and in particular the coefficients mt of the series
cyf-
164 4. Continuous Functions on Zp
expansion of / are rational integers. On the other hand, the iterated differences
V1 / will vanish identically if the exponent i is greater than the degree of /. Hence
the series 5ZiW|(j) is a finite sum, and the uniqueness of the representation has
been proved in (1.1). ■
Corollary 1. If a polynomial f e Q[x] takes integral values on N, it also takes
integral values on Z.
Proof. It is enough to check this property for the basis of L consisting of the
binomial polynomials. If x — — m is a negative integer, then
(~™ J = -m(-m - 1) • • ■ (-in - i + l)/i! = (-1)1 r
Hence the (:) and all / e L define functions Z -► Z. ■
Corollary 2. If a polynomial of degree d > 0 (w#/i rational coefficients) takes
integral values on d -\~ I consecutive integers, then it takes integral values on
all integers.
Proof. Let / take integral values on the integers a,a + \, ...,a + d and
consider its translate g(x) = f(x — a) which takes integral values on the first integers
0, 1,..., d. Hence the first iterated differences of g at the origin are also integers,
and if / is a polynomial of degree d, so is g. The expansion g = ^2i<zd V'g(O) (j)
shows that g 6 L. ■
Definition. Let Mbea Z-module. The M -valued polynomial functions are those
that have a finite expansion in the basis consisting of binomial polynomials.
Since the polynomial functions with values in M are the finite sums J] w,- (j), the
mapping
Maps(N;M) -+ MN : / ^ (V/(0))i>0
induces a bijection between the polynomial functions and Mm: The subspace of
the product consisting of families with only finitely many nonzero entries.
13. Periodic Functions Taking Values in a Field
of Characteristic p
We shall have to consider the case where the Z-module M is a vector space over
the finite field Fp. To start with, let us take M = ¥p.
Proposition. For i < p\ the functions (:) : Z —> Fpj x J-*- (*) mod p are
periodic of period T = p*. They make up a basis of this space of T-periodic
maps.
w
1. Functions of an Integer Variable 165
Proof. The binomial coefficients are best described by their generating function
(l + i
i>0 \f/
The identity (1 + u)x+T = (1 + u)x - (1 4- u)T combined with the congruence
(I+m/ = l+upt (modp)
leads to
(1 + u)x+pt = (1 + uf • (1 + upt) (mod /?).
Fori < p', the coefficients of w1 in (1 + u)x+p' and in (1 -h«)^ are the same mod p\
hence
CVO-C)(mod/,) ^^
To prove the second part of the statement, consider the linear map
j : Maps^^^F,) -+ FTp, f » (Vf(0))0^T.
If V* /(0) = 0 for 0 < i < T, then / vanishes at the points 0, 1,..., T - 1 (1.1),
hence vanishes identically by T -periodicity. This proves that the linear map j is
infective. Since both spaces Maps^.p^odic (Z; Fp) and FTp have the same dimension
over Fp (even the same number of elements, since this field is finite), it is bijective.
It will be enough to check that the image of the set of binomial polynomials (:) is
the canonical basis of the target
(:) is a polynomial of degree i => V*Q =0 for k > *",
V*G) = CI) vanishes at 0 for k < i, V*Q(0) = 1. ■
Remark. It is not difficult to prove periodicity of the binomial coefficients relative
to nonprime moduli. For example,
ls periodic of period ml.
Theorem. Let M be a vector space over Fp and f : Z -» M a function that is
Periodic of period T = pT (for some t > 0). Then f can be uniquely written
in the form
0<i<
166 4. Continuous Functions on Zp
In other words,
Mapsr.periodic(Z;M) = 0M,-
i
is the direct sum of the subspaces
Mi = [ ; W C Maps(Z;M) (0 < i < T).
Proof. A 7-periodic map on Z is a map on the finite quotient Z/ TZ, and hence
takes only finitely many values. This reduces the proof to the finite-dimensional
case. The map
j : Mapsy.peri^ -> MT, f m> (V/CO)^,-^
is linear and injective. Since both spaces Mapsj-.^nodic and MT have the same
dimension over Fp (even the same number of elements, since this field is finite),
it is bijective. ■
1.4. Convolution of Functions of an Integer Variable
Let A be a commutative ring and /, g : N -► A two functions. We define their
shifted convolution product by1
/**(«)= £ f(i)gU)= £ /(0*(/t-l-0 (*>1)
i+j=n —1 0<i<n-l
and / *g(0) = 0. This is a commutative, associative, and distributive product on
Maps(N, A).
Proposition. The iterated differences of a shifted convolution product are given
by
V"(/ *g) = E*+j=»-i V'/ • V'*(0) + / * V"g (n > 1).
Proof. It will be practical to use the notation f\ for a unit translate of a function
/:
f\(n) = f(n + \) (ii>0)
(the value /(0) is lost). With this notation the difference operator is expressed
by
V/ = /, - /-
^e usual convolution product is defined by / * g(n) — Yli+j=n f(0g(j) (n — 0).
1. Functions of an Integer Variable 167
Let us evaluate the translate of a shifted convolution product:
= /(«)*(0)+ 5Z /<0*(7 + D
i+j=n~\
= f(n)g(0) + (f*gi)(n),
whence
V(/*g) = (f*g)i - f*g = f ■ «(0) + f*Vg.
Iterating the preceding formula, we obtain
V2(/±S) = V(/-g(0)+/*Vg)
= V/.«(0)+V(/*V«)
= V/ ■ g(0) + f ■ Vg(0) + f*V2g.
By induction, we obtain
v"(/*g)= £ v'7-v^(0) + /*v"g,
i+j=n-1
which expresses Vn(/ *g) as a sum of / * Vng and a linear combination of the
finite differences V1"/ (i < n) of/. ■
1-5. Indefinite Sum of Functions of an Integer Variable
If the finite-difference operator V is to be compared to the derivation operator —
pursuing the analogy — we should construct an inverse of it, corresponding to
integration. It is clear that for any function / : N —> A, there is a unique primitive
F '■ N -» A satisfying
VF = / and F(0) = 0.
These conditions indeed imply
/(0) = VF(0) = F(1) - F(0) = F(l)
and then
/(«) = F(/i + 1) - F(n), F(n + 1) = F(/i) + /(/i).
% induction, F(/i + 1) = Eo<,-<» /(')■
Definition. 77ze indefinite sum operator S is defined by
5/(0) = 0 and 5/(n) = ^ /(i) (n > 1).
0<i <«
168 4. Continuous Functions on Zp
If we use the shifted convolution product introduced in the preceding section, we
see that Sf = 1 * /, where 1 represents the constant function 1 on N. In fact,
V(i */)(n) = £/(0 -£ /(i) - /(/i), V(i */) = /.
Examples. (1) Let / = 1 be the constant unit function. Then S\(n) = n.
(2) Let / (= SI) be the identity function N -> N. Then
(3) More generally, let / = (^) be the fcth binomial polynomial N -^ N. We
have seen in (1.1) that V(J = (^1,) and (£) = 0 (* > 1). Hence
SG:.)-G) ^
This property can be read in Pascal's triangle. Consider, for example, the two
consecutive sequences
f2:0, 0,1, 3,6,10,...,
/3:0,0,0, 1,4,10,....
The differences of the second one indeed give the first one.
(4) Consider now f(n) — n2, the square function. In this case
Sf(n) = £V - I2 + 22 + • ■ - + (n - I)2.
Since
'GH--"2-C>
we have
and
6Sf(n) = 2n(n - l)(/i - 2) + 3/i(/i - 1)
= n(n - l)[2/i - 4 + 3] = n(n - l)(2/i - 1).
We have obtained the well-known formula
Sftn) = £V = ln^n ~ l^2n - ^
1. Functions of an Integer Variable 169
This way of proceeding is similar to the general procedure that consists in writing
the binomial series expansion of F = Sf,
F = F(0) + VF(0)Q + V2F(0)Q + V3F(0)Q,
using F(0) = 0 and VF = /; hence
VF(0) = /(0) = 0, V2F(0) = V/(0) = 1, V3F(0) = V2/(0) = 2.
The preceding examples lead to the following result.
Proposition 1. If the function f of an integer variable is given by
i>0 V/
then
F(n) = Sf(n) = Tci(.n ). .
The preceding examples also show that
1*1 =id, 1*1*1(72)= (")>••■> 1*1* ■■-*!(/»)= (")■
k+\ faciors
A few more formulas may be useful. By definition / = V(S/) = V(l * /). Let
us compute S(V/):
5(V/)(n)=l*V/(/i)= J^ V/(*>
= £ tf<* + ]) ~ fW = /(«) - /(0).
" we denote by F0 the projection on constant functions defined by
P0 : Maps(N; A) -> A, / h> /(0) ■ 1
we have obtained S o V = id — Po- Hence the following proposition.
Proposition 2. 77ie indefinite-sum and finite-difference operators are linked by
theformulas
Vo5 = id, SoV = id-P0, VoS-SoV = P0. ■
^e identity S(V f) = f — /(0) - 1 gives a first-order limited expansion of / if
We only rewrite it / = /(0) ■ 1 + S(V/). This point of view has been generalized
bV van Hamme.
170 4. Continuous Functions on Zp
Theorem. For every function f of an integer variable and every integer n > 0,
we have
f = /(O) • i + v/(0) - Q + v2/(0) - Q + - • • + v/(0). r\ + Rn+lf
with the van Hamme form for the remainder Rn+\ f = Vn+1 / * (/7)-
Proof. The case n = 0 has already been obtained: R\f = SVf = 1 * V/. For
«>lwe can use the identity
1+7=71-1
proved in (1.4). Let us apply
V+1(g*/) = J] V'2-VV(0) + 2*V"+1/
i+_/=fi
to the function g(x) = £), for which V's(jc) = (^.) = (p. We find that
v-((;),/) %eC)v>/(o,+(;)±v-,
But the left-hand side is
Vn+1 ((* )*/) = Vn+1(l*l*-.-*l*/) = Vn+l(Sn+lf) = f,
whence the result, since V o S = id. ■
2. Continuous Functions on Zp
2.1. Review of Some Classical Results
Let us recall the basic property of uniform convergence.
Theorem. Let X be a topological space, M a complete metric space, and
(fn)n>o a sequence of continuous maps X —> M. If
d(fm, fn) := supdM(fm(x). fn(x)) -* 0 (m,n -* oo),
xgX
then the sequence (fn)n>o has a limit that is a continuous function f : X —> M-
Proof. Fix momentarily x e X. Then (/n0O)n>o is a Cauchy sequence in the
complete space M; hence it converges. Let f(x) = lim^oc fn(x) denote its limit.
2. Continuous Functions on Zp 171
This defines a function f : X ~+ M. We have to prove that this function is
continuous. But for each positive s > 0 there is a rank N = NE such that
dM{fm(y\ My)) < supdM(M*), /(*)) = d{fm, fn) < *
(m,n> N, ye X). Letting m->oowe infer
dM(f(y), My)) < £ (n>N,yeX),
and hence
d(f,fn) = mpdM(f{y),My))<£ (ri>N).
yex
This proves that the sequence (fn)n>o converges uniformly to / and implies the
expected continuity: let us recall this point. For a, y e X, we write
dM(f(y), f{a)) < dM(f(y), My)) + dM(My), Ma)) + dM(Ma), /(«));
whence
dM(f{y), f(a)) <£ + dM(My), Ma)) + e (/! > TV).
Let us choose and fix an integer n > N.If a e X,the continuity of the function
fn assures us that there is a neighborhood V of a in X such that
y e V => dM(My), Ma)) < *.
The preceding inequality shows that
dM(f(y), f(a)) <3s (ye V),
and hence / is continuous at the point a (for any a e X). ■
Another classical result for continuous functions / : Zp —> R is the following,
u we fix a continuous injective function p> : Zp -» R (for example, a linear
model of Zp (1.2.3) corresponds to such a function), then / can be uniformly
aPproximated by polynomial expressions in <p. Indeed, the algebra of polynomials
ln P is a subalgebra of the algebra of continuous functions over the compact
sPace Zp, which separates points. The Stone-Weierstrass theorem implies that this
subalgebra is dense for uniform convergence.
Finally, let / : Zp -> Cp be a continuous function. Then |/| : Zp ->R is
c°ntinuous, and since Zp is a compact space, sup|/|= max|/| is attained at
S°me Point xeZp. More precisely, f(Zp) is a compact subset of Cp and the
Proposition in (II. 1.1) shows that
{|/(*)l # 0 : x e Zp] is discrete in R>0.
Particular, for every e > 0 there are only finitely many possible real values of
j/(*)| satisfying | f(x)\ >s.
172 4. Continuous Functions on Zp
2.2. Examples of p-adic Continuous Functions on Zp
The definition of a topological ring A shows that any polynomial / e A[X] gives
rise to a continuous polynomial function A -> A. In particular, if / e Cp[X] is a
polynomial with coefficients in Cp, it gives rise to a continuous function Zp —> Cp
by restriction. Since x e Zp implies |x| < 1, any power series J2i>oaix' with
ai 6 Cp and \ai\ -> 0 converges uniformly, and hence defines a continuous
function Zp -> Cp. For any continuous function / : Zp -> Cp, we define its sup
norm by
H/ll = sup |/(x)| = maxl/WI (<oo).
JC€Zp *€ZP
Finally, let us give examples of continuous functions Zp -» Cp of an apparently
different type. If x = J2i>oatPl e Zp, we define/(x) = ^->0fl,-/?21. This defines
a continuous function Zp —> Zp with
l/M - /OOI = I* - y|2.
This estimate shows that / is even differentiate at every point with /' = 0, but
/ is not locally constant. We shall come back to this example later on. If we put
f(x) = Y^ atP™, we have similarly
l/W-/(y)l = l*-yr.
and with f(x) = J] aiPl\ then for any m > 1,
l/(*)-/O0l<l*-yr
if |x — v| is small enough.
2.3. Mahler Series
The binomial polynomials define continuous functions
Q : Z,-> Z„ *-(*).
Since N is dense in Zp, we have || Q || = supN | (£) | < 1. In fact, (J) = 1 proves
that
= 1 (Jfc > 0).
As noted in the previous section, for any sequence (fl,-),->o in Cp with |«,-1 -> 0, the
series J]fc>o fl*G) defines a continuous function / : Zp —> Cp. It is quite
remarkable that conversely, every continuous function Zp —> Cp c«n be so represented-
G
2. Continuous Functions on Zp 173
This result has been obtained by Mahler and will be established below. For
example, it is applicable to all locally constant functions.
Definition. A Mahler series is a series J]*>o fl*C) w^ coeffic^ents \ak I -► 0
in Cp (or Qp).
Comment If a series ]T^>0 ak Q) converges simply at each x 6 Zp, it converges
uniformly on Zp and is a Mahler series. In fact, assume that it converges at the
single point — 1. This implies ak(~kl) -> 0, and since (~!) = (—1)*, we see that
\ak\ = \ak (~kl) | -► 0: the series converges uniformly.
Example. Let t e Mp — namely, t e Cp, \t\ < 1 — and consider the sequence
ak = tk, which tends to 0. The Mahler series ^ >0 f*(*) converges uniformly to
a continuous function / : Zp ~> Cp. Since (1 4- t)n = JIo<A:<n CD'* *°r mtegers
« > 1, the preceding continuous function extends n m> (1 -f- t)n, and it is still
denoted by
(l+0* = iy(*) (*eZ,).
2.4. The Mahler Theorem
Keeping the preceding notation concerning the binomial polynomials fk(x) = (*k)
and the sup norm ||/|| = supz |/(x)| for continuous functions on Zp, we intend
to prove the following general result.
Theorem 1. Let f :ZP —> Cpbea continuous function and put ak = V* /(0).
Then \ak | -» 0, and the series Xjt>o ak (*) converSes uniformly to f. Moreover,
11/11 = supjk>0|a*|.
Proof. Since the function / is continuous, f(Zp) is a compact subset of Cp and
)| has at most 0 as an accumulation point in R>0. Without loss of generality
we may assume / ^ 0 and replace / by ///(jco), where x0 € Zp is chosen with
maximal. Hence we shall assume ||/|| = 1 from now on: The image of /
ls contained in the unit ball Ap of Cp. Let us consider the quotient E = Ap/pAp
\pAp = 5<|P|(CP)) as a vector space over the prime field Fp. Then the composite
f = (/ mod /?): Zp -> Ap -> E is continuous (takes only finitely many values,
ls locally constant) and is not identically zero. Since Zp is compact, it is uniformly
continuous and uniformly locally constant. This means that for t suitably large, <p
ls constant on cosets mod pfZp. Hence <p is 7-periodic on Z, where T = pr with
values in the vector space E. By (1.3) we can write
174 4. Continuous Functions on Zp
Taking representatives a® 6 Ap for the ak, the difference
k<zT
has values in pAp. By the competition principle, at least one \ak\ = 1, and
|flj| <1, max|flj| = l.
By construction ||/ - ^<r a^{j) || = r < ]/?|. If / - £*<r <#(*) is not 0, we
can iterate the procedure on this difference and find S > T and coefficients a\
(k < 5) with
\alk\<r, max|a]|=r,
and
We can even write
|/-Z;c«j?+«2)(;)|=^sip2i
II k<zs W ||
if we agree to define ak = 0 for k > 7. It is obvious that this procedure leads to
convergent series
ak = al + ^ + • • - e Cp, |fljf| < |pn| -> 0,
|a*| < 1 (ft < T), \ak\ <r (T<k<S), etc.,
and also supJt>0 |a*| = suP*<:r \ak\ — \ — || /1|. The proof of the theorem is
therefore complete, since |/ — ]T^>0ak('k) | < \p\m for all positive integers m. ■
Corollary. For any continuous function f : Zp —* Cp, there is a sequence of
polynomials fn 6 Cp[x] that converges uniformly to f. B
Theorem 2. Lef f : N ~> Cp be any map and define ak = V*/(0). 77zen rftf
following properties are equivalent:
(i) \ak\ —> 0 w/ierc /c -> oo.
(«") 77ie Mahler series ^2k>0 ak ( j) converges uniformly.
(Hi) f admits a continuous extension toZp -» Cp.
(iv) / is uniformly continuous (for the p-adic topology on NJ.
(v) II V*/|| -> 0 when k -> oo.
2. Continuous Functions on Xp 175
Proof. Here is a complete scheme of implications.
(/)=r»(n)Wehave
■CDIHG)
k(J <|fl*| (J1 = 1**1 (xeZp),
hence the uniform convergence if \ak\ ~> 0.
(«") => (tit) This is the basic property of uniform convergence reviewed in (3.1).
(Hi) O (iv) On a compact metric space, any continuous function is uniformly
continuous.
(in) => (v) Apply the Mahler theorem to the continuous extension of / to Zp
(still denoted by /):
*>o W
Since VQ) = (^), we have
-S-C:.)
v/
*>1
and by induction
By the same theorem
"'-£<-,)■
l|V'7||=supki4|-»0.
!nparticular, |fl/| = |V->/(0)| < ||VVII -> 0; hence (v) =» (i). ■
2-5. Convolution of Continuous Functions on Zp
As an application of the Mahler theorem, we show that the (shifted) convolution
Product defined in (1.4) for functions of an integer variable N -> Cp extends to
^p -* Cp. In turn, this result allows us to give an explicit estimate for the remainder
ln a finite Mahler expansion. By definition,
f*g(n)= J2 /(')*0").
l/**(«)l 5 max |/(i)«(j)| < ||/||||g||,
and
ll/*sll<ll/llllsll-
176 4. Continuous Functions on Zp
Proposition. Let f and g be two continuous maps Zp -» Cp. Then the shifted
convolution product f *g has a continuous extension Zp —► Cp.
Proof. By (3.4) (Theorem 2, (i) => (hi)), the existence of a continuous extension
°f / *g (initially only defined on N) will follow from V*(/*g)(0) -^ O.Toprove
this convergence, let us come back to the formula (proved in 1.4)
v2w+1(/*g)= £ v7-v^(0) + /*v2w+1g,
1+7=2/*
V2n+l(/*g)(0)- J] V7(0)-V^(0) + (/*V2w+Ig)(0).
1+7=2/1
For any bounded function h, the ultrametric property gives || V/i|| < ||/i||, and we
can estimate
where
v2n+1(/*g)(0) = r,+ r2 + r3,
ri= I] v'7(0)-v2^g(0),
n<i<2n
r2= £ v2n-'7(0) • v^(0),
n<7<2«
73 = (/*V2n+1g)(0),
as follows:
Hil<|V"/l-ll*D.
\T2\ S ll/ll ■ Wg\\,
|r3| < ll/ll ■ Iiv2"+1g|| < ii/H • nv"g||.
Altogether, this shows that
|v2n+,(/*g)(0)| < max(|r,i, ir2|, ir3|)
< max(||V"/ll • ll«|| , ll/ll ■ IIVgII)->0.
Similar estimates can be made for |V2n(/ *g)(0)|, and we prove thereby the
required convergence: V*(/*g)(0) -*■ 0. *
Corollary 1. Any continuous f : Zp -» Cp fo« limited Mahler expansions
f = /(0) + V/(0) • Q + V2/(0) - Q + - - -
+ V"/(0)Q+ /?„+,/ (n>l)
2. Continuous Functions on Zp 177
with the van Hammeform of the remainder
Rn+lf = V"+1/£Q, ll/Wi/ll < l|V"+1/|| -> 0 (n -> oo).
Proof. The announced formulas hold on N by the preceding section. Taking g =
(") in ||/*gll 5 11/11 llgll, we see that they extend continuously to Zp by the
proposition. ■
Another application of the Mahler theorem (or of the possibility of extending
the convolution product to continuous functions over Zp) is given by the following
corollary.
Corollary 2. For any continuous function f : Zp -> Cp, the indefinite sum
Sf = f * 1 of f extends continuously to Zp. More precisely, iff = Xjt>o ^k('^j
is the Mahler expansion of f, then
Sf = l*f = I>(, " A II5/H = ||/||.
Proof. We have noticed that
sQ-^GH+i)-
whence the result.
Corollary 3. The only linear form <p : C(Zp; K) -> K that is invariant under
translation is the trivial one <p = 0.
Fkoof. In fact, we prove that if <p(F) = ^(F^forallF 6 C(Zp; K), where Fx(x) =
F(* + 1), then <p = 0. Indeed, take any / e C(ZP; K). There exists an F e
C(ZP; K) with / = VF = Fx - F (take F = Sf). and thus
<p(f) = <p(Fl-F) = <p(Fl)- <p(F) = 0. ■
Corollary 4. Let a : Zp -> Zp, x i-> — 1 — x, be the canonical involution
(U.2). Then S(f o ct)(jc) = -Sf(-x).
**oof. For integers n, m>lwe have
S/(#i +«) - Sf(n) = f(n) + ■-■ + /(« + m - 1).
y density of the integers n > 1 in Zp and continuity of both sides, we get more
generally
Sf(x + m) - S/(x) - /(*) + - • • + f{x + m - 1) (x e Zp).
178 4. Continuous Functions on Zp
Take now x = ~m in this equality:
Sf(0) - Sfir-m) = /(-«) +•■- + /(-!)
= /(<x(ifi-l)) + --- + /(<x(0))
= S(foo)(m).
Since S/(0) = 0, the result follows.
Example. Let a = 1 -h f 6 l+McCp and take
*>o \*'
Then we have
ax - 1
Sf(x)= (a#l).
a — 1
3. Locally Constant Functions on Zp
3.1. Review of General Properties
When X is a topological space and E any set, a map / : X —► Zs is locally constant
if for each jc 6 X
Vx = {j € X : /(j) = f{x)}
is a neighborhood of x. Equivalently, one can require f~l(e) open in X for each
e e E, or even /_1 (A) open in X for each subset A c E. In other words, locally
constant functions / : X -» Zs are continuous functions when E is endowed
with the discrete topology. On a connected space, a locally constant function is
constant (take x 6 X, put e = /(jc) 6 E, A = E — {e}, and consider the partition
of the connected space X into two disjoint open sets f~l(e) and f~l(A): Since
f~l(e) ^ 0, f~l(A) must be empty and / = e is constant). A locally constant
function /ona compact space X can take only a finite number of values (/W
must be compact and discrete).
Lemma. If X is a compact metric space, a locally constant function f onXtS
uniformly locally constant when there exists 8 > 0 such that
d(x,y)<:8=> f(x) = f(y).
3. Locally Constant Functions on Zp 179
Proof. Give E the discrete metric. Since X is compact, / : X -> E is uniformly
continuous. Hence there is a 8 > 0 such that
d(x, y)<8=> d(f(x), f(y)) < 1 =» f{x) = /()0,
and the conclusion follows. ■
The set of functions X -> E is denoted by ,F(X; £"), and when E = K is a
field, -F(X; A^) = -F(X) is a vector space over K (omitted from the notation if
this field is implicit from the context). When X is a compact ultrametric space, the
locally constant functions X -> K form a Jf-vector subspace .Flc(X) of -F(X).
The characteristic functions of clopen balls of X form a system of generators of
T\X).
32. Characteristic Functions of Balls ofZp
We are interested in locally constant functions on X = Zp taking values in any
abelian group M (this abelian group will typically be an extension K of Qp). Let
us start by the study of the (uniformly) locally constant functions / 6 -Flc(Zp; M)
satisfying
\x~y\<\pj\ = ^j=^nx)=f(y)
for some fixed integer j > 0. These are the functions that are constant on all closed
balls of radius r7- = l/pj- Since the balls in question are the cosets of pjZp inZp,
these functions are the elements of the vector space
Fj = FiZ/ptZ) = HZplp^p) C T\ZP\K\
In fact, we have a partition
0<i<p'
into balls of radius r,-, and
i + pjZp = BSp->{i) (0<i<pj)
ls an enumeration of these balls in Zp. For fixed j the characteristic functions
<Pt j = characteristic function of the ball B<\/pJ(i) (0 < z < pi)
^ake up a basis of the finite-dimensional space Fj. When we let j increase, the
subspaces Fj also increase, and
Pc(Zp-K) = ]jFj.
"fortunately, the previously given basis of Fj has no element in common with the
^is constructed similarly in F7_i. A clever way of constructing coherent bases
180 4. Continuous Functions on Zp
of the spaces Fj — where the basis of Fj extends the basis of F/_i — has been
devised by M. van der Put. Let
Ifc
(pQ0 = 1 characteristic function of Zp (i = 0)
<Piti characteristic function of i + pZp (1 < i < p)
<pit2 characteristic function of/ + p2Zp (p < i < p2), etc.
Generally,
^ = (pt j characteristic function of i + p^Zp if p*~l < * < pJ".
Since absolute values of elements of Zp can only be powers of p, we have
\X\ < T <=> |JC| < -^ (p^-1 < I < Pj),
I pJ
and ij/i = <pij is also the characteristic function of the ball
Bi = {xeZp:\x-i\ <!//}
(with the convention Bo = Zp for i = 0).
On the other hand, the indices i in the range pJ~~l < i < pj are precisely those
that admit an expansion of length j in base p, namely an expansion of the form
/ = *o + hp-\ f- h-iPJ~~l (0 < i£ < p - 1, 0-i ^ °)-
Definition. The length of an integer i > 1 W f/ie integer v = v(i) > 1 .swc/i fto
f/ie expansion of i in base p has digits ii = Ofor £ > v, w/i/fe /v_i ^ 0.
With this definition, the van der Put sequence is defined by
fy = (fi^i) : characteristic function of/ + pv(l)Zp.
Here are the first few functions:
1
P0.1 •
^0,2 -
0*0,3 •
• • <Pp-l,l
■ ■ <Pp-l,2
• • <Pp-l,3
<Pp,2 ■
■ • <Pp2-l,2
<Pp2,3
- w-u
<Poj
The sequence (i/l)j<pj appears at the top of this triangular table of characteristic
functions.
Proposition. The sequence 0l/t)o<i<pj is a basis of F} (j > 0).
3. Locally Constant Functions on Zp 181
Proof. For fixed j > 0, the components of any / e Fj in the known basis <pij
(i < pj) of Fj are the (constant) values of / on the balls i + pjZp:
f = ]T /(f)wj-
i<p*
In particular, for / = fa, the characteristic function of Bg = I + PvZp, we have
a sum of the form
fa = £ Vij,
where the indices that occur are the same as those occurring in the partition
Bi=\J(i + p'Zp).
They are the indices i such that 0 < i < pJ and i = t (mod /?)v (in order to have
i € Bg). These indices can be listed:
i=£, L + p\ l + 2p\ ....
The first one is t itself, and they are all greater than or equal to I. The matrix
of the components of the V"£ in the basis <pij is lower triangular with l's on its
diagonal (all its entries are O's and 1 's). This matrix U has determinant 1 and hence
is invertible: The fa (0 < I < pj) form a basis of Fj. If we write U = I + N,
the matrix TV is lower triangular with O's on its diagonal and hence is nilpotent: A
power of N vanishes. This proves that
U^^I-N + N2 + (-\)mNm if 7V"+1=:0.
In particular, the inverse U~lofU has integral entries: The components of the <ptj
m the basis (fa) are also integers. ■
Here is an even more precise result.
Proposition. If f = ^ &iito € Fj, the coefficients are given by
a0 = /(0) and an = f(n) - f(nJ) (n > 1),
where n_ — n — nv-\ pv~l denotes the integer of length strictly smaller than n
obtained by deleting its top digit in base p.
^oof. We have already observed that /(0) = a0- Fix a positive integer n and
c°nsiderthesum/(n) = J2i<pj «ito(")in which fa(n) = Oor 1. More precisely,
to(«)= ! <=>n e B,
4=> n = / mod pv(l)
4=> the digits of n and i
are the same up to v(i)
i is an initial partial sum of n.
182 4. Continuous Functions on Zp
This shows that
/(n) = flo+ (*) + *„,
whereas
f(n J) = fl0 + (*)-
Hence /(n) — /(n_) = g„ as claimed in the proposition. ■
Corollary. When / = £ fli" Vo* £ ^/ tates ^ values in an ultrametricfield, we
have
||/||= max |fl|-|.
Proof. For each xeZpwe have Vi(*) = 0 or 1: \^i(x)\ < 1 and
\f(x)\ = lj^aiifi(x)\<max\atl
This proves
11/11 = sup \f(x)\< max |fl|-|.
Conversely, a0 = /(0) => |flo| < 11/11, and for n > 1,
Iflnl = l/(n) ~ /(Ml < max(|/(M)|, |/(n_)|) < ||/||,
hence max \an | < || /1|. ■
Since (VOi>o is a basis of Tlc(Zp',K) = U;>o ^j» il is easv to genera*ize
the preceding results to all locally constant functions (taking their values in an
extension K of Qp).
Theorem- Let f : Zp —> K be a locally constant function. Define
a0 = /(0), fl„ = f(n) - /(«_) (n > 1).
Then f = £ a,-^,- is a finite sum and \\f\\ = sup,- |fl,|. •
J.J. 77ze van der Pw/ Theorem
We are now able to give the main result, namely the representation of any
continuous / : Zp —> K where K is a complete extension of Qp.
Theorem. Let f : Zp —> K be a continuous function. Define
ao = f(0),an = f(n)-f(n^) (n > 1).
4. Ultrametric Banach Spaces 183
Then \an \ —> 0, and ^jT, ai V^i converges uniformly to f. Moreover,
H/ll =sup \at\ = max \ai\.
i l
Proof. Since \n — m_| —> 0(n —> oo) and / is uniformly continuous, we have
\an\ = l/(w) — /(i-)l -> 0 (m ^ oo), and the series converges uniformly. The
sum of this series is a continuous function,
g = J2ai^i-
We still have to prove f = g. Since these functions are continuous, it is enough
to show that their restrictions to the dense subset N are the same. The obvious
equality /(0) = a$ — g(0) can be used as the first step in an induction on n. Let
8j = T,i<p> ai&- For n < Pj we have
f(n) — /(n_) = an (by definition)
= coefficient of gj (since m < pJ)
= gj(n) - gj(n-),
f(n)-gj{n) = f(n-)-gj(n_).
This shows that if / and gj agree on {0, 1, 2, n — 1}, they will also agree at
the point n (provided that n < pi). As a consequence, for all integers n e N,
/(«) = lim7- gj(n) = g(n) (with a stationary convergence). As mentioned, this
proves f = g. The equality ||/|| = sup,- \ax | is obtained exactly as in the case /
locally constant. ■
4. Ultrametric Banach Spaces
in this section K will always denote a complete ultrametric extension of Qp.
We have already given in (II.3.1) the formal properties of ultrametric norms on
Qp-vector spaces, and we have studied finite-dimensional such spaces over K.
Here we turn to infinite-dimensional ones.
We shall simply say normed space for ultrametric normed space over AT, and
Banach space for complete normed space.
4-1. Direct Sums of Banach Spaces
he direct sum of a family (£,-),-G/ of normed spaces is the algebraic direct sum of
fois family,
££) Ei = {(xi) : only finitely many jc, ^ 0} C ]~[ Et
quipped with the sup norm on the components,
||x || = sup II*, || = max \\xt || if x = (*,-).
184 4. Continuous Functions on Zp
When (Et )ieI is a family of Banach spaces, it is convenient to consider a completion
of the preceding direct sum. Here is the construction. The support of a family
x = (Xi)iei € Yliei Ei>is h = {i e / : Xi ^ 0} C /, and ||xz-1| -y 0 means
for all £ > 0, the set /*(£) := {*" e I : \\xi || > e] is finite.
If \\xi II —► 0, then the support Ix of the family x is flf /nasf countable, since it is
the countable union of the finite sets Ix{\/n)(n > 1).
Definition. The Banach direct sum of the family (£z)jG/ of Banach spaces Et
is the normed space
a,^n^
iel
consisting of the families x = (jc,) such that ||jc, || —> 0, equipped with the sup
norm
ll^ll = 11(^)11 := sup ||jcf || = max ||^ ||.
i l
This terminology is justified by the following result.
Theorem. The Banach direct sum of a family (E;)ieI of Banach spaces is a
completion of the normed direct sum of the family
Proof. The set of families x such that || jc,- || —* 0 is a vector subspace of the product
Yliei Et* ana< tne algebraic direct sum is dense in it. Let us show that the Banach
direct sum is complete,
ie/ iel
Let n i-> jc(71) = (x[[ ),-G/ be a Cauchy sequence in the direct sum. For each i € U
n i-> jc, is a Cauchy sequence in £,. Let X, be its limit. For given e > 0, there is
an integer NE such that
\\x(n) - X(m)\\ <£ (12, Wl >Ne).
A fortiori, for all i e I,
l|JC(«)_jc(m)|| Se (nm>Ne)
Letting m —> oo, we obtain
\\*jn) - Xi\\ < £ (n>NE,i€l). (*>
Since II*, || < £ outside a finite set J (depending on £ and «), we also have
11^-11 < maxdl^ll, 11^ - x/ll) <£ 0" £ 7).
4. Ultrametric Banach Spaces 185
This proves that the family x :— (*,-) is in 0I-G/£I-. Coming back to the inequality
(*), we see that
||x<»> _ x || = sup ||XW _ x. || < £ (n > ^).
i
This proves x(fl) -> x in 0-G/ £f. ■
Example. When all Banach spaces £,- = £ are equal, the algebraic direct sum is
also denoted by
iel iel
Its completion is the space of sequences in E converging to 0: we denote this
space by c0(/; E). (The notation cG(K) is similar to the classical (c0) introduced
by S. Banach when K = C.) When E = K is the base field, or when / = N, we
drop them from the notation if there is no risk of confusion:
cd(/) = co(/; K\ c0(E) = c0(N; £), c0 = c0(N) = c0(N; ff).
We can now formulate a few consequences of the theorem.
Corollary 1. L£tf Ebea Banach space. Then c0(/; £) is fl completion ofE(I) C
£ for the sup norm. ■
Corollary 2- Lef E be a Banach space. Then the sum map £(/) —> E has a
unique continuous extension X : co(/; E) —> £.
Proof. The sum * = (jcJ i-> X!;G/ *» ■ ^(/) -^ £ is a contracting linear map
£x, I < sup 11^11 = 11x11 (*e£(/)).
II IG/ || '"
11 has a unique continuous extension X. This extension is also a contracting linear
map by density and continuity. Hence we have more generally
lJ2xq <suPll*;ll = Wl (* ec0(I:E)). m
II ie/ || l
^his sum X can be computed using any ordering of the index set / and any
grouping / = ]J j.: The equality for families with finite support extends by
continuity to the completion c0(/; E) (cf. (II. 1.2)).
Corollary 3 (Universal Property of Direct Sums). Let £j denotejhe canonical
ejection of a factor into the direct sum Ej -> 0-G/ Ex c 0IG/£,*. Tfterc
J°r each Banach space E and family (fj) consisting of linear contractions
186 4. Continuous Functions on Zp
fj : Ej —> F, there is a unique linear contraction f such that the following
diagram is commutative:
Ej ^ ©IG#ft C ©IG/F,-
fj\ I ®fi / f
E
Proof. Under the assumptions made,
(*.) ^ {fa): @ieiEi -+ c°(/; E)
is a linear contracting map, and composition with the sum X yields the unique
solution to the factorization problem
i
4.2. Normal Bases
When E and F are (ultrametric) normed spaces over K, we denote by L(E; F) the
normed vector space of continuous linear maps T : E -» F. Recall that a linear
map is continuous precisely when it is continuous at the origin, or, equivalently,
when it is bounded:
\\Tx\\
\\t\\ :==sup -n~ir < °°-
By definition, we have
11^*11 < II71IMI (xeE).
This shows that T is a contraction precisely when || T || < 1.
Comment. The inequality ||Fjc|| < ||r||||jc|| (jc e E) shows that ||7x|| < ||T||
when ||jc|| < 1, and hence sup^u^ \\Tx\\ < ||7||. But contrary to classical
functional analysis, this inequality can be a strict inequality: When 1 ^ ||F||, the unit
sphere ||x|| = 1 is empty, closed and open unit balls coincide, and sup^u^i ^
suP|UII<i- For the operator T = id (and \\E — {0}|| discrete in R>o) we have
sup ||x|| = sup 11*11 <l^||id|| = l.
II* II < I II*II<1
Proposition 1. If F is complete, then L(E\ F) is also complete.
Proof. Let (Tn) be a Cauchy sequence in L(E,F). For each x e F, (Tn(x))IS
a Cauchy sequence in the complete space F, and hence has a limit Tx which
obviously depends linearly on x e F. This defines a linear map T : E -* ?'
Let e > 0 be given. There exists an integer NE such that \\Tn - Tm\\ < £ for &*
4. Ultrametric Banach Spaces 187
n m> N£. Letting m->oowe deduce \\Tx — Tmx\\ < £||jc|| for all n,m > N£.
This proves that the operator T — Tm is continuous (bounded); hence T — Tm +
(7 - Tm) is continuous. Moreover, ||T — Tm || < £ when m > NE. This shows that
|| y _ 7^ || -^ 0, Tm -> T (m -» oo), and everything is proved. ■
Corollary. For any normed space E, the topological dual Ef = L(E; K) is a
Banach space. ■
Example. Let / be any index set and E a Banach space. The vector space of
bounded sequences a = (fl,-)ie/ m E with the norm ||a|| = supf- ||a,-|| is a normed
space, denoted by /°°(/; E) (it is complete: cf. exercise).
The universal property of a direct sum consisting of factors E, all equal to the
same Banach space E and for linear forms cpi : E —> K leads to the following
statement.
Proposition 2. The topological dual of the space co(/; E) is canonically
isomorphic as a normed space to /°°(/; £")•
Proof. If <p is a continuous linear form on c0(/; E), we let <pt = <po et denote the
restriction of tp to the i th factor E in co(/; E) (families having a zero component for
all indices except i). Since \\<pi || < |M|, we get a bounded family (&) e Z°°(/; £")-
Conversely, if (^f) e /°°(/; £')> we can define a linear form ^ = E^f on c0(/; £)
by the formula (a,) i-> £f- ^ite) (a summable series, since the sequence <pi is
bounded and \\at || -> 0). Both maps
^^(^o£,-), (^-)h> x^-
are linear and decrease norms. Hence they are inverse isometries. ■
In other words, the bilinear map
(te),fe))H>X^^fe), c0(I-E)xloo(I;Ef)^K
ls a duality pairing that proves the proposition.
Corollary. The space /°°(/) = /°°(/; AT) w fl Banach space. m
Inthespaceco = c0(/), the family of elements ef- = (^^(Kronecker symbol)
as the following basic property. Each sequence a = (an)n>o e c0 is the sum of a
Unique convergent series
n>0
188 4. Continuous Functions on Zp
and ||a|| = supw>0 \an\ == max„>0 \an\- We say that this family of elements e,- =
(8ij) constitutes the canonical basis of this space (in spite of the fact that it is not
a vector space basis: In linear algebra, linear combinations are always assumed to
be finite linear combinations).
This leads to the following definition.
Definition. A normal basis in an ultrametric Banach space E is a family (e,-),-€/
of elements of E such that
• each x e E can be represented by a convergent series
x = Ylj xiei where the sequence of components \xi\ —> 0,
• in any representation x = X!/ xiei we have
||x||=sup-€/ \Xi\.
A normal basis is sometimes called an orthonormal basis. In particular, for each
convergent series ^7 ;c,er-, the set of nonzero components is at most countable, as
observed earlier. If (e,-)/ is a normal basis, we have ||ef-1| = 1 for each i e I. On
the other hand,
X = J2Xie* = J2 *"*'" =* Xfc1' ~ >7l)e' = °'
j i i
and by the second postulated property of a normal basis,
sup \x{ -yi\ = ||0|| = 0, =^ xt = yi (i e /),
whence the uniqueness of representations in normal bases. All properties of normal
bases are summarized in the following obvious result.
Proposition 3. Let E be an ultrametric Banach space having a normal basis
(ei)isi- Then the mapping (jc,-) \-> ^2iGl *;e,- defines a linear bijective isometry
co(/; K) —> E. Conversely any linear bijective isometry c0(/; K) —> E defines
a normal basis in E, namely the image of the canonical basis ofco(I; K). ■
Example 1. The Banach spaces c0(/; K) supply examples of ultrametric spaces
with normal bases. In particular, when the index set / is finite, we get the (finite)
product spaces Kn with the sup norm (cf. exercises).
Example 2. Let E = C(ZP; K) be the space of continuous functions Zp -* K
(where K is a complete extension of Qp) equipped with the sup norm. The Mahler
theorem (2.4) asserts that the binomial polynomials constitute a normal basis of
E: The map
c0(K) -> E : (ak) i-> ]T^/* = $^<fe
k>0 £>0
G
4. Ultrametric Banach Spaces 189
is a bijective linear isometry. The van der Put theorem (3.3) asserts that the sequence
(jj/j)j>o constitutes another normal basis of E. These two normal bases are quite
different in nature.
4.3. Reduction of a Banach Space
Let K be a complete extension of Qp and E an ultrametric Banach space over K.
We keep the general notation for
• A = BS\(K): maximal subring of K,
• M = £<i(tf): maximal ideal of A,
• k = A/M: residue field of K.
Moreover, we consider the closed unit ball in E, E\ = {v e E : \\v\\ < 1}, as an
A-module and ME\ = {kx : X e M, x € Ej) as an A-submodule (MEi is
obviously an additive subgroup of E\). As a consequence, E = E\/ME\ is a
ft-vector space.
Remark. We have quite generally ME\ C Z?<i(Zi). This inclusion is in general
a tfricf inclusion. For example consider any finite, ramified extension K of Qp as
a Banach space over Qp. Its open unit ball is strictly larger than pBsx(K): The
open unit ball contains an element of norm \p\l/e, while all elements of pB<\(K)
have norms < \p\ < \p\l/e.
Lemma. If either \\E\\ — \K\, or \K*\ is dense in R>0, then ME\ — B<X(E).
Proof. In the first case, if k = ||jc|| < 1, we can write x = X ■ (x/k) e ME\. In
the second case, if ||jc|| < 1 we can choose a scalar k e K with ||jc|| < \X\ < 1
and still write x = A. • (jc/A) e M£i. ■
Proposition. 7f (ei)ieIJs a normal basis of E, then (e, mod ME\)iGf is a basis
of the k-vector space E.
Proof. Define £t — (ef mod ME\) e E. These elements generate E: If x =
(* mod Mf"]) e £", we can write x = X!^^ w^tn ^ I**I — 1 an(^ on^y finitely
many \xi\ = 1, giving rise to a finite linear combination jc = X^ifo' moc* ^)^i-On
the other hand, take a linear combination £\ «,£, =0e£ = E\jME\ (a,- € ft,
only finitely many nonzero such coefficients). We can choose scalars a,- e A c ^
with cxi = (a- mod M) and a, = 0 if a( = 0. By assumption
J2ate< eME* <= fl<i(£),
namely || £\ ma || < 1. By definition of a normal basis,
sup \at\ = X^J < 1
190 4. Continuous Functions on Xp
and \at | < 1 for all i. This proves that all at = (at mod M) — 0 e k and the linear
combination is trivial. The family (£,)/ is free and is a basis of the reduced vector
space E. ■
4.4. A Representation Theorem
With the same notation as before, observe that the closed balls Bsr(E) (r > 0)
and the open balls B<r(E) (r > 0) are A-modules and for any ideal / of A,
Bsr(E)/IBSr(E) is an A/7-module.
In particular, if the ideal / is principal, say / = (£) with |§ | < 1, then
BSr(E)fi;BSr(E) is an A/(f )-module.
Let us generalize the expansion theorem (II. 1.4) to the vector case.
Theorem. Let E be an ultrametric Banach space, f e K, |§ | < 1, and choose
a set of representatives S C B<r(E)for the classes mod £;Bsr(E). Assume that
0 e S. Then every element x e B<r(E) can be represented uniquely as the sum
of a convergent series
Proof. Take for a0 the (unique) representative in S with x—ao el; BSr(E). Hence
x — ao = r\ = §jci for some x\ e BSr{E). One can proceed similarly for x\ and
find elements a\ e S, x2 € BSr(E) with x\ — a\ = i-x2, namely
x =a0 + <3i£ + t2*2-
Iterating the construction, we obtain a series J2i>o a& • which converges to x. For
this part of the proof, the completeness of E is not needed, since the element x,
the limit of partial sums, is known a priori. But when E is complete, every series
X^x)^"?1 w^tn coefficients a, e S is convergent, since \a^l\ < r|£|' —* 0. The
uniqueness statement is immediately verified. Indeed, if X^o^'?' = 0» we ^ave
*">i
hence a0 = 0, since this representative is in 5. By induction, all ax = 0. •
4.5. The Monna-Fleischer Theorem
In a Banach space co(/; ^0, we have
||x|| =sup \x{\ = max |x,-| e |/T|.
/ 7
4. Ultrametric Banach Spaces 191
Hence if an ultrametric Banach space admits a normal basis, we have
\\E\\ = \K\, ||£-{0}|| = |#x|.
Theorem. Let K be a complete ultrametric field with \K*\ discrete in R>0 and
E an ultrametric Banach space over K. Then E admits a normal basis precisely
when\\E\\ = \K\.
Proof. The preliminary comment proves the necessity of the condition.
Conversely, let us show why it is sufficient. Since K has a discrete valuation, its
maximal subring A = R is principal, with maximal ideal M = P = nR. Then
PE\ =ttE\. Let us choose and fix a system of representatives S c R for the
classes mod P, with 0 e S. Also choose and fix a basis (e,-)/ of the k- vector space
E = E\/7tE\ with liftings e,- e E\9 so £,- = e^ mod ttE\. I claim that (e,)/ is a
normal basis of E. Consider first the case of a vector x e E\: ||x|| < 1. The vector
x = (x mod ttE\) can be expanded in the k-basis (£,-)/, say x = J2 ai£i (only
finitely many at ^0). Consider the representatives
af* e S, a™ = a,- (mod P);
hence af* = 0 except for finitely many indices. If ||x|| = 1, at least one \af)\ = 1
andalll^l < 1. We have
ri =x-£a?))e/e £<!(£)-
By the lemma in (4.3), we have B< \ (E) = P E\ = it E\, and the same construction
with the vector 7r_1(x — YLaT**i) e Ei gives a family
0 for finitely many indices only,
such that
x = ^^0)e/+7r^^1)e, + r2 to e ir2*,).
By iteration, we obtain a sequence r„ e iznE\ and convergent series
Qi = a™ + 7TflJ!) + - - - e /? C AT
giving a representation * = X!/ fl*e* witn ai ~^ 0 (f°r fixed j<> only finitely many
Gi ¥" 0). At each step of the iteration we have to choose a scalar kn such that
\\K(7i-nrn)\\ = 1.
This is possible by the assumption || E \\ = \ K |. ■
192 4. Continuous Functions on Zp
4.6. Spaces of Linear Maps
Let AT be a complete ultrametric field and F, F two ultrametric Banach spaces
over K. Assume that E admits a normal basis and fix an isomorphism cq{J) = E.
Then any linear map T : E -> F furnishes a family of f, = Fe7 e F, namely the
image of the normal basis of E — canonical basis of c0(J). When the linear map
T is continuous, this family is bounded
\\fj\\<\\T\\\\ej\\ = \\T\\.
We thus obtain a linear map
L(E;F)->r(J;F) : T h-> (f,-)y.
Proposition 1. /^.sw/ne tfzaf F admits a normal basis and fix an isomorphism
co(J) = E. Then the map
L(E-F)^r(J;F)
defined above is an isometric isomorphism.
Proof. We have already seen that ||(f/)/ll < \\T\\. Conversely,
x = y^*/ej =>• F(x) = y^JCj-fy (this sum converges!),
j j
\\T(x)\\ < sup||*7-f7|| < sup 1^-1 sup \\fj\\ = IM|sup||f7-||
j j j J
whence \\T\\ < sup7- ||f7|| = ||(f/)/ll- Observe that for any choice of bounded
family f7 e F, there is a T € L(E; F) with Tej = f7 (J e 7), so that the map
L(E; F) -> l°°(J; F) is surjective. ■
In particular, for F = AT, we get the following result (cf. Proposition 1 in (4.2)).
Corollary. There is a canonical isometric isomorphism
(c0(J))' = i°°(jy ■
Assume now symmetrically that F has a normal basis and fix an isomorphism
c0(/) = F. The linear maps T : E -+ F = c0(/), x \-+ T(x) = (yv(x)) gi^3
family (^,)/ of linear forms (pt■ \ E -> AT. If F is continuous, so are the linear
forms tpt and ||F(x)|| = sup,- |^(x)|,
||F||=sup||F(x)||/||x||
= sup sup |^(x)|/[|x||
= sup sup |p,-(x)|/||x|| = sup ||^||.
i x^O i
This proves the following proposition.
4. Ultrametric Banach Spaces 193
Proposition 2. The linear map
L(E- c0(D) -> /°°(/; E'):Tv^ fa)
is isometric, but not always surjective. ■
When both E and F have normal bases, we get the following statement.
Proposition 3. When E = c0(J) and F = co(/), we can make canonical
identifications
L(E-F) = Z°V;co(/)) C /°°(/;E') = /""(/-./"V)) - Z°°(/ x J). m
In other words, when normal bases are chosen, continuous linear maps E —> F
are represented by bounded matrices with columns in c0(/) = F.
More particularly, if T is continuous and of rank less than or equal to 1. we can
write
T(x) = <p(x)a = ((P(x)ai)I
for some tp e E\ In this case, ^(x) = <p(x)at, \\<p( \\ = \at | ||^|| -> 0. This proves
that the image of T belongs to the closed subspace c0(/; E'). By linearity, the same
property will hold for any continuous linear map T of finite rank:
Lh(E-c0(I)) -> c0(/; E'):Tb+ fe).
Definition. A completely continuous linear map T : E —> F is a linear map
that can be approximated {uniformly on the unit ball) by finite-rank continuous
linear maps.
If we denote by LCC(E, c0(/)) the space of completely continuous maps E —* F,
thenT m> (^) defines an isometric map LCC(E, co(/)) —* cq(/, E'). It is surjective:
It is enough to check that the image of the finite-rank operators is dense in the target
space. But if (<pt) is an arbitrary sequence of continuous linear forms on E with
fell -^ 0, and £ > 0 is given, there is a finite subset J C / such that \\<pi\\ < e
f°r i & J- Define TJ/t = <pt for i e 7 and Vi = 0 for i £ 7. Then (^,) is the image
°f a continuous finite-rank operator and ||(^,) — 0Ai)|| < £•
Comment. One can show that when A' is a locally compact field, the completely
continuous maps T : E -> F are precisely the linear maps that transform bounded
sets of E into relatively compact sets in F. These transformations are classically
called compact linear maps. In the general case, the distinction between compact
a°d completely continuous operators has been studied in detail and has led to the
definition of compactoids.
194 4. Continuous Functions on Zp
4 J. The p-adic Hahn-Banach Theorem
Let E be a normed space, V c £ a vector subspace, and p>: V —> AT a continuous
linear form
Ml = sup —— < oo.
Is there a continuous linear form 7p : E -> K extending <pl If the answer is positive,
can we find a linear form Ip with the same norm ?
Theorem (Ingleton). Let V be a subspace of a normed space E. When the base
field K is spherically complete, the restriction map
if m> <p = 1r\v, E'-> V
is surjective. Moreover, for each (p e V, it is possible to find an extension ij/ = lp
withM\\ = \\<p\\.
Proof, (a) Let us show first that a continuous linear form on a subspace V ^ E
can be extended to V + Ka (for any a e E — V) without increasing its norm. The
definition of ty = Tp has to satisfy
U(x + ka)\\ < |M| - \\x+ka\\ {xeV,le K).
For A = 0 this is satisfied, since \^|v = (p. When A ^ 0, we may divide by -A
and see that it is sufficient to find a linear form ij/ with
\mx-a)\\<\\<p\\.\\x-a\\ (xeV),
\\<p(x) - 1r(a)\\ < ||p|| - \\x - a\\ :- r* (x e V).
In other words, we have to choose ct = ij/(a)m the intersection of the balls Bx =
B<rx((p(x)) C K. For any pair of points xjeV, <p(x) e Bx and ^(y) e £> are
at distance
\<p(x) - <p(y)\ < ||p|| ■ ||x - y\\ < ||p|| max(||x -a\\,\\y- a\\) = maxfo, ry).
This proves that the smallest among the balls Z?x and By is contained in the largest:
Bx H Z?v ^ 0. Since we are assuming that the field K is spherically complete,
the intersection Plxev &x *s not empty and any a in this intersection is a possible
choice for a = i/(a).
(b) Consider now the set of pairs (V, <pr) consisting of a vector subspace V' D V
and an extension (pf of(p to V with the same norm as <p. This set of pairs is ordered
by the relation
(V", <p") >- (V, ?') <=> V" 3 V and p'V = <P ■
5. Umbral Calculus 195
Any linearly ordered set of such pairs has an upper bound. By Zorn's lemma,
there is a maximal pair. By the first part, this maximal pair is defined on the whole
space E. ■
5. Umbral Calculus
The Mahler theorem (3.4) has been generalized by L. van Hamme. To be able
to give this generalization, we have to briefly review umbral calculus: This term
has its origin in the nineteenth century, when formal computations were used with
little justification. Today, it refers to an algebraic treatment of polynomials, power
series, and identities between them.
5.1. Delta Operators
Let K be a field of characteristic 0 and K[X] the vector space of poly nomials (in
one variable) with coefficients in K. The translations xa (a e K) are the linear
operators in K[X] defined by
(rfl/)(X)=/(X+fl)-
We shall often identify the indeterminate X with a variable x (in K or in an
extension of K: Since K is infinite, there is no danger in identifying formal polynomials
and polynomial functions on K). Since the degree of the zero polynomial is not
defined, let us adopt the ad hoc convention deg(O) = — 1: This allows us to speak
of tiiesubspace of polynomials having degree less than or equal to n for any n > 0.
The unit translation will also be denoted by x\ = E.
Definition. A delta operator is a linear endomorphism 8 ofK[X] such that
(1) 8 commutes with all translations xa (a e K),
(2) 8(X) = c e K x is a nonzero constant.
Proposition. Let S be a delta operator in K[X]. Then
(1) 8(a) = Ofor all constants a e K.
@) iff is a nonconstant polynomial, then deg(<5/) = deg/ — 1.
Proof. By hypothesis, 8(X) = c ^ 0, and by translation,
c = xac = xa8X = 8xaX = 8(X + a) = 8X + 8a = c + 8a.
"ence 8a = 0 for all constants a e K. To prove the second point, it will suffice to
show that
deg(8Xn) = n- 1 (n > 1).
196 4. Continuous Functions on Zp
Fix an integer n > 1 and put SX" = f(X). Then
f(X +a) = raf(X) = raSX" = Sra(X") = S(X + a)"
and for X = 0,
/<«)=£(*)«**<*"~*)<p).
or
/(*) = ^ (^ W"-*)(0) • X*.
We see that / is a polynomial of degree less than or equal to n with a coefficient
of Xn given by 5(1)(0) — 5(1) = 0 (using the first part, already proved). The
coefficient of Xn~l is n8X(0) = nc ^ 0 (the field K has characteristic 0). Hence
f(X) = 8Xn is a polynomial of degree n — 1. ■
Corollary. The image by a delta operator of the subspace of polynomials of
degrees less than or equal ton(n > I) is the subspace of polynomials of degrees
less than or equal ton — 1.
Proof. The dimension of the image of a linear operator is equal to the dimension
of the source minus the dimension of the kernel. The assertion follows from the
proposition. ■
Examples. (1) The differentiation operator D is itself a delta operator. More
generally, if a e AT, the operator xaD = Dxa is a delta operator.
(2) The finite difference operators (recall E = x\: Unit translation)
V = V+ = n - id = E - id,
V_ =id-r_i = t_iV,
and xa V± are delta operators. When a ^ b, the operators xa — xb are also delta
operators.
(3) Any formal power series in D of order 1, namely
defines a delta operator. For example
log(l + D) = D - \D2 + \D3 ,
^-1 = D + ^Z>2 + £Z>3 + ---,
D2/(eD- 1)-Z)-|Z)2 + ..-
are delta operators.
5. Umbral Calculus 197
5.2. The Basic System of Polynomials of a Delta Operator
Let 8 be a delta operator. If / is a nonzero polynomial, there is a polynomial g
such that 8(g) = / (and necessarily deg g = deg / -f 1 by the preceding section).
This polynomial g is determined up to an additive constant, since the kernel of 8
consists of the constants. Replacing g by g — g(0), we see that there exists a unique
polynomial g such that
8(g) == /, g(0) = 0 (normalization),
and the degree of this polynomial is one more than the degree of /.
Definition. The basic system (pn)n>o corresponding to a delta operator 8 is the
system of polynomials such that
1. degprt = n (n > 0),
2. 8pn = npn-x (rc>l),
3. /*) = 1, p„(0) = 0 (n> 1).
Starting with po = 1 there is a unique polynomial p\ (of degree 1) such that
Kp\) = 1 and p\ (0) = 0. Proceeding inductively, there is a unique polynomial pn
(of degree n) such that 8(pn) = «p„_! and p„(0) = 0. Hence the definition
characterizes a unique system of polynomials for any delta operator. Explicit formulas
for computing these polynomials will be given in (5.5) and (6.2). Any basic system
constitutes a AT-basis of the vector space K[X].
For example, the basic system of the delta operator D (derivation) is the system
(*")h>o of powers of x. For the operator 8 = V the basic system is
(x)n = x(x — l)---(x—n-hl) (Pochhammer symbol)
with the convention (jc)0 = 1. We indeed have (1.1)
V(x)„ = h(x)„_i (n > 1),
anc* (x)n vanishes at x = 0 if n > 1. For 8 = V_ the basic sequence consists of
the polynomials pn(x) = jc(jc 4-1) - • • (x + n — 1). For every basic sequence, we
have
Skpn = n(n - 1) • - - (n - k + 1) • /?„_* (ft < «).
h particular,
8npn =nl- p0 = nl
Generalized Taylor Expansion. Let 8 be a delta operator and (pn )„>0 its basic
system in K[X\ Then we have general expansions
f(* + y) = E Mr^ •Pk(y) (/ e ^[X])-
i>0
198 4. Continuous Functions on Xp
We can indeed write the tautology
8k(Pn)(0)
Pk,
where all coefficients of the pk are 0 except for pn, which is 1. For any linear
combination / of the pn we obtain by linearity
f y^ **(/X0)
Replacing / by one translate rx f in the preceding equality, we obtain
= z.n^mpk-
which is the announced generalized Taylor expansion. ■
In particular, if we take for / the polynomial pn, we obtain the following
equalities.
Binomial Identities. Any basic sequence of a delta operator satisfies the
following identities:
Pn(x + y)= 2^ (,. I ■ Pk(x)pn-k(y)-
0<k<n
The binomial identity can be written in the mnemonic way
Pn(x+ y) = "<jKx)+ &))*"
where one must remember to replace powers by indices in the binomial expansion
of the right-hand side.
5.3. Composition Operators
Definition. A composition operator is an endomorphism T of K[X] that
commutes with translations.
We shall determine all composition operators. More precisely, we shall establish
the following result.
Theorem. The following properties of an endomorphism T: K[X] —* K[X]
are equivalent: They characterize composition operators.
(i) T commutes with the unit translation,
(ii) T commutes with all translations.
5. Umbral Calculus 199
(Hi) For all delta operators 8, T can be written as a formal
power series in 8: T = (p(8) e K[[8]].
(iv) T = (p(D) e #[[D]] is a formal power series in the derivation D.
(v) T commutes with the derivation TD = D T.
(vi) T commutes with any delta operator.
Proof. (/) =» (it) Write x\ = E and use the Taylor series expansion around n:
Enf(x) = f(n+x) = J2 f(k\n) - ^-
By the commutation hypothesis T E = E T we infer
T(xk)
EnTf = TEnf = ]T fik)W " —
*>0
Put g = Tf and consider the polynomial in two variables
T(xk)
F(x,y) = g(x + y)-Y,f(k)W'
k>0
k\
We have seen that F(x, n) = 0 for all positive integers n and all x. If x is fixed, the
corresponding polynomial in y has infinitely many roots and is consequently
identically zero. This proves F == 0. Hence xyg - Txyf = 0, or xyTf - Txyf = 0.
Since this is valid for all polynomials /, we see that the operator T commutes with
translations.
(ii) => (Hi) Let T be a composition operator and 8 a delta operator. Write the
generalized Taylor formula using the basic sequence (pk) corresponding to 8:
(first for fixed y and variable x). We have
Let us apply the composition operator T to this polynomial,
and evaluate it at the origin (we now fix x = 0 and consider a variable y):
(Tf)(y) = (tv7Y)(0) = (Tzyf)(0) = £ <ZM2> . a*/(>').
Hence
(7»(0) ,,
200 4. Continuous Functions on Zp
and finally,
v^ (Tpk)(0) .
t = 22 i •5 G * [ph-
The coefficients of the expansion of a composition operator T as a power series
in the delta operator 8 — say T = ^ak8k — are given by ak = (Tpk)(0)/k\. In
particular, let us remember that for the case 8 = D these coefficients are
(7V))(0)
a, —
k\
Since obviously (ii) => (i) and (Hi) => (iv) => (v) => (vi) it only remains to prove
(vi) =^ (H) to accomplish the full cycle of equivalences. Let 8 be a delta operator
and write the generalized Taylor formula for an arbitrary polynomial:
/(* + y) = £aV(*)-/?*(y)/*!,
This shows that all translations can be expressed as formal power series in any
delta operator. As a consequence, if an operator commutes with a delta operator,
it commutes with all translations. ■
For convenience, let us use the following notation for the commutant of a subset
A of the endomorphism ring of K[X]:
Af = {T e End K[X] :TS = ST for all S e A}.
The commutant of A' is the bicommutant — or double commutant — of A: It is
denoted by A" = (Af)f.
Corollary. In the endomorphism ring ofK [X], the commutant of a delta
operator 8 can be identified with the ring K [[£)]]. In particular, this commutant is
commutative, and the bicommutant of any delta operator can be identified vM»>
the ring K[[D]].
Proof. The equivalences (v) o (vi) o (iv) of the theorem show that the
commutant of the derivation, or of any delta operator, coincides with the ring of power
series in the derivation D. This ring is independent of the delta operator in question
and is commutative; hence by (ii) o (v) 4> (vi) we have
{D}' = {tv :yeK}' = {8}' c EndK[X].
5. Umbral Calculus 201
Since {D}' is commutative, {D\ C {D}". On the other hand, the operation that
consists in taking the commutant obviously reverses inclusions, and
D e {D}' =» {D}' d {D}". m
Let T be a nonzero composition operator. We can write 7 = $2 ->v 0j£7 with a
first nonzero coefficient flv ^ 0 (v > 0). In this case, we say that the composition
operator 7 has order v and we write T = DVS = SDV with a composition operator
S of order 0, namely, 5 is invertible. Since the kernel of the operator Dv consists
of the polynomials of degree less than v, we infer
v = dim ker Dv = dim ker 7.
This equality shows that the order of a composition operator is independent of the
delta operator used to represent it as a power series. On the other hand, the delta
operators are the composition operators of order 1.
If T = YlajDJ anc* T' = Ylbj D3 are two composition operators, then T o V
is also a composition operator, and its formal power series is obtained by
multiplication of the formal power series giving T and Tf.
5.4. The van Hamme Theorem
Let T be a continuous endomorphism of the Banach space C(ZP) of continuous
functions on Zp (and values in a fixed complete extension K of Q^). Let us recall
the definitions of the norms
||/||= sup |/(x)|= max |/(jc)|,
sup = max taken on the compact space Zp,
urn = supii7/i!/ii/n= sup ii2Yii.
/¥o 11/11=1
When this continuous endomorphism commutes with the unit translation operator
. = Ti> it also commutes with the (forward) difference operator V = T\ — id and
lts Powers. Hence 7 leaves the subspaces ker Vn c C{ZP) invariant.
Lemma. The subspace kerVn ofC{Zp) consists of all polynomials of degree
strictly smaller than n.
tttooF. The statement is obvious for n — 0 and 1. In fact, if Vn/ — 0, the finite
^fference theory applied to the restriction of / on N shows that this restriction is
a Polynomial p of degree smaller than n. Hence we have / — p on N and also on
^p by continuity and density of N in Zp. ■
202 4. Continuous Functions on Z>p
As a consequence, any continuous endomorphism 7 of C(Zp) commuting with
E (or equivalently, with V) leaves the polynomial subspace
n = K[X] = |JkerVn c C(ZP)
invariant and induces a composition operator in this space. Let us expand this
composition operator as a formal power series in the delta operator V
T|n = £>nV" e/a[V]].
If 7 ^ 0, the order v of 7 is the index of the smallest nonzero coefficient. Since
|| V|| = 1, the ultrametric inequality shows that
||71| <sup|a„|.
On the other hand, the basic polynomial sequence of the delta operator V is the
sequence (x)n — x(x — 1) - - - (x — n + 1), and the coefficients an are given by the
formula
T((x)n)
<Xn = — -
n!
In particular for n = 1, 1^1 = \\T(x)\\ < \\T\\\\x\\ = \\T\\. If we assume ||7|| =
\(Xy I, we see that \an | < \a\ \ for all n > 1. The main step of van Hamme's
generalization of the Mahler theorem can now be given.
Proposition. Let 7 be a continuous endomorphism of C(ZP) that commutes
with V. Assume 7(1) = 0 and \\T \\ = \ce\\ = h so that T induces a delta
operator on K[X] with basic polynomial sequence (/?n)n>o-"
po = 1, degpn = n, T(pn) = npn„x and pn(0) = 0 (n > 1).
Then \\pn/n\\\ = I.
Proof. Let us use the renormalized qn = pn/n\, so that by definition
q0 = 1, dcgqn = n, T(qn) = qn^, andqn(0) = 0 (n > 1).
We have to prove ||qn || = 1 (n > 1). Replacing 7 by 7/als we may assume ct\ = 1>
1 = Ikoll = II7*!|| < ||9l|| = \\Tq2\\ < \\q2\\ < • •
Now by assumption, 7 = V + a2V2 H = V(/ + a2V H ) = WU with
an invertible composition operator U = (/ + a2V + --•), II ^11 = 1 Define
V = U~l = I - a2V H also with || V|| = 1. We claim that there is a suitable
continuous invertible composition operator S, with || 51| = 1, such that
qn = SV"(fn), (*)
5. Umbral Calculus 203
where fn = 0 denote momentarily the binomial polynomials: V/n = /n_i- First,
for any composition operator 5 of order 0, the preceding definition leads to
polynomials with degqn=n. Moreover,
Tqn = To SVn(fn) = VU o SVn(fn),
and since U = V-1 and all the operators in question commute,
Tqn = 5V""1 o V(/„) = SVn~\fn-X) = qn-x.
There only remains to construct a suitable invertible composition operator 5 with
||S|| = 1 so that the formula (*) furnishes polynomials with qn(0) — 0 (n > 1).
Let us take
V
S = / - V— = / - VV'tf,
where V is given by the formally derived power series in V giving V. Namely,
V = 7 + 5^ A.V" =» V = £nA,V"-1.
«>i «>i
Now we have
SV"(/„)=(/-Vy)oVU)
= (Vn-VV,-1V')(/„).
Recall that all the operators are formal power series in V, and V*/„ = fn-k
vanishes at the origin for k < n. The only interesting term is thus the monomial
containing V"/„. But if <p = <p(t) is a formal power series, the formal power series
<p" - t<pn-x<p' = <pn~ (t/n)(<pny = ^ - {t/nW
has a coefficient of tn equal to 0. Since this is the constant term in SVn(fn), this
proves that qn(0) — 0. All operators used in the definition of 5 have norm less than
<*equal to 1; hence ||5|| < 1, \\qn\\ < IISIIIIVIIII/«II = 1- ■
Theorem. Let T be a continuous endomorphism ofC(Zp) that commutes with
^- Assume T(l) = Oand \\T\\ = \T(x)\ = 1. Define the polynomial sequence
q0 = 1, dcgqn = n, T(qn) = qn„u qn(0) = 0 (n > 1).
Hicn ^flch continuous function f e C(Zp) c<zh Z?e expanded in a generalized
Mahler series
vwfc c„ = (T"f)(0) -> 0 and ||/|| = sup„>0 |c„|.
204 4. Continuous Functions on Zp
Proof. Using the notation of the preceding proposition, we have T = VU with U
invertible and \\U\\ = 1. Hence
\Tnf(0)\ < nr"/ll = \\unvnf\\ < nwil -* o
(by the Mahler theorem). It will be enough to establish all statements for the
polynomial functions /, since the general case will result from this by density
and continuity. The generalized Taylor expansion of a polynomial / takes the
form
/ = £>"/)(0) • —^ = X>"/)(0) • q„.
From ||#n|| = 1 follows quite generally
11/11 < sup\cn\,
and from the asserted formula for the coefficients,
k„i - l(r"/)(0)l < lir"/ll < IIrBII11/11 < \\T\\n\\f\\ < ll/ll,
whence conversely sup |cn| < ||/|| and finally sup \cn\ = ||/||. ■
Comment. The generalized Mahler expansion is not valid for the delta operator
D (differentiation): This operator does not extend continuously to all of C(ZP)-
On the other hand, its renormalized basic sequence isqn(x) — xn/n\, and even if a
series expansion f(x) = $^n>oc«xn//1- converges uniformly, ||/|| = sup|/(x)| =
max |/(jc)| is not usually equal to sup |cn|. The delta operator D on tf [X]) has a
power series expansion in V with coefficients
an — D I —- I (0) = coefficient of x in ( I
— constant coefficient of (x — 1) ■ • • (x — n -f l)/n! = (—l)""1 /n.
In particular, ax = 1, but \an\ > 1 when n is a multiple of p, so that the theorem
is not applicable.
5.5. The Translation Principle
To illustrate an important principle we begin with a particular case.
Example. We know that the basic sequence for the delta operator D is the sequence
of powers. The basic sequence corresponding to a translate xaD of D is
pn(x) = x(x - naf~x (n > 1).
5. Umbral Calculus 205
Indeed, we have
Dpn = (x - naf~l +(n- l)x(x - na)n'2
— (x — na)n~2[x — na -f nx — x] — n(x — na)n~2[x — a],
whence
*aDPn = h(* + a — naf~\x + a — a] = npn^x.
To be able to prove the general translation principle we need a couple of easy
results.
Lemma. Let T = <p(D) = ]C«>ofln^n ^ fl composition operator and let Mx
be the multiplication by x operator f \-± xf. Then
TMx-MxT = <p'(D).
Proof. By definition,
(DMX - MxD)f = (jc/y - */' = /,
whence DMX — MXD — I (identity operator). Similarly,
(DnMx » MxDn)/ = (xfin) - x/(n) = nf{n~l\
whence DnMx - MXD" = nD{n~l\ This is the particular case T = Dn of the
expected formula. The general case results by additivity, since for any polynomial
/, (TMX - MxT)(f) is a finite sum
J2<*n(DnMx - MxDn)f = J^annD^f
(only terms with n < deg(/) + 1 really occur). ■
Comment. One can define the Pincherle derivative V — TMX~ MXT of any
composition operator. For T = <p(D) the lemma shows that V = <p'(D). A sim-
uar result has been used for a long time in quantum theory: If Mf denotes the
"Multiplication operator by a polynomial /, then
DMf — MfD = Mfi multiplication operator by the derivative /'.
Observe that in (5.4) we have used a different derivative, namely a derivative with
respect to a series expansion in the operator V. For this reason, it is always necessary
*° specify with respect to which delta operator the derivative of a composition
°Perator is taken.
206 4. Continuous Functions on Z,p
Proposition. Let 8 be a delta operator and write 8 = D<p(D) with an invertible
power series <p. Then the basic sequence of polynomials of 8 is given by
pn=x<p(Dyn(xn-1) (#i>l).
Proof. Since (p(D) as well as cp(D)~n are invertible operators, <p(D)~n(xn~l)]s
a polynomial of degree n — 1 and the polynomial pn = x(p(D)~n(xn~l) has
degree n. Since x divides p„, we obviously have p„(0) = 0. It only remains to
check that 8pn = npn_i- By definition, pn = Mx(p(D)~n(xn~l), so that 8pn =
D<p(D)Mx<p(D)~n(xn~l). Using the lemma, we can write
A/X<p(£>r"(*n_1) = <p(DYnMx(xn-1) - [^(Dn'tx""1)
= <p(DTn(xn) + «[^(D)-n-1](x"-1).
Hence
5pn = Dcp(D)MMD)-»(x"~l)
= D^(D)[^(D)-"(x") +^z[^(D)-"-1](^"-1)]
= (p(D)-n+l(Dxn) + n^(D)-n(Djc'1-1)
= ^(DrB+1(nJc"_1) + n(n - lMD)-n(x"-2)
= N(/))-"+1Mx+n(n - 1)^(D)-"](jc"-2).
Using again the lemma to bring the operator Mx into the first position, we obtain
fyn = [Mxn<p(D)-n+l + (n<p(D)-n+ly + n(n - l)<p(D)-n](xn-2)
= nMx(p(D)-(n-l\xn~2) + [-n(n - l)^(D)"n + n{n - lWDnt/"2)
= /iM^(D)-«»-V2) = npn-L ■
The Translation Principle. Let 8 be a delta operator and (pn)n>o its basic
sequence. Then the basic sequence of the translate delta operator xa8 is given
by po— 1 and
~ x
Pn = Pn(* - na) (n > 1).
x — na
Proof. By the explicit formula of the proposition with 8 = D<p(D), we have
pn = Jch^D)]""^-1)
= XT.^JPr"^"1) = JCT_Bfl[(l/jc)pB]f
from which the translation principle follows. *
Observe that since the polynomial pn is divisible by x (n > 1), pn(x — na>
is divisible by x — na and the polynomial pn is divisible by x: It vanishes at the
origin as required. Several cases of this translation principle are interesting. F°r
example, the case a — —\ leads to the backward difference operator r_i V = V->
while a = — ^ leads to a centered difference operator.
6. Generating Functions 207
Umbral calculus
Delta
operator
(IV.5)
D = d/dx
Basic sequence
of polynomials
(IV.5.2)
Related
sequences
(IV.6.1)
(*")„>o
umbral
operator
(Pn)n>0
Appell sequences
Dpn = «/?„-!
Sheffer sequences
8s„ — nsn-i
n>0
(IV.5.5) translation principle
Binomial identity: pn(x + y) = " (p(x) + p(y))n"
Appell sequences: pn{x + y) = " (p(x) + y)n,"
Sheffer sequences: sn(x + y) = " (s(x) + p{y)TT
cf.(V.5.4), (V.5.5) for the example of the Bernoulli numbers and polynomials.
6. Generating Functions
6.L Sheffer Sequences
In this section 8 will be a fixed delta operator, and (pk)k>o will denote its basic
sequence. Recall the generalized Taylor series (5.2)
„ , y^ ff*/)(0) , ,
£>0
valid for any polynomial / e ff[X].
Definition. A Sheffer sequence {relative to 8) is any sequence of polynomials
(sn)n>osuch that
1. degs„ =nforalln > 0,
2. fa„ = n • sn-iforall n > 1.
*he constant so is nonzero. If (5-K)n>o is a Sheffer sequence, we have
S*sn = n(n - 1) • • - (n - k + 1) - sn_k = (n)* - s„_* (* < ti)-
208 4. Continuous Functions on Zp
The generalized Taylor expansion
gives for f = sn
sn(x + y) = ]T ( W*) - sn-k(y). (S)
This formula generalizes the binomial identity for the basic sequence (pk).
Definition. An Appell sequence is a Sheffer sequence corresponding to the
derivation operator D.
The Appell sequences (pn) are characterized by the relations
1. deg pn = n for all n > 0,
2- Pn =n- pn-iforalln > 1.
The Appell sequences satisfy (5), which is in this case
pn(x + y)= V I " )xk - pn-k{y).
0<A<n W
This identity may be symbolically written
Pn(x + y) = "(x + p(y))nr
where we interpret exponents of the binomial expansion of the right-hand side as
indices.
Proposition. Let S be an invertible composition operator. Then the polynomial
sequence sn — S(pn) is a Sheffer sequence. Conversely, if(sn) is a Sheffer
sequence, the endomorphism S of K[X] that sends the basis (pn) onto the basis
(sn) is an invertible composition operator.
Proof. To check the first statement, we compute 8sn using the fact that 8 and a
commute:
8sn = 8Spn = S8p„ = 5(n/?n_i) = nS(pn i) = ns„~i.
Conversely, for n > 0 we have
S8pn = Snpn_i = n5-n_i = 8sn = 8Spn.
Since the polynomials pn make up a basis of K[X], this proves that 5 and
commute. Hence 5 is a composition operator (5.3).
6. Generating Functions 209
6.2. Generating Functions
Let us still consider a fixed delta operator 8 with basic system of polynomials
(Pn)n>0' Let 5 be an invertible composition operator. The polynomial system
S"lpn = sn is a Sheffer sequence, and we are going to determine more explicitly
the exponential generating function
.zn
VVfr)--- = Fs(x,z)
*—Z ni
«>o
(where z is a new indeterminate, a variable in C or Cp,...). We know that
8 = <p(D) e K[[D]], <p(0) = 0 and <p'(0) ^ 0,
5 = ^r(D) e ff[[D]], ^(0)^0.
By (5.3) the formal power series corresponding to the composition operator rx 5_1
is
*—' /i! z—' «!
n\
On the other hand, the formal power series (in D resp. 8) corresponding to 8 can
be computed as follows. Firstly, we have seen that
r* = J2 Pn(x)— = yV —r = exp(jcD),
and secondly,
r* = rxS~l o 5 = Fj(jc, 5) o j,(D).
since 5 = <p(D), or equivalently D — <p~l(8) (a systematic characterization of
invertibility of formal power series is given in (VI. 1.3) Theorem 1), a comparison
01 the two expressions for rx furnishes
Fs(x,<p(D))-j/(D) = exp(xD).
With the formal power series 8 = <p(D), we can express D = <p~l(8) and come
ack to the above expression:
Fs(x, z) - ^((P~\z)) = exp(jc<p_1(z)),
Fs(x, z) = Y\(x)— = ———- • exp(x<p_1(z)).
*-* n\ T]/{<p-l(z))
210 4. Continuous Functions on Zp
We can deduce several useful identities from this one. For example, derivation
with respect to x leads to
dxFS{X,z) = YJSnWZ-}
• exp(x<p-\z)),
n>0 "'
<P~\z)
H<p~\z))
X <P~l(z)
±~!Sn{0)nl jK<p-1{z)Y
In particular, for the basic sequence sn = pn which corresponds to the identity
composition operator 5 = id, hence to the formal power series ij/ = I,
Ep„(x)— = exp(x<p \z)),
n'
n>l
The first identity gives an algorithm for the computation of the basic sequence
(Pn)n>o- Here are a few examples.
Example 1. Let us consider the delta operator
V = V+ = r-l=eD-l= <p(D),
for which
z = <p(u) = eu - 1, eu = z + 1, u = log(l + z) = <p~\z).
We have
exv(x<p-l(z)) = exp(x log(l + z)) = (1 + z)x
The basic polynomials for this delta operator V are simply the Pochhammer
polynomials
po = 1, pn(x) = (x)n = x(x - 1) • • - (x - n + 1) (n > 1).
Example 2. As with the delta operator V_ = 1—r_ = 1 — <rD, which corresponds
to - = <p(u) = 1 — e~M, m = log 1/(1 — z) = ^_1(z), we have
exrfxp-'fe)) = (1 - Z)-* = £ (T)(-Z)"-
6. Generating Functions 211
The basic polynomials are now
Po=h Pn(x) = x(x + 1) • • - (x + n - 1) = (-l)"(-*)„ (n > 1).
Another example of the general formulas follows (cf. the exercises for the Fibonacci
numbers and the Gould polynomials).
63. The Bell Polynomials
The Bell polynomials Bn(x) can be defined by their generating function
]TB„(*)^=exp[*(e*-l)].
This generating function has the required form for a basic sequence of polynomials
of a delta operator. We can indeed take
u = <p~l(z) = ez — 1 = (r_! o exp)(z)
and hence
z = <p(u) = (log orO(M) = Log(l + w).
This shows that the delta operator 8 that leads to this generating function is
8 = <p(D) = log(l + D) = D - Id2 + \d* - • - -.
The following formulas result from the general theory:
Bn(x + y)= J2 (1l\Bk{x)Bn_k{y\
0<k<n W
J2B'n(0)^ = <p-\z) = e*-h
whence ^(0) = 1 (w > 1). The polynomials B n are monic polynomials having
zero constant term if n > 1. The first ones are
£o = l, B1(x) = x, B2(x) = x + x2, B3(x) = x + 3x2+x3,
B4(x) = x + lx2+ 6x3 + x4.
Tf
u we take the derivative of the generating function (with respect to z), we obtain
^relation
Bn+l{x) = x J^ (?W*).
0<k<n \*/
fom which these polynomials are easily computed inductively.
212 4. Continuous Functions on Zp
Comment. The special values Bn = Bn{\) of the Bell polynomials are the Bed
numbers. They represent the numbers of distinct partitions of the set (1, 2,..., n\
into nonempty subsets. The first ones are
Z>o — 1, B\ = 1, Z>2 — 2, Z>3 = 5, B4 = 15,
where, for example, the five partitions of {1, 2, 3} are
{1},{2},{3}
{1,2}, {3}
{2,3},{1}
U, 3}, {2}
{1,2,3}.
Exercises for Chapter 4
1. (a) Suppose that we define the notion of Banach space E over an ultrametric field K
simply as a complete normed K-vector space. Try to prove that if E is a Banach
space of positive dimension over K, then K is complete.
(b) If you cannot prove (a), think of the following examples: A' is a noncomplete ultra-
metric field and E — K is its completion with the norm given by the extension of the
absolute value. This is a Banach space over K. For example, take K = Q with the
p-adic absolute value and E = Qp as Banach space over Q, or K = Qap (algebraic
closure of Q^) and E = Cp as Banach space over Q£. What is happening?
2. Let (Ei)iG[ be a family of Banach spaces. Define the Banach product of this family as
the normed vector subspace
consisting of the bounded families x = (jc/), equipped with the sup norm
l*| = IC*f)|:=sup|jc,|.
I
In particular, ©rG/£/ is a normed vector subspace of Yliei Ei-
(a) Show that this Banach product is complete and hence is a Banach space. Observe
that /°°(7; E) = Y]iGl E and conclude that /°°(7; E) is complete for any Banach
space E. ^ w
(fc) Show that the dual of 0IG/E£ is canonically isomorphic to Yliei *v
(c) Formulate the universal properties of the direct sum and Banach product as canonical
isometries
Z.(E;f]Ei)= F]Z.(E;E,),
E(0E,;E)S fJE(E(-;E).
[The second isomorphism for E = K gives (fc).]
Exercises for Chapter 4 213
3. Let E = Kn (for some n > 1) with the sup norm. If (sj)i<j<n is a normal basis of
£, show that the matrix having the sj for columns is in Gin(A): It has entries in A and
determinant in A *. Conversely, any normal basis is obtained in this form.
4. When \KX | = \n\z is discrete but the condition ||£|| = |K\ is not satisfied, show that
the norm of E can be replaced by an equivalent one that satisfies it. Take either
M^suptWiAeff, W<WI}
or
|Mr = inf{|A|:Aetf, \k\ > ||x||},
for which
i^iiixn < Hxir < iixii < »xirr < itti-mixii.
Since sup — max and inf = min, these new norms take their values in \K\.
5. Let (ui)iG[ be a family of continuous operators in an unitrametric Banach space E such
that for each x e E,<p(x) := supIG/ ||u/(jc)|| < oo. Show that suplG/ ||t<j|| < oo.
(Hint. Consider the subsets En C E defined by <p < n and use the Baire property for
the union U„>! En ~ E, or copy the proof of the Banach-Steinhaus theorem from any
book on functional analysis !)
6. (a) Assume / e Q[X], /(0) e Z, and that V/ takes integral values on all natural
numbers. Prove that / also takes integral values on N.
(b) Let the polynomial / e Q[X] take integral values on all natural numbers: /(N) c
Z. Prove that / also takes integral values on all integers: /(Z) C Z.
(Hint. Show that /(Z) cQnZp for all primes p.)
7. The maximal number of electrons on atomic layers is given by the following sequence
K :2, L :8, M : 18, N : 32,....
What is the next one? Find a polynomial formula f(n) giving these values.
(Hint. Compute the finite differences to determine the simplest polynomial / taking
these prescribed values.)
8- The maximal number of regions in the plane R2 determined by n lines is given by (make
pictures!)
0 12 3 4 ...
f(n) | 1 2 4 7 11
Find a polynomial formula for f(n).
• ^et f(n) denote the maximal number of regions determined in the unit disk \z\ < 1 (of
the complex plane C) by connecting n distinct points on its boundary \z\ ~ 1 by lines.
Show that this sequence starts as follows:
n
f(n)
12 3 4 5 6 ...
1 2 4 8 16 31 ...
Find a polynomial formula for f(n).
■ Consider the Fibonacci sequence as a function of an integer variable n -> fn:
/o-0. /i = 1, /„+i = /„ + /„-i (»>!)■
Does this function extend continuously to any Zp (p prime)?
214 4. Continuous Functions on Zp
11. What are the Mahler series expansions of the following polynomial functions:
f(x) = 2x2 - 1, g(x) = 4*3 - 3x, f(g(x)).
12. Let xn ~ J2k>o an,k (£) be the Mahler series of the continuous function xn: Zp -► Q
Hence anj = 0 for fc > n. Show that
«n,0 = &n (=1 for « = 0 and = 0 for « > 0),
0<y<A: ^J/
an£ = k(an-i%k +a„_ljjfc_1) (k > 1,« > 1).
Show also that when p is an odd prime,
apk 3= 0 (mod p) (2 < fc < p - 1).
The ank/k\ are the Stirling numbers of the second kind; cf. (VI.4.7).
13. Let / : Zp -* Qp be continuous, given by a Mahler series
-§*e>
/« _
What is the Mahler series of the function xfl
14. Prove the following formula:
(OH-1HU2HC-3)
■ +
(_D»-i
forrc > 1.
15. Show that the series
T.pn{p2n_l)-
converges for all x € Zp, * ^ 1. The sum /(*) defines a continuous unbounded function
ZP-{-l}-*Q„-
16. Let a e 1 -h Mp and m a positive integer prime to p. Show that there is a unique wth
root of a in 1 +Mp.
(Hint. Consider the series expansion (1 + 01/m = £*>() C *m)**0
17. Let / : Zp -► Cp be a continuous function and F = Sf its indefinite sum (IV. 1.5).
(a) Show that there are uniform estimates
\F(n + pv)~F(n)\<£v (n e N),
where £^ -> 0 (v -* oo). (J/wr. In a sum F(« + pv) - F(n) = J2n<i<pv f®
group the indices i in question into cosets mod p*Z. Let C be one such coset and
Exercises for Chapter 4 215
pick one c e C,
x€C xeC
Hence | £xgC /Ml < ma^xGC (I/to - /(c)|, |pw/tol)-)
(b) Show that for every given e > 0, there is an integer v such that
\F(n + V) - F(/i)| < c (n, * € N).
(c) Prove directly (i.e., without the Mahler theorem) that for any continuous function
/ : Zp -* Cp, the indefinite sum F of f extends continuously to Zp. (Corollary
2 in 3.5).
18. Show that the finite sums
X>rfl/ (5>=0, 5>*0)
are delta operators (notations of 5.1).
19. Let us define the Bell-Carlitz polynomials B„ by their generating function
exp (xz + (ez - 1)) - J2 B«Wl-
Hence B£(0) = B„ (=B„(1) cf. (IV.6.3)) are the usual Bell numbers.
(a) Prove that
0<y<n ^-"
Hence these polynomials interpolate consecutive values in the sequence (£n)n>o-
(b) Prove that the sequence (££)n>o is an Appell sequence (IV.6.1).
(Hint. Differentiate the expression found under (a).)
20. Considerthepowerseriesexpansion(l-f—t2)~l = ^Tn>0 anr'2. Show that c^ = /n+i,
where (fn)n>o is the Fibonacci sequence
/0 = 0, /i = 1, /„+!=/„+/«-l <*!>!)■
Define a sequence (pn) of polynomials by the identity
eXp ^ log t _z!_z2J = J2 P»<*)jj,
so that an = pn(\)/n!. Show that this generating function corresponds to the choice
u = cp-\z) = log 1/(1 - z - z2), c"" - 1 = -z - z2, S = ?>(£>).
Show that
'-•w-eK?)-'-'-'
(the operator —V_ is simply given by / »-> f(x — 1) — f{x)).
216 4. Continuous Functions on Z
21. Show that the basic sequence of polynomials corresponding to the translate delta
operator T-E Vis
Pn(x) = x-{x + ne- 1)„_! (n > 1).
The renormalized polynomials qn{x) = pn(x)/n\ are the Gould polynomials
x (x + ne)n x (x + ns\ x (x -h ne — 1\
qn0c)=— :— = ——( ) = -l , -
x+ne n\ x-\~ne\ n ) n\ n — \ J
(Hint. Check by induction that
Vpn = n(x+e){x + e + {n-\)e- 1)„_2 = nxE(pn_i)
and hence t-EVpn = npn^\, then use the translation principle (5.5).) Write explicitly
the binomial identity for the Gould polynomials.
Show that the delta operators r_£V satisfy the condition ||r|| = \ct\ \ — 1 of the van
Hamme theorem (5.4), and hence they give rise to uniformly convergent expansions of
all continuous functions Zp -► K (complete extension of Qp).
5
Differentiation
Calculus in the p-adic domain is rather straightforward. Let us emphasize, however,
that:
• A function with a continuous derivative is not necessarily strictly differentiable.
• The mean value theorem is valid provided the increment is small enough:
• The radius of convergence of the exponential series is rp < oo.
in this chapter the field K will denote a complete extension of Qp, e.g., K = Cp
1- Differentiability
1-1- Strict Differentiability
Let X c K be a subset with no isolated point.
Definition. A function f.X-^Kis said to be differentiable at a point aeX if
the difference quotients (f(x) — f(a))/(x — a) have a limit I = f'(a) asx-^a
(x ^ a) in X.
Equivalently, one can require the existence of a limited expansion of the first
order
f(x) = f(a) + (jc - a)f\a) + (jc - a)<j)(x) where 4>(x) -> 0 (jc -» a).
218 5. Differentiation
Example 1. Let (Bn)n>\ be the sequence of open balls
Bn = [x 6 Zp : \x - p"\ < |p2"|} C{«ZP: \x\ = \p"\]
and / the function on Zp vanishing outside (J Bn (a disjoint union) with values
Ax)=p2n (xeBn).
Then / is constant on each open ball Bn and hence is locally constant outside
the origin. Consequently / is differentiate at each x^O with f'(x) = 0. At
the origin limx_>o(/(^) ~ f(®))/x = linWo f(x)/x exists and is zero, so that
/ is also differentiable at this point with /'(0) = 0. In this example, /' = 0
(identically), /' is continuous, a situation classically denoted by / e Cl, but the
difference quotients
f(y)-f(x) = f(x)-f(y)
y — x x — y
take the value 1 on the pairs x = xn = pn, y = yn — pn — p2n, which are arbitrarily
close to the origin.
Example 2. Let /: Zp -> Zp be the continuous function defined by
x = S£janpn ^ f(x) - J2a»P2n'
n>0 /2>0
Then / is differentiable at all points x e Zp with f\x) = 0. Again /' — 0 e C1,
but / is injective, and hence far from being locally constant.
The preceding examples show that the notion of differentiability at each point of
a set X is not very useful, even if we require these derivatives to vary continuously,
and we shall introduce a stronger condition.
Definition. We say that f is strictly differentiable at a point a e X — and
denote this property by f e Sl(a) — if the difference quotients
*>/(*» y) =
x-y
have a limit I — f'(a) as (x, y) -» (a, a) (x and y remaining distinct).
Classically, i.e., for a function /: / -> R (where / C R is an open interval),
if f\a) exists at each a e I and f' :a \-> ff{a) is continuous, then / is strictly
differentiable at all points a e I. The examples preceding the definition show
that in ultrametric analysis, the situation is different and we have to assume strict
differentiability to get interesting results.
Proposition 1. Let f : X -> K be strictly differentiable at a point a € X with
f\a) ^ 0. Then there is a neighborhood V of a in X such that the restriction
of fjf\a) to V is isometric.
1. Differentiability 219
Proof. Since / e Sl(a), for each e > 0 there is a neighborhood VE of a for which
l*/(*. y) - f'(a)\ < £ if x e V£ and y e V£.
Let us take £ = \f\a)\ (^ 0 by assumption) and V the corresponding
neighborhood. Then
l*/(*. y) ~ /»l < l/'(*)l * 0, if (x, y) € V x V
and there is a competition between the terms <£/(*, y) and /'(«)
l*/(*. >0I = l/'(«)l for (jc, y) e V x V.
Hence \f(x) - f(y)\ = \f(a)\ -\x-y\ for (x, y) € V x V. ■
Corollary. If f e S1^) and /'(a) ^ 0, f/ierc ftere w a neighborhood V of
a e X in which f is injective. ■
Theorem. Assume that the function f is defined in a neighborhood ofaeK
and strictly differentiable at this point with f'(a) ^ 0. Choose an open ball B
containing a such that
a = sup
fix) - f{y)
x-y
f'(a)
< \f'(a)\.
Then f maps each open ball contained in B onto an open ball, namely
B<E(b) CB=^ f(B<e{b)) = *<*(/(£>)) (£' = \f(a)\e).
Proof. Put s = fr(a) ^ 0. As in the preceding proposition, we have
I/W-/O0I
x-y
= \s\ (x^ye B),
and f/s is an isometry in the ball B. This already proves
f(B<£(b)) C B<W£(/aO).
To prove that this inclusion is an equality, we select any c € B<|*|e(/(fc)), namely
\f(b) — c\ < \s\e, and show that the equation f(x) = c has a solution x with
I* — b\ < e. Equivalently, we show that the map <p(x) = x — (f(x) — c)/s has a
fixed point x with \x - b\ < e. Observe first that <p(B<£(b)) c B<£(b):
<p(x)-b=x-b-(f(x)-c)/s
= x-b- (f(x) - /(fc))/5 - (/(*) - c)A,
kp(x) - b\ < max(|x - fr|, |/(x) - /(ft)|/M. |/(fc) - c\/\s\) < e.
220 5. Differentiation
Now we prove that <p is a contracting map with contraction ratio a/\s\ < 1:
, , , , /(*) - f(y)
<p(x) - <p(y) = x - y
s
= ^J / _ fix) - f(y)
s v * - y
|jc — y\ a
\<p(x) - <p(y)\ < cr = — ■ |x - y|.
M M
Since the ball B<£(b) is closed in the complete space K, the mapping ^ has a
unique fixed point in this ball and the theorem is completely proved. ■
Observe that this theorem is a generalization of Hensel's lemma (II. 1.5) (here
/ is not a polynomial): The function / — c has a zero x e B, or f(x) = c, as soon
as |/(b) — c| is small enough for some b e B.
Let us turn to strict differentiability on a subset X having no isolated point.
Since X is a metric space, it is Hausdorff and the diagonal of X is closed in X x X.
The open subset X x X — Ax is dense in the product X x X.
Proposition 2. For f:X -► /f, the following properties are equivalent:
(0 / eS\a)foralla e X.
(ii) The function 4>/, initially defined only onXxX — Ax, admits a continuous
extension & to X x X.
(Hi) f is differentiate at each point a e X and there is a continuous function
a onX x X vanishing on Ax with
fiy) = fix) + iy- x)fix) + (y- x)a(x, y) (x,ye X).
Proof. The implication (i) => (ii) is given by the double limit theorem, which
we recall: Let Xo be a dense subset of a topological space X, Y a metric space,
and f a continuous map Xo —>* Y such that for each x e X
z e Xo and z -+ x implies f(z) has a limit g(x) e Y.
Then the extension g : X —> Y is continuous. (More generally, the conclusion is
valid when the target space Y is a regular space, i.e., a topological space in which
every point has a fundamental system of neighborhoods consisting of closed sets.)
The implication (ii) => (i) is obvious.
Finally, if 4>/ has a continuous extension 4>, it has a unique one by the density
of X x X — Ax in X x X. Since we can write
f(y) = f(x) + (y-x)&f(x,y)
= fix) + (y - x)f'(x) + (y - x)[<l>f(x, y) - /'(*)],
■- j
a(jc,_v)
it is obvious that (ii) <$■ (Hi). *
•
1. Differentiability 221
Definition. We shall say that f is stricdy differentiable on X — notation f e
Sl(X) or even f e S1 —when the conditions of Proposition 2 are satisfied.
When / E S1, f'(x) = 4>(x, x) is continuous and f eCl, but strict
differentiability is a stronger condition, justifying a specific notation. Strict differentiability
furnishes coherent limited expansions, and if
M = sup |4>/(*, y)\ = sup |$(*, y)| < oo,
we have
l/W-/(y)l<Af|jc-yl-
7.2. Granulations
The theorem of the preceding section is particularly interesting when the field K
is locally compact, namely when it is a finite extension of Qp. Let us come back
to the usual notation for this case:
K D RD P = nR, k = R/P = F^.
If/* e \KX\, every ball Bsr(a) is a disjoint union of q open balls Z?, = 5<r(«i) =
B<er(cii) (withfl = \n\ < 1) and any set containing # distinct points jc,- G Bsr(a)
with
l*i-*jl>r (Mj)
contains at most one point in each Bt, hence exactly one point in each Bt.
Proposition. Let K be a finite extension of Qp and f : £1 —> K bean isometry
where Q is some compact open subset ofK. Then f maps the balls contained
in Q onto balls of K.
Proof. If BSr(a) is a ball contained in Q, it is clear that
/(*<r(fl» C Bsr(b) (b = f{a)).
There remains to prove the surjectivity f(BSr{a)) = B<r{b). But if we take a
Partition of BSr(a) consisting of smaller disjoint balls, say B\ = fl<£(az) with
8 = MV, the images xt = /(a*) of chosen points ax e B\ form a system of qv
Points in B<r(/(«)) with
l*i —-^jl = Ifli - flj| > e (i ^ j).
Hence the image f(BSr(a)) contains a point in each smaller ball of the partition
°f f(B<r(a)) into qv balls of radius s = \n \vr < r (y > 0). This shows that
the image of Bsr(a) by / meets all closed balls of positive radius. Hence this
lrnage f(B<r(a)) is dense in B<r(f(a)). Since it is compact, it is closed, and the
Proposition is established. ■
222 5. Differentiation
Definition. A granulation of an open compact set Q c K is a finite partition of
Q into balls Z?<r(a,) of the same radius r > 0.
Since two balls B\, B2 having a common point satisfy either B\ C B2 or Z?2 C
B\, two granulations are always comparable: One is finer than the other. Every
ball of the coarser one is a disjoint union of some power qv of balls of the finer
one. Now observe that qv = 1 (mod p — 1), so that the numbers of balls in
the two granulations differ only by a multiple of p — 1. This number of balls is
well-defined modulo /? — 1.
Definition. For any open compact set Q in a finite extension K of Qp, we define
the type r(Q) e Z/(p — 1)Z of Q to be the class mod(p — 1) of the number of
balls in any granulation of Q.
For example, the type of Zp is p e= 1 and the type of Z£ is p — 1 = 0.
It is obvious that the type is additive for disjoint unions:
r(Q U Q') = r(fi) + r(£T) e Z/(p - 1)Z.
Consequently, to compute the type of any open compact set Q, it is enough to
know the cardinality of any partition of Q into balls (allowing unequal radii). The
following theorem summarizes the preceding comments.
Theorem. Let Q be an open compact subset of a finite extension K of Qp and
f an injective strictly differentiate map Q-± K. If f vanishes nowhere, then
Q and f(Q) have the same type.
Proof. From / e Sl(a) and f'(a) ^ 0 we infer that there is a neighborhood V
(for example an open ball) of a in Q such that any ball in V is transformed by /
into a ball of /(V). ■
Corollary. Let p > 2, and f : Zp -> Z£ be an injective strictly dijferentiable
map with nowhere vanishing /'. Then f is not surjective. *
13. Second-Order Differentiability
With the same notation as in (1.1), we define
*2/(*, y, z) =
x-y
when jc, v, and z are distinct. Since we can also write
a *v * /(*) , f(y) m
*2/(*. y, Z) = + 7 r +
(x - y)(x -z) (y - x)(y - z) (z - x)(z - y)
this function 4>2/ is symmetric in x, y, and z.
1. Differentiability 223
Definition. We say that a function f is twice strictly differentiable at a point
a e X — and denote this property by f e S2(a) — if 4>2/(jc, y, z) tends to a
limit as (x, y, z) —>* (a, a, a), x, y, andz remaining distinct.
Proposition 1. Iffe S2(a\ then f e Sl(a).
Proof. Let us take two pairs (x, y) and (z, t) e X x X — Ax in the vicinity of
(a, a) and estimate the difference
*/(*, y) - */(*, t) = */(*, y) - */(Z, y) + */(*, y) - */(*, t)
= (x - z)4>2/(x, z, y) + (y - t)®2f(y, U z\
If we assume / e S2(a), then 4>2/ will remain bounded in a neighborhood of
(a, a, a), say |4>2/| < M, when the three variables of 4>2/ are close enough to
a. In particular if x, y, z, and t are near enough to a, we have
|*/(*f y) - */(z, 01 < Mmax(|x - z|, |y - f |),
a quantity that tends to zero when (x, y) and (z, t) tend to {a, a). Since the target of
$/ is a complete space, the Cauchy criterion is valid and shows that this function
$/ has a limit as (jc, y) -> (a, a). m
As in (1.1) (Proposition 2), the double limit theorem shows that the following
two properties are equivalent:
(i) f eS2(a)foralla e X.
(ii) The function 4>2/, initially defined only on triples with distinct entries, admits
a continuous extension to X x X x X.
We shall say that the function / is twice strictly differentiable—notation / e S2(X)
or even f e S2 — when these conditions are satisfied.
Proposition 2. /// <= S2, then f e S1.
Proof. We have to prove that the difference quotients
x-y
have a continuous extension across the diagonal of X x X. By assumption, there
ls a continuous function 4>2 that extends 4>2/ to X x X x X, and we have
<bf{x, z) - 4>/(y, z) = (* - y) - $2(x, y, z).
In this expression we let z -> x. We know that 4>/(jc, z) tends to /'(*) and
/'(*) - $/(y, x) = (x-y)- 3>2(*, y,x).
224 5. Differentiation
Since the order of the variables in <£/, 4>2/, and <I>2 is irrelevant, we can write
/'(*) = */(*, y) + (x-y)- *2(*. x, y),
and interchanging x and y,
/'GO = */(y, *) + (y - *) ■ *2(y. y. *)■
Subtracting these expressions, we obtain
/'(*) - f'(y) = (*- y)l*2(x, x, y) + *2(x, y, y)],
*/'(*. y) = ®2(x* x, y) + <M*, y, y).
This shows that 4>/' admits a continuous extension to X x X: /' e Sl. Moreover,
/» = (/'/(«) = */'(«,«) = 2*2(fl, «, a). ■
7.4. Limited Expansions of the Second Order
It is also possible to characterize the second-order differentiability by means of
limited expansions (this will not be used later and may be skipped).
Proposition. In order for a function f to be in the class S2, it is necessary and
sufficient that it admits a limited expansion
fix) = f(y) + (*->)■ <*(y) + (x- y)2P(x, y),
where a and P are two continuous functions.
Proof, (a) Suppose first that / e S2 C S1. In the formula
®if(x, y, z) = (x, y, z distinct),
x-y
we can let z —* y. In the limit, we get
^nx,y,y)=^x^-^y^^f^-f'^ ix*y),
x — y x — y
namely
*/(*. y) = f'(y) + ix- y)$2/(*, y, y).
Coming back to the definition of 4>/, we have
fix) ~ /(y) = (x- y)f'(y) + (x - y)2$2/(*, y, y).
This gives an expansion of the desired form with
a = /' and 0(x9 y) = 5>2/(*, y, y)-
I. Differentiability 225
(b) Conversely, let us postulate the existence of a limited expansion as in the
statement of the proposition; hence
&f(x,y) = a(y) + {x-y)P{x,y) (x * y).
If x, y, and z are distinct, we have
*/(*,*) = a(z) + (*-z)j8(x,z),
*f(y,z) = a(z) + (y-z)P(y,z),
whence by subtraction (and division by x — y),
*2/(*. y, z) = ^—^P{x% z) + 1—1-Piy, z)
x — y x — y
= ty(x,z) + tip(y*z)
(where A. + M = 1)- Let us choose a point a ^ x and subtract the same quantity
P(x, a) = (X + fi)P(x, a) from both members:
*2/(*. J>, z) - j8(x, a) = k[P(x, z) - P(x, a)] + /z[/S(y, z) - j8(x, a)]-
It is clear that
y -»» g and z-± a => /x -> 0 and 4>2/(jc, y, 2) -* /3(jc, a)
(observe that \k\ = 1 as soon as maxflz — a\, \y — a\) < \x — a\). When x, y, and
z -*» a (while remaining distinct), we even see that 4>2/(jc, y, z) -± P(a, a): In
the region U: maxfljt — z\, \y — z\) < |jc — y|, in which |/x| and |A.| are less than
or equal to 1 we have
\*2f{x,y,z)-P(a,a)\ <
max(|0(jc, 2) - j8(jc, a)|, |/3(y, z) - 0(x, a)|, \P(x, a) - j8(a, a)|) -
In this region 4>2/(jc, y, z) -» /S(a, a) (jc, y, and z distinct -* a). Since 4>2/ is
symmetric in its three variables, we can estimate the difference
\*2f(x,y,z)-P{a,a)\
by first permuting the variables in order to bring them into the region in which the
preceding estimates have been made. ■
Caution. A function / on Zp can have a derivative /' e Sl without being twice
strictly differentiable, namely with f &S2. One can think of a function / with
vanishing derivative at each point, hence with /' = 0 e S2, but that is not stricdy
differentiable at a point: We have given an example of such a function, locally
constant outside the origin, in (1.1).
226 5. Differentiation
7.5. Differentiability of Mahler Series
Let / be a continuous function on Zp and choose y eZp. We can write the Mahler
expansion of the continuous function x h* f(x + y) as
/(* + y) = £ c*fr>(f) with C*W = (v*/)(y) -* o.
Theorem. Let f be a continuous function on Zp. Then f is differentiable at y
precisely when
l(vV)(y)/*l ^ o (*^oo).
In this case f\y) = £*>,(-1)*-!(VVX30/*-
Proof. Replacing / by its translate x h+ f(x + y) we see that it is enough to
prove the theorem when y = 0. Now, since cq = /(0), we have
=e^)=ei(::;>
If |q/A:| -> 0 (when A: -»» oo), the Mahler series
represents a continuous function of y e Zp. In particular, /'(0) exists and
^--"-gG-Of-g^'T-
Conversely, if / is differentiable at the origin, the function g defined by g(0) =
/'(0) and g(;c) = (f(x) — f(0))/x for x ^ 0 is continuous on ZP and possesses a
Mahler expansion
§-0
gCO = >,)*(,_ J (where w = V*g(0) -► 0).
We deduce
/(*) - /(0) = *g«
*>0
But
*>0 ^
<;H-»GK)-*+,)G:.K)-
1. Differentiability 227
Hence we can write
f(x)=f(P) + xg{x)
-*+5»(*H:,K))
= c0 + ]T k{yk~l + Yk\k)'
By uniqueness of the Mahler coefficients of /, we deduce ck = k(Yk + Yk~\)
(k > 1) and in particular ck/k = Yk + Yk~\ -± 0 (k-± oo). ■
Comment For any integer A: > 1 we have \k\ = p~~v^k) > \/k, or equivalently,
Vl*l <£. Hence |cfc/fc| < k \ck I, and the condition k \ck\ -> 0 implies \ck/k\ -* 0.
This stronger condition will imply strict differentiability of the Mahler series.
Let us first give a statement concerning Mahler series of Lipschitz functions.
Definition. A function f: X —► K {as in (LI)) is Lipschitz when there exists
a constant M with
\fW-f(y)\<M\x-y\ tx.yeX).
Since the smallest bound M is
||*/||:=sup|*/(xf30|,
Lipschitz functions are also characterized by |4>/| bounded. We shall denote by
Lip(Zp) the subspace of C(ZP) consisting of Lipschitz functions. By definition,
S\Zp)cLip{Zp)cC(Zp).
Proposition. A function f = X!*>oc*(*) e C(Zp) is Lipschitz precisely when
{k\ck\}k>o is bounded, namely
|<£/| bounded <=> the sequence k\ck\ is bounded in R>o-
The proof of this proposition is based on the following lemma.
Lemma. Fork > 1 and p3 <k< pJ+l, we have
<pJ\x-y\ (x,yeZp).
10-0
Comment More precisely, when k is in the quoted interval, its expansion in base
P has the form
k = kQ + kxp + - ■ ■ + kjpJ (0 < ki < p, k; ^ 0)
228 5. Differentiation
(j 4-1 digits), and we call k- the integer ko + k\ p -\ \-kj-\pj l < pJ (at most
j digits). Then
Jfc - it. = JkypJ"f |* - Jfc-| = p~y,
and the statement of the lemma can be written uniformly for all integers k:
10 -(Oh*-'->--i*-*-
This lemma shows that |jc — jy| < \pj\ = p j implies |(*) — (£)| < 1. For
example, if y = x + // for some h > j,
< 1 (forfc<p*).
IGRVOI
This is the /?*-periodicity of the binomial functions (IV. 1.3)
x h+ I J mod p (k < ph)
already exploited in the proof of the Mahler theorem.
Proof of the Lemma. The formal identity (1 + Tf+y = (1 + T)x{\ + T)y leads
to the well-known relation
(ThEoe
(first for positive integers x and y but also by density and continuity for p-adic
integers x and y). Write then
0-rro-&e7j)CKO+£(*7OG^>
Thus
(;KhE(v)Gi>j:^f7r)G*,>
and it only remains to estimate
£K'-%-.)|
1
< max — =
I
15*5* \i\ mini<,-<jk |i|
1. Differentiability 229
It is clear that for p3 < k < pJ+l, the minimum in question is attained for i = pJ
with |i| = \pJ \ = p~j. The lemma follows. ■
Remark. A slightly less precise inequality, namely
would be sufficient for our study of S1 functions.
Proof of the Proposition. Let us write the difference quotients
f(x + h)-f(x)
4>/(* + fc,x) =
2-is*(cr)-©)
k>\ 0<i<*
for h ^ 0. We observe that
■iS-feO©-©)
\ck(x\( h \\ \ck\
\h\ij\k-ij\- \h\ K
uniformly in i (and fixed h ^ 0). The double family
c±(x\( h \ = _£*-(x\( h~1 \
h \i)\k-i) k-i\i)\k-i-l)
is thus summable in any order, and in particular, it is equal to a double series over
the indices i > 0 and j = k — i — 1 > 0. Replacing h by y + 1, we obtain
*/(* + y+ !,*)= £
«,J>0
£i±z±lj *
7 + 1 V"
(y * -D-
Firstly, the ultrametric inequality gives
|*/(* + y + i,*)l < sup
cI-h/+l
7 + 1
and hence
||*/1| - sup |*/(x + y + 1, x)| < sup
jc,v^-1 i,j>0
ci+7 + ]
7 + 1
230 5. Differentiation
In fact, the preceding expansion is a false Mahler series in y because it is not
valid for all values of y eZ. Nevertheless, it holds for all (x, y) e N x N, and
this proves that the coefficients are given by the finite differences on the integers.
Hence secondly,
ci+j+\
7 + 1
< sup |<D/| = ||4>/|| (<oo)
and
Altogether, we have
sup
Ci+j+l
7 + 1
< l|4>/ll-
sup
i.j>0
ci+i+\
j+l
11^/11 (<oo).
In particular, considering only the subset of indices (i, j) for which i+j + l—n,
sup(\cn\,\cn/2\,...,\cn/n\)<\\<i>f\\.
On the left we have
\cn\sup l/\i\ = \cn\ p* (ps <n< //+1).
Call Kn the highest power of p that is less than or equal to n. The preceding
considerations prove that
ll*/ll = sup
ci+j+\
7 + 1
= sup icn\cn\.
Since tcn < n < picn, the proposition follows. ■
Corollary 1. Let f e Lip (Zp) and f = ^cn (n) its Mahler expansion. Then
||4>/1| = SUp Kn\cn\ < 00. fl
The number || 4>/1| does not define a norm on the vector space Lip (Zp) because
3>/ vanishes for constant functions: It is only a seminorm. In order to have a norm*
we take
!l/lh= sup(1/(0)1, ||*/||).
Since /(0) = c0, we define in an ad hoc way the value /c0 = 1 in order to have
||/||l = SUp Kn\cn\.
n>0
1. Differentiability 231
Corollary 2. Let f e Lip (Zp) and Sf its indefinite sum. Then Sf e Lip (Zp)
and
\\fh<\\Sfh<p\\fh.
Proof. We have
H/lli = sup Kn\an\,
|| Sf ||! =SUp/Cn|Gn_i|
n>l
by Corollary 2 in (IV.3.5). Now observe that
whence the assertion. ■
Corollary 3. The map
is an isomorphism between the normed spaces (Lip (Zp), ||. || 1) and 1°°. 77ie
!*„(*) («>!)
1 and//
correspond to the "canonical basis" of £°°.
Here, /cn (highest power of p that is less than or equal to n) is considered as an
element of Zp: Its absolute value is \k„\ = l//cn e R>0.
Proof. Any / e C(Zp) is given by a Mahler series
to W
When / is Lipschitz. we write this series
=£H)eLip(Z'>
with
/
n>0
ll/lli = sup(|/(0)|, W\\)C=l sup kb|c„| = sup |c„/k„|;
/I>0 /2>0
hence the result.
232 5. Differentiation
1.6. Strict Differentiability of Mahler Series
Theorem. For a continuous function f = X!*>oc*(*) e C(Zp), we have
k\ck\^0 (k^ oc)=* feSl(Zp).
Proof. We have
•"">-i>((:)-(D)A-*
and thanks to the lemma (in its weak form),
^h(C)-©)A-j,)
<k\ck\.
If k\ck\ —* 0, 4>/ is a continuous function as a sum of a uniformly convergent
series with continuous terms: The polynomial (*) — (*) in x and y vanishes
identically on the diagonal x = y and is divisible by x — y, whence ((*) — (£)) /(x - y)
is also a polynomial function. ■
It is possible to prove conversely
feSl(Zp)=>k\ck\^0 (*-*oo).
Corollary. Let f e Sx(Zp) and Sf its indefinite sum. Then Sf 6 Sl(Zp). ■
With the preceding results, it is easy to construct examples of continuous
functions on Zp exhibiting various behaviors (as far as differentiability is concerned).
Example 1. Let the Mahler coefficients c* of a continuous function / be
[ pJ ifk = pi,
I 0 if A: is not a power of p,
so that
\Ck/k\ takes alternatively values 0 and 1.
Hence \ck/k\ does not tend to 0, thereby proving that / is not differentiable at the
origin. But 4>/ is bounded, since k\ck\ (taking values 0 and 1 only) is bounded.
Example 2. As in the preceding example, but with
[ pV if k = p',
Ck — \
10 if A: is not a power of p.
2. Restricted Formal Power Series 233
Then
\ck/k\ takes alternatively values 0 and \pj\ -> 0,
so that / is differentiable at the origin. Here \ck/k\ = k\ck\ and / e Sl.
2. Restricted Formal Power Series
2.7. A Completion of the Polynomial Algebra
Recall that in this chapter K denotes a complete extension of Qp. A formal power
series with coefficients in a subring R of the field K is a sequence (an)n>o of
elements of R. However, when we use the product
(dn)n>0 • (bn)n>0 = (Cn)n>0 with Cn = ^ fl.-fej (fZ > 0)
we prefer the series representation f(X) = J]n>o fln^" instead of (an)n>o- The set
of formal power series is a ring and an tf-algebra denoted by /?[[X]]. Recall that
the formal power series ring with integral coefficients has already been considered
in (1.4.8); we shall come back to a more systematic study of formal power series
rings in (VI. 1). The particular formal power series having coefficients an -> 0 are
called restricted formal power series, or more simply restricted {power) series.
The restricted formal power series with coefficients in K form a vector subspace
of K[[X]] denoted by K{X] and isomorphic to the Banach space c0(K) (IV.4.1).
This subspace is a completion of the polynomial space for the Gauss norm — sup
norm on the coefficients —
K[X] C K{X} C K[[X]].
We still call Gauss norm the extension
||/(X)|| := sup |a„| = max \an\ (f(X) = TanX" e K{X}).
Lemma. For two restricted power series f and g, we have
11/511 < 11/1111*11-
^oof. Let f(X) = £n>0fl„X", g(X) = £„20fc„X" be two polynomials. Their
Product h = fg is the polynomial h(X) = £„>o cnXn having coefficients c„ —
£.+,=„ a,bj. Since \c„\ < max,+J=„ \a,\\bj\ < ||/||||g||,
||/«||=max|cn|< H/ll\\g\\ (f.geKlXM
^ence multiplication is (uniformly) continuous in K [X] and extends continuously
t0 the completion K{X} with the same inequality. ■
234 5. Differentiation
Hence K{X] is a ring and a Banach algebra. This is the Tate algebra in one
variable over K.
As usual, we denote by A the maximal subring of K, M the maximal ideal of
A, and k = A/M the residue field of K. The unit ball A{X} C K{X} is a subring,
since
11/11 <1, Ilsll<l=>ll/Sll<l.
From this it follows that the reduction (IV.4.3) of the Banach space K{X}, the
quotient of its closed unit ball by its open unit ball, is the polynomial ring over the
residue field
A{X]/M{X}=k[X].
Let |*| < 1 (x in K or Kr: complete extension of K) and /= ^n>GcinXn a
restricted formal power series. Then \anxn\ -> 0(n —> oo),sothat/(jc) = ^n>0G„x"
converges and / defines a function on the unit ball of K (or Kr)
f:A-> K :x h^ ^ anxn.
n>0
The sup norm of this function satisfies
\\f\\cb(A,K) = sup \f(x)\ < sup \an\ = \\f\\Kix\-
A n>0
In particular, the series £n>o anxn converges uniformly on the unit ball A, and /
defines a continuous function on this ball. The linear map
K{X] -> Cb(A; K): £>„*" i-> /
is a contracting map of Banach spaces.
Example. Let K = Qp and consider the polynomial (restricted formal power
series) X - Xp of norm 1 in QP{X}. Since xp = x (mod p) for all jc e Zp, we
have |xp— x| < |/?| = 1/p whenx e Zp and the norm of the continuous function
x h^ xp — x on Zp is l/p < 1.
Theorem. If the residue field k of K is infinite, the canonical embedding
K{X} -> Cb(A: K) is isometric:
sup\f(x)\ = \\f\\Km.
xgA
Proof. If / == 0, then ||/|| =0 and there is nothing to prove. Otherwise, we can
replace / by f/an where \an\ — \\f\\. Thus we may assume that ||/|| = 1. In this
case the image of / e A{X} in the quotient is a nonzero polynomial
feA{X]/M{X] = k[X],
2. Restricted Formal Power Series 235
and since the residue field k is infinite, we can find a e k* with /(a) # 0. Taking
any a e A* G K* with residue class a in kx, we have
|a| = land|/(a)| = l,
whencesuplJc|5l \f(x)\ = maxw<i \f(x)\ = maxUN1 \f(x)\ = 1 = ||/||. ■
The preceding proof shows more: For/ e /ffX^wehavesup^ |/| = max a |/|
in spite of the fact that the unit ball A is generally not compact. Moreover, the
maximum of \f(x)\ (\x\ < 1) is attained at a point x with |jc| = 1.
Recall that we denote by Ap the maximal subring of Cp (closed unit ball) and
by Mp the maximal ideal of Ap (open unit ball).
Corollary. We have
sup \an | = sup 7 anxn = max \S^ anxn ,
n>0 xeAp[^ { XeAp\^ I
and this maximum is attained on A* = Ap — Mp, which is the unit sphere
\x\ = 1 in Cp. The canonical embedding
K{X}->Cb(Ap-Cp)
is isometric. m
2.2. Numerical Evaluation of Products
Let f(X) = ^n>0anXn and g(X) = ^n>obnXn be two restricted power series.
Their formal product is the power series
h(X) = J2cnX\
where
0<i<n
As we have seen in the previous section, it is again a restricted power series.
Theorem. Let f(X),g(X) e K{X} be two restricted power series and let h(X)
be their formal product. Then h(X) e K{X}, and the evaluation of this formal
product can be made according to the usual product
h(x) = f(x)g(x) (|x| <1).
*t*oof. Replacing anxn by an and similarly bnxn by bn, we see that it is suffi-
Clent to prove the statement for x = 1. With cn = l^o<z<nG^«-i we nave t0
236 5. Differentiation
prove
if Hn>o a" and 5Zn>o bn converge, then ]Tn>0 cn converges and
n>0 n>0 n>0
Because multiplication is continuous,
i>0 j>0 i<W j<N
Let us show now that
I> " X>j ~ I>* ^ 0.
Choose Ns large enough to ensure
lOil <£, \bi\ <£ (i > A/e).
Now the difference
J2aiJ2bJ~lLCk
i<N j<N k<N
is the sum of the terms aibj corresponding to pairs (i, j) in the square 0 < i, j < N
above the diagonal, namely with i + j > N. The contribution of these terms is
less than or equal to eC if C is an upper bound for the coefficients and N > 2Ne
because at least one index i or j will be greater than or equal to NE. This proves
the theorem. ■
Observe that the classical result concerning absolutely convergent series cannot
be applied here, since we only assume \an\ -> 0 and \bn\ —> 0 (but ]T \&n\ and/or
]T \bn | may diverge). On the other hand, due to the ultrametric inequality, it is now
easier to estimate tails of sums!
Corollary. The canonical map K{X} -* Cb(A; K) is a norm-decreasing ho-
momorphism of K-algebras. ■
This isomorphism is isometric when the residue field k of K is infinite (and also
when \K* | is dense in R>o, as we shall see later (VI. 1.4)). The identification of a
restricted formal power series f(X) with the function / that it defines on the unit
ball A will often be made.
2.3. Equicontinuity of Restricted Formal Power Series
Let us still identify K{X} with a normed subspace of C^(Ap; Cp). With A =
B<i(K) as usual,
supl/WI < \\f\\Km ■■= sup \an\ = sup \f(x)\ (f = J2 a»Xn e KWy
xeA /i>0 xgAp
2. Restricted Formal Power Series 237
Proposition 1. The unit ball in K{X} is uniformly equicontinuous. More
precisely, K{X] cLip (A) and ||4>/|| < ||/||, < ||/||/or/ 6 K{X). In particular,
\f(x + h)- f(x)\ < \h\ H/ll if \x\ < 1 and \h\ < 1.
Proof. Write / ~ ]T anXn, so that
fix) - f(y) = J2 ««(*" - y") = (* - y) £ "n(.xn-x + ■■■ + y"-1).
If |jc| < 1 and \y\ < 1, the ultrametric inequality gives \xn~l -\ h yn"1 | < I,
and the result follows. ■
In a similar vein, let us derive the following inequalities.
Proposition 2. If f e K{X}, then
\f(x + y)-f(x)-f(y) + f(0)\<\\fi-\xy\ (W<1, lyl<D-
If moreover f is odd* then
l/(* + y) - /(*) - f(y)\ < ll/ll -to>(* + y)l (M < i. \y\ < *)•
Proof. With the same notation as before,
fix + y) - /(*) - /(y) + /(0) = Y, *■«* + >)" - x" - >")),
n>2
whence the result, since each term (x + y)n — jcn — yn is divisible by xy. When /
is odd,
/(* + 30-/(*)-/(y)= E «n((^ + >7)n-^-^))-
n odd >0
Only the terms with n odd and greater than or equal to 3 remain in the sum, and
for these
{x + yf -xn-yn= xy(x + y)pn(x)
for some integral polynomials pn e Z[X]. Hence ||p„|| < 1. ■
Remark. Although it is uniformly equicontinuous, the unit ball of Cp{X] is not
Precompact in C&(AP; Cp): The Ascoli theorem is not applicable, since Ap is not
locally compact. For example, the infinite sequence (X")„>o satisfies
IX"-Xm || = 1 (n^m)
and hence contains no convergent subsequence. However, if we consider the
restriction of these continuous functions to the (compact) unit ball R of a finite
e*tension K of Qp, the preceding sequence admits a convergent subsequence,
238 5. Differentiation
namely (Xqm)m>o, where q is the cardinality of the residue field of K. In fact, the
subsequence (Xp" ) converges uniformly on every unit ball Z?<i(0; K), provided
that K has finite residue degree over Qp (III.4.4).
2.4. Differentiability of Power Series
The formal derivation operator /* i-^- /*' is continuous and contracting on K{X]
simply since \nan | < !««| —> 0 and
||/'||= sup \nan\< sup \an\ = \\f\\.
We are interested in strict differentiability; hence we look at the differential
quotients
x-y
When / e K{X}, Proposition 1 of (2.3) shows that
n>\ 0<i<n-l
is a formal power series in two variables and coefficients tending to zero. Thus
4>/ has a continuous extension to A x A that is a sum of a uniformly convergent
power series (in two variables). The value on the diagonal is
^f(x,X) = J2^nxn~K
This proves the following result.
Theorem 1. Let f e K{X}. Then f defines a strictly differentiableJunction on
the unit ball A of K: f e Sl(A). The derivative of f is given by the restricted
formal power series
f = */Ia = J^nanX"-1 e K{X}. ■
It is easy to give more precise estimates for the convergence:
*/(*, y) - 2>«a"-1 = X>(*" - yn)/(x - y)- J2na"^1
= Yian(*n~~l + - - - + yn~l - n$n~x) -+ °
when (x,y) —> (§, §). In fact,
2. Restricted Formal Power Series 239
and by (2.3),
<max(|;c'"||y-^U*'-£Wl)
<max(|y- £|, |x-£l) (i.y.feA).
We have obtained
\*f(x,y)-f'G)\<m-max(\y-S\,\x-S\)-
Theorem 2. A restricted formal power series f = ^anXn defines a twice
strictly differentiable function on the unit ball AofK: f e 52(A).
Proof. As we have seen in the proof of the preceding theorem,
<t>f(x,y) = J2a" Z! xiyJ=ai+a2(x +>>)+-•-.
n>\ i+j=n-\
and hence, for distinct x, y, z,
®2f(x,y,z) =
x-y
n>2 i+/=n-l * ^
n>2 Ar+£4/n-n-2
Since \an\ -> 0, this series converges uniformly on A x A x A and represents a
continuous extension of 4>2 /. ■
Generalization. Let us just indicate here that differentiability of restricted power
series is not limited to order two. In fact, one can define higher-order difference
quotients inductively by
®kf(xo,xu...,xk) :=
Xq - X\
(*o ^ xi). The expressions 4^/ are symmetric in their fc ■+• I variables, and an
ea&y computation shows that
*'m"" *"-?(nJC-J
Taking /(*) = xN we obtain
240 5. Differentiation
a sum of all homogeneous monomials of total degree TV — k. In this case <bkf has
an obvious extension to the diagonal (all jc,- = jc):
^ r, . N-k .u f monomials of degree N — k]
kjy ' ' ' \ in k + 1 vanables J
-co---
The equality &kf(x> jc, ..., jc) = f^k\x)/k\ remains true by linearity when / is
a polynomial or even a restricted series. These functions are of class Sk on A.
2.5. Vector- Valued Restricted Series
Let E be an ultrametric Banach space over K. A restricted vector series (with
coefficients in E) is a formal power series
f(X) = J2^Xn,
n>0
where an e £, ||fln || —> 0. We can still define the Gauss norm of such a restricted
series / by
H/ll := sup \\an\\.
n>0
Hence the normed space of restricted vector series (with coefficients in E) is a
Banach space isometric to Co(E). If the indeterminate X is replaced by a variable
jc e A C K, the restricted series ^2n>oanXn gives rise to a continuous vector-
valued function f : A -> £, which we can write as /(jc) = ^n>^xnan (not that
it matters, but we may prefer to write scalar multiplications on the left), for which
sup B/WI!< sup 11^11 = 11/11.
|x|<l n>0
That is, the linear map co(E) -> Ct(A. E) is continuous and contracting.
Proposition. When the residue field k of K is infinite, the canonical map
Co(E) -> Cb(A, E)is an isometry.
Proof. Assume ||/|| = c > 0 and look at the A-module B<C(E): Its quotient
E = £<c(£)/£<c(£) is a vector space over k = A/M. The restricted series /
with coefficients in BSC(E) has a polynomial image / = ^anXn having at least
one nonzero coefficient, since ||/|| = c. We can choose a fc-linear form <p^on
E such that <p(an) ^ 0 for such a coefficient. The scalar polynomial <p o / =
J] a„Xfl = J] <p(an)Xn is not identically zero and there is an element a e k* such
that 0? o /(a) ^ 0. A fortiori f(a) ^ 0 and ||/(a)|| = c for every a e A, a in the
coset a (mod M). ■
3. The Mean Value Theorem 241
3. The Mean Value Theorem
3.1. The p-adic Valuation of a Factorial
Since the formula for the p-adic order of n\ will play an essential role, we review
it.
Lemma. Let n > 1 be an integer and let Sp(n) be the sum of the digits ofn in
base p. Then the p-adic order ofnl is given by
ordp(n!)= £—.
p-\
Proof. We have to compute
ordp(n\) = ^ or&p(k)-
l<k<n
Let us fix an integer k < n say with order ordp(k) = v and write its expansion in
basep:
k = kvpv + • • • + ktpl (y < £, kv ^ 0).
Then
*-l = (p-l) + ... + (p- l)pv~l + (kv - l)pv + - • • + kiPe,
and hence
Sp(k-l) = v(j>-l) + Sp{k)-l.
Equivalently,
v = ordp(k) = -(1 + Sp(k - 1) - Sp(k)).
p — 1
Summing over all values of k < n we obtain a telescoping sum
ord>!) = —L- Y, (! + V* ~ *) " V*)) = —^7^ ~ W)- "
^ l<A:<n P
Alternative Proof. A more traditional way of obtaining the same formula goes
as follows. The number of integers k with fixed v = ordp(k) that appear in the
Product n\ is equal to the number of multiples of pv that are not multiples of pv+1
(and are less than or equal to n), namely
[pv\ [pv+1V
242 5. Differentiation
where [x] denotes the integral part of the real number x. Hence
-H+[7]+[?]+-£[?]-
Let us write n in base p as n — no + n\p + njp2 -\ (a finite sum). Then
raft
1 + "2P + "3P2 + '
Hence
n2-\~n3p-\~n4p2 +
[7H+pbM
Summing all these, we obtain
n + ordp(n!) = Sp(n) + p ordp(n!),
n - 5p(/i) = (p - 1) ordp(«!)
and the result follows. *
3.2. First Form of the Theorem
As already recalled in (2.1) the field K is assumed to be a complete extension ot
Qp (e.g., Cp or £2p). Even for polynomial functions /, the following form of the
mean value theorem,
\f{h)-ftO)\<\h\-\\f'\\
does not hold without restriction. Recall that for polynomials / (or more generally
for restricted power series), we use the sup norm on the coefficients (Gauss norm)-
If the residue field of K is infinite, this norm coincides with the sup norm
on the unit ball of K (2.1).
3. The Mean Value Theorem 243
For example, if f(t) = tp9 we have /'(f) = Ptp~ Whence ||/'|| = \p\ = l/p < 1.
And with /i = 1,
l = l/(l)-/(0)|>|/i|^/'||-l/p.
However, we show below that there is a universal bound (depending on the prime
p but not on the restricted series /) such that the mean value theorem holds for
\h\ <rp. The preceding example can be used to discover the limitation in size of
the increment h. In order to have
l/(^)-/(0)|<|/i|.||/,||
in this particular case, we must have
\h\p<\h\\\f'\\ = \h\\p\9
whence the restriction \h\ < |/?|1/(p-1).
Let us recall that we have introduced a special notation (II.4.4) for this absolute
value:
rp := \p\Wp-» , \p\ <rp<l.
It will play an important part from now on. Observe that
ri = \, rp> -p- (p odd prime),
and also rp / 1 when the prime p increases (whereas \p\ = l/p \ 0).
Theorem. Let f(X) e K{X] be a restricted power series and also denote by
f the corresponding function t h» f[t) = X)n>0 antn on tne un^ ^a^ A of K.
Then
I/U + A)-/(0I<|A|-Il/'II
for all t, he K with \t\ < 1 and \h\ <rp = \p\l/{p~l).
First Proof. (1) Let us establish first the result for a polynomial /. The Taylor
formula permits us to compute the difference f(t -f- h) — f(t) as
/(* + h) - f(t) = £>* - *lp. = h £ ^ • D^fXt),
k>l K' *>' K'
so that
|r|<l=»|/(r + /i)-/(OI<|A|sup
Since
\\Df\\ = H/'ll = sup \kak\ < sup |**| - H/ll,
*>1 *>0
U-I
k\
,Jfc-l JT'l\
\\DK-lf
244 5. Differentiation
we see that ||D|| < 1, ||D*|| < 1 (k > 1). In particular, \\Dk~lf\\ < ||/'|| and
||D^7'||<sup|^| t3? o ( = Ofcr*>deg/).
n>k
The result will be proved if we can show that \hk~l/k\\ < 1 (for all k > 1). This
condition for k = p requires \h\p~l < |/?|, i.e., \h\ < rp. When it is satisfied, we
have
\h\k~l < ipl^-OAp-D < \k[\9
simply since
k - SJk) k - 1
p-\ p-\
(2) Consider now the general case of a restricted series f(t) = ]C*>o aktk.
Without loss of generality we may assume \f(t + h) — /(f)| ^ 0, hence / not
constant. Consider the polynomials fn(t) = X^<n aktk- We have
||/-/n||=sup|^|~^0
k>n
as well as
||/„||= sup \ak\ = ||/||,
||/„'||= sup 11^*11 = ||/'||
fc<n
for all large n. Take f and /i as in the first part. The convergence
Mt + /i) - /„(*) -> /(r + h) - f(t) ^ 0
implies \fn(t + /i) - /„(f)l = \f(t + /z) - /(r)| for all large n. Hence, using such
a large value of the integer n, and using the result for polynomials, we have
\nt+h)- /(oi - \ut+*) - /„(oi < ihi - ii/;ii = iai - ii/i. ■
Second Proof. Let us observe that
so that the operator Dk/k\ transforms polynomials with integer coefficients into
polynomials with integer coefficients: ||D*/*!|| < 1, ||D*|| < \kl\ -> 0. Better
still: If g is any restricted series, it is obvious that || Dk(g)/k\\\ -► 0, since the first
k coefficients of g are destroyed by the operator Dk (while the other coefficients
3. The Mean Value Theorem 245
of g are multiplied by integers under the effect of the operator Dk/k\). Coming
back to the expression
hk~l Dk'lf\t)
we see that the mean value theorem will be proved if we show \hk~] /k\ < 1. For
k = p this condition requires \h\ < \p\ll{p~l) = rp as before. When it is satisfied,
take an integer k, put v = ordp(k), and write k = pvm > pv. We have
hk-l| rk-\
- \pv\ ~ p ]F] [F]
The exponent is
PV~l .._„ . _. . «-l,
li = r -v = (l+ /> + ■-- + pv'1) - v > 0,
and the proof is completed.
Remark. Let E be an ultrametric Banach space over K and
f(X) = y£akXk
k>0
a restricted power series with coefficients in E. Then we can view / as the vector-
valued function
t \-> Yltkak, A~> E
on the unit ball A of K (2.5). Then the mean value theorem immediately furnishes
ll/U + /0-/(OII<W-ll/'ll
forallf,/i e K such that |f| < 1 and \h\ <rp = Ipl1^-1*. In fact, simply replace
m the above proof \ak\ by \\ak\\, \f(t)\ by ||/(0ll» etc. whenever necessary.
'•-?- Application to Classical Estimates
Let us apply the mean value theorem (3.2) to the polynomials f(t) = (1 4- t)p"
& > 1). Since /'(f) = pn{\ + t)p"'\ we have ||/'|| = \pn\ and hence
l(l + 0^-l|<l'M/nfor|f|<v
Recall that the fundamental inequality (III.4.3) in its second form gives
\(\+tf -l\<\t\-(max(\t\,\p\)T,
246 5. Differentiation
hence the same inequality only when \t\ < \p\. Since
1/P = \P\ <rp<\
with strict inequalities for all p > 3, the mean value estimate is sharper for odd
primes (in the indicated region).
r-|Z|
An application of the mean value theorem
With the Newton polygon method (VI. 1.6) we shall be able to compute more
explicitly these absolute values |(1 -h t)p" — 1|. Let us simply observe now that
this absolute value vanishes when (1 + t)?" = 1, namely when 1 + t e [Ly. The
smallest \t\ for which this occurs is 1 + t e iip, and as we have seen in (II.4.4),
this implies |f| = \p\l/{P~l) = rp.
Let us give another application to binomial coefficients. Define successively two
polynomials g and / (having integral coefficients) by
(1 + X)p = 1 + p • g(X) + Xp,
f(t) = (l+tg(X) + XpT,
so that
/(0) =(1+F)n,
Here we consider f :Qp -> E, where E C QP[X] denotes the finite-dimensional
subspace consisting of polynomials of degree less than or equal to np. The mean
value theorem (vector form) leads to an estimate of the norm of
f(p) - /(0) = a + xy - (i + xt = £ ("P)^ - E (")*"*•
3. The Mean Value Theorem 247
Since ||/'|| < \n\, we have \\f(p) — /(0)|| < \np\ and in particular, looking at the
coefficient of X"*
CK)
(7)-
(mod npZp).
On the other hand, when j is not divisible by /?, the coefficient of XJ satisfies
(mod npZp).
This last congruence was obvious a priori, since
(7)-7(7-".W
3.4. Second Form of the Theorem
Let us give a closely related form of the mean value theorem for series converging
in M C A C K. Assume that /(f) = ]C*>ofl*'* converges whenever \t\ < 1.
More precisely, let us assume that the coefficients a^ e K satisfy
Wk\rk —> 0 for all positive real numbers r < 1.
The variable f itself can be taken in the field K or any extension of this field, e.g.,
in Cp. When t g Mp we can consider the restricted series fT e CP{X} defined by
MX) = J^akTkXk.
If If | < 1 we have f(t) = fz(t/r) as soon as the element t g Mp is chosen such
that |?| < |T| < 1. Obviously;
/ (f) = and sup \f (t)\ = ——.
T |r|<|r| lTl
The mean value theorem for the restricted series fT now gives
Equivalently,
l/(r + A)-/(OI<l*|- sup |/'(f)|.
Ul<|r|
All this is valid whenever \h/r \ <rp for some t g Mp. We can find such are Mp
e*actly when |/z| < rp. Let us summarize this result in the case K = Cp, using
the notation
HsIU, := sup \g(t)\ (<oo).
IH<i
248 5. Differentiation
Theorem. Let /eCp [[X]] be a formal power series that converges in the open
unit ball Mp. Assume that ||/'||<i < oo. Then we have
\f(t + h)-f(t)\<\h\.\\f'\u
for all t e Mp and \h\ < rp. I
In this case, the mean value theorem holds in Mp for increments heCp
satisfying \h\ < rp (notice the strict inequality). Examples of this situation are given by
power series / e ZP[[X]] (or more generally in A[[X]]): ||/'||<i < 1. Take for
instance / = J2k>o Xk = 1/(1 - X). For all \t\ < 1 we have \f[t + h) - f(t)\ =
\h\/\(l-t)(l-t-h)\ = \h\(t, h eMp), simply since \l~t\ = |l-f-A| = l
the strongest wins! Here |/'(f)| = 1/|1 - t\2 = 1 and H/'lUi = 1.
3.5, A Fixed-Point Theorem
Theorem. Let K be a finite extension ofQp, R = B<i(K)its closed unit ball
and f e K{T} a restricted formal power series with \\f\\ < 1. Assume
||/'|| < 1 and inf \f(x) - x\ <rp = \p\l^'l\
xeR
Then f has a fixed point in R.
Proof. The function / defines a continuous map from the unit ball R of K into
itself. Since \K x | is discrete in R>0 (we are assuming that the field K is a finite
extension ofQp), there is a point x0 e R with |/(*o)-*ol - V^efine inductively
jcn+i = f(xn) for n > 0. By the mean value theorem,
\xn+i ~xn\ = \f(xn) - /(xn_i)|
<I*n-*n-lHI/r|| < |*n-*n-l|,
and we see by induction that |xn+1 — xn\ < rp (n > 0). If xn — xn-\ for one
positive n, we are done. Otherwise, \xn — jcn_i | ^ 0 for all positive integers, and
as before,
l*n+l -*nl = l/(*n) ~ /(*n-l)l
< \xn -xn_i| - II/'H < |xw -xn_i| < rp.
The strictly decreasing sequence |xn+i — jcn | in the discrete subgroup | K x | C R>o
has to tend to 0: (xn) is a Cauchy sequence. The limit of this sequence is a fixed
point of / in R.
Comment. To show that the hypotheses are necessary, let us consider the function
f{T) -P+le QP[T] C QP{T}. We have
f(T)^pTP-\ ||/'|| = i<rp<l.
3. The Mean Value Theorem 249
This function / has no fixed point in the unit ball Zp of Qp. In fact,
f(x) -x=xp-x-\-l = l (mod p) (x e Zp)
so that infjceZp 1/00 ~~ *l = *> an(* the second assumption of the theorem is not
satisfied. (However, xp—x + 1 = 0 certainly has a root in a suitable finite extension
of Qp, and / has a fixed point in the unit ball of such an extension.)
3.6. Second-Order Estimates
Let us keep the notation of the general mean value theorem (3.2).
Theorem. We have
\f{t +h)- f(t) - f'{t)h\ < \h2/2\ ■ \\f"\\
whenever t,h e K satisfy \t\ < 1 and
\h\ < |y/l\ ifp = 2, \h\ < \p\l/{p-2) if p is an odd prime.
Proof. As in (3.2), it is enough to prove this theorem for polynomials. Let us write
the Taylor series of / at the point t:
f{t + h)- /(f) - f'(t)h = £>* • D*/(0/*!
k>2
= *2E-
k>2
uk-2
k\
hk~
Dk-2f"(t)
k-2 ftt
Dk~2f
j^kik-l) {k-2)\K'
As in the proof of the mean value theorem (3.2) we have
II Dk~2f" II „ l|r>*-2/"'ll
11 J " <ll/"ll (*>2), and " J ■
II (* - 2)!
(1) For p ^ 2 it only remains to prove
hk~2
\ (k - 2)[ |
0 (k -* ex)).
*(* - 1)
< 1 (k > 2).
tor k = p this requires \hp 2\ < \p\, which is the condition given in the theorem.
When it is satisfied, if v — ardp(k) > 1, we have k > pv and \k — 11 = 1, whence
hk"2
k(k - 1)
< ^ < \Pf
250 5. Differentiation
with an exponent
v >
P-l
v > 0
(the linear fractional transformation x h» ^^ increases when x < p —
vertical asymptote — since for x = 0 it takes a value pv~~l > 1 above the horizontal
asymptote). In the case v = ordp(k — 1) > 1, we have k > pv + 1, and the
preceding estimates are satisfied. Finally, when ordp(k — 1) = ordp(k) = 0, we
have \k(k — 1)| = 1, and the proof is complete.
(2) In the case p = 2, we take a factor h2/2 in front of the above Taylor
expansion, and it only remains to prove
uk-2
k(k - l)/2
< 1 (k > 2)
for |/z| < |\^|. For /: = 4 this already requires \h2/2\ < 1, which furnishes
the restriction \h\ < |\^|. Conversely, assume that this condition is satisfied. For
v = ord2(k) > 1,
k > 2V and \k{k - 1)/2| = |2|
v-l
whence
NA:-2
fc(it - l)/2
< |/Z|2 < |2|2v-,-l-(v-D
\2\e.
Since we are assuming v > 1, the exponent e is equal to 2V l — v > 0. One can
treat the case v = ord2(£ — 1) > 1 in a similar way. ■
Comment. The condition on the absolute value of the increment \h \ is less
restrictive for p = 2, but the inequality is also weaker in this case, since the denominator
2 in \h2/2\ is important (it is irrelevant for odd primes p).
Corollary. Let K be a finite extension ofQp, R its ring of integers with maximal
ideal P. For neN {or even n eZp), we have
(1 + xf = 1 + nx (mod pnxR)
as soon as x e 2pR.
Proof. We take f(T) = (1 + 7T, so that
f"(T) = n{n - 1)(1 + Tf~2 (n > 2),
|iril = |n(n-l)| (n>0).
4. The Exponential and Logarithm 251
For |x| < \p\l/{p-2) (resp. < \y/2\ if p = 2) we have
|(l+x)n-l-^|<
liril < |/uc|
Since \x/2\ < |/?| when jc e 2/?/?, the preceding inequality furnishes the expected
statement. ■
This corollary gives the fourth form of the fundamental inequality, mentioned
in(III.4.3).
4. The Exponential and Logarithm
4.1. Convergence of the Defining Series
Theorem. The series y£,k>i(—l)k~1xk/k converges precisely when \x\ < 1.
The series X^o**/^ converges precisely when \x\ < rp — \p\l/(p~~l\
Proof. Since \k\ = 1 for all integers k prime to p, in order to have convergence
of the first series, the condition \xk/k\ -> 0 implies |jc| < 1. Conversely, when
M < 1,
<k\x\k~+0 (k~+oc).
For the second series
_ |jt|*lp|~or(V*!) = tpikordpx-ordp(k\)
we use (3.1) for the p-adic valuation of factorials. The exponent is
Since Sp(k) — 1 when k = pj (j > 0) is a power of p, we have
i-L^O <=> k(ordp(x) TT/^00'
and this happens precisely when ordp(;c) —- > 0, namely when
ordp(x) > -, |*I < \p\l^'l)
p-\
as asserted.
252 5. Differentiation
By analogy with the classical case, we shall write
xk
ex = exp(*) = £ T7
for the sums of these series whenever they converge. Strictly speaking, we should
mention the dependence on the prime p and, for example, write logp(l + x) and
expp(x).
Comments. (1) In the p-adic domain, the exponential function is not an entire
function: The convergence of the exponential series is limited by the radius rp:
ri — \ and - < rp < 1 (p odd prime),
which we have already encountered as a limitation for the increments in the mean
value theorem (3.2) and (3.4). A heuristic explanation for this apparent
coincidence is furnished by the Taylor series, when expressed in terms of the differential
operator D = d/dx. Quite formally, we have
f(x +h) = J2 h DJ(X) = exp(/>D)(/)(*).
On polynomial functions, or more generally on restricted power series, we have
seen that ||D|| = 1, so that the series for exp(/iD) converges for \h\ < rp (as
we have just seen). However, observe that the first form (3.2) of the mean value
theorem holds even up to | h \ < rp. In the classical case, the exponential is an entire
function, and there is no limitation for the size of the increments in the mean value
theorem.
(2) Since |jc| < rp < 1 is required for convergence of the exponential, there
is no number e = exp(l) defined in Qp. For p > 3, however, rp > \/p = |pl»
and exp(p) is well-defined by the series: One could select a definition of a number
e = ep as a pth root of exp(p). Similarly, when p = 2 one could define e as fourth
root of exp(4). However, there is no canonical choice for these roots.
(3) The series defining the functions log and exp have rational coefficients.
Hence for each complete extension L of Qp,
xeB<l(L)==>log(l + x)eL,
x e B<rp(L) => e* = exp(jc) e L.
4.2. Properties of the Exponential and Logarithm
Proposition I. For \x\ < rp we have
| log(l + x)\ - |x|, | exp(x)| = 1, |1 - exp(x)| - \x\.
4. The Exponential and Logarithm 253
Proof. For k > 1 we have Sp(k) > 1 and hence ordp(kl) <(k- l)/(p - 1). We
infer
\k\ > \kl\ > \p\—y = rp
k-\
1*7*1 < \x*/k\\ < (Ixl/r^r1 - Ijc| < |x| < 1
for k > 2 and 0 < |jc | < rp. Hence the absolute values of the terms in the series
xk
1+* + X!t7 = exp(x),
*>2 K'
k>2
are strictly smaller than the first ones. By the ultrametric character of the absolute
value, the strongest (we underline it!) wins:
exp(x) = i + x + ^2''' =^ I exP(*)l = 1>
*>2
exp(jc) — 1 = x_ + ^2' *' =* I exP(*) — 1 f = fjcJ,
*>2
log(l + jc) = x + Yl' ■' =* I lo^! + *)I = M
tfui<v ■
Corollary. The only zero of log(l + x) in the ball \x\ < rp is x = 0. ■
In fact, we shall prove a stronger result:
x h-> log(l + x) is injective in the ball B<rp.
Proposition 2. For two indeterminate s X and Y, we have the following formal
identities:
exp(X + Y) = exp(X) - exp(K),
logexp(X) = X,
explog(l -\~X)= 1-f-X.
**oof. The first identity is easily obtained if we observe that the product of two
Monomials X1/i\ and YJ/j\ is
XlYi _ (i + j\ X'Yi
254 5. Differentiation
Grouping the terms with i + j = n leads to a sum {X + Y)n/n!. Let us turn to the
second identity. In the series log(l + X) = 5^n>1 anXn we would like to substitute
X = eY - 1 = b\Y + fc2F2 H (*i = 1)- We have to expand the following
expression and group the powers of Y:
£>„ (biY + W2 + •--)"= J2C"Y"-
n>l n>l
Here are the first coefficients:
c\ = a\bu c2 = a\b2 + a2b\, c3 = a\b3 + a2 - 2bib2 + a3b{.
More generally, we see that
cn = a\bn + a2(- - •) H h «n-i(- - •) + anb\.
For 2 < y < w — 1 the coefficient of 0/ is a polynomial in ^,..., bn-\ with integral
coefficients (of total degree j). The problem is to evaluate the polynomials cn at
the rational values
flu = , bn = — [n > 1).
« «!
The result of this computation is known: Identical computations are classically
made for the substitution of the real-valued power series x = ey — 1 in the real-
valued log(l + x) (convergent for |jc| < 1). But it is established in any calculus
course that the result is log(ey) = y. Hence all evaluations of the polynomials c„
vanish for n > 2, and the expected formula is proved. The third identity is treated
similarly. ■
Remark. The preceding proof is surprising: It relies on real analysis for a purely
formal result that is applied to p-adic series. It was our purpose to deal with the
exponential and logarithm funaion in an elementary way — before treating power
series systematically — and thus we had to give an ad hoc proof for this inversion.
But a more systematic treatment of formal power series will give us an opportunity
to present an independent proof of this property with no reference to real analysis
(VI.1).
Proposition 3. For \x\ <rp and \y\ < rp we have
exp(x + y) = exp(x) - exp(.y),
logexp(x) =x,
explog(l +jc) = 1 -h jc.
Proof. Observe first that if an and bn -► 0, then the family (anbm)njm>o is sum"
mable. In particular, its sum is independent of the way terms are grouped before
summing. Hence the first identity holds as soon as the variables x and y are in the
domain of convergence of the p-adic exponential
exp(x) - exp(y) = exp(x + y\
4. The Exponential and Logarithm 255
Let us check the second identity: We have to show that it is legitimate to substitute
a value jc e Cp, \x\ < rp in the formal identity
X = logex=log(l+e(X)),
where
i.
The substitution in the sum can be made by addition of two contributions:
(_ir-i
E
/w
-e(xr
+
."<# " Jx=jc L^>7V JX=jc
In the first finite sum, the substitution can obviously be made in each term
according to
(2 \n
x+h+"j =e(x)" (|jc|<r")-
Since \e(x)\ = \x\ < rp < 1, we have
e(jcf -* log(l + e(x)) = log ex (N -> oo).
The proof of the second identity will be completed if we show that the second
contribution is arbitrarily small (for large N). But when |jc| < rp, each monomial
appearing in the computation of e(x) satisfies |jc* /1! j < rp (because i > 1),
and each monomial appearing in the computation of e(x)m has an absolute value
less than r™. All individual monomials appearing in the evaluation of the second
contribution ^m>N • - • have an absolute value smaller than
sup
(-D
,m—1
m
Since the power series for the logarithm converges, it is possible to choose TV large
enough to ensure that all 11/m \r™ (m > N) are arbitrarily small and that the same
holds for their sum (independently of the groupings made to compute it). Again,
the verification of the third identity is similar. ■
Corollary, (a) The exponential map defines an isometric homomorphism
exp : £<rp -^> B<rp(l) = 1 + B<rp C €£.
{b) The homomorphism log : 1 + Mp —> Cp is surjective.
Proof, (a) The fact that the exponential map is injective in its domain of definition
results from the equality loge* = x. Better still, the exponential is an isometry:
■e-'\ = \e*\\er
-II
l^-'-ll
yV
256 5. Differentiation
The inverse of the isometry
B<
\ + B<
is the restriction of the logarithm to the ball B<rp(l) C 1 + Mp. In particular, we
have
log(l +x)(l+y) = log(l +x) + log(l + y) (*, y e B<Tp).
But the power series
fix, y) = log(l + x)(l + y) - log(l + x) - log(l + y) = £ a™*"1/
n,/M>0
converges for |jc| < 1 and |y| < L. Since
ara+n/
m\n\amn =
0,
(*,y)=(0.0)
awxany l
we conclude that the logarithm is a homomorphism in its ball of convergence:
log(l + x)(l + y) = log(l +x) + log(l +y) (x, y e £<i).
(&) If x e Cp, choose a sufficiently large integer n in order to ensure that
\pnx\ < rp. Hence
pnx = logexp/?njt, * = logf
for a pn th root § e 1 + Mp of exp pnx el+ Mp (ffl.4.5). ■
1+B<
The unit ball in C5
4. The Exponential and Logarithm 257
Theorem. The logarithm defines a homomorphism log : 1 + Mp -> Cp. Its
kernel is the subgroup fip^. Its restriction to 1 + B<rp is an isometry (hence
injective).
Proof. There only remains to establish the statement concerning the kernel of the
logarithm. Let x = l+rel + Mp be in this kernel. We know that xp" -> 1
(when Ji -> oo) (III.4.5: Proposition 2). Take n large enough so that \xp" — 11 < rp.
Since xp" is still in the kernel, we now have xp" = 1 by the corollary of the first
proposition. ■
4.3. Derivative of the Exponential and Logarithm
The exponential and logarithm are strictly differentiable functions in their disk of
convergence (2.4), and
[expxl' = J] -JT = £ TTZTv = CXP*'
[iogd+X)v = ^(-d*-1^. = Ec-n*-1
-i^-i
*>1 * M 1+X
Proposition. Let t e Mp. Then, the derivative of
x^(l+tf : Zp^- Cp
at the origin is log(l +1).
Proof. By definition, we have a Mahler expansion
*>o W
'p)->
since t e Mp c Cp. We deduce
-S'O-grG*:!)-'
d+*)*-i=u»*(;] = >:r(: ;h4
(1+0*
^-«-»-5(Ci:0-^)i-
258 5. Differentiation
When \t\ < 1 we know that tk/k -> 0, whereas
(^ _ J) — C—!>*-'| < 1. (*_J)->>(-l)*-1Whenjc-»0 (* > 1).
This proves that
(i+tr-i
log(l+0 (*->0inZj
x
uniformly in t on any disk B<r C Cp of radius r < 1. ■
Comment. We can write
(d/dx)x=0(l + tf =\og(l-)rt),
where the derivative is the limit of difference quotients taken with respect to
increments in Zp. When log(l +t) ^ 0 and \h\ is small enough, we have
(1+0*-1
= |log(l + 0l-
This provides an improvement of the second form of the fundamental inequalities
(III.4.3), in the region rp < |r| < 1. But | log(l + 01 is arbitrarily large in this
region, since log : 1 + Mp -> Cp is surjective (4.2).
4.4. Continuation of the Exponential
It is natural to try to construct a homomorphism
extending the exponential defined above by a series expansion. If such an extension
exists, when xeC^we can choose a high power pn of p so that pnx e B<rp (tne
exponent n depends on x) and then
fMp" = f(pnx) = exp(pnx).
In other words, f(x) has to be a pnth root of exp pnx in the algebraically closed
field Cp. This can be done in a coherent way, thus furnishing a continuation of the
exponential homomorphism.
Proposition. There is a continuous homomorphism Exp : C^ ~> 1 -f- Mp &'
tending the exponential mapping, originally defined only on the ball B<rp C Cp-
Proof. Recall that 1 + M^ is a divisible group (III.4.5), and divisible groups
are injective Z-modules (III.4.1), and hence enjoy an extension property ^
4. The Exponential and Logarithm 259
homomorphisms defined over subgroups. We can use this for the homomorphism
exp : B<rp —* 1 + B<rp C 1 + Mp,
since its target is the divisible group 1 + Mp. m
For this corollary, only the p-divisibility of 1 + Mp is used.
The usefulness of the extensions Exp is limited by the fact that none are
canonical. However, since the logarithm is defined on the image of any extension, the
composite log oExp : Cp —► Cp has a meaning. If x eCp, let us choose an integer
n such that pnx e B<rp, and consider the following equalities:
/?nlogoExp(jc) = log(Exp x)pH = log(Exp(pnx))
= log(exp(pnJc)) = pnx.
Consequently, log oExp(jc) = x. We have obtained the following result.
Corollary. The log : 1 4- Mp —► Cp is inverse to all extensions
Exp : Cp -> 1 + MP>
and any such extension is infective. ■
Let us summarize the construction in a diagram of homomorphisms (of abelian
groups):
Exp log
cp -X i+Mp -4 Cp
u u u
exp log
B<rp > 1 + B*zrp > B<rp-
Both composite arrows are identities.
4.5. Continuation of the Logarithm
Proposition. There is a unique function f : C* -> Cp having the properties
(1) / is a homomorphism: f(xy) = f{x) + f(y),
(2) The restriction of f to I + Mp = £<i(l) coincides with the logarithm
defined by its series expansion,
(3) f(p) = 0 (normalization).
Proof. Let us start with the uniqueness statement. By (III.4.2), the subgroups
P M(P) and 1 -\-Mp generate C*. Hence it is enough to see that the given conditions
imply that / vanishes on /?Q/x. This is obvious on the subgroup jz, since the field Cp
(of characteristic 0) has no additive torsion. On the other hand, if x e pQ, there will
te an equation xa = pb (with some integers a and b). Hence af{x) = bf(p) = 0
260 5. Differentiation
by the normalization condition, and f{x) = 0 as expected. For the existence
part, it is enough to define / trivially by 0 on pQfi observing that this definition
is coherent with the logarithm on the intersection p^fi Pi (1 + Mp) = ^ *.
But the subgroup ilp™ is precisely the kernel of the logarithm series (Theorem
in (4.2)). a
The preceding continuation of the log function is called the Iwasawa logarithm
and is denoted by Log.
Theorem. The Iwasawa logarithm has the following properties:
(I) It is locally analytic: In the neighborhood of any a ^ 0
(— l)*"1 ( x — a\k
Log x = Log a + Yl ~i ( ) A* ~a\ < M)-
(2) For* eZ*,
1 (i-^P-i)*
L°BXSST~^1^ 1 "
1 P k>i K
(3) For any complete subfield K ofCp, Log(A^x) C K.
(4) For every continuous automorphism oofCp,
Log(^) = (Logxf.
Proof. (1) When a ^ 0 let us simply write x = a{\ + (x — a)/a) and
Log x = Log a + log ( 1 + 1,
so that the asserted expansion follows.
(2) If x e Z*, we have xp~l = 1 (mod p) and
Log x = —l— Log (xp~l) - —?— log (1 + (xp-1 - 1)).
p-\ p-1 v
The series expansion is applicable to the last term and furnishes the announced
expression.
(3) If K D A D M and x e K*9 let us write
x = pr - K ■ u (r e Q, f e V(p), «el+ M),
so that Log x — log w . Hence we can find integers n and ra with xn = pmv, where
u 6 1 -h M, and hence nLogx = Logu. Since jc" e A^, we have v e K. Now,
the coefficients of the series defining the logarithm are rational and K is complete:
log v e K, and finally Log x = (log v)/n e K.
4. The Exponential and Logarithm 261
(4) Under the stated conditions, we can consider the homomorphism
/ : C*-+ Cp, x k> a'1 LogU").
We know that \x° \ = |jc| by uniqueness of extensions of absolute values first on
Q° (II.3.3) and then also on Cp by continuity. Since the coefficients of log(l + x)
are rational numbers, the second condition of the proposition is verified by /.
Hence / satisfies the three characteristic properties of the Iwasawa logarithm: It
must coincide with it. ■
Comments. (1) The product of the subgroups fAp°c and 1 + B<Tp is a direct product,
since the intersection of these subgroups is trivial. But this product is not equal
to 1 4- Mp. Indeed, the logarithm of an element in the product is in the ball
log(MP~-(l + £<rp)) = £<rp,
whereas log(l 4- Mp) = Cp. A different way of seeing this consists in observing
that
1 ^ £ = 1 + § e ^ => |5| = |p|i/*<i") > rp
(? of order pf) (II.4.4). Taking x = 1 +1 e 1 + B<rpi hence \t\ < |§|, we have
f* = l + £+f+ £f with |£f| < min(|£|, \t\).
Now
\tX-l\ = \t;\ = \p\l/^f)
has a very particular form. It is clear that we are not obtaining all elements of
1 + Mp in this form (in Mp the p-adic order is an arbitrary positive rational
number).
(2) The rationality property of the logarithm shows that for every finite extension
K of Qp the logarithm furnishes an isometric isomorphism
log: 1+ B<rp{K)^B*rp(K)-
to particular,
log : 1 + pZp -^ pZp (= Zp) (/? odd prime),
log : 1 + 4Z2 ^ 4Z2 (= Z2).
to general, with the conventional notation
K D RD P = ttR,
262 5. Differentiation
the multiplicative subgroup 1 + P still contains its torsion subgroup (p-torsion)
as a direct factor. Let m be the largest integer such that K has a pmth root of
unity: This torsion subgroup is iipm\ it is cyclic. One can show that there is an
isomorphism
1 + P ^ Z/pmZ x R.
This results from the structure theorem for finitely generated Zp-modules: The
ranks of the multiplicative Zp-modules
1 + P, 1 + B<r,(ff) c 1 + P
are the same, since the quotient is finite, cyclic of order pm (cf. A. Weil: Basic
Number Theory). These results do not extend to infinite extensions of Qp contained
inCp.
(3) Let us still consider a finite extension K of Qp in Cp. When P = B<rp(K),
both cases
lip C 1 + P and ilp <£ 1 + P
can occur. For example, for p = 3, consider as in (II.4.7) the two quadratic
extensions Kx = QKV^) and K2 = Q3(a/3). Since the field QHV-3) contains
a 6th root of unity (the ring of integers of this field is a hexagonal lattice in
the complex field) and since J^-i e Qs(a/3, V—3) = K\ - K2, we see that
/x4 C ^i ■ K2 and necessarily
M12 = At4 - A*3 C Kx - K2.
On the other hand, the order of n0) D (Kx • K2) is #(#* ) = 3/ - 1, hence of the
form 2, 8, 26,..., and the presence of a fourth root of unity implies that this order
is greater than or equal to 8. In particular, K\ ^ K2. Since Q(\/^3) = Q(ft)we
see that f3 e A^i but f3 ^ A^2- Nevertheless, quite generally,
vp~(K) = vp~nK (c (l -h Mp)n k = l + />).
If P = nR, then ijlp~(K) c 1 + B<[7r](Cp), so that the order of iip~(K) is adivisor
of the order of
^n(i + %,(Cp)),
which is known, since the absolute values |f — 1| for f e/xpoc are given by
(III.4.2).
5. The Volkenborn Integral 263
(4) Here is a diagram summing up the general situation:
log
(1) ^1 + ^4^
n n ^ n
log
flp c > 1 + B<rp > BSrp
n n n
log
fipc ^-+ 1 + Mp —* Cp
n n ^ n
The lines consist of short exact sequences, split by the choice of a section Exp
of log. Observe that the subgroup /?Q • \i is well-defined, independently from the
choice of a copy of pQ cC^.
Note. The possibility of extending the exponential to the whole of Cp had already
been shown by M.-C. Sarmant(-Durix) in her doctoral thesis. We have followed
the method of the book by W. Schikhof.
5. The Volkenborn Integral
5J. Definition via Riemarm Sums
Let K be a complete extension of Qp. We are going to define f f dx for certain
functions / : Zp -> K. Unfortunately, Corollary 3 in (IV.3.5) shows that one
cannot define nontrivial translation-invariant linear forms on C(Zp). Let us recall
this result (notation ft(x) = f(x + 1)).
Lemma. If<p : C(Zp) —> K is linear and translation-invariant, i.e.,
<p(fi) = <p(f)forallfeC(Zp),
then cp = 0. ■
Observe that we can define translation-invariant linear forms on jFlc(Zp), the
space of locally constant functions on Zp (IV.2.1). Indeed, we can construct such a
linear form with <p(l) = 1. Translation invariance imposes the same value l/p for
the characteristic functions of the cosets of pZp. More generally, this translation-
tavariant linear form should take the same value l/pn for the characteristic
functions of the cosets pnZp. These functions have sup norm 1 but
\<p(f)\ = \y\ = Pn is arbitrarily large.
264 5. Differentiation
This shows that this linear form is not continuous. Equivalently, we can define the
p-adic "volume" of the balls B<\^(J) in Zp to be
™(fl<ip«i0")) = ^Qp-
The next construction works for dijferentiable functions — at least (Sf)'(0) should
exist — and is not translation-invariant. It is convenient to deal with strictly dif-
ferentiable functions / e Sl(Zp).
Let us start with the expressions
A E /U>= H fU)mU + pnZP)
o< j < pn o< j < pn
representing Riemann sums for /. The integral of / over Zp will be defined as the
limit (n -> oo) of these sums, when it exists. The indefinite sum F of a function
/ has been defined in (IV. 1.5) in order to have VF = f (F(0) = 0):
F = Sf=l±f: F(k) = J^ fU).
Hence we have
1 Y- ,,„ F(p") - F(0)
P 0<j<pn
(since F(0) = 0), and we see that the limit exists if F is dijferentiable at the origin.
When the function / has Mahler coefficients c„, we know that the coefficients
of Sf are simply shifted, and the differentiability of Sf at the origin is equivalent
to the requirement
|c„_i/«| -> 0 (ji^co)
(Theorem 1 in (1.5)). This is the case if / e Sl(Zp).
Definition. The Volkenborn integral of a function f e S}(Zp) is by definition
[ f(x)dx=\im\ J^ /tf) = (S/)'tf>).
If / = c is a constant, then fz f(x) dx = c. Here is a main property of this
integral.
Proposition 1. (a) For f e Sl(Zp) we have
[ f{x)dx\<p\\f\U.
Jz
5. The Volkenbom Integral 265
(b) Iffn -> f in S\ namely ||/„ - fh -> 0, then
[ fn(x)dx^ [ f(x)dx.
JTD JZD
Proof. By definition,
/,
fMdx
= l(S/)'(0)| < \\Sfh = sup(||*S/||, |5/(0)|),
so that (a) follows from Corollary 2 in (1.5):
IIS/Hi <p||/||i.
(b) is a consequence of (a). m
Recall that we use the notation V/ for a discrete gradient of a function /:
V/(x) = /(* + l)-/(x).
Proposition 2. For f e S1(ZP) we have
[ Vf{x)dx = f'{0).
JzP
Proof. By definition,
[ v/(x)dx = (sv/yco) = (/ - /(0)/(0) - /'(0),
Jzp
since SV/ = / _ /(0) (Proposition 2 of (IV. 1.5)). ■
5.2. Computation via Mahler Series
The indefinite sum of a binomial function Q is the next one S(n) = (n+1)- This
observation makes it easy to compute the Volkenbom integral of a function / e S1
°f which the Mahler expansion is known.
Proposition. Let 5Zjt>o c* (*) ^e tne Mahler series of a strictly dijferentiable
function f e Sl. Then
[ f(x)dx = J2
(-!)*<;*/(*+D-
266 5. Differentiation
Proof. Since fn — zlk<n
c*Q tends to f in ||. || i, we can simply integrate term
by term:
\k + l) k+l\ k J
C y(0)=iim-i-f'-,)=-i-('-|)=<=ii
V^-h 1/ x-±ok+\\ k ) k+\\k) £+1
Now,
and
implies
(one can also apply directly Theorem 1 in (1.5) to the function Sf). ■
Example. Let us fix t e Mp c Cp, namely \t\ < 1, and consider the function
f = ft defined by
/(x) = (l + r)J
Then
[ (l + t)xdx = Yt-^==-log(l±t) (=lfbrr = 0).
Jz„ t£ k +J f
5.3. Integrals and Shift
A few more formulas for the Volkenbom integral will be useful. Recall that the
translation operators xx have been defined in (IV.5.1) by
r*/(0 = /(* + *>■
In particular, for x = rx = E (unit translation), xf = /i. Let us also denote by D
the differentiation operator, V the finite difference operator, and S the indefinite
sum. Obviously, D commutes with translations and consequently also with V =
r-id.
Proposition 1. Let Pq : f h> /(0)1 be the projection ofSl(Zp) onto constants.
Then the following relations hold:
5. The Volkenborn Integral 267
(a) Sx = rS- Po-
(b) DS commutes with all translations xx.
(c) SD = DS- P0DS.
Proof. By definition, for integers n > 1,
S(rf)(n)= £ xf(j)= J2 fU + D
0<i</2
which proves Sx = xS — Po (by density of the integers n > 1 in Zp and continuity
of the functions in question). On the other hand, differentiation of the function
Sxf = xSf - /(0) leads to DSxf = DxSf = xDSf. Moreover, recall that
VSf = / but SVf = / - /(0) (IV.1.5). In other words,
VS = id, SV = id - P0.
We infer
SD = SDVS = SVDS = DS - P0DS.
The proposition is proved. ■
Proposition 2. Ler / e S2(Zp). 7fo#i
(fl) f Txf(t)dt = {SfY{x).
(*>$(/')(*) = [ f(x + t)dt- [ f(t)dt.
Proof. Start with the definition fz f{t)dt = (S/)'(0). Hence
f /(r + l)dr = f xf(t)dt = (Sxfy(P) = DSxf(0)
Jzp Jzp
= xDSf(0) = DSf(l) = (Sf)'(\y
*he first formula (a) for a positive integer x = n follows by iteration, and for any
x ^Zpby continuity and density (alternatively, one can do the same calculations
W]tii xx in place of r). Recall now Proposition 2 of (5.1),
[ V/(r)rfr = //(0)>
JzP
^d use a translation
P Vf(t+x)dt = f'{x).
JzD
268 5. Differentiation
From this, it is obvious that S(f')(n) = /'(0) H h f\n — 1) can be expressed
as a telescoping sum
S{f'){n) = [ f(t + n)dt- [ f(t)dt (\<ne N),
and by continuity and density of the positive integers n in Zp,
S{f'Kx)= [ f{t + x)dt- [ f(f)dt (xeZp).
JZp JZp
Writing /' = g, we can choose any primitive G e Sl(Zp) of g and write
S(g)(x)= [ G(x + t)dt- [ G(t)dt.
JzP JzP
Of course, two different primitives of a function g may differ by any function h
having h! = 0.
Proposition 3. Let f e S2(ZP) c S\ZP) and define F{x) - fz f(x + t)dt.
Then F e Sl(Zp) and
F\x)= [ f\x + t)dt.
Proof. By Proposition 2 of (1.3),
/ e S\ZP) => /'e S\ZP),
so that
G(x)= f f'(x + t)dt
Jzp
defines a function G e Sl(Zp). Moreover, by Proposition 2 (a),
f f(x + t)dt = (Sf')Xx) = (OSOf)(x),
JzP
which proves G = OSOf. Now by the first proposition SO = OS - PoDS and
G = 0{OS - P0OS)f = OOSf = (5/)" = F\
because F = (5/)'. ■
Proposition 4. Let o denote the involution (1.1.2) x i~> —1 — x of Zp. Then
[ (foa)dx= [ fdx.
JzD JzD
5. The Volkenborn Integral 269
Proof. We have seen that
Sf{h)
f fdx = (Sf)XO)=\im^.
Jz h-^o h
Let us take h = —pn {n -» oo). Hence
(S/)(0) = hm = hm .
n->oo —pn n->co pn
But by the Corollary 4 in (IV.3.5),
-SA-pn) = S(foo)(pn)9
whence the result (S/)'(0) = (S(f o o))f (0). ■
Corollary If f is an odd function, then
b/d——-
Proof. Quite generally, using Proposition 2 in (5.1) and Proposition 4, we have
/'(0) = / {f{x + 1) - /(*)) dx = / (/(-*) - /(*)) rfx.
■fa, Jzp
Now, if / is odd, we obtain the announced result
/'(0)= [ -2f{x)dx.
JzP ■
5-4. Relation to Bernoulli Numbers
In (5.2), we have proved
f (l + tfdx = -log{l + t)
JzP l
f°r \t\ < 1, t e Cp. Let us now choose |r| < rp and define s = log(l + t), so that
t = es - 1, \s\ = \t\ <rp.
The preceding formula now reads
JzD
esxdx= S
es - 1
270 5. Differentiation
Classical Definition. The Bernoulli numbers are the rational numbers bk
defined by the following generating function:
— = £**-•
e'-l +-± k\
k>0
Here are the first few values:
b0 = 1, bx = -\, b2 = \, b3 = 0, b4 = -^, b5 = 0,
and,
^6 = 4^ ^8 = -^, ^10 = 66, ^12 = ~2^-
Since we can also write etx = $^>0 tkxk/k\ with a convergence in 5!(ZP), we
can integrate term by term (Proposition 1 in (5.1))
and identify the coefficients
bk = I xkdx.
JzP
Observe that by definition bk e Q, and these integrals are independent of the prime
p used to compute them! Also, \bk\ < pllJC^Ih = p (still by Proposition 1 in (5.1)),
namely | pbk | < 1, i.e., pbk e Zp Pi Q. In (5.5) we shall give a more precise result.
Proposition. The Volkenborn integral of a restricted series f = Yln>oan%n
exists and can be computed term by term:
I f(x)dx = ^2/anbn.
Here, the bn are the Bernoulli numbers, and using fz f(x) dx = —/'(0)/2 for
the odd functions f(x) = x2k+l (Corollary at the end of (5.3)), we obtain
*>! = -{, b2k+l=0 (*>1).
Classical Definition. The Bernoulli polynomials Bk are defined by the
following generating function:
V- r, , Jk teXt
> Bk(x)— = .
*>o K- e l
5. The Volkenborn Integral 271
I hope that no confusion will arise between the Bernoulli and the Bell
polynomials (also denoted by Bn in (IV.6.3)): The context should always explicitly specify
which ones are under consideration!
Obviously, bk = Bk(0). Conversely, the definition
_ ^p jcV tk
leads to an explicit expression of the Bernoulli polynomials (with Bernoulli
numbers as coefficients):
*(*) = !.! £ b^
j+k=n J mK m
= E (%-»'■
Thus Bn is a monic polynomial of degree n (equal to its index). This expansion
is symbolically written "Bn(x) = (b + *)":" the binomial formula leads to the
correct expression, provided that we interpret fe* as the fcth Bernoulli number bk.
Here are a few values:
floto = 1, Bx{x) = x-\, B2(x) = x2-x + \.
Returning to the Volkenborn integral, we have
Identification of the coefficients leads to the p-adic expression of the Bernoulli
Polynomials
Bk(x)= f {x+yfdy.
The formula (5.3)
Jz»
Vf(x + y)dy = f(x)
for / = j* leads to
f (x + y+lfdy- f (x + y)kdy =
Jzp Jzp
kx
.k-l
272 5. Differentiation
= (h + 1)B„(jc)+ £ (" .'- ')fi*W-
k<ji~
namely
Bk{x+1)-Bk(x) = kxk-1.
In particular, Bk{\) = 5^(0) for k >2, and these polynomials may be extended by
1-periodicity on R. We obtain the continuous periodic functions x \-± Bk(x — [x])
(/: > 2) on the real line (as usual, [x] denotes the integral part of a real number*
so that 0 <x — [x] < 1).
On the other hand, we can expand (y + x + 1)" = 51 (*)(> + *)* ^^ hence
rewrite
(n + l)x" = (xn+l)' = f J2 (" +k \x + y?dy
fc<„-i V K /
This gives a recurrence relation for the computation of these monic polynomials:
In particular, for x = 0 and w > 1,
Another relation for the Bernoulli polynomials is easily obtained from the fact that
the integrals of / and of / o a are the same (Proposition 4 in (5.3)):
Bk{\-x)= f (y+l-xfdy
-L
{-\-y+\-xYdy
= (-D* f (y + x)kdy = (-l)kBk(x).
5.5. Sums of Powers
The above formula VBk(x) = kxk~\ with SV = id - P0, leads to Bk(x) -h-
kS(xk~l). Replacing k by k -f- 1, we obtain
*) = Mh*w (*>0).
5. The Volkenborn Integral 273
This gives an explicit formula for the sums of powers:
1 ^ A-hl\ .. xk+l
E(7)
*',= fTT.k J 6-"' + *+>
bkX+L\j-^bM-ii+^-
[.-liK'->T + i
Here are a few explicit expressions for these sums of powers:
Sk{n) = £ ** = S{xk)\x=n (k > 1),
i<i<«
Si(n) = \nin - 1),
S2in) = \nin - l)(2n - 1),
S*{n) = \n\n - l)2,
S*in) — \n5 - ±n4 + |n3 - ±n,
S5in) = In6 - ^5 + ±n4 - ±n2.
(Observe that for k = 0, Six0) = Biix)—bi = x gives a sum of powers J2i<<n*° =
n, which is correct if the summation is extended over the indices 0 < i < w and
*° is the constant 1, including 0° = 1.) In the Archimedean theory, the main term
is xk+l /[k -f- 1): It gives the primitive of xk, namely the area below the graph of
t h> tk between the values 0 and x. Here the main term will turn out to be bkX.
Proposition. When p is an odd prime, the sums ofkth powers satisfy
Skip) = pbk imodpkZp) (*>1),
while
Ski2) = l~2bk (mod kZ2) ik > 1).
Proof. We have pbo = p9 pb\ = —p/2, which are both in Zp (even if p = 2).
We already know (5.4) that pbk e Zp ik > 0) (this also follows by induction, as
the next argument shows). Since
v / k \ Dj Dk+1
Sk(p) = pbk\VSj-i)bk^7 + TTi
z£0-J
pj-2 . , pk
pbt + pk > ( . )pbk+i-j——- + pk
JU-l) ' *(*+!)■
274 5. Differentiation
we have to show that
^ /k-l\ Pj~2 Pk
V ( \pbk+x.~ + —- eZp.
But the pbk+\-j are in Zp for j > 2, and for p > 3
, .,. nx . .. J-Sp(j) ^ J ~ 1 . ,
ordp jO - 1) < ordp j! = ^— < < j - 1
p-1 /?- 1
implies ^^ e Zp. For p = 2,
ord2 y0" - 1) = max(ord2y, ord2(j - 1)) < y - 1
with equality for j = 2. This explains the loss of one power of 2. ■
Corollary. For any prime p, b2n € QHp~lZp {n > 1). ■
Remarks. (1) For p = 2 and o<2<2/: > 3, the corresponding Bernoulli number is
zero: The congruence 5jt(2) = 1 = 0 = 2bk holds mod &Z2, not mod 2£Z2. For
even k = 2n > 2, the same congruence forces &2n to have an even denominator.
(2) For p = 3 and even /: = 2n > 2, the congruence S2w(3) = 3&2n (mod 3Z3)
leads to 3b2n = 1 + 22n = 1 + 4n (mod 3Z3), and 3 appears in the denominator
of bin- By the preceding remark, the factor 6 appears in the denominator of all b2n
{n > 1).
(3) The property pb-m e Zp (all primes p) means that the denominator of bin *s
a product of distinct primes (each prime occurring at most once). We have a more
precise result.
Theorem (Clausen-von Staudt). The denominator of the Bernoulli number
bin is the product of the primes I such that I — 1 divides 2n. More precisely<,
bm = - 5Z 7 + m2n (m2n e Z*-
l prime:l~l\2n
Proof. Let us start with the congruence pb2n = Sinip) (mod pZp). Now, it is
easy to compute a sum J2o<j<P Jk m°d P- This is a sum over the field Fp. Put
Sk = (Sk(p) mod p)e¥p.
For each 0 ^ u eFp we have
sk = ^ xk = ]£^(*w)* = uk ^ *k ~ "****'
jcgFp jteFp xeFp
whence
(1 - uk)Sk = 0.
5. The Volkenborn Integral 275
If k is not a multiple of p — 1, we can choose «eFpx such that uk ^ 1 (because
Fx is cyclic), and in this case we see that sk = 0, namely Sk(p) == 0 (mod p).
On the other hand, if p — 1 \ k, then
jr€Fp O^eFp
This information can be gathered together in the following form:
\ £ prime: t-\\k^2n /
Letting the prime p vary (an exception!), we obtain
b*+ e ;hQn n zp=z
t prime:l—\\k p prime ■
5.6. Bernoulli Polynomials as an Appell System
In (5.3) we have proved
/'(*)= [ Vf(y + x)dy.
In the case of Bernoulli polynomials, this gives
B'kW= [ VBk(y + x)dy= [ k(y + xf~x dy = kBk_y(x).
JzP JzP
Hence (Bk)k>G is an Appell system of polynomials. In particular, it satisfies the
modified binomial identity for Sheffer systems (IV.6.1). We can derive it
immediately in our context:
Bn{x + y) = I (t + x + y)ndt
Jzp 0<*<n W
{t+x)kyn-kdt
e (;W
0<*<n
i n /, i jl iv
In umbral notation, we can write symbolically
B»(* + y) = "(»(*)+30V
which generalizes (5.4) Bn(y) = "(b + v)n," since £n(0) = fc„. Let us give the
relation between this system and composition operators. Let U be the operator on
276 5. Differentiation
K[X] defined by
U(j>Xx)= [ p(y + x)dy.
It is obvious that this operator U commutes with the unit translation E = Ti = t,
hence (IV.5.3) with all translations:
rU(p)(x) = U{p)(x + 1) = [ p(y + x + l)rfy = U{xp){x).
Jzp
By definition £/(**) = Bk, and the system of Bernoulli polynomials is a Sheffer
sequence (IV.6.1). Moreover, as we have seen in (IV.6.2), V = eD — 1. We deduce
V{Up){x)= [ (pO> + x + l)-p(y + x))d/=Vw.
Jzp
(eD - \)Up = Dp, Up = ~^—p.
eu — 1
This is the expression of the composition operator U as a formal power series in
the derivation D (IV.5.3).
Exercises for Chapter 5
A. Classical reminder. Let /, g, h : R -> R denote the functions defined by
fix):
x2sin(l/x)ifx^0,
0 if x = 0,
andg = /+x/2,/z = /-x2.
(a) Prove that / is differentiable at every point with /'(0) = 0, but / is not strictly
differentiable at the origin / £ Sl(0) and /' is not continuous.
(b) Prove that g is differentiable at every point with g'(0) = \, but there is no
neighborhood of the origin in which g is increasing.
(c) Prove that h is differentiable at every point with /z'(0) = 0 and there are infinitely
many points in every neighborhhood of the origin at which h has a relative
maximum.
B. Classical reminder (continued). Let / be a real-valued function defined in the
neighborhood of a point a e R. Assume that f e Sl {a) and f'(a) > 0. Show that there is a
neighborhood V of a such that the restriction of / to V is an increasing function and
in particular is injective.
1. Discuss the continuity and differentiability at the origin of the following functions on
e»<;> e/(;>
„>o v1/ „>o \P /
z„.
Exercises for Chapter 5 277
2. Prove that the function Sp introduced in (V.3.1) — sum of digits in base p — satisfies
Sp(m + n) = Sp(m) + Sp(n) - (p - l)oidpf ^ * J,
5p(m - w) = 5p(m) - Sp(n) + (/? - l)oidprj.
3. Let / : Zp -> Qp be defined by
/(*) =
P"if|xl = |p«| = ^,
0 ifx = 0.
Then / is locally constant outside the origin, and lim^^o \f(x)/x I = 1- By refining the
preceding definition, construct a function g that is locally constant outside the origin,
also differentiable at the origin with g'(0) = 1.
4. Check that | log(l + x)| < rp when \x\ = rp. But show that | log(l + x)| is variable on
the sphere \x\ — rp,
(a) For. which values of x e Cp do the following series converge?
r2«+l
-*-ir + - = D-^
3! £8 & + W
cosx = i _x +... = Y{-xT^—.
2! 4* (2n)!
n>0
(fc) In the disk of convergence, prove that
2 2
sin x 4- cos x — 1,
sinx cos y 4- cosx sin y = sin(x + y),
cos x cos j — sin x sin j = cos(x + y).
Compute the derivative of the functions sin and cos.
(c) Choose a square root i of — 1 in Cp and prove that
cosx + i sinx = e1* (i e Cp, i1 = —1).
(<0 Check the estimates (give their domain of validity)
|sinx| = |x|, |cosx| = l, |cosx —1| = ?
6- Prove that when t e Mp, x \-+ (1 + t)x is differentiable at the origin of Cp: To compute
the limit of differential quotients for x -> 0 (in Cp not only in Zp), use the expression
(1 + t)x = exp(x log(l + /)) valid for small |x|.
7- The Chebyshev polynomials (of the first kind) can be defined by the classical formulas
Tn(cos6) = cosaz0 (n > 0). Observe that Tmn{x) — Tm{Tn{x)). When p is an odd
prime, prove that
Tp(x) = xp (mod p)
278 5. Differentiation
and with the mean value theorem, show that
Tnp(x) = Tn{xp) (mod pnZp(x))
(what can you say about the case p — 2?).
8. Let us say that a polynomial f(x)e Zp[x] is an nth pseudo-power when f\x) e
nZp[x].
(a) Show that the following polynomials are nth pseudo-powers: xn, f(x)n (/ any
polynomial), Tn (Chebyshev polynomial of the first kind; cf. previous exercise).
(b) Using the mean value theorem, prove that if / is an nth pseudo-power, then
a =ee b (mod pZp) => f{a) ^ f{b) (mod pnZp).
(c) Suppose {fn)n>0 is a sequence of polynomials with deg /„ = n and satisfying the
congruences
fPn(x) = fn(xp) (mod pnZp).
Show that fn is an rath pseudo-power. Deduce that for m e N, a e Zp, the sequence
fmpv has a limit for v -> oo.
9. Define a sequence of polynomials inductively by the conditions
/ Pn(
Jo
PO = I. Pn = primitive of p„_i such that / p„(xMx=0 <n>l).
The first one is p\ (x) = x — |.
(a) Prove that pn(x)= Bn(x)/n\, where #„(x) denotes the nth Bernoulli polynomial.
(6) Prove that pn(l) — pn(0) (n ^ 1) and compute the Fourier series expansions of
the 1-periodic functions /„ extending Pn\[0,i]>
Jhtimx
For even n ~2k >2 there is absolute convergence, and
2 12
10. For any prime /?, prove the following congruence for the Bernoulli numbers:
2n(bpn — £«) = 0 (mod pnZp) (n > I).
(///«/. Use the congruence 7P" = 7" (mod pnZp) (exercise 8), hence a similar
congruence for the sums of powers Spn{p) and Sp{p% and conclude by Proposition m
(5.5).)
11. For each wz > 1, show that the numerator of
2+- + -+-- + -
2 3 n
is divisible by 2™ when n is large enough.
Exercises for Chapter 5 279
(a) Check the preceding assertion experimentally for a few values of n > 2.
(b) Prove the general statement by consideration of the logarithm 1 + M2 -> C2 and
the expansion of log(l — 2) = log(— 1) = 0.
12. Show that all continuous homomorphisms / : Z* ~> Q* have the following form:
/(Cm) = f'V (f € Mp-i, «el + pZp)
for some v € Z/(/? — 1)Z andx € Zp.
13. Prove that an infinite product ]l«>o(1 + fl«)' wnere all a„ ^ —I, converges for any
sequence (an)n>o converging to 0 in Cp.
(Hint. Use )log(l + an)\ — \an\ if \an\ is small.)
14. Let / be an ordered set, (E|)ie/ a family of sets (or groups, rings,...), and let <pij :
Ej -> Ei be maps (resp. homomorphisms,...) given for i < j e /, subject to the
transitivity conditions
<Pij = <Pu o <Pij : Ej ~+ Ei-+ Et (i < I < 7).
Assume that / contains a countable cofinal sequence S : in < i\ < ■ ■ - and consider
the projective system (Ein,<pin+uln)n>o with projective limit lim^ Ex. Show that if T
is another countable cofinal sequence in / and lim 7 E, is similarly defined, there is a
canonical isomorphism lim s Et ~ lim j Ei (use the universal property of projective
limits). Provided that / has a countable cofinal subset, we may define lim Et by choosing
such a sequence 5 and putting lim Ei := lim 5 Ei. For example, let A be the maximal
subring of a complete ultrametric field K and consider the ideals
lr = £<r(/0 C A (0 < r < 1).
Establish the following isomorphism:
A{X] -^ lim (A/Ir)[X]
(limit when r \ 0).
6
Analytic Functions and Elements
A powerful method for defining functions is provided by power series (we have
seen two examples in Chapter V: exp and log). This method is here developed
systematically, and we come back to a more thorough study of formal power
series. As is classically known, uniform limits of polynomials in a complex disk
lead to analytic functions.
Another class of special functions is supplied by rational functions, namely
quotients of polynomials: The simplest being the linear fractional transformations.
We also study them in this chapter, especially since their uniform limits in the
p-adic domain lead to the "analytic elements" in the sense of Krasner. Indeed, in
ultrametric analysis, the sole consideration of balls is not sufficient and in particular
not adapted to analytic continuation.
In this chapter the field K will still denote a complete extension of Qp in Cp (or in
£2p) often with dense valuation. The results that also require K to be algebraically
closed will be simply formulated for the field Cp (they are also valid for Qp)-
1- Power Series
7.7. Formal Power Series
Formal power series have already appeared repeatedly (with integral coefficients
in (1.4.8), with coefficients in a field in (IV5), (V.2)). We now study them more
systematically.
Let A ^ {0} be a commutative ring with a unit element 1. The formal power
series ring A[[X]] consists of sequences (an)n>o of elements of A, with addition
1. Power Series 281
and multiplication respectively defined by
(fln)n>0 "h (bn)n>0 = fan "h bn)n>0,
Instead of the sequence notation (an)n>o we shall prefer to use the notation / —
f(X) = 5Zn>ofl«^n ^or a formal power series. The formal power series ring
A[[X]] contains the polynomial ring A[X], and since 1 e A, we have Xn G
A[[X]] (n > 0).
Let us show that this formal power series ring constitutes a completion of the
polynomial ring.
Definition. Lef /(X) = X!n>o fl«^" ^e a nonzero power series. Its order is the
integer
o) = co(f) = min |neN:fl„^ 0}.
This order is the index of the first nonzero coefficient of /(X). We shall also
adopt the convention co(0) = oo with the usual rules
oo>n, oo-hn = n-hoo = oo (n > 0).
The following relation is then obvious:
<*>(/ + g)> min(G>(/), <w(g)),
with an equal sign if the orders are different. Moreover,
w(Xnf) = n + a>{f)
shows that
{/(X) : co(f) >n} = XnA[[X]]
ls the principal ideal generated by X" in the formal power series ring. Since
A[[X]]/X"A[[X]] = A[X]/(X"),
we also have
A[[X]] = fim A[[X]]/XnA[[X]] = lim A[X]/(Xn)
(with obvious identifications), and the ring A[[X]] appears as a completion of the
ring A[X] for the metrizable topology admitting the ideals (X") as a fundamental
system of neighborhoods of 0.
When the ring A has no zero divisor, we have, moreover,
co(fg) = co(f) + co(gl
282 6. Analytic Functions and Elements
Taking/ = g we infer co(f2) = 2co(f) and co(fn) = nco(f) (n > 0) by induction.
In particular, we see that if A is an integral domain, so is the formal power series
ring A[[X]]. If we iterate the construction, A[[X]][[y]] = A[[X, Y]] is also an
integral domain. We have obtained the following result.
Lemma. Let A be an integral domain andn a positive integer. Then the formal
power series ring A\[X\,..., Xn]] is also an integral domain. ■
Definition. The formal derivation D of the ring A[[X]] is the additive map
defined by
D (J2anXn J = J2nanX"-1 = J2nanX"-\
It satisfies
D{fg) = D(f)g + fD(g) (/, g e A[[X]]).
Since ker D D A, we see in particular that
D(af) - aD(f) (a e A, f e A[[X]]),
namely, the derivation D is A-linear. Since
co(D(f))>co(f)-l,
it is also continuous for the previously defined topology.
If we iterate this derivation D we obtain
Dk{Xn) = n(n - 1) • • • (n - k + l)Xn~*,
and since the product of/: consecutive integers is divisible by kU we can define
±Dk:Xn^ (n)xn~k
k\ \k)
even when the ring A does not contain inverses of the integral multiples of 1 - Hence
we define an A-linear map
^Dk : A[[X]] -> A[[X]]
correspondingly. In spite of the fact that a formal power series does not define a
function, we also use the notation /(0) for the constant coefficient ao of /. Then
if /(X) = £^o^*n,wehave
ak = ~Dk (f)(0) (*>0).
1. Power Series 283
7.2. Convergent Power Series
Since we are assuming that the field K is complete, an ultrametric series
converges when its general term tends to 0. If r > 0 denotes a real number such that
\an \rn -± 0, then J2n>o fl" *" converges (at least) for |jc | < r, and we get a function
Bsr(0) -+ K.
Definition. The radius of convergence of a power series f = 5Zn>o a"Xn
having coefficients in the field K is the extended real number 0 < r/ < oo
defined by
rf = sup{r >0: \an\rn -± 0}.
Alternatively, we can consider the values of r > 0 for which (\an \rn) is bounded:
sup{r > 0 : \an\rn -+ 0} < sup{r > 0 : (\an\rn) bounded),
and conversely,
(\an\rn) bounded => \an\sn -► 0 (s < r)
proves the other inequality, so that
rf = sup{r > 0 : Qan\rn) bounded}.
It is possible to compute this radius of convergence as in the classical complex
case by means of Hadamard''s formula.
Proposition 1. The radius of convergence of f = ^2n>oan^n ™
1 1
lim„>0 \an\l/n limsup^oo K|1/w '
Proof. Define rj by the Hadamard formula. If |jc| > r/ (this can happen only if
*7 < oo!), we have
lim sup Mia*!17* = |jc| - lim sup \ak\l/k = \x\ ■ — > 1.
»-*°° k->n "^°° k->n rf
Hence the decreasing sequence supjt>n |x | \ak \l/k is greater than 1, and for infinitely
many values of k > 0 we have \ak \ \x\k > 1, namely, the general term akxk of the
series does not tend to zero: The series ^2akXk diverges. Conversely, if |jc| < rj
(this can happen only if rf > 0!) we can choose |jc| < r < rj, and from
lim supr\ak\l/k =r- lim sup\ak\l/k < 1
we infer that for some large /V
supr\ak\}/k < 1.
k>N
284 6. Analytic Functions and Elements
Hence |fl* \rk < 1 for all k > N and
\akxk\ = \ak\rk(^\ <^\0 (k -* oo).
This shows that the general term of the series £ 0*** tends to zero, and the series
converges. ■
The letter x will here be used for a variable element of B<r/, while the capital
X denotes the indeterminate. When rj = 0, the power series converges only for
x = 0. Hence we shall mainly be interested in power series / for which r/ > 0.
Definition. A convergent power series is a formal power series f with r/ > 0.
Comments. (1) Let f{X) e K[[X]] be a convergent power series. If K' C £lp is
a complete extension of K, then / can be evaluated at any point x e B<r/(K').
The convergent power series f(X) defines in this way a continuous function (still
denoted by /)
f : B^iK')-+K'
because it is a uniform limit of continuous polynomial functions
fy : x b+ ^T anxn.
Usually, we shall simply write the condition |jc| < r/, assuming implicitly that the
element x is taken in K, Cp, or even Qp.
(2) If rf > 1, / is a restricted series, and by Theorem 1 in (V.2.4), it is strictly
differentiable, with a derivative given by numerical evaluation of the formal
derivative: f(x) = (Df)(x) (|jc| < 1). A similar result holds for any convergent power
series: If |jc0| < rf, then the restricted power series g(X) = f(xoX) has the
preceding property, and we conclude that
/'(*) = (/>/)(*) (\x\<rf).
(3) Observe that a radius of convergence rf > 0 does not necessarily belong to
\KX\. If 0 < rf $ |KX|, then the sphere |jc| = rf is empty: Bsrf = B<r/. These
subtleties disappear when we take x in the universal field £lp, since \QP\ = R>o*
Either the series converges at all points of the sphere {x e Qp : |jc| = /"/} or it
diverges at all points of this sphere.
Examples. (1) The radius of convergence of the series J2n>0 Xn is r/ = 1. This
series diverges at all points of the sphere |jc| = 1, since its general term does not
tend to 0.
1. Power Series 285
(2) More generally, any series / = 5Zn>o anXn € ZP[[X]] has a radius of
convergence r/ > 1. Interesting examples are supplied by the expansions
fa(X) = T(a)xneZp[[X]]
for fixed a e Zp. We also denote by (1 -h X)a the formal power series fa(X).
(3) When a series 5Zn>o fl«^" converges on the sphere |jc| = 1, it is a restricted
series (V.2) and rj > 1. Here is an example with r/ = 1. Consider 5Zn>o PnXp",
which obviously converges when |x\ = 1. Since \pn\l/ff = \p\n/p" -> 1, we have
rf = 1 by Hadamard's formula.
(4) The radius of convergence of a series can be 1 even when the coefficients
are unbounded or when \an\ -> oo. The series X^>o Xp"/pn illustrates this
possibility. As in the previous example rj = 1, since
\l/pn\l/P" = \p\-n/p" ^ 1.
This series converges only if |jc| < 1: It obviously diverges if |jc| = 1.
Proposition 2. Let f and g be two convergent power series. Their product
fg (computed formally) is a convergent power series, and more precisely, the
radius of convergence of fg is greater than or equal to min(r/, rg). Moreover,
the numerical evaluation of the power series fg can be made according to the
usual rule
(fg)(x) = f(x)g(x) Qx | < min(rf, rg)).
Proof. All statements are consequences of (V.2.2). ■
Corollary 1. Let r > 0. The set of power series f = ^anXn such that
\an\rn -> Ois a ring, and for each x e Bsr the evaluation map f h> f(x) is a
homomorphism of this ring into the base field K. ■
Corollary 2. For any polynomial /, the radius of convergence of the composite
f ° g is > rg and
(f o g)(x) = f(g(x)) (\x\<rg).
Proof. If |jc| < rg, taking / = g in the preceding proposition, we obtain g2(x) =
S(x)2 and by induction gn(x) = g(x)n (n > 0). Taking linear combinations of
these equalities, we deduce
ifog)(x) = f(g(x)) (\x\<rg)
for
any polynomial f. ■
286 6. Analytic Functions and Elements
The possibility of evaluating a composition fog according to the same rule
will be established for general power series in (1.5).
Proposition 3. The radius of convergence of f = 5Zn>0 anXn and ofits
derivative Df — 5Zn>i rcflnX"~] are the same: r/ = rDf.
Proof. Let us prove this proposition when the field is either an extension of Qp or
an extension of R with the normalized absolute value. We know that
±<\n\<n (neN)
and also
n±l/n -► 1 (n -* oo).
This proves
which concludes the proof. ■
Although / and Df always have the same radius of convergence, their behaviors
on the sphere |jc| = r/ may differ. For example, the radius of convergence of the
series / = $^n>0 xP" is r/ = 1. This series diverges on the unit sphere, but the
derivative Df = 5Zn>o Pn*p"~l converges at all points of the unit sphere.
Example. The series
log(l-hX) = J](-ir-1X7n,
1
= £(-i)n~1xn~1 = £(-i)nx"
0+x> »>1
have the same radius of convergence, since the second one is the derivative of the
first. Obviously, the radius of convergence of the second one is r = 1; hence the
radius of convergence of the logarithmic series is also 1 (compare with (V.4.1))-
Direct inspection (V.4.1) shows that the series log(l -h X) diverges when \x\ = h
while the series f(X) = expX diverges on the sphere |jc| =rp.
1.3. Formal Substitutions
In this section we study the composition of power series /(X), g(X) e K[[X]\-
In order to be able to substitute X = g(Y) in the power series f(X), it is essential
to assume that the order of g(X) is positive. This assumption is represented by any
of the following equivalent notations:
co(g) > 1, s(0) = 0, g(X) e XK[[X]].
1. Power Series 287
Then co{gn) > n, and if f(X) = £n>ofl"X'\ then
figtY)) = Sflnfe(^))" = X>^
is well-defined, since the family (an(g(Y))n)n>0 *s summable: The determination of
any coefficient cn involves the computation ofat most a finite number of am (g(Y))m
(m < n) and their coefficients of index at most n in each of them. We thus define
the composite power series by
(fog)(Y) = J^cnYneK[[Y]].
The substitution X = g(Y) furnishes a homomorphism
f(X) h> {fogXY) = f(g(Y)) : K[[X]] -+ K[[Y]]
sending 1 to 1 and continuous for the metrizable topology having the ideals
X*tf[[X]] as fundamental neighborhoods of 0, since
co(f)>k=>co(fog)>k.
For a fixed power series g of positive order, the identity of formal power series
(/1/2) og = (/io g)(f2 o g)
is easily verified. Hence f2og = (/ o g)2, and by induction
fnog=(fog)n (n>l).
Observe that the exponents are relative to multiplication and not to composition.
Iteration of composition is represented by
£°(2) =g°g, g°{n) = gogo---og.
» v '
n factors
Also distinguish the multiplication identity / = 1 (constant formal power series)
and the composition identity g = X = id:
foX = f, Xog = g {feK[[X]l geXK[[X}]).
For example Xo(n) o / = /, but Xn o / = /" (n > 0).
Proposition. Let g and h be two formal power series with positive order. Then
for any formal power series f we have
(fog)oh = fo{goh).
Proof. Both sides are well-defined. They are equal when f(X) = Xn, since fog =
£" in this case (the observation made just before the proposition is relevant). Hence
288 6. Analytic Functions and Elements
the statement of the proposition is true by linearity for any polynomial /. Finally,
in the general case let f(X) = ^2n>0cinXn. Then
(fog)oh = (j2<*ngn)oh = J2a-(Snoh)
\H>0 / H>0
Theorem 1. Let f(X) = J2n>oan^n be a formal power series. The following
properties are equivalent:
(i) 3ge K[[X]] with £(0) = 0 and (f o g)(X) = X.
(ii) ao = /(0) = Oandar = /'(0) ^ 0.
When they are satisfied, there is a unique formal power series g as required by
(i), and this formal power series also satisfies (g o f)(X) = X.
Proof. (0 =» (ii) If g(X) = £m^ bmXm, then the identity (/ o g)(X) = X can
be written more explicitly as
^«(Xr=fl0 + fl,^X + X2(-) = X.
In particular, ao = 0 and a\b\ = 1; hence ax ^ 0.
(ii) =» (0 The equality (/ o g)(X) = X requires that axbx = 1 and that the
coefficient of Xn in axg(X) + - - • + ang(X)n vanishes (for n > 2) (indeed, the
coefficient of Xn in amg(X)m, whenever m > n, vanishes). This coefficient of Xn
is determined by an expression
a\bn + Pn(a2, ... ,an\bu ..., bn-i)
with known polynomials Pn having integral coefficients (not that it matters, but
these polynomials are linear in the first variables ax; cf. (V.4.2)). The hypothesis
ax ^ 0 e K makes it possible to choose iteratively the coefficients bn according to
bn = -a~lPn(a2,...,ari;bu...,bri-l) (n > 2).
These choices furnish the required inverse formal power series g.
Finally, if / satisfies (ii) and g is chosen as in (j), then bo = 0 and b\ =
\/a\ ^ 0, so that we may apply (i) to g and choose a formal power series h with
(g o h)(X) = X. The associativity of composition shows that
HX) = (/o£) o fc(X) = / o (g^h)(X) - /(X).
id id
This proves g o f(X) = go h(X) = X.
m
1. Power Series 289
We still need a formula for the formal derivative of a composition. The identity
D(fg) = (Df)g + fDg (fgeK[[X]])
is well-known and easy to check. In particular, if / = g, we see that D(g2) —
2g Dg. By induction
D(gn) = ngn~lDg (geKUX]]) (n > 1)
and by linearity
D(f o g)(Y) = Df(X) Dg(Y) = Df(g(Y)) Dg(Y)
for all polynomials / e K[X].
Theorem 2 (Chain Rule). Let f and g be two formal power series with
g(0) = 0. Then the formal derivative of f o g is given by
D(f o g)(Y) = Df(X) Dg(Y) = Df(g(Y)) Dg(Y).
Proof. Fix the power series g and let / vary in K[[X]]. Then
co(f) > k => co(f og)>k => co(D(f og))>k-l
as well as
co(f) >k=> co{Df) >k-l=>co \Df(g(Y)) Dg(Y)] > * - 1.
The identity D(f o g)(Y) = Df(g(Y)) Dg{Y), valid on the dense subspace of
polynomials / e K[X], extends by continuity to / e /^[[X]]. ■
Application. Let us come back to the formal power series
of order 0 and
log(l + X) = X - \X2 ± ■ ■ ■ = Y.«>i ^f^Xn
of order 1. Their formal derivatives are respectively
D(ex) = 0+ 1 + X + iX2 + • • ■ = ex,
D(iog(i + x)) = i - x + x2 t • • • = Vc-iy-'x"-' = —l—.
For the composition, let us introduce here the formal power series of order 1
e(X) = ex-l = £„>i ±Xn, De(X) = D(ex) = ex.
290 6. Analytic Functions and Elements
The composite
log(ex) = log(l + e{X)) = X + J2 C*X*
is well-defined, and its formal derivative is
1
D(log(ex))
l+e(X)
i + E^q**-1.
Comparison of these two expansions gives
0 = kckeQ, ck = 0 (k> 2),
and this proves
log(ex) = log(l+KX)) = X.
The formal power series e(X) is the inverse for composition of log(l 4- X): By the
last assertion of Theorem 1 we also have e{X) o log(l + X)= X, namely
expologd-hX)- l=X,
or equivalently
1.4. The Growth Modulus
Let / be a nonzero convergent power series with coefficients in the field K. For
|jc| < rj we have f(x) = X!n>o an*n and hence
\f{x)\ < maxl^l.
Although the sphere \x\ = r is not compact, / is bounded on this sphere:
l/(*)l<|ajrm (W=r),
for some m > 0. Let us say that a monomial |flmjc'"| is dominant on a sphere
\x\ = r(< rf) when
Wn\rn < \am\rm for all n ± m.
In this case, this monomial is responsible for the absolute value of /,
\f(x)\ = \amxm\ = \an\rjn (\x\ = r),
which is constant on the sphere. When r is small enough, there is always a dominant
monomial: If m = co(f) is the order of /, we have |/(jc)| = \amxm\ for all
sufficiently small values of |jc|.
1. Power Series 291
Definitions. (1) The growth modulus of a convergent power series f is defined
by
Mrf = Mr(f) = max \a„ \r" (0 < r < rf)
so that r \-> Mrf is a positive increasing real function on the interval
flU,) CR.
(2) We say that r e [0, rj) is a regular radius for f if the equality Mrf =
\an\rn holds for one index n = n(r) > 0 only The monomial \an\rn oranxn is
called the dominant monomial for that radius.
(3) When there are {at least) two distinct indices i ^= j such that Mrf —
\a( \rl = \aj |rJ, we say thatr is a critical radius and the monomials |fl,- \rl — Mrf
are called competing monomials.
By definition
l/(*)l<Mr(/) if|x| = r<r/f
and this inequality is an equality \ f(x)\ = Mr(f) for all regular radii r. If a0 ^ 0,
then r = 0 is regular and \f(x)\ — |a0| for small |jc|. The positive critical radii
satisfy rl~j = \aj/at\ e |^x|: They are roots of absolute values of elements of
K. A critical radius of a power series with coefficients in K is the absolute value
of an algebraic element (over K).
When the coefficients^ e K are given, it is easy to sketch the curves r h> \an\rn
{n > 0) and their upper bound Mrf on the given interval. This upper bound is a
continuous convex curve. Let us show that it is continuously differentiable except
at a discrete set of points of the interval [0, rf).
Mrf( = |f(x)|)
»i|r
r= x
The growth modulus: r h^ Mrf
292 6. Analytic Functions and Elements
Classical Lemma. Let cn > 0 and 0 < R < oo be such that for every r < #
cnrn —>► 0 (<zs n —> oo). 77ien
r i-> M(r) = sup cnrn = max cnrn
is a continuous convex function on the interval I = [0, R) that is smooth except
on a discrete subset A = {rr < r" < rm < ■ - •} c /. Between two consecutive
values of A, M coincides with a single monomial cmrm.
Proof. Let 0 < r < R. Since cnrn -> 0 (n -> oo), there is an integer m > 0 with
cmrm = max cnrn = M(r).
If N > m and 0 < 5 < r, then
Qvr < cmr ==> — • r < 1 ==> — ■ .y < i ==> c^s < cms .
Hence only finitely many monomials, namely those for which N < m, can compete
with cmsm for s < r. The critical radii s < r are among the finite set of solutions
of
s^1 = — (0 < i < j < m).
The set A is either finite—possibly empty — or consists of an increasing sequence
converging to R. ■
This proves that a nonzero convergent power series / has only finitely many
critical radii smaller than any given value r < jy and the set of regular radii of /
v$> dense m [0, rf).
In the following commutative diagram, we denote by E the union of the critical
spheres in the open convergence ball of / and by [0, /y )reg the subset consisting
of regular values.
B<r/ D B<r/ — £ —> K*
ii.i ii.i ii.
M f
[ar/) D [0,/y)reg M R>0
Examples. (1) The power series J2n>0xn = 1/(1 - jc) and I^o*"/"1 = *
have no critical radius. This is obvious for the first one, and follows from the prooi
of Proposition 1 (V.4.2) for the second one (this can also be seen as a consequence
of the fact that the exponential has no zero (2.2)). On the other hand, if the set
{|fl„|r"} is unbounded on the real line, then there exists a sequence of critical radii
H / rf (exercise).
1. Power Series 293
(2) If / is a restricted power series, then My(f) is the Gauss norm of /. This
suggests that the maps / \-+ Mr(f) are norms on a suitable subspace of K[[X]]
depending on the value r > 0. This is indeed the case.
Proposition 1. When \ K x | is dense in R>o and f e K{X], the Gauss norm of
f and the sup norm of the function defined by f on the unit ball AofK coincide.
In other words (cf. (V.2.1)), the canonical homomorphism
K{X}->C(A;K)
is an isometric embedding.
Proof. The Gauss norm of / is Mi /, and the inequality
sup|/(Jt)|<A/,/
|x|<I
holds in general. With our assumption, we can choose a sequence xn e K with
\xn\ regular and |jc„| / 1; hence we have
i
Mxf = supMr f < sup|/(x)|. ■
r/\ xgA
Proposition 2. When r > 0 is fixed, f h> Mr(f) is an ultrametric norm on
the subspace consisting of formal power series f(X) = ^2anXn such that
\an\rn -► 0(n —* oo). This norm is multiplicative, i.e., Mr(fg) = Mr(f)Mr(g)
when f and g belong to this subspace.
Proof. If / ^ 0, then one an at least is nonzero, and Mr{f) > \an\rn > 0, since
r > 0. Hence Mr is a norm on the subspace considered. Moreover, the equality
Mrifg) = Mr(f)Mr(g)
is true if r is a regular radius for /, g, and fg, since it is the common value (V.2.2)
\fg(x)\ = \f{x)\\g{x)l (\x\ =r,xe Qp).
The general result follows by density of regular values and continuity of the maps
r m> Mr(f), r m> Mr(g), r \-+ Mr(fg). m
In the classical case, a complex function with an infinite radius of convergence
Js an entire function. The only entire functions that are bounded on C are the
constants. This is the theorem of Liouville. There is an analogous result in p-adic
analysis.
294 6. Analytic Functions and Elements
Theorem. Let the power series f e K [ [ X] ] have infinite radius of convergence
If the function \f\ is bounded on K and \K*\ is dense, then f is a constant
More generally, if\f(x)\ < C\x\N for some C > 0, N e N, andallx e Kwith
\x\ > c, then f is a polynomial of degree less than or equal to N.
Proof. It will suffice to prove the second, more general, statement. Write f(x) =
^2 anXn as usual. We have
\an\rn<Mrf = \f(x)\lxl=r<CrN,
provided that r > c is a regular radius of /. By the lemma and since | K x | is dense
in R>0, this happens at least for a sequence of values r, = \xj | —> oo, Xj e K,
KI<Crf-".
Letting j —> oo, we get an — 0 for all n > N. This proves that / is a polynomial
of degree at most N, as claimed. ■
There are many entire functions that are bounded on Qp, just as there are many
entire functions bounded on R (e.g., polynomials in sinjc and cos jc).
1.5. Substitution of Convergent Power Series
Let f(X) = £n>ofl"A:'\ g(X) = £m2l bmXm be two convergent power series
with g(0) = 0. The formal power series (/ o g){X) = 5Z*>o c^k w^turn out t0
be convergent, too, and we intend to prove the validity of the numerical evaluation
if o g){x) = f{g{x)\
^2,ckxk =^an (llm>ibmxm)
*>0 n>0
when |jc| is suitably small.
In order to be able to substitute the value X = g(x) in the formal power series
f(X), it is necessary to assume that this is small: \g(x)\ < rf will do. But even
if g(x) ~ 0, namely, x on a critical sphere of g, \x\ might be too big to allow
substitution in fog. Recall that critical spheres occur when several monomials
are of competing size. The circumstance g(x) = 0 does not prevent a few individual
monomials to be large, and thus have an influence after rearrangement of these
terms. On the other hand, the power series fog converges in a ball, and it would
be unreasonable to expect to be able to take advantage of the single fact that "#(*)
small" i.e., x close to a root of g, and hence x on a critical sphere of g, while
sup |g| on this critical sphere is Mr{g) (r — \x\). This explains the reason for the
hypotheses made in the following theorem.
Theorem. Let f and g be two convergent power series With £(0) = 0. //
\x\ < rg and M\x\(g) < r^, thenrjog > |jc| and the numerical evaluation of the
1. Power Series 295
composite fog can be made according to
if o g){x) = f(g(x)).
Proof. Assume that x e K (or Qp) satisfies the assumptions and define r = \x\.
Then recall that if f(X) = J^n^0an^n and£(F) = £^, bnYn, the formal power
series (/ o g)(Y) = Ylk>ockYk *s obtained by grouping equal powers in the
expansion of ]Cn>ofl«£(^)" (tms *s a double series). Define the polynomials
fN(X)= J2 a«xn-
The substitution
(fNOg)(x) = fN(g(x))
is valid if |jc| < rg by Corollary 2 of Proposition in (1.2). Let y = g(x). Since
lvl = IgOOl < Mr(g) < rf, we have /w(y) -> f(y), and here is a diagram
summing up the situation:
fN(g{x)) —> /(*(*)) (/V^OO)
|| <— polynomial case for fN (1.2)
(/jvo*)(x) -* (fog)(x) (N-+OQ).
Introduce
(fNog)(Y) = J^c'k(N)Yk,
((f-fN)og)(Y) = J^4(N)Yk,
k>N
so that the coefficients c* of / o g are
Recall that
(/ o g)(Y) - (fN o g)(Y) = ^ ang{Yf = ]T 4(N)Yk
is obtained by grouping the monomials having the same degree. Any monomial
of g(Y)n is a sum of products of n monomials of g(Y). When we evaluate it at a
point y with M\y\g = p, the ultrametnc inequality shows that its absolute value is
less than or equal to pn. This is where we use the assumption Mr(g) < rf in its
full force: Choose y e Cp with
(a) \x\ —r < |y| < rg, so that g(y) is well-defined,
(k) lg(y)| < M\y\g = p < ry, so that /(g(y)) is well-defined
296 6. Analytic Functions and Elements
(this is possible by continuity of t \-^ Mtg and Mr(g) < rf). Our previous
observation gives
k*(A0/l < sup \an\pn -> 0 (N -> oo)
because p <rf. This shows that the sequence (ckyk)k>o lies in the closure of the set
of sequences (c'k(N)yk)k>o (N > 1). But for each N, the sequence c'k(N)yk -> 0
(k -> oo) and the space c0 of sequences tending to 0 is complete (IV.4.1). This
proves
ckyk -* 0, rfog > |y| > \x\,
and also
(/o£)(y)-(/*os)(y)->0. ■
Example. Take K = Qp and consider the formal power series
fix) = J2X" = j-rj> g(Y) = y-yp-
Takearootf e \ip-\. Then g(£) = 0, so that f(gtf)) = /(0) = 1 is well-defined.
But rfog = 1, and the power series of / o g is not convergent on the unit sphere,
so that (/ o g)(r) is not defined. Here for r = |f | = 1, Mr(g) — 1 is not less than
rf = 1, and the substitution is not allowed (cf. exercises for the case p = 2). This
example also shows that for fields K having a discrete valuation, the condition on
balls g(B<r(K)) C B<irf is not sufficient to allow substitution: If (1 < r < />)>
y eZp = £<r(0; Qp), we have y = y* (mod pZp) hence \y - yp\ < Ipl < l
and thus g(B<r(Qp)) = ^(B<i(Qp)) C £<i- But although / converges in the
open unit ball, we cannot find a power series representing the composite / o g in
the ball £<i, since the rational function
1
1 - y + yp
has poles at the roots of 1 — y + yp = 0. These poles are located on the unit sphere
of a finite extension of Qp, and no power series can represent this rational function
on the sphere r = 1 (cf. exercises).
Another quite interesting example where the composition (/ o g)(x) is well-
defined but different from f(g(x)) will appear in (VII.2.4).
Application. As proved in (1.3), the formal power series f(X) = log(l + X) and
g(X) = e(X) = exp X — 1 are inverses of each other. By (1.5),
\og{ex) = x {\x\ <rp\
1. Power Series 297
since M]x\(e(-)) = \x\ and \x\ = r < rp < rXog = 1. Similarly,
explog(l +x) = 1 +x (\x\ < rp),
since Mjjc|(log(l -f- •)) = 1*1 for \x\ = r < rp, and rp is the radius of convergence
of the exponential. This is a second, independent, proof of the fact that exp and
log are inverse isometries in the open ball B<rp(K) (Proposition 2 in (V.4.2)).
1.6. The Valuation Polygon and its Dual
The study of
Afr = Afr(/) = sup|fl„|r"
«>o
is best made using logarithms. We shall use Greek letters for these logarithms:
p =logr <pf = logrr,
an =logK|,
/xp = log Mr = sup (np + an).
H>0
It is convenient to choose the log to the base p in order to have log p = 1 and
an = log \an\ = -oidp(an) = -vn.
The function /xp is a convex function as the sup envelope of affine linear functions.
It is apiecewise linear function, since the critical radii (and their logarithms) occur
on a discrete subset. Its opposite
-/xp = inf (vn - np)
H>0
is a concave function. When |jc| = r = pp is a regular radius, we have
-/xp = ordp/(x).
Definition. The function
p h-> hp := inf(vn — np) (—oo < p < Pf),
77 >0
or its graph, is the valuation polygon of the power series f.
{a) Let atXl be the dominant monomial between two consecutive critical radii,
say 'r <r < r'. Then
hp = -lip = inf(vn - np) = v, - ip
298 6. Analytic Functions and Elements
is affine linear in the corresponding interval
fp = log r < p < p' = log r'.
This gives a side of the valuation polygon. The valuation polygon, or the graph of
p m> hp := inf^o^H ~ ^p), is the boundary of the convex intersection of lower
half-planes determined by the lines of equations
A* : p^ vn- np.
The slope of A* is — n, and this line passes through the point (0, vn). The segment
of A* above the interval ['p, p'] is a portion of the boundary — a side — of this
convex region.
(b) If the dominant monomial just beyond the critical radius r' is cijXJ, then
i < j are the extreme indices for competition of the monomials
(rV-| = k/fljl=p-,4+,'J,
p' = logr' = ^i.
j-i
(c) From the definition hp = \nfn>p(vn — np) (-co < p < pj) we infer
successively
hp <vn-np (p, n >0),
hp+np < vn (p, n > 0),
sup(hp + np) <vn (n> 0).
p
The function
n »->► sup(fcp -h np)
p
is a piecewise linear convex function whose graph gives the boundary of the convex
intersection of upper half-planes containing all points Pn = («, vn). The line of
equation
Ap : nh+ hp-\-np
has slope p and passes through the point (0, hp). This gives a method for computing
fcp = —/xp for a fixed value of p. The value vn — np is geometrically the height
above the origin of a straight line of slope p going through the point Pn = («, v«)-
One can draw the graph of the function n »->► ordpfln of an integer variable
consisting of the points Pn — and look for the lowest line of slope p going through
these points. The height above the origin of this lowest line gives the value hp-
Letting now the slope p vary, this construction furnishes the desired convex hull
of the points Pn (or of the graph of n m> vn = ordp an).
1. Power Series 299
vn=ordpan
critical slope
01 i j n
The Newton polygon: convex envelope of the Pn
Definition. The function
n h> sup(vn + np) (-oo < p < pf),
p
or its graph, is the Newton polygon of the power series f.
The Newton polygon is the boundary of the sup convex envelope of the points
pn = (n, vn) (n > 0).
A few conventions are useful at this point. When a coefficient an vanishes, its
valuation vn = oo, and the corresponding point Pn is at infinity above all other
ones. For example, the Newton polygon has a first vertical side at m = ord(/),
least integer m with am ^ 0. If / is a polynomial, it also has a last vertical side at
n = deg (/) (since all P„'s are at oo when n > deg /).
The two polygons constructed are duals of each other. The sides of one
correspond to vertices of the other. For example, a side of the Newton polygon
corresponds precisely to a slope of a lowest contact line going through two distinct
points Pn. This situation occurs when two monomials have competing maximal
absolute values, namely when this slope p is the logarithm of a critical radius: The
valuation polygon has a vertex, and the graph of r h^ Mrf exhibits an angle at
the corresponding value of the radius r. More formally, from
hp = inf (y„ - np)
/i>0
(P < Pf)
We infer
hp <v„- np (p < pf, n> 0)
with equality for at least one index n. Equivalently,
hp + np < vn (p < pf9 n> 0),
sup (hp -f rcp) < vn (m > 0).
p
300 6. Analytic Functions and Elements
Duality of the Newton polygon and the valuation polygon
The affine lines
Ap :t h* hp+tp,
which are below all Pn, have a sup that is the Newton polygon. Dually, the affine
lines
A*:p^ -(np + ctn) = vn-np
have an inf that is the valuation polygon.
This notion of duality is developed in convexity theory. It has numerous
applications:
differential geometry (contact transformations),
variational calculus, classical mechanics, ....
Algorithm. Here is an efficient procedure to find the critical radii of a power
series f(X) = Y^n>oan^n- Let v« := or^pan anC* plot the points Pn = (h> vn'm
Determine the convex envelope of this set of points (and of Poo — (0, oo)). The
vertices of this convex envelope correspond to dominant monomials, those
responsible for \f\ = Mr between two critical radii, and endpoints of sides correspond
to competing monomials (responsible for a critical radius) with extremal indices.
1. Power Series 301
If pt and Pj (i < 7) are two endpoints of a side of the Newton polygon, the slope
p' of this side
P' = ^ = l0g/
J -*
corresponds to the critical radius r' for which the two monomials atX1 and a3X3
are the extreme competing monomials 10,-Kr')1 = \cij\(r')J,
(r'y-i = \ai/aj\=pv'-v', r'= p'.
For p / p', the point Px is the only contact point of the line Ap of slope p defining
the Newton polygon
hp = Vi - ip (p / p'\
For p \ p', the point Pj plays a similar role:
hP = vj - JP (P \ P')«
Example 1. Consider an Eisenstein polynomial (II.4.2)
f(X) - Xn + an^Xn-x + - + floe Z[X],
where p \ at (0 < j < n) and ao is not divisible by p2. These assumptions mean
that
ordptfo = 1, ordp a,- > 1 (1 < i < n),
so that the Newton polygon of / can be drawn (see the figure).
°rd p(an)
critical slope : p = - 1/n
critical radius : r = pp = | p |1/n
The Newton polygon of an Eisenstein polynomial
302 6. Analytic Functions and Elements
Example 2. Let us treat the case of the power series
f(X) = lpg(l + X) = £„>, ^X».
We have
ordp an = 0 if 1 < n < p, ord^ a^ = — 1,
and
— 1 < ord^ g„ < 0 if p < n < p2,
ordD(l/n)
slope : -l/(p3 - p2)
The Newton polygon of the logarithm
The vertices of the Newton polygon are the points
Pl =(1.0), Pp = (p,~l), Pp2 = (p2,~2), Ppi=(p\-3),
The successive slopes of the sides are
-1
1
P2~ P
p3- p2
p-\
They correspond to critical radii
p~~pz* < p p2~p < p p1-p2 < ..
and we recognize the sequence
(^0).
(-► 1),
rP = \p\*-1 <r'=rxplp <r\
_ r\/P2
p p
(->!)•
Between two consecutive critical radii, the absolute value of the logarithm
coincides with the absolute value of the dominant monomial. We already know that
|log(l+x)| = |x| (0<\x\<rp),
1. Power Series 303
— the isometry domain of log (inverted by exp) — and see further that
\xP\
P
rp < |log(l+Jc)| =
= p\x\p < prp (rp < \x\ <r'p),
where r' = rl/p is the next critical radius. Quite generally,
pj~lrp < |log(l+x)|
PJ'MpJ <PJrp
for
rj"" <\x\<rW
Here we see how [ log(l + x)\ increases: We already knew by (V.4.4) that it can
be arbitrarily large, since log : 1 -\~Mp -> Cp is surjective. On the other hand, the
zeros of log(l -\~x) can occur only when \x | is equal to a critical radius. This gives
an independent proof of (II.4.4) for the estimates of |f — 11 when f e iip™.
L7. Laurent Series
Let us show how the preceding considerations extend to Laurent series. Let
oo
/ = J2 a"x" = E a"x"
neZ -oo
be such a series with coefficients in the field K. Thus we consider this series as a
sum of two formal power series
/ - /-+/+ = J2 a"x"+E a"x"
n<0
n>0
with /+ e K[[X]] as before, and /~ = £„<0 anXn ^[[X^1]] has zero
constant term. Convergence requires |jc| < r"j = r/+ = 1/lim^^oo \an\x/n for the first
°ne and similarly \x~x \ < 1/ lim^oo \a-n \x/n for the second one. Let us define
rf := limn^oo \a_n
|l/n
and let us assume rf < ft, so that we have a common open annulus of convergence
rf < \x | < rt and hence a continuous (strictly differentiable) function — still
denoted by / — in this annulus of K (or in any complete extension of K).
The absolute value of this function / is bounded on a sphere \x\ = r (where
rf <r < rp,
\f(x)\< sup \an\rn (\x\=r),
—00</1<OC
and is even constant on this sphere, provided that a single monomial dominates
aU the others. In this case we say that it is a regular radius. We again define the
304 6. Analytic Functions and Elements
growth modulus of /,
MTS = Mr(f) := sup \an\rn (rj<r< r+),
—oo<n<oo
as a positive real function on the interval (rj, ri) C R. A critical radius r is a
value rj<r<rt such that for two monomials (at least)
Mrf = Iflilr' = IflylH (-00 <i < j < 00).
The critical radii make up a discrete subset of the interval (rj, ri), and regular
radii are dense in this interval.
The growth modulus r h^ Mrf of a Laurent series is a convex function but
is not necessarily increasing. Taking the log to the base p, define p = logr,
Mp/ = log Mr/. Then
py-+hp = iLpf (logr~ <c p < logrj)
is a concave piecewise linear function (inf envelope of affine linear ones). It is the
valuation polygon of the Laurent series /. All these facts are established exactly
as in the case of power series.
Laurent series can also be multiplied in a common annulus of convergence. Let
us indeed start with the case of Laurent polynomials. If
p = Y2a»X"< 1 = J2b"x" e K[X, X~ll
finite finite
then their product is the polynomial pq = J] cnXn having coefficients
Cn= Y2 <*kbi3 knl < sup \akbi\ (n e Z).
k+l=n k+l=n
With the Gauss norms of p and q (sup norms on the coefficients) we have
\cn\ < \\PW-\M (neZ)
and consequently
\\pq\\<\\p\\'\\q\\.
The product operation is (uniformly) continuous: It extends continuously to the
completion K{X, X~1} with the same inequality. This completion consists 01
Laurent series Yl-oo<n<ooa"Xn> wnere \an\ -> 0 for both limits n -> 00 and
n —> — 00: These are called restricted Laurent series.
In particular, the powers of a restricted Laurent series are again restricted Laurent
series, and if f e K{X, X~1}, thenxmfn e K{X, X'1} for all m eZandneN-
More generally, if /is a convergent Laurent series and rf < \a\ < rt,theng(X) ^
f(aX) is in K {X, X~l}, and the same results are established for convergent Laurent
series instead of restricted ones.
2. Zeros of Power Series 305
2. Zeros of Power Series
2.1. Finiteness of Zeros on Spheres
Let K be a complete extension of Q^ (in Cp or £2p)y
K D AD M, A/M = k : residue field.
Select a nonzero convergent power series f(X) = ^2,n>^anXn e K[[X]]: rj > 0.
If f(a) = 0 for some a e K*, \a\ < r/, then r = \a\ is a critical radius: Indeed,
|/(a)|=0<Afr/:=sup|flfI|r,,^0
(cf. (1.4): r = 0 is critical precisely when ao = f(0) = 0).
We have already obtained an illustration of this fact in the study (1.6) of
|log(l +x)| for |;c | < 1: the zeros of log occur on the critical spheres, centered at
1, containing pth-power roots of unity (V.4.2).
Proposition. Let f e A{X] be a restricted power series. Let a e A. Then there
is a formal power series g such that
f(X) = f(a)-)r(X-a)g{X).
Moreover, g e A{X} and rg > r/.
Proof. Replace / by MX) = f(aX), and hence fx e A{X] (if \a\ < 1 we even
haver^ = rf/\a\ > 1). Hence we only have to consider the typical case a = l.We
write / = Y.n^anXn (an e A, \an\ -* 0), and we have to find g = Y,n^bnXn
with
/(X) = /(1) + (X-1)*(X).
Comparing coefficients, we find the conditions
oq = /(l) - bo, an = bn-X -bn (w > 1),
or b0 = /(i) — a0 = J]i>oG<"' bn = K-i ~ an. By induction we see that
Hence |&„| -> 0 and g e A{X}, as desired. If rj = 1, we are done. If r/ > 1, take
any r > 1, r < r/, so that 1^,-lr1 -> 0. Hence there is a constant c > 0 such that
Ic-lr' <c, k-| <c/r' (i >0).
Hence \b„\ < sup£->n \at\ < csup£>n 1/r1 = c/rn+1,
l*nkn<c/r (n>0).
306 6. Analytic Functions and Elements
Since the sequence (\bn\rn)n>o is bounded, rg > r. Letting r increase to r*, we
see that rg > sup1<r<r r — r/ (compare with Theorem 1 in (V.2.4)). a
Theorem (Strassman). A nonzero restricted power series f e A{X] has only
finitely many zeros in A.
Proof. (1) Zeros on the unit sphere. Assume / — ^2n>0anXn ^ 0 and define
/x := min{n : \an\ = sup |<zr-|} < v := sup{n : \an\ = sup |<zr-|},
i i
so that /x < v. If /x = v, then / has no zero on the unit sphere. We are going to
show more precisely that / has at most v — /x zeros (counting multiplicities) on
the unit sphere. Suppose v > 1 and f(a) = 0 for some ggAx, namely \a\ = 1.
Write
f = (X~a)g, geA{X}.
By the definition of the extreme indices /x and v, when we reduce the coefficients
modM,
f(X) = (X-a)g(X)ek[X],
we find that
deg / = 1 + deg g, co(f) = co(g),
since a ^ 0 (co denotes the order as in (1.1): first index of a nonzero coefficient),
v = 1 + vg, /x = jj.g.
This proves that
vg — fig = (v — /x) — 1 < v — /x.
But any zero b ^ a of f Is also a zero of g:
0 = /(£>) = (£> - a)«(6) =» g(b) = 0.
For example, if v = /x + 1, we arrive at vg = /x5, so that g cannot vanish on A •
In this case, / has only one zero in Ax, and v — /x = 1. If v > /x + 1» we can
repeat the procedure for g. In this way, we see that after at most v — /x steps, the
last function h e A{X] obtained will satisfy vh — /xft, hence will not vanish on
the unit sphere A x. This process leads to a factorization
f = P h, P polynomial, h e A{X},
and h does not vanish on Ax.
(2) Zeros in M. If f(a) = 0 for some a e M, namely \a\ < 1, consider
fa(X)= f(aX), for which rfa = rf/\a\ > rf > 1. By the first step, fa has a finite
2. Zeros of Power Series 307
number of zeros on the unit sphere: / has a finite number of zeros on the critical
sphere of radius r = \a\. Since / has only finitely many critical radii r <c 1, the
conclusion follows. ■
The proof has shown more precisely that the number of zeros of / in A (counting
multiplicities) is bounded by the telescoping sum of differences v — /x of exponents
of critical monomials (corresponding to the critical radii less than 1). Hence we
have obtained the following result.
Corollary. Let f = 5Zn>oG«^n e ^[(THl be a nonzero convergent power
series and assume that r < r/ is a critical radius of f. Let also
fx = min{n : \an\rn = Mrf] < v = max{n : \an\rn = Mrf]
be the extreme indices of the monomials of maximal absolute value. Then,
counting multiplicities,
f has at most v — /x zeros in Sr(K),
f has at most v zeros in the closed ball B<r(K),
f has at most fx zeros in the open ball B<r(K). ■
Remark. With the previous notation we have
*, x- f \a^ iorr-s <s < r,
Msf = {
I \av\sv forr<s<r-M
for small enough e > 0. Taking logarithms (to the base p),
/x<j — ordpa^ for a / p,
va — ox&pav for a \ p,
and v — /x appears as a difference of slopes of the valuation polygon at the
corresponding vertex.
2.2. Existence of Zeros
We keep the same notation as in the preceding section.
Theorem 1. Let K be a complete and algebraically closed extension of Qp and
f = ^2anXn e K[[X]] a nonzero convergent power series. If f has a critical
radius r < rj> then f has a zero on the critical sphere of radius r in K. More
precisely, if p. < v are the extreme indices for which \an\rn = Mrf, then f has
exactly v — /x zeros {counting multiplicities) on the critical sphere \x\ = r of
K: There is a polynomial P e K[X] of degree v — /x and a convergent power
308 6. Analytic Functions and Elements
series g e K[[X]] with
f = P • g, rg > rf, g does not vanish on Sr(K).
Proof. The result is trivial if r = 0, so we assume r > 0 from now on. Recall that
la^r*1 = \av\rv, rv~^ = \a^/av\ e \K*\. Since K is algebraically closed, there
is an element a e K with \a\ = r. Replace / by fa(X) — f(aX) having r = 1
as critical radius. This converts / into a series having a radius of convergence
rja = r//\a\ > 1, and in particular, fa e K{X}. We can similarly replace / by
the multiple f/av and assume |flM| = kM = Mr/ = 1 (and av = 1). To sum up,
it is sufficient to study the normalized situation
r = 1 < r/ is a critical radius of f e A{X] C K{X],
\all\ = \av\ = Mrf = l. \an\<\ (n>0),
\an | < 1 for n < /x and also for n > v.
We are going to show that (counting multiplicities)
/ has precisely v — /x zeros on the critical sphere Sr{K).
For |jc| t£ 1 close to 1, the absolute value of f(x) is given by
||fl jc"| = |*r if |jc| / 1 say 1 - £ < |jc| < 1,
(*)
Ifl^l = |jcr if |jc| \ 1 say 1 < |x| < 1+e-
(The first estimate is valid when |x| is larger than the largest critical radius less
than 1, and similarly, the second one is valid when |x| is smaller than the smallest
critical radius greater than 1.)
First step: Truncation. For any index r > 0 define the polynomial Px = ^n<x anx
(of degree < t) and the remainder gT = £„>r anXn. We have / = PT 4- gr> an(*
if r > v, then
Mxgz = max|an| < 1 = Mxf = MXPZ.
By continuity of the functions r y-+ MrgT and r^Mr/we infer
!>„*"
< MrgT < Mr/ (|x| = r close to 1).
If r is regular for both gT and / (which is the case if r ^ 1 is close to 1), we have
\gAx)\ = MrgT < Mrf = \f(x)\ (\x\ = r ± 1, r close to 1).
Consequently, if r > v, then
2. Zeros of Power Series 309
for the same values of \x\ = r. Choose t > v, so that aT ^ 0: deg PT = r. Since
K is algebraically closed, we can factorize this polynomial:
Pr(X) = Or [](*-£)-
More precisely, consider the partition of these roots into three subsets,
A = Ar
Ar = a;
A = Ar
roots § with |£| < 1,
roots § with |£| > 1,
roots § with |£| = 1.
Here is a table of the absolute values \x — § |, depending on § and |x| close to 1:
\x\/\
1*1 = 1
|x|\l
A : |?| < 1
1*1
1
\x\
A : |§| = 1
1
l*-fl
\x\
A' : m > 1
1*1
1*1
151
In the middle column, we see that when |x| crosses the value 1, then \x — § |
(f G A) varies from 1 to |x|. The number of roots f e A — taking multiplicities
into account — is responsible for the variation of growth of | PT |. We have
\Pr{x)\ = \at\Y\\x-$\Y\\x-S\Y\\x-S\
AAA'
(the factors of this product are repeated as many times as the respective
multiplicities require). Considering separately the cases |x| < 1 and |x| > 1, we have
\Pr(x)\
l«rll*|#A- 1 -nA'l*l if \x\=r/h
Klk|#A.|x|#A-nA'l?l ^ l*l=r\l
(1)
(where multiplicities are taken into account in the exponents—the same notational
abuse is made below). Recall (*)
|/(x)| = Mrf
lajljcr = |*r for\x\=r/>l,
\av\\x\v = \x\v for|x|=r\l.
Comparing the first lines of (1) and (2), we infer
M=#A, |or| f] 151 = 1-
(2)
310 6. Analytic Functions and Elements
Observe that if r > v, then \aT\ < 1, so that N is not empty! Comparing now the
second lines, we get
8 := #A = v - /x
independently from the index of truncation r (recall that this takes into account
the multiplicities of the roots |f | = 1 occurring in PT and is thus greater than or
equal to the cardinality of this set of roots). Since \aT\ Y\A> l£| = 1, we can now
write
\Pr(x)\=Y\\x-S\ for |x| = l, (3)
A
namely, the absolute value of PT(x) on the critical (unit) sphere is the product of
the distances of x to the roots f e A.
Second step: Convergence. Let us compare two successive truncations: if Pz ^ /,
then there is r' > r with
PAx) = J^ anxn = PT(x) + az>xx\ az> ± 0.
By the first step, the roots of the polynomial PT> on the unit sphere constitute a set
A' having the same number of elements (counting multiplicities) 8 = v — \jl as A,
and
|/V(*)l = ni*-*'I far |x| = l.
A'
In particular, if we take a root § e A of Pz, we have
n i* - ?\=i^«)i=i^'«> - p-«)i=i^r'i=i^'i-
A'
Hence for one root £' 6 A' at least, we have
ir-fi<iM1/5.
When / is not a polynomial, we can consider the infinite sequence of successive
truncations of /, which are polynomials of degrees equal to their index
T < T\ = Xf < T2 = t" <
Their sets of roots on the unit sphere
A0 = A, Aj = A', A2, ... C {x e K : |x| = 1}
have the same cardinality 8. Let us choose and fix a root f = f0 e Ao- We have
seen that we can choose a root %\ e A\ such that
I*l-f0l<|flrj1/a
2. Zeros of Power Series 311
and then a root £2 e A2 such that
Ife-fil^KI1'8, etc.
Since |a„| -» 0, this construction furnishes a Cauchy sequence (£,),->o on the unit
sphere of the complete field K. Let us call f „> its limit. By construction,
i/(&»)i =
£ «*i
< max \at \ -> 0,
/&») = /(lim &,) = lim /(&,) = 0.
This proves the existence of a root a = foo of / in the unit sphere of K. Writing
/ = (X — a)g, if v — /x > 1, we can repeat the construction of a root of g.
Eventually, we arrive at the precise statement of the theorem. ■
For example, if a convergent power series / e CP[[X]] has no zero in the open
ball B<zrf of Cp, then it has no critical radius. This is the case for f(X) = ex, as
was mentioned in (1.4) (before the Liouville theorem).
Corollary. Let f e K[[X]] be a convergent power series having no zero in
some closed ball \x\ < r ( < ry) of Ka. Then \/f is given by a convergent
power series with r\/f >r. If f has no zero in an open ball \x\ < r' ( < r/) of
Ka, then \/f is given by a convergent power series with r\/f > r'.
Proof. Let / = J^n>0anXn. since flo = f(0) ^ 0, we may replace / by f/f(0)
and assume a0 = 1. Define g = 5^n>1 anXn, so that / = 1 + g, rg — rf. The
formal power series \jf e 1 H- X K[[X]] is obtained by formal substitution (1.3)
7 = TT7°*(X)-
since co(g) > 1. For the estimate of the radius of convergence of this power series,
we may replace K by Ka, and hence assume that K is algebraically closed. By the
theorem, / has no critical radius less than or equal to r, and if / = ^Zn>o^nXn,
then |«o | > |fl„|rnforalln > 1. This shows that Mrg = maxM>! \an\rn < |a0| = 1-
Numerical evaluation of the above composition (1.5) is valid when
\x\ <zrs = rf, M\x\g < r1/(i+n = 1,
which is the case for |jc| <r, since
Mlxlg sMrg <z 1.
The same reference (1.5) also proves that r\/f > r. The second statement is
obtained by letting r / r'. ■
312 6. Analytic Functions and Elements
When K is not algebraically closed, we can still give the following factorization
result (in the following statement, /x and v have the same meaning as before).
Theorem 2. Let K be a complete extension of Qp in Qp and f e K[[X]\ a
nonzero convergent power series.
(a) If f(a) = 0 for some a e Qp, \a\ < ry, then a is algebraic over K.
{b)Ifr=\ is a critical radius off, then there is a factorization
f = cP - Q - g\, c e Kx, P, Q e A[X] monic polynomials,
P of degree v - jx, \P(0)\ = 1, Q of degree /z, Q = X** (mod M),
gi e 1 -\- XM{X} (rg > r/) has no zero in the closed unit ball
of Ka. These conditions characterize uniquely this factorization.
Proof, (a) If / has a zero a on the sphere \x\ = r in Cp (or £2p), then r is a critical
radius, and the preceding theorem shows that / has v — /z roots in Qp (counting
multiplicities). If o is a tf-automorphism of £2p, it is continuous and isometric
(III.3.2):
/(a") = /(«)"= 0, \a°\ = \a\=r.
Hence a has a finite number of conjugates contained in the finite set of roots of
/ on the sphere |x| = r of £2p. By Galois theory, this proves that a is algebraic
over K. The same argument shows that the product P = Y[* (X — £) e Ka[X]
extended over all roots of / having absolute value r (all multiplicities counted)
has coefficients fixed by all tf-automorphisms of Ka and hence coefficients in K.
This is a monic polynomial P e K[X] of degree v — /z.
(b) Define P = Y]^(X - f) e K[X], the product over all roots of / having
absolute value r = 1 (taking into account multiplicities). Hence P is a monic
polynomial of degree v — /x, and P(0) = ±n? *s a un^- Let similarly Q =
Y\^(X - f) e Ka[X] be the product corresponding to the roots of / in the open
unit ball |£| < 1, i.e., f e M. Then 2 is a monic polynomial of degree /x having
its coefficients in M except for the leading one. As before. Galois theory shows
that Q e K[X]. Now, / = PQg with a convergent power series g, rg > *7 > 1>
having no zero in the closed unit ball of Ka, hence no critical radius. If g ~
Hn^ohX*, we have \b0\ > \btxl\ for all i > 1, |jc| < 1, since there is no critical
radius in the unit ball. Hence \bo\ > |£,|, and taking c = bo ^ 0, we see that
(£,- —> 0 since rg > 1)
/ = cPG^i. *i = 1 + £>< W 6 1 + XM{X}.
i>0
For uniqueness, observe that in a factorization / = cP • Q - g\ with g\ e
1 +XM{X} (rg > r^), hence g] having no zero in the closed unit ball of K >
the polynomial cP Q is a multiple of the product of the linear factors
corresponding to the roots of / in the closed unit ball (counting multiplicities). If the degree
of PQ is v, this monic polynomial is the product Y\^(X ~ H) e Ka[X] extended
over all roots |f | < 1. If P = \\ (X — i-) is a product extended over a subset ot
2. Zeros of Power Series 313
roots, the condition \P(0)\ = | J~[? £| = 1 implies that the factors correspond to
roots |£1 = 1- The degree of P being v — /x by assumption, this product contains all
the linear factors of / corresponding to the roots on the unit sphere. Consequently,
Q is the product of the linear factors of / corresponding to the roots in the open
unit ball. ■
Remarks. (1) Under the assumptions of (b). if / has its coefficients in A, and
not all in M, then the constant c is a unit, and by reduction mod M, the equality
/ = cPQg\ leads to / = cPQg\ = cPX11, since g[ = 1. Since P is a monic
polynomial of degree v — \i with constant term P(0) e Ax, P(0) ^ 0, we
recognize the significance of /x and v as the extreme indices of monomials of
maximal absolute value for |jc| = 1. In the normalized form av = 1; hence c = 1.
(2) This theorem is a version of the Weierstrass preparation theorem, which was
initially proved for rings of germs of holomorphic functions in several complex
variables. It has now several formulations in purely algebraic terms.
2.3. Entire Functions
Definition. An entire function is a function f given by a formal power series
f e K[[X]] having infinite radius of convergence: rj = oo.
Before studying the entire functions more closely, let us prove the following
elementary result.
Lemma. For any sequence (an)n>o *n a complete ultrametric field K with
fln-> i, the products pn := Y\n<zN an converge to a limit denoted by Y\n>o a" =
YY^Lo an- More generally, if(an)n>o is a sequence of K-valuedfunctions defined
on some set S, and ifan —> 1 uniformly on 5, the partial products pw converge
uniformly to \\n^ an.
Proof. By assumption \an\ = 1 for large n. Hence the partial products remain
bounded, say \pN\ < C (TV > 0). By definition,
Pn+i - Pn — (0jv - 1)Pn,
\Pn+\ - Pn\ < C\aN - 1| -* 0 (TV -> oo).
This proves that the sequence of partial products p^ is a Cauchy sequence. It
converges in the complete field K. The second statement follows immediately
from the first one. ■
The exponential is an example of a function with no zero:
ex - e~x = e° = 1 => ex # 0
(this is true for the complex exponential and for the p-adic exponential: Only the
homomorphism property is used!). Although it is an example of an entire function
314 6. Analytic Functions and Elements
in complex analysis, the finite radius of convergence of the p-adic exponential
prevents this function from being entire in this context. In fact, any entire function
having no zero in complex analysis is of the form f(z) = es^ for some entire
function g, but the only entire functions in p-adic analysis having no zero in an
algebraically closed field are the constants. This leads to an easy determination
of entire functions in /?-adic analysis. The main results are contained in the next
statement.
Theorem. Let f e A^[[X]] be a formal power series with r/ = oo.
(a) If f does not vanish in Ka, then f is a nonzero constant
(b) If f has only finitely many zeros in Ka, then it is a polynomial
(c)lf0^f€ Cp[[X]], the following conditions are equivalent:
(i) f has infinitely many zeros.
(ii) f has a sequence of critical radii —> oo.
(hi) The growth of \f\ is not bounded by a polynomial in \x |,
(iv) / is given by a convergent infinite product
f(x) = Cxm • \\(\ — x/t)tne product taken over nonzero
roots off, counting multiplicities, andm = ordo/.
Proof, (a) If / does not vanish, then ao = /(0) ^ 0 and \f(x)\ = \a0\ whenever
|jc| is smaller than the first critical radius. Since / does not vanish, there is no
critical radius; hence all an = 0 for n > 1. This proves that / = a0 is constant
(b) After division of / by the monic polynomial having the same roots as /,
we are brought back to the first case.
(c) The equivalence (i) <=$ (ii) is a consequence of the finiteness of zeros on
each critical sphere. The equivalence (ii) <==> (Hi) is Liouville's theorem (1.4).
Finally, (iv) => (i) is clear, and we now show (iY) => (iv). By assumption / ^ 0,
and if its order is m > 0, we can write
f(X) = J2 a»X" = CxmV + Yl "«*">
with C = am (andaj, = am+n/am). Without loss of generality we may now assume
that / is given by an expansion f(x) = 1 + Yln>\ anx"- In mis case I/Ml ^ *
for small jc, namely for |jc| < r0 (ro denoting the first critical radius of /)- Just
beyond this critical radius, we shall have |/(x)| = \x\N if there are precisely N
zeros (counting multiplicities) of / on the critical sphere \x \ = no- Let us write
f(x) = po{x).Mx), po(x)= Yl (l~j)-
The same procedure can obviously be iterated on each successive critical sphere
and furnishes a factorization
/(*)=P«to ■/«+!(*), Pn(x)= Yl l1"^)-
l£l<r«./(£)=0 ^ */
2. Zeros of Power Series 315
This construction makes it obvious that for fixed x, the terms (1 — x/§) of the
product tend to 1 and this convergence is uniform in x in a ball B<r (provided
that R < oo). This is the infinite product representation of /. It also shows that
for each given sequence (£■) with |§j1 -* oo there is an entire function having the
£'s as zeros (with correct multiplicities, and no other root): The corresponding
infinite product converges uniformly on all bounded sets (its general term tends
to 1 uniformly on bounded sets). Observe finally that if an infinite product of the
form j~[0 — XI%Y nas trie same zeros (^ 0) as a power series /, the quotient
has no further zero x ^ 0. This function has no positive critical radius and can
only be a monomial cmxm, m being the multiplicity of the zero at the origin. This
concludes the proof of the theorem. ■
2.4. Rolle's Theorem
Rolle's theorem for differentiable functions of a real variable is valid for scalar
functions only. Here it is:
If f : [a, b] —> R (a < b) is continuous and differentiable on
the open interval (a, b), then there exists a < c < b with
f (c) = .
b — a
The mean value theorem with an intermediate point follows from it. The preceding
equality can be written f(b) — f(a) = (b — a)f\c) or with a = t, b = t + h,
c = t + 6h, as a limited expansion of the first order:
f(t + h) = f0) + h.f'(t + 6h) (O<0< 1).
We give here the p-adic versions. Let us start with an easy observation.
Proposition. Let f e K[[X]] be a convergent power series. For f e K with
If I < r/, there is a unique convergent f% e K[[X]] with f(x) = f%(x — f)
for small \x\. Moreover, f% has the same radius of convergence as f, and the
preceding equality holds for \x\ < rj.
If / = X!n>o anXn, this means that we can expand around §,
YJ^xn=Y^an(M){x~Hf,
with no gain (no loss either) in convergence: |x| < r/ <=> \x — f | < r/.
Proof. For a polynomial / e A[X], this is the Taylor formula for the expansion
around the point £ e A:
n<d n<d n<d
316 6. Analytic Functions and Elements
with a polynomial ft e A[X]. This proves that sup„ 1^(5)1 < 1 when supn |^| <
1, so that f *-* ft diminishes the Gauss norms. The same is true for the converse
isomorphism. We conclude that
/ h-> ft : K[X] -► K[X] is isometric.
This isometry has a unique isometric extension to the completion K{X}: We still
denote it by / h-» ft. Now, if / e ^[[X]] has ry > 0, we may apply the preceding
result to any g = f(aX) where a e Ka, \a\ < r/, since g e Ka{X} in this case.
This shows that the radius of convergence of ft is greater than or equal to rf,
but as before, the inverse isometry proves the converse inequality and nothing is
gained. ■
Theorem. Let f e C^[[X]] have convergence radius rf > 1. Then
(p) if f has two distinct zeros a ^ b in B<\ satisfying \a —b\ <rp,
then fr has a zero in B<Y;
(b) if f has two distinct zeros a^bin B<x satisfying \a — b\ < rp,
then f has a zero in B<x.
Proof. By the preceding proposition, we can replace / by its expansion centered
at the point b. Thus, we may assume a ^ b = 0, \a\ < rp (resp. \a\ < rp):
f(X) = £B21 anXn and ax ^ 0 (otherwise, f'(0) = 0, and we are done). We can
also assume that \a\ = rc is the smallest positive critical radius. Hence there is an
integer n > 1 such that
whence
\a\\rc = \a>nVnc,
= rn~x <rnp~] (resp. < rnp'1).
If v = ordpM, say n = pvm, m prime to p, we have
n - 1 pvm - 1 pv - 1 v 1
- = - ->- r=pv-1 + --- + p + l>v
p~l p~\ p~\
(with equality only for m — 1 and v = I: n = p). Hence
<r»p-l = \p\ft<\p\v = \n\9
so
that \a\\ < |na„| and |«i| = (nfl^r"1""1 for some r' < 1. Recalling that r/' ^
r/ > 1. we see that the power series /' admits a critical radius r' < 1. By (2.2) /
has a zero in the closed unit ball. In the case (b), \ax | < \nan | proves r' < h and
the zero is in the open unit ball. *
2. Zeros of Power Series 317
Example. Let / = xp — px and choose a root p1/^-1* e Cp. The zeros of / are
0 and fjLp-\ - pl^p~l\ Hence two distinct roots are at a distance rp. The zeros of
f are also the zeros of f/p = xp~] — 1, i.e., the elements of \jlp~\ on the unit
sphere.
Corollary. Let f e Cp[[X]] withr/ > 1. For each pair of points a, b e Ap
such that \a — b\ < rp, there is a point f e Ap such that
nb)-f(fl) = {b-a)f'{$).
If a, b e Mp and \a — b\ < rp, there is a point f e M^ such that
f(b)-f{a) = (b-a)f(£).
Proof. As in the classical case, consider the function
/(*) /(*) f{b)
a x b
1 1 1
4>(x) =
which vanishes at x = a and x = £>. Its derivative
4>Xx) =
f(fi) fix) f{b)
a 1 b
1 0 1
vanishes in A^ (resp. Mp).
2.5. The Maximum Principle
The preceding theory concerning critical radii — and particularly the existence of
zeros on critical spheres — has important consequences for the study of power
series.
Proposition. Let r < rf be a critical radius of f e CP[[X]]. Then \f\ takes
all values between 0 and Mrf in \Cp\. More precisely, for each y e Cp with
\y\ < Mrf, there is a solution x eCp of the equation fix) = y with \x\ = r.
If \y\ = Mrf, the same equation also has a root of absolute value r, provided
that \y-f{0)\= Mrf.
Proof. Consider the formal power series
fiX) -y = ia0-y) + J2a»Xn (*° = f(0))'
If \y\ < Mrf, then f ~y has the same dominant monomials as /, and r is
still a critical radius of / — y: This function vanishes on the corresponding sphere.
If \y\ = Mrf, the assumption made ensures that the formal power series
318 6. Analytic Functions and Elements
/ — y = (ao — y) + 5Zn>i anXn also admits the critical radius r and hence
vanishes on the corresponding critical sphere of Cp. This proves that
Mrf = sup \f(x)\ = sup \f(x)\ = max|/(x)|
\x\<r \x\<r lxl=r
when r is critical. ■
Corollary. Let r < rj. Then
Mrf = SUp \f(x)\ = SUp \f(x)\.
\x\<r \x\<r
Moreover, ifre\C*\isa rational power of p, then
Mrf = max|/(x)| = max|/(*)|.
I*l<r \x\=r
Proof. For every r <: r/,
l/(*)l < AV < Mr/ (1*1 <r)
implies that
sup |/(x)| < sup |/(x)| < Mr/.
|x|<r |*|<r
Conversely, we can find a sequence (xn) in C^ such that:
\xn | = r„ is regular for all n and rn / r.
Hence
l/(*)l = MrJ /Mrf
implies that
SUP |/(X)| > SUp |/(*B)| = Mrf.
|x|<r n
Finally, if r is regular, |/(x)| = Mrf is constant on the sphere \x\ = r, while if r
is critical, Mrf = max^i^ |/(x)| follows from the proposition. ■
2.6. Extension to Laurent Series
Instead of Taylor series, we can work with convergent Laurent series
oo
f = J2anX"eK[[X,X-'}]
—OO
as in (1.7). Existence of zeros on critical spheres rj < |x| = r < r^ is ensured,
provided that the field K is algebraically closed (typically, if K = C^).
2. Zeros of Power Series 319
For example, let r be a critical radius of /. As in (2.5), an equation f(x) = y e
C will have a solution x e Cp with |x| = r, provided that
either |v| < Mrf or \y\ — Mrf and \y - a0\ = Mrf.
This shows that
sup \f(x)\ = max \f(x)\ = Mrf (r~ < r < r+).
|x|<r l*l=r J J
In the case of a Laurent series /, the function r y-> Mr f is not necessarily
increasing, but it is always a convex function on the interval (rj, r^). A consequence of
this observation is the maximum principle for annuli:
If r~ <n <r2<rj (r, e \KX\)9 then
SUPri5|x|<r2 l/MI =maXr,<|x|<r2 |/(*)l = max (Mn / , Mnf).
We have also seen that
p = logr h* /xp/ = log Mrf
is a convex function on the interval (log rj, log r^) (cf. (1.7)). Let us show that it
is — as in complex analysis — a formal consequence of the maximum principle
for all functions xm fn(meZ, n e N) (given by convergent Laurent series by
(1.7)) in annuli rJ < r\ < r2 < rj.
Hadamard's Three-Circle Theorem. Assume that f e K[[X,X~1]] is a
convergent Laurent series, so that f is also a function defined on an annu-
lus r~ < |x| < r+ of K. Then
p = logr h> iipf = log Mrf — log max \f(x)\
\x\=r
is a convex function.
Proof. Let (r~ < r\ < r < r2 < r+), M, — MrJ, so that
Mrf < max(Mi,M2)
by the maximum principle. Apply this inequality to xm fn (m e Z, n > 0),
rmMnr < max (if MJ1, rfA^),
and taking nth roots
rm/"Mr < max (r?,nMu r2lnM2\
If K = Cp, we can choose the rational number a = m/n such that r"M\ = r"M2
(if K — £lpor C, we can take a sequence of rational numbers m*/rc* converging
to the real root a of r\Mx = r%M2). With this choice for a (using continuity if
320 6. Analytic Functions and Elements
K — Qp or C), we can write
raMr < r?M, = r$M2 = (r^Mtf - (r^M^f
if s + t — 1. Since p ~ log r is a convex combination of the pt = log r,, we can
choose s > 0, t > 0 with ^ -h r = 1 and r = r{- rl2. With this choice ra = r"s • r™,
and the obtained inequality simplifies into
Mr < MJ - M[.
With /Xp = logMr and /x,- = logMj (/ — 1,2), we now get the announced
convexity property
VP < sfii + r/x2 (p = *a + rp2)- ■
Definition. Let f be a Laurent series with r 7 = 0. We say that the origin is an
isolated singularity. Three cases can occur:
(1) f is a Taylor series (an = Ofor all n < 0):
The origin is a removable singularity.
(2) / has finitely many coefficients an ^ Ofor n < 0:
The origin is a pole.
(3) / has infinitely many coefficients an ^ Ofor n < 0:
The origin is an essential singularity.
If the origin is a pole of /, its order is the smallest integer m > 0 such that
xm f is a Taylor series (has a removable singularity at the origin). In the case of an
essential singularity, the analogue of a classical result of Picard is valid.
Proposition. Let f have an essential singularity {at the origin). Then there are
infinitely many critical radii r,- \ 0, and for each e > 0, y e Cp, the equation
f(x) = y has infinitely many solutions 0 < \x\ < e. ■
Proposition. Let f be a Laurent series with r7 = 0 and rt = oo. Then
(a) iff has no zero in C*, f is a single monomial;
(b) if f has only finitely many zeros in C*,
then f is a polynomial in X and X~l;
(c) / is given by a Weierstrass product
nx) = cxm • nte|>,(i - */f y* ■ riifKid - $/xy*
extended over the roots f of f = 0,
v^ denoting the multiplicity of the root f. ■
Example: Theta Functions. Choose an element q e C* with 0 < \q\ < *•
Consider the product
&i(X)=Y\(i-q"x)Y[(i-^),
3. Rational Functions 321
which converges in the annulus 0 < |jc| = r < oo. Obviously,
Q
If we define more generally 0fl(X) = S\(a~lX) (a € C*), then we have
®a(q~lX)=-— ©fl(X).
aq
Products of such theta functions satisfy functional equations of the form
S(q-lX) = C(-X)d - 0(X).
These functions are used for the construction of the Tate elliptic curves.
3. Rational Functions
Functions defined by convergent power series expansions are defined in a ball.
Unfortunately, as we have seen in (2.4), it is impossible to obtain an analytic
extension of such a function by looking at the expansions at different points of
the ball of convergence: The radius of convergence does not change, so the ball of
convergence is the same. Any point of a ball is a center of the ball, and there is no
way of defining "points near the edge."
On the other hand, we like to consider rational functions (quotients of
polynomial functions) as analytic functions outside their set of poles (zeros of their
denominators). These functions can be expanded in power series in each ball
containing none of their poles. More generally, uniform limits of rational functions
will play a role similar to the analytic functions in complex analysis: They are the
"analytic elements" introduced by Krasner.
3.1. Linear Fractional Transformations
A linear fractional transformation is a rational function
ax+b
x y—> f{x) = —
ex + d
where ad—bc^ 0. The coefficients are taken from a field K, and a linear fractional
transformation defines a map
/ : tfU{oc} -> K\J{oo).
The space K U {oc} = Pl(K) is the projective line over K: Its elements are the
homogeneous lines in K2, represented by quotients
x
— = [x : y] = class of pairs proportional to (jc, y).
y
322 6. Analytic Functions and Elements
When c = 0 (a and d ^ 0), we get an affine linear map
a b
-x + - = a'x + b' (a' ^ 0).
a a
When c ^ 0,
cx-\rd c \_ \ c /x + d/cj
Typical examples of linear fractional transformations are
(a) translations x \-+ x + b,
(b) dilatations (or homotheties) x \-^ ax,
(c) inversion x \-> l/x.
The preceding formula shows that these particular linear fractional transformations
generate the group of all linear fractional transformations.
A good description of linear fractional transformations is supplied by 2 x 2
matrices: To each such invertible matrix we associate the linear fractional
transformation having for coefficients the entries of the matrix
(-)
ax + b
cx + d
Composition of linear fractional transformations corresponds to matrix
multiplication: The above correspondence is a homomorphism from the group G\i{K) of
invertible 2x2 matrices with entries in K to the group of automorphisms of the
projective line P*(A^) = K U {oo}. The kernel of this homomorphism consists of
the nonzero multiples of the identity matrix I2 (scalar matrices) L J = a ■ h
(a ^ 0), namely the center of G\2(K). Hence there is an isomorphism
PGh(K) = Gl2{K)/(K*h) ^ Aut(P\K)).
Here are representative matrices for the three types of linear fractional
transformations listed above:
(a) The matrix i J produces the translation x j-^ x + b.
(b) The matrix i 1J produces the dilatation (or homothety) x \-^ ax.
(c) The matrix ( n) produces the inversion x \-^ l/x.
Proposition. Let K be an ultrametric field. Then the image of a ball of K under
a linear fractional transformation is either a ball or the complement of a ball
3. Rational Functions 323
Proof. Affine linear transformations send an open (resp. dressed) ball to an open
(resp. dressed) ball. The formula
/(*) =
ax + b
cx + d
K-
+ (b - ad/c)
_L_1
x+d/c]
(when c^O) shows that a linear fractional transformation that is not an affine
linear map is nevertheless composed of such transformations and of an inversion.
It is thus sufficient to prove the statement for the inversion. Consider, for example,
the ball B<r{a) and its image B' by inversion. If the origin belongs to B<r(a\
then this ball coincides with B<r = B<r(0) and its inverse is the set defined by
|jc| > 1/r, i.e., the complement of the ball BsX/r(a). Otherwise, \a\ > r, and for
x e B<r(a),
so that
\x-a\< r, |*| = \a + (x - a)\ = \a\,
\a-x\
1 1
jc a
=
a — x \
xa
This proves that the image of the ball B<r(a) under inversion is contained in the
ball B<r/\a\2(\/a). Since the same argument shows that the image under inversion
of the second ball must be contained in the first one, we conclude that the inversion
is a bijection between these balls. A completely similar proof holds for closed balls
instead of open ones (alternatively, one can use the relation B<r(a) = P\s>r B<s (a)
between closed and open balls.) Hence we have
f{B<r{a)) = B<rna]2(\/a) if \a\ > r,
f(BSr(a)) = B<rM2(\/a) if \a\ > r,
f({x-\x~ci\=r}) = {y:\y-l/a\ = r/\a\2} if \a\>r. ■
The image of a ball is called a generalized ball: A complement of a ball of K is
identified to a ball of Pl(K) containing the point at infinity; note, however, that in
general two such generalized balls satisfy no inclusion relation. The analogy with
the classical complex case is striking. Indeed, recall that in C a linear fractional
transformation preserves the family of generalized circles (circles and straight
lines) and the family of generalized disks (disks, half planes, and complements of
disks).
3.2, Rational Functions
Let us review a few elementary algebraic facts concerning rational functions having
coefficients in an algebraically closed field K. Let / e K(x) be a rational function
and write / = g/h with two relatively prime polynomials g and h, h being
monic. We say that / is regular at the point a e K if h(a) ^ 0. In this case,
f(a) = g{a)/h{a) is well-defined, and the numerator of the function / — f(a)
324 6. Analytic Functions and Elements
vanishes, and hence is divisible by x — a. This shows that if / is regular at a, we
can write
/ = /(«) + (* — a)f(i)(x), /(i) rational, regular at a.
Iterating this construction for f^, we obtain a second-order limited expansion of
/. By induction, we see that for any integer m > 1,
/ = a0 + aiix -*) + ■■• + am^(x - a)"1'1 + (x - a)m/(m),
where /(m) is rational, regular at the point a.
We say that / ^ 0 has a /?o/e of order m > 0 if the denominator /z has a zero
of order m at the point a (hence g(a) ^ 0, since g is prime to h). In this case, we
write h(x) = (jc - a)mM*X hx(a)^ 0; hence
f = ^ = a /i = — regular at a.
J (x-a)mhi (x-a)mJ J hi &
If we write the above expansion for /i, we obtain
/=; —(av + a.ix - a) + - • • +am-i{x - a)m~l +{* ~ a)m f)
{x — a)m v
a0 Cl\ &m-\ 7 ( 1 \ 7
= (x-ar + (x-ar-l+'"+^Z + f = Pa\7^)+f
with a polynomial Pa of degree m and zero constant term. The rational function
\x—aj (x — a)m x—a
is the principal part of / at the pole a. It is uniquely characterized by the properties
{ Pa is a polynomial with zero constant term,
I f — Pa[ I is regular at a .
I \x-aj
The order of / ^ 0 at the point a e K is by definition
ordfl/ = ordflg — ordah e Z.
This integer is positive if / (is regular and) vanishes at a. negative = — m if / has
a pole of order mate.
Consider the finite set {a,-} = {a e K : h(a) — 0} of poles of / = g/h and
the respective principal parts Pt (-—^ j of /. Then / — £f Pt ( ~^- j is a rational
function that is regular everywhere: It is a polynomial, and we have obtained the
decomposition
3. Rational Functions 325
One way to obtain this decomposition is to start with the Euclidean division
algorithm for polynomials,
g = Pooh + gu deg gi < deg g.
Then
h h
If deg g > deg ft, then deg Pco = degg — deg/z; otherwise, Poo = 0. The well-
known partial fractions expansion for the first term leads to
*~2> (*-«,)■
The particular rational functions
1
(x-aY
, xn (a e K, m > 0, n > 0)
generate the K-vector space K(x). Since they are also independent, they make a
vector space basis of K(x).
If AT is a valued field, we have
With the
previous
1 \x
1 \x
notation
*
hh
0
oo
(1*1
Qx
-> oo),
l-*fl).
-»0|jc|-»oo
we see that
\f(x)\ -> 0 when |jc| -> oo <=* /^(jc) = 0,
|/(jc)| is bounded when \x\ -> oo <£=> /oo(*) is constant,
|/(jc)| -» oo when |jc| -» oo <£=> deg PooU) > 0.
Let us now specialize to K = Cp, algebraically closed and complete. We can use
the binomial series expansion: For a ^ 0, m > 1,
(jc -a)m (-aY» \ a)
= E<-'r"(7)^ (IxKKI)
326 6. Analytic Functions and Elements
(a Taylor series),
-1 (i - a-ym
xm\ xJ
(a Laurent series). In particular, in any region r\ <.\x\ <.ri containing no pole of
/, we can choose the first type of expansion for the principal parts corresponding
to poles |of,-1 > ri and the second type for the principal parts corresponding to
poles \a.i\ < r\. We obtain Laurent series expansions (1.7) (and (2.6)).
Proposition. Let f e Cp(x) be a nonzero rational function and {af} its set of
poles. Then f admits three types of Laurent series expansions:
(fl) E-m<«<oo anxn (0 < 1*1 < min{|or/| : at ^ 0}),
(*) E-oo<«<oo««x" (max{K| : \a(\ < r] < \x\ < min{|a,-| : |a,-| > r}),
(c) E-oo<«<tv ««x" (1*1 > maxflaj}).
Proof. In the first case (a), if / has a pole at the origin, then m is its order. Let us
consider only the case (b). If r > 0 is fixed, group the principal parts corresponding
to poles in the closed ball |jc | < r. For each individual monomial in these principal
parts, a multiple of some 1 /(x — c^)m, choose the Laurent expansion that converges
for |;t| > |a/|. Any linear combination of these expansions converges at least for
|jc| > max{|c*/| : |a?,-| < r}. Group similarly the principal parts corresponding to
the poles |o?,-| > r, and choose the Taylor series for the corresponding monomials
]/(jc — al)m: Their linear combination converges at least for |jc| < min{|or£| :
|of,-| > r}. Adding these two contributions, we get a Laurent series as announced
Observe that since a Laurent series defines a continuous sum in its open annulus
of convergence, this region cannot contain a pole of the sum, whence the precise
estimate for the radii limiting its region of convergence. ■
3.3. The Growth Modulus for Rational Functions
Let us say that a radius r > 0 is regular for a rational function / = g/h eCp(x)
(g and h relatively prime polynomials) when it is regular for both g and h, hence
when g and h do not vanish on the sphere \x\ = rofCp. Hence, when \x\ = r is
regular for / = g/h%
M je
\g(x)\ = Mrg, \h(x)\ = Mrh, and |/(x)| = --~-
Mrh
Lemma. Let f = g/h e Cp(x) and define Mrf := Mrg/Mrh for real r > 0.
This expression is well-defined independently from the particular representation
of f as a fraction g/h, and r i-^ Mrf is continuous on R>o- For each regular
1
(x - a)1
3. Rational Functions 327
r > 0, |/Ml = Mrf on the sphere \x\ = r. In each region where f has a
Laurent series expansion, Mrf coincides with the growth modulus as defined
in (2.6).
Proof. If / = g/h = gi/hu then gh\ = g\h, \g(x)h\{x)\ - \g\{x)h(x)\ implies
Mrg ■ Mrh\ = Mrgl - Mrh
first for the regular values r (of g, h, g\, and h\, i.e., of the product ghg\h\),
and hence also for all values r > 0 by continuity. This proves that MrgjMrh —
Mrg\ /Mrh i. The other assertions of the lemma are obvious. ■
Considering the previous results, we shall simply denote by Mrf = Mrg/Mrh
the growth modulus of a rational function f = g/h.
i
p=3
0
Mr((x-1)/(X-P))
;\ i/r
IV
1/p 1
*■
pole zero
Growth modulus for a linear fractional transformation
Observe that for a nonzero rational function /, Mr f > 0 for r > 0. When a radius
r > 0 is not regular for the rational function f = g/h, we say that it is a critical
radius: f has some poles and/or zeros on the sphere |jc| — r. Denote as before by
{di} the poles of / and introduce its set {/?,-} of zeros.
Theorem. Let f = g/h e Cp(x) be a rational function. Then we have:
(a) If f is regular at the origin, then r h> Mrf is
convex increasing on the interval 0 < r < min{|c^-|}.
(b) If f has no pole in the region r\ < \x\ < r2, then
r h> Mrf is convex in the interval r\ < r < r2.
(c) Ifdegg < deg/z, thenr h-> Mrf is decreasing for r > max{|or/|}.
(d) \f(x)\ = Mrf = cr***-**hfor \x\ = r > maxfla.-l, |^|}.
328 6. Analytic Functions and Elements
Proof, (a) and (b) follow from the lemma and (1.4). For (c) and (d), observe
that for any polynomial P of degree d, MrP = \a<i\rd when r is bigger than the
absolute value of all roots of P. Apply this to P — h to obtain (c), and to both g
and h to obtain (d). a
Example. Consider the rational function
/(*) = —X— e Cp(x),
1 — xz
which has a simple zero at the origin and two poles on the unit sphere. For |x| < 1
we have |1 — x2\ = 1, while |1 — x2\ = |jc|2 for \x\ > 1. Hence
l/(*)l
f |x| if |x| < 1, [r if r < 1,
Mrf=\
l/\x\ if |x|> 1, [rl if r> 1.
Proposition 1. Let f = g/h e Cp(x) be a rational function and let Sr be the
sphere {x : \x\ = r] of radius r > 0. Then:
(a) Iff has no pole on 5r, then \f(x)\ < Mrf {x G Sr\
(b) Iff has no zero on 5r, then \f(x)\ > Mrf (x e Sr).
Proof, (a) If a critical sphere Sr (r > 0) contains no pole of /, its denominator does
not vanish on this sphere and r is a regular value for the denominator: \h(x)\ = Mrh
is constant on 5r,
i f, M ^(X)I ^ M*% ha f /i i x
^(x)l = ~UT - ~^TZ = Mrf (|x| = r>"
Mr/z Mr/z
(fc) Replace / by 1//. ■
If / has a zero f$ e Sr, then by continuity, |/| takes arbitrarily small values in
a neighborhood B<e(/3) of f$. Such a neighborhood is contained in the sphere Sr
as soon as £ < \f$\. Hence |/| takes arbitrarily small values on the sphere 5r. The
same holds for 1 // if / has a pole a e Sr. If / has both zeros and poles in 5r, this
shows that |/| takes arbitrarily small and large values on this sphere. This will be
made more precise in the next propositions.
Proposition 2. Let f = g/h e Cp(x), Sr as before and consider an open ball
D — B<r(a) of maximal radius in the sphere Sr (hence \a\ = r). If f has no
pole in D, then
Mrf = sup \f(x)\:=\\f\\D.
Proof. For s > r = \a\, the spheres Ss and Ss(a) = [x : \x — a\ = s} coincide.
Hence Msf = Msaf (growth modulus with respect to the center a): This is
3. Rational Functions 329
obvious for regular values of s and by continuity also for all values s > r. This
proves
Mrf = MrMf.
Since / is regular in the ball D, its growth modulus Mtaf (with respect to the
center a) is an increasing function of t < r. By the maximum principle (2.5) for
balls,
M,,fl/= sup \f(x)\ (t <r),
\x-a\±t
and by continuity of the function t h> Mta /,
Mr,af = sup MtMf = sup sup \f(x)\
= sup |/(x)| = ||/||D. ■
|jc— c|<r
Observe that since we work with the field Cp, having an infinite residue field,
a sphere Sr of positive radius is a disjoint union of infinitely many open balls of
maximal radius r, so that it is always possible to choose a ball D = B<r(a) without
pole of / as in Proposition 2.
Proposition 3. Let f = g/h € Cp(x) be a rational function, Sr as before.
Then:
(a) Iff has no pole on Sr, then Mrf — sup^j^ \f(x)\.
(b) Iff has no zero on Sr, then Mrf = inf|x|=r |/(*)|.
(c) Iff has both zeros and poles on Sr, then
\f{x)\ assumes all values of \Cp\ onx e 5r.
Proof. Observe that if / has no pole and no zero in Sr, then r is regular and
\f(x)\ — Mrf is constant on 5r. Now (a) follows from Propositions 1 and 2. For
(b), replace /by 1// and apply the previous result.
(c) Choose a pole a e Sr and a zero p e Sr with
\a - fi\ = tmnfa - h\ := &
(minimum taken over the zeros f}} and poles at in Sr). Then
M5Mf = MSff (s>8\
since the spheres of radius s > 8 and centers a (resp. /3) coincide. By continuity,
Ms,af = MSmpf := M.
Now, for each y e Cp, \y\ < M,f — y has a critical radius r < 8 and f{x) = y
has a solution x e B^8(P) C 5r. Similarly, for each v e Cp, \y\ > M, f{x) = y
330 6. Analytic Functions and Elements
has a solution x e B<$(u) c Sr (consider l/f). Finally, as in (2.6), / also assumes
some values v = f(x) where \y\ = M. I
3.4. Rational Mittag-Leffler Decompositions
Recall that we denote the complement of a set B c Cp by Bc = Cp — B.
Proposition 1. Let 0 ^ / = g/h e Cp(x). Assume degg < deg/z and that
f has all its poles in a ball B = B<a for some a > 0. Then for any subset D
disjoint from B, \\f\\D < WIWb^ = Maf. If D = B<a(a) is a maximal open
ball in the sphere \x\ = cr, then
U\\D = \\f\\Bc=M0f.
Proof. Since / has all its poles o?,- in B, we have ap = max,- |o?,-| < cr and
!/(*)!< MUI/ (|x|> crp).
On the other hand, since degg < deg/z, the growth modulus Mrf decreases for
r > o>,
\f(x)\<Maf (|*|>cr),
and for Z) c #c we have
ll/lb<ll/ll^<Afff/-
Taking a sequence^ e #c with regular rn = |jc„ | \ cr, so that |/(jc„)| = Mrnf /
M0f, we see that ||/||^ > sup„ |/(x„)| = Mff/. Finally, if D = fl<cr(a) is a
maximal open ball in the sphere |jc| = cr, then Proposition 2 of (3.3) shows that
II/IId = M?/> since / has no pole on \x\ = a. ■
Observation. The last step of the preceding proof, ||/||d = Maf, requires only
that we find a sequence {xn) in D with regular rn = \xn — a\ / a, so that
l/(*„)| - Afrjl.fl/ -> MCMf = M0f
ll/lb > sup!/(*„)! > liml/UJI == M,/.
This will be essential for generalizations (cf. Proposition 2 in (4.2)).
If f = g/h e Cp(x) is a rational function and cr > 0, we can group the principal
parts corresponding to the poles a, of / in the open ball B = B<tT:
/*w = £ p, (——) ■
3. Rational Functions 331
We can apply Proposition 1 to fo, since this function has all its poles in B. The
growth modulus Mrfo is decreasing for r > o:
WfB\\D = ^p\Mx)\<MafB (Den
xeD
Proposition 2. Let f e Cp(x) be a rational function and let fB be the sum
of the principal parts of f corresponding to its poles in B = B<a. If D is a
maximal open ball B<a(a) in the sphere \x\ = a and D contains no pole off,
then
WfB\\D = MafB<Maf=\\f\\D.
Proof. We may assume fo^O and let us introduce /0 := / — fB e Cp(x),
which is regular in B (but may have poles in the sphere |jc| = o). Hence
r h> Mr/0 is increasing (may be constant) for r <a.
On the other hand, MrfB decreases (strictly) beyond
op := max{|a| : a is a pole of f in B] < o.
There is at most one crossing point of Mr fo and Mr fB in the interval {pp,o). Hence
Mrf0 ^z MrfB with at most one exception r e (ap, ex). For all regular values (all
except finitely many), these Mr represent absolute values of the corresponding
functions where in a sum, the strongest wins:
MrfB < max(Mr/*, Mr/0) = Mr(fo + /0) = Mrf {ap{fo) <r / a).
Taking an increasing sequence of regular values rn f a, we conclude that
MafB<M0f
Finally, by Proposition 2 of (3.3) we have M„fB = Wfo ||d, Maf = \\f\\D. ■
Let us go one step further and group the poles of / in a finite number of balls.
Theorem. Let B, = B<a,(ai) (1 <i<£)be a finite set of disjoint open balls
in the closed ball B<r and define D = B<r ~ \J1<l<t Bt. Let f e Cp(x) be
a rational function regular in D, f ~ fBi the sum of the principal parts of f
corresponding to its poles in Bi (1 < i < I). In the canonical decomposition
/ = /o+X>= ^,/i,
l<i<£ 0<i<i
where /0 is regular in B<r, we have
imiD=max H/.llo.
332 6. Analytic Functions and Elements
Proof. As in (3.3) (Proposition 2) we can select open balls Dt of maximal radius
ot on the spheres \x — a{\ = oi (1 < i < £) and containing no pole of /.
Mittag-Leffler decomposition of a rational function
By Proposition 1, the inclusions /),■ C D C Bf lead to the same sup norms
\\fi\\Dt^\\fl\\D = \\fi\\Bf{=M^QJi).
By Proposition 2, we have \\fi\\o, < 11/ II d, (i > 1), hence
UMd = WMd, < 11/11A < II/IId 0<i< I).
Now the competitivity principle (II. 1.2) in / — /o — X!i<i<£ ft =® shows that we
have
||/||D = max ||/||z>. ■
0<i<£
Note that for the regular part /o of / in B<r we have
||/o||d = sup |/0(jc)| = max |/0(jc)| = sup |/0(jc)|
\x\<r l*l=r \x-b\<r
for any \b\ = r. On the other hand, if deg g < deg h, f = g/h -» 0 (\x\ -+ oo),
we find that /o — 0, so that we also have
II/IId= sup \\Mo
for the unbounded domain D = Cp — Lij<,-<^ Bt.
Let us introduce the following notation for any subset D of Cp:
R(D): ring of rational functions having no pole in D,
R0(D) C R(D): subring defined by |/(*)| -> 0 when |jc| -> oo.
3. Rational Functions 333
For any open ball B and any D disjoint from B, we can look at
R(D UB)-^> R(D) -> R0(Bcy
The first map is the restriction / h> f\D (injective as soon as D is infinite), while
the second is / h> fB (principal part of / in B). This second map is surjective
and admits a section fB »-> fB\D also given by restriction. If a rational function is
regular onDUB and on Bc, i.e., regular everywhere, then it is a polynomial. If
moreover it tends to 0 (when \x\ -» oo), then it is 0:
R(DUB)DRo(Bc) = {0}.
Hence the preceding sequence is a short exact sequence: It splits
/ = /o + fB <* (/o, /*) : R(D) = R{D U B) 0 R0(BC).
More generally, with the notation and assumptions of the theorem,
0 —* R{B<r) -^ R(D) —> 0 R0(Bf) —> 0
is a split short exact sequence (the case £ — 1 is as in the previous example). The
section of / h> (fi)i<i<e furnishing the splitting is (fi)i<i<i h> Ei<i<* /iId-
Indeed, the difference / — £1<l<£ .//Id extends to the regular /0 e R(B<r). All
these maps are linear and contracting; hence
R(D)^R(B<r)(B 0 R0(Bf)
Ki<£
is an isomorphism of the normed space /?(£>) with a direct sum of normed spaces
overCp(rV.4.1).
3.5. Rational Motzkin Factorizations
It is easy to give a product decomposition for rational functions quite similar to
the sum decomposition given in the preceding section. Let B C Cp be an open
ball and / = g/h e Cp(x) a rational function having all its zeros and poles in B:
S = {aeCp: g(a)h(a) = 0}cB.
Hence f = c Yia€s(x ~~ fl)Mc G^a G ^> c g Cp- Choose fc in £ and write
a€S
Observe that
\x-a\
\x-b\
= 1 ixiB).
334 6. Analytic Functions and Elements
More precisely,
x —a b — a \b — a\ t n
7 = 1 + 7. 7 <1 btB),
x — b x — b \x — b\
\x-a « »»—/^_^v^
ix-* »„<'*■«»■ Pil^j "' <L
This gives a factorization
/(x) = c(*-&r/i(*).
where
m = (number of zeros - number of poles) of / in B,
and
\\h-\\\Bc < 1, h(x)-> 1(|*|-* oo).
If we take another center b in B, we shall have
/(x) = C(x - b)mh(x)
with
\x-b)
In particular, we see that c and m are independent of the choice of center of B (but
h depends on this choice). The integer m is called index of f relative to the ball
B. Asymptotically,
f(x)^c(x-b)m (M-*oo).
We can formulate a more general result when the zeros are located in a finite union
of balls. Let r > 0 and let
Bt = B<ai(bl)cB^ (1 <i<£)
be a finite set of disjoint open balls contained in B<r (0 < 07 < r, \bi\ < r)-
Consider the domain
& = B<r- U Bt.
l<i<£
The next three propositions concern rational functions / that are units in the ring
R(D): Neither / nor \/f has poles in D, i.e., / has neither zero nor pole in D.
Proposition 1. Any f e /?(/))* can be uniquely factorized as
f ~ f° ' ll & (Motzkin factorization),
l<i<£
3. Rational Functions 335
where f0 e #(£<r)x and for 1 < i < £
f = (x- b.T'hi e R(B<r, \\ht - 1 Use < 1, hiix) -> 1 (|x| -> oo).
Proof. The only possibility consists in collecting the zeros and poles of / in Bt
and defining f as the product of the corresponding factors {x - aYa (a 6 Bi,
\La e Z positive for zeros and negative for poles of /). With /0 = // Y\\<i<t fi
all requirements are satisfied. ■
As before, the difference mi between the number of zeros and poles of / in Bt
(taking multiplicities into account) is the index of f with respect to Bt. With any
choice of center bi of Bi, the Motzkin factor ft of / relative to Bt satisfies
_* ill <i.
Proposition 2. Assume \\f — \\\d < 1- Then f has as many zeros as poles in
each ball Bi C Dc.
Proof. The assumption implies \f(x) — 1| < 1 for all x e D, hence \f(x)\ = 1
is constant in D. Consider a ball B; and consider the growth modulus centered
at bi e Bi. Without loss of generality, we may assume i = 1, b\ = 0, since b{
is also a center of the ball B<r. Since D contains a maximal open ball D\ of the
sphere |jc| = ox := ex having no pole of / — 1 (in fact infinitely many such balls),
Proposition 2 of (3.3) shows that
Mo(f-D=\\f-l\\D1<\\f-l\\D<l-
\\hi-nD =
fj
(x-biY
Motzkin factorization of a rational function
336 6. Analytic Functions and Elements
By continuity of the growth modulus, we have Mt (/ — 1) < 1 for all t close to a.
For regular values of t < o close to a we have
\f(x) - 1| = Mt(f - 1) < 1 => \f(x)\ = 1 (|x| = t).
Hence Mt f = 1 for t / o. But our study of Laurent series has shown (3.3) that
for f < cr close to cr, Mtf = tm, where m is the difference between the number
of zeros and poles of / in Z?i. Hence m = 0, as asserted. ■
Under the assumptions of the preceding proposition, if / has some zeros in Bi,
it also has some poles in this ball, and we can look at the principal part P{-f of /
in Bi (3.4).
Proposition 3. If \\f — \\\D < 1, then the principal part Pif of f relative to
the ball Bi and the Motzkin factor f = ht defined in Proposition 1 are related
by
l|/'i/llfl = ll/f-l||fl (l<i<*).
Proof. Let S denote the set of zeros and poles of /, and let / = /o - rii<<<< f> ^
the Motzkin factorization of /. By Proposition 2, we have
Hence cot = f — 1 is a rational function that is regular outside Bt and tends to 0
when \x\ —> oo: cot is a sum of principal parts of poles in £f. Moreover,
I|g>«IId< INIIat < 1-
Similarly, /0 = c(l -f~ coo), and replacing / by f/c (hence /0 by f0/c) we may
assume c = 1. Let us compare the additive and multiplicative decompositions
/=«>/+ T, Pif= n o+^)=(i+a>l-)- n(i+^>
l<i<£ 0<j<£ j^i
* * -'
1+^r regular in B,
with || ^ \\D < 1. Hence
/ = (l + Tfc) + a*(l + ifc),
and the principal part Pt f of / relative to the ball Bt is also the principal part ot
Pif = Pi(o>g + a*ifc) = a>, + /,/((w^I). W
By the rational Mittag-Leffler theorem (3.4),
II^/(^^,)IId< II*>«iMId < W&Ad,
3. Rational Functions 337
and in (*) the first term is dominant:
\\Pif\\D=\M\D = \\fi-l\\D. ■
3.6. Multiplicative Norms on K(X)
When r > 0 is fixed, the growth modulus Mr(f) (1.4) defines an absolute value, in
other words a multiplicative norm, on the field K(X) of rational functions having
coefficients in an extension K of Qp. Other norms of the same type are obtained
if we consider a e K and consider the growth modulus centered at the point a:
For a rational function / regular at a, having a power series expansion
/(*) = ^2an(a)(x - d)n (\x - a\ < rf),
Mrtfl(/):= sup \an(a)\rn (r < rf).
When \K\ is dense in R>o, the maximum principle (2.5) shows that
Mr.af= sup |/(x)|= sup |/(x)| (f e K[X]);
\x—a\<r \x—a\<r
hence MrMf = \\/\\b is the sup norm on the ball B = B<r(a) when / is a
polynomial. Since a multiplicative norm on K{X) is completely determined by its
values on K[X], we deduce that the following properties are equivalent:
(0 Mrfi = Mry, (ii) B<r(a) = B^r(b); (Hi) \a - b\ < r.
When the field K is algebraically closed, a multiplicative norm is completely
determined by its values on linear functions. As we have seen, for a linear function
X-f = (0-?) + (X-«)
we have
Afrifl(X-f) = supdf-fl|fr) (r>0).
When f varies in A^ we have Mra(X — f) > r and hence
inf Mr.fl(X-f)>r.
In fact, this inequality is an equality: Take f e K with |f — # | < r.
Proposition. Lef A" &e an algebraically closed, spherically complete extension
°fQp- Then any absolute value \J/ on the field of rational functions K(X) that
extends the absolute value of K is of the form Mra for some a e K and r > 0.
Proof. By (II. 1.6), the absolute value ty is ultrametric. We are looking for a ball
# = B<r(a) leading to
nx~Z) = MrAX-Z)=\\X-Z\\B (feAT).
338 6. Analytic Functions and Elements
Let us consider
r = inf f{X - f).
(1) If this is a minimum, take a e K such that \^(X — a) = r. Then
r < \HX - £) < sup(*(X - a), tfr(* - £))■
v v , v v ,
=l*-fll
If If — a\ ^ r, this is an equality
*(X - f) = sup(r, |f - a|) - Mr,fl(X - f).
If |f — a | = r, the preceding inequality gives
r < f(X - f) < sup(r, r) = r;
hence \^(X — f) = r = sup(r, |f - a\) = Mr?fl(X — f). This proves \^ = Mr>fl.
(2) In general, take a sequence
\HX-a„) = r„\r (n>0).
As we have seen,
r < \HX " £) < sup(^(X-^), ^-f)) = Mr„jfln(X - f ).
Hence
r < *(* - f) < liminf Mrn,an(X - f) (f e tf),
n-»oo
\J/ < liminf Afril,flB.
If \I/(X — f) > r, then \^(X — f) > r„ for all large n, and by (1),
r < V(X - f) = sup(r„, |f -«„|) = Mrn.fl„(X - f) (n > N)
proves that
^(X-f) = liminf Mrn,fln(X-f).
n—>oc
If thereisaf e ^f with^(X-f) — r, we are brought back to the first case already
treated.
(3) Let us study liminf„^.oo MrnCln(X — f). Consider the inequality
r„+i = \^(X - an+i) < sup(r„, |a„+i - an\) = Mr^Qn(X - an+i).
If rn ^ \an+\ ~ an\, then r„+1 = sup(r„, \an+i - an\) > rn. Since we suppose
on the contrary rn+\ < rn, there is a competition r„ = \an+\ — an\. Define the
sequence of balls Bn = B<rn(an) (n > 0). We have just proved that
Bn+l cBn (n> 0).
4. Analytic Elements 339
Since the field K is spherically complete by assumption,
B = p| Bn £ 0,
and any element a in this intersection is a possible center of the ball B = B<r(a):
liminfMr„,fln(X-f)= lim \\X - f Ik = IIX - f II* = Mr,fl(X -f).
This concludes the proof. ■
4. Analytic Elements
Since analytic continuation cannot be achieved by means of Taylor expansions in
p-adic analysis (cf. (1.2)), another procedure has to be devised. It was Krasner's
idea to mimic the Runge theorem of complex analysis: A holomorphic function
/ defined in a domain D of the complex plane C can be uniformly approximated
by means of rational functions. More precisely, for each compact subset C C D,
choose A = teJiG/ with one point in each connected component of the
complement of C in the Riemann sphere. Then / can be uniformly approximated on C
by rational functions having all their poles in the set A.
We shall adopt this point of view here, and we start by a discussion of the
domains of Cp in which the idea of Krasner can be carried out. For simplicity, we
shall limit ourselves to bounded analytic elements.
4.1. Enveloping Balls and Infraconnected Sets
Let D be any nonempty subset of the field Cp. Its diameter is defined by
8 = 8(D) = sup \x — y\ = sup \x — a\ < oo (a e D).
x,ytD xeD
The closed ball
BD := B<8(a) (a e D)
is called the enveloping ball of D (if D is unbounded — i.e., 8 = oo — we take
BD = Cp). It is the intersection of all closed balls containing D and hence the
smallest closed ball containing D. When D is closed and bounded,
flGCp-D^r = d(a, D) = inf \a - x\ > 0,
y xeD
and the open ball B<r(a) is a maximal open ball in the complement of D. Each
maximal open ball of Bo — D is called a hole of D. The preceding observations
show that any closed bounded subset D has a representation
D = BD - \J Bt, where BD = Du\jBt
i i
with (possibly infinitely many) holes B{ — B<ri(at).
340 6. Analytic Functions and Elements
Examples. (1) Let D = £<i be the open unit ball. Then BD = B<\, and the holes
of D are all the open balls £<i(tf) contained in the unit sphere (\a\ = 1).
(2) If 0 < r g \CP\, we have B<r(a) = B<r(a), and this set coincides with its
enveloping ball (it has no hole).
We shall be interested in a special class of closed bounded subsets.
Definition. A subset D C Cp is calledinfraconnected if its diameter S is positive
and for each a e D
{\x — a\ : x e D] is dense in [0, S] c R>o-
In other words, D is infraconnected when for all pairs of distinct points a ^
b e D, all annuli
[x e Cp : r\ < \x — a\ < r2] (0 < rx <r2 < \b — a\)
meet D. In particular, if D is infraconnected, it has infinitely many elements.
Infraconnected sets
Lemma. Let D be an infraconnected set. Then for each c e Cp,
Ic = [\x - c\ : x e D]
is dense in an interval of R.
Proof. (1) If c = a e D, by definition {\x — a\ : x g D] is dense in the interval
[0, S], where 8 is the diameter of D.
4. Analytic Elements 341
(2) If c g D is at a distance r < 8 of D, we have to prove
/c = {|jc — c\ : x e D] is dense in the interval [r, 5].
There is nothing to prove if r = 8. Otherwise, choose a e D with \c — a\ < 5.
Since 8 = supbeD \a — b\, we can choose b e D with |c - a\ < |tf — b\ < 5.
Hence
r < \c - a\ < \a - b\ = |c - fc| < 5
(make a picture !). The annuli of inner radius r\ > \a — c| having center a or
c coincide. If we select any outer radius r-i > r\, r2 < |c — b\ = \c — a\, the
corresponding annulus meets Z), since this subset is infraconnected.
(3) Finally, if D is bounded and c e Cp is not in the enveloping ball of D, D is
contained in the sphere \x — c\ = d(c, D) > 8 centered at c and Ic is an interval
reduced to a point. ■
Examples. (1) Let 0 < r\ < r2 < oo. Then the annulus r\ < \x — a\ < r2 is
infraconnected. But the complement of this annulus is not infraconnected. The
complement of a sphere in a ball is infraconnected. For example, the subset |jc| ^
l/p of the unit ball B<i is infraconnected.
(2) The compact subset Zp C Cp is not infraconnected.
(3) Let (Bi)i<i<i be a finite family of disjoint open balls contained in B<\. Then
D = B<!- ]J Bt
is infraconnected, and the holes of D are the open balls Bf (a more general class
of examples will be given below).
Proposition. Let (fir-)/>o be a sequence of disjoint open balls contained in the
closed unit ball B<\. If the sequence of radii rt tends to 0, then
D = B<l-~]jBi
is an infraconnected set. Its enveloping ball is Bo — B<\.
Proof. Let us recall the following fact (systematically used in the proof):
Any nonempty sphere of radius 0 < p < 1 is a union of infinitely
many open balls of equal radii < p.
Let us order the radii of the balls Bt in strictly decreasing order
r' > r" > r'" > ■ - • \ 0.
342 6. Analytic Functions and Elements
By assumption, there are only finitely many balls Bj of any given radius in this
sequence. There exists an open ball B' C Z?<i of radius rf and disjoint from
the finite number of balls Bn having radius greater than or equal to r'. In this
ball Bf we can find an open ball B" of radius r" disjoint from the finite number
of balls Bn having radius greater than or equal to r" (and < r'), etc. We can
construct a sequence of clopen balls B' D B" D Bm D - - - having radii (diameter)
approaching 0. Since the field Cp is complete, the intersection B' n B" Pt B'" - • • is
a common point and is not in the union of the sequence (Z?,)i>o- This construction
shows that Z?<i — \J Bt is nonempty and has infinitely many points. The proof
that it is infraconnected follows from the observation that at each step of the
above construction, we can choose the ball Z?(,) in any given nonempty sphere of
prescribed radius > r(,). ■
4.2. Analytic Elements
As in (3.4), let R(D) denote the ring of rational functions having no pole in D:
R(D) = {/ = g/h : g, h e Cp[X], h having no zero in D}.
Definition. Let D be a closed subset of Cp. A function f : D -* Cp is an
analytic element if it is a uniform limit of a sequence of rational functions
fn e R(D).
The analytic elements on D make up a vector space H(D), which is a uniform
completion of R(D). However, note that in general an / e R(D) can be an
unbounded function on D, so that R(D) is not a metric space. Let us start with the
important case where it is a metric space (in (4.3) we shall show how to treat the
other case).
Proposition 1. When D C Cp is a closed and bounded subset, each f e R(D)
is bounded on D, and H(D) is the closure ofR(D) in the Banach algebra Cb(P)
for the sup norm.
Proof. Recall (3.2): The functions
x", — (n>0, a<£D, m> 1)
(x — a)m
constitute a basis of the vector space R(D). When D is bounded, the functions x
(n > 0) are bounded on D. Moreover, when D is closed and a g D, the distance
infVGo |jc — a\ is positive, so that the functions l/(x — a)m (m > 1) are also
bounded on D. This proves that all rational functions having no pole on D define
bounded continuous functions D —► Cp, and the same is true for the analytic
elements (IV.2.1). Since the closure of a subalgebra of Cb(D) is also a subalgebra,
the statement follows. •
4. Analytic Elements 343
Corollary. The product of two analytic elements on a closed and bounded set
D is an analytic element ■
We can now generalize Proposition 1 in (3.4) for infraconnected sets.
Proposition 2. Let D C Cp be a closed, bounded, and infraconnected set.
Assume 0 e BD and let 0 < d(0, D) < r < S(D). Then
Mrf<\\f\\D (feR(D)).
If the sphere \x\ = r meets D, we have, more precisely,
Mrf<\\f\\srnD<\\f\\D (feR(D)).
Proof. Let o — d(0, D), 8 = 8(D), so that [\x\ : x e D] is dense in the interval
[a, 8]. If / e R(D), let us show that there exists a sequence xn e D with
l/(*n)l = MMf -* Mrf (n -* oo),
so that
II/IId > sup 1/(^)1 > lim 1/(^)1 = Mrf
First case: D does not meet the sphere Sr = {\x\ = r}. Since D is infraconnected
and r e {\x\ : x e D], we can find points xn e D with |jc„| -> r monotonically.
All except finitely many (that we may discard) are regular, and we have finished
in this case.
Second case: There is a point a e D n Sr (observe that in this case D c Sr may
well happen for r = 5 !). We have MJjfl/ = Msf for s > r, since the spheres
of radius s and respective centers a or 0 coincide. By continuity we also have
Mraf = Mrf. By density of the values \x — a\ (x e D infraconnected) in the
interval [0, 8] we can find a sequence xn e D such that \xn — a\ = rn is regular
for / (with respect to the center a), rn / r: \xn\ — \a\ = r. Hence xn e Sr H D,
\f(xn)\ -> MAfl/ = Mr/,
WfUno > sup 1/(^)1 >Afr/.
The proof is complete. ■
Corollary. Let c e Bd where D is a closed bounded infraconnected set, and r
in the interval Jc := closure of{\x — c\ : x e D}. Then the growth modulus Mrc
(centered at the point c) has a continuous extension to H(D). More precisely,
f h> Mrcf is a contracting map
\Mrxf - Mr_cg\ < MrAf - g) < 11/ ~ g\\n (/, g e R(D)).
344 6. Analytic Functions and Elements
Proof. Take a sequence (jc„) as in the proof of the above proposition (working for
both / and g) and let n —► oo in the inequality
I l/(*„)l - \g{xn)\ I < \f(xn) - g(xn)\ < ||/ - g\\D. m
Definition. Let D be closed, bounded, and infraconnected:
ce BD, o = d(c, D)<r < 8(D).
The growth modulus MKC is defined on H(D) by continuous extension of
f h> MrtCf : R(D) -* R>0-
Forfixed r and c, the growth modulus is aseminorm on H(D). Beware of the fact
that for complicated infraconnected sets D there can be nonzero analytic elements
f on D with Mrxf = 0: This can happen only when r is an extremity of the
interval [a, 8]. For this more specialized topic involving a discussion of T-filters,
we refer to the recent book by A. Escassut.
43. Back to the Tate Algebra
A power series f(x) = JZ«>o fl«x" w^tn 0 < r^ < oo does not necessarily define
an analytic element on B<rf. If it does, this sum is bounded on the closed and
bounded ball B<rf (Proposition 1 in (4.2)). In the typical case rf = 1, if the \an\
are unbounded, there are infinitely many critical radii less than 1, and the sum
is unbounded: It is not an analytic element on B<x = Mp. When the \an\ are
bounded, both cases can happen. The series JZ«>o x" ^ */0 ~~ x) nas bounded
\an\ (=1) and is an analytic element on Mp (indeed a rational function with a
single pole at I ^ Mp). It is more difficult to give an example of a power series
with bounded coefficients that is not an analytic element on Mp (a criterion will
be given in (4.6)). When \an\ —> 0, the sum / is a uniform limit of polynomials
(partial sums) on Ap, and we get an analytic element on the closed unit ball. This
simple observation shows that a convergent power series defines analytic elements
on all balls B<r (r < rf).
Theorem. The space H(AP) of analytic elements on the closed unit ball
coincides with the Tate algebra Cp{x} with its norm: for f = JZ«>o #«*">
ll/H = sup \f(x)\ = sup \an\.
Ul<i «>o
Proof. When \a\ > 1 the series expansion (3.2)
{x-a)m (-af1 V a) ^ \ n Jam+n
4. Analytic Elements 345
converges for |jc| < \a\ and a fortiori for |jc| < 1: Its coefficients tend to 0:
I an+m \n }\~ \a\n+m
This shows that the space of rational functions without a pole on Ap is a subspace of
the Banach algebra Cp{x}. This subspace contains the dense subspace consisting
of the polynomials
Cp[x] C R(Ap) C Cp{x},
and the closure is H(AP) — Cp{x}. m
To be able to speak of analytic elements on the complement of a ball (which is
unbounded) we now approach the case of unbounded domains D, and hence R(D)
is not a metric space. Let us introduce the vector subspaces
Rb(D) := [f e R(D): f bounded on Z)},
consisting of the rational functions f = g/h, deg g < deg h, having no pole in D,
Ro(D) := {/ e R(D) : f -* 0 (|*| -► oo)} c Rb(D)
consisting of the rational functions f = g/h, deg g < deg h, having no pole in D.
The Euclidean division algorithm shows more precisely that
tf(D) = *6(D)©Cp[*]
= Ro(D)@Cp ©jcCp[x].
A fundamental system of neighborhoods of an fo in R(D) is given by
Wo) = {/ €= R(D) : sup \f(x) - Mx)\ < £} {e > 0).
xeD
In particular, if /o is bounded, then VE(fo) c Rb(D), namely:
vE(fo)nxCpix] = {0}.
This proves that the topology induced by uniform convergence on xCp[x] is the
discrete one:
/?(£>)= Rb(D) © xCp[x] .
normed space uniformly discrete
By completion we get
H(D) = Hb(D) © xCp[x} -
Banach space un|formly discrete
346 6. Analytic Functions and Elements
We can also write
H(D) = H0(D) © Cp @xCp[x]
and group the last two factors
H(D)=H0(D)eCp[x],
but the uniform structure on the last factor is not the discrete one.
When D is unbounded, we shall only use bounded analytic elements and thus
work in the Banach algebra Hb(D) = H0(D) © Cp. We note that H0(D) is a
(maximal) ideal in this algebra with quotient Hb(D)/Ho(D) = Cp (a field).
Let us now treat explicitly the case of the complement of open balls.
Proposition. The bounded analytic elements on Cp — Mp = {\x\ > 1} are the
formal restricted power series in 1/x:
Hb({\x\ > 1}) = Cp{l/x} D H0({\x\ > 1}) = x'lCp{l/x}.
Proof. The inversion x \-+ y = 1/x transforms bounded rational functions having
no pole in \x\ > 1 into rational functions having no pole in \y\ < 1,
^({k|>l}) = /?({|y|<l}),
and the completion is
Hb({\x\>l}) = H({\y\<l}) = Cp{y}
by the preceding theorem. ■
The comments made prior to the proposition prove that the analytic elements
on {\x\ > 1} are given by Laurent series having only finitely many nonzero terms
anxn with n > 0:
H({\x\ > 1}) = Hb({\x\ > 1}) @xCp[x] = Cp{l/x}@xCp[xl
More generally, if B = B<r(a) is an open ball, then
H(BC) = Hb(Bc) © (x - a)Cp[x - a],
and
Hb(Bc)cCp[[(x~ay]]]
is the subspace consisting of the formal power series
f(x) = ^ an/(x - fl)n such that \an\/rn -» 0 (n -» ex)).
4. Analytic Elements 347
Similarly,
H0(BC) C(x- aylCp[[(x - a)~1]]
is the subspace consisting of the formal power series
f(x) = ^2an/(x - af such that \an\/rn -> 0 (n -> oo).
n>l
4A. The Amice-Fresnel Theorem
Let B = 1 + Mp C Cp be the open ball of radius 1 and center 1. We are going to
give a useful description of the space Ho(Bc) of generalized principal parts relative
to the hole B: These are the analytic elements in the complement of B that tend
to zero at infinity. By (4.3) we know that these analytic elements / e Ho(Bc) are
given by restricted power series in l/(x — 1) with zero constant term:
/w=E*-(,_1lrfi (i*-i-*o)
and
\\f\\B< = Mhlf = sup \Xm\.
Let us expand each term (x — l)-™"1 according to the binomial formula
f*o V " /
Recall the elementary identity (-1)"(-™-1) = (-iy(~^_l) e.g., if m > n, then
(-iK~""1)=(~iy''
m
t (—m — 1) • • • (—m — n) m{m — 1) ■-■(« + 1)
n! (n + 1)-- -(m — l)»i
(n + 1)(« + 2) - • • w(w + 1) - - ■ (m + n)
-<-K"V')
ra!
(similar computations are valid when n > m). Grouping terms, we see that the
coefficient of jc" in f{x) is
Define #? : Zp —► C^ by the uniformly convergent series
p(x) = -
m>0
348 6. Analytic Functions and Elements
(this is a Mahler series in y = — x— 1 (IV.2.3)). The inequality \\<p\\zp < ma^lA^is
obvious, and conversely, since Xm is a linear combination with integral coefficients
of (p(— 1),.. .,(p(—m — 1), we also have
max|Am| < \\(p\\Zp
(recall Theorem 1 in (IV.2.4) for Mahler series). This proves that
Mz,=SUp|A„|==||/l!lI-
By definition, (p is a continuous extension ofn\->an to Zp. Reversing the
operations, we have proved the following result of Y. Amice and J. Fresnel.
Theorem. Let f = JZ«>o fl«x" e Cp[[jr]] be a convergent power series with
Tf >\ and denote by B the open ball 1 + Mp. Then the following properties
are equivalent:
(i) The sequence n\~± an has a continuous extension <p : Zp —> Cp.
(ii) / is the restriction of an analytic element of Ho(Bc).
4.5. The p-adic Mittag-Leffler Theorem
For a simple region D = Bsr — Iii<,-<£ ^ where the Bi = B<0t(ai) are disjoint
open balls in Bsr, namely holes of D (notations and assumptions of (3.4)), the
rational Mittag-Leffler theorem leads by completion to a simple decomposition of
analytic elements in D:
H(D) -^> H{Bsr) © 0 H0(Bf).
Explicitly, this means that each / e H(D) can be uniquely written as
/ = /o + £/' (/o e H(Bzr)\ ft € H0(Bf), 1 < i < £),
i
namely with generalized principal parts f off, regular outside B{?, or equivalently,
having all their singularities in the hole Bi of D. If we choose a center^ in the hole
Bi, such a generalized principal part f e H0(Bf) is given by a Laurent expansion
(Corollary in (4.3))
Let us turn to a closed, bounded, and infraconnected domain Z) C Cp.
Proposition. Lef D be closed, bounded, and infraconnected, f e R(D). Let
also B = B<a be a hole of D. If fs denotes the sum of the principal parts
4. Analytic Elements 349
attached to the poles of f in B, and fo = f — fB, then
WfB\\D<M0fB<M0f<\\f\\D,
||/l!D = max(||/ll||£„||/olb).
Proof. If / e R(D) is a rational function without a pole in D, B = B<a is a hole
of D, and fB is the sum of the principal parts of / corresponding to its poles in B,
then we have / = fB + f0 with a regular /0 € /?(Z) U 5), /B(x) -* 0 (x -* oo),
and
W/bWd < M0fB by (3.4) Proposition 1,
Ma fB < Maf by (3.4) Proposition 2,
Maf <U\\d by (4.2) Proposition 2.
This proves \\fB \\d < II/IId> and the competitivity principle (II. 1.2) in
/-/n-/o = 0
leads to
11/11^= maxdl/Blb, ll/olb). ■
This means that we have an isomorphism
R(D) -^> R(D U ff) ® /?0(^c)
of normed spaces.
Let now (Z?,-)i€/ be the family of holes of Z), so that BD = Z) U JJ7 Z?,- is the
enveloping ball of D. We also have a split short exact sequence
0 ^ tf(tfa) -^ /?(D) -* 0 R0(Bf) -* 0
with linear contracting maps of normed spaces. The surjective map is / i-> (/i)/Gy,
where fi denotes the sum of the principal parts of / at its poles in Bji When
/ € R(D) is given, finitely many fi are nonzero. The map
is a splitting: / — J^. /i |/) is the restriction of a rational /0 € R(BD) and /(jc) =
/o(jc) + X!i /i(^) (■* e £*)• The central term is the normed direct sum of the
extreme ones, and
RlD)^+R{BD)®Q)R0(Bf)
i
is an isometry of normed spaces. By completion, we obtain the following general
result.
350 6. Analytic Functions and Elements
Theorem- Let D be a closed, bounded, infraconnected set, (Z?,)IG/ its family of
holes. Then there is a Banach direct sum decomposition
H(D) -^ //(Bo)0.G///o(Bf)-
Each f G H(D) can be uniquely expressed as a sum
/=/o + J]/i-I II/IId = max(||/oll, sup \\f§ ||),
iel
where
fo is an analytic element on the enveloping ball of D,
fi are analytic elements on B\ with f(x) —► 0 (x —► oo),
WfiW = WMsf = WMd ^ 0(i ^ oc). m
In particular, in the summable family (/j)ie7 of generalized principal parts of
an analytic element / e H(D), at most countably many f are nonzero (IV.4.1).
4.6. The Christol-Robba Theorem
A formal power series / = JZ«>o a"xn e Cp[[jc]] having bounded coefficients
converges at least for |jc| < I: rj > 1 but it does not always define an analytic
element on the open ball Mp. Let us determine the space H(MP) of analytic
elements on this ball.
Since H(MP) c H(B<r) for all r < 1, it follows that any / e H(MP) is given
by a power series JZ«>o fl«x" sucri mat \an\fn "^0(n-> oo) for all r < 1, hence
by a power series having a radius of convergence ry > 1. On the other hand, Mp
is closed and bounded; hence H(Mp) c C&(MP) (Proposition 1 in (4.2)).
Lemma. The subspace o/Cp[[jc]] consisting of the convergent power series f
having a radius of convergence rf > 1 and a bounded sum in Mp coincides
with the space l°° of formal power series having bounded coefficients: The
map
(««)«>o »-> £>„*" : £°° -* Cp[[x}} n Cb{Mp)
is an isometric isomorphism.
Proof. If (an)n>o is a bounded sequence,
l/WI
n>0
<sup\an\ (\x\ < 1),
4. Analytic Elements 351
hence ||/|| < sup„>0 \an\ with the sup norm of / on Mp. Conversely, assume that
the power series Yln>o a^xn converges for \x\ < 1 and has a bounded sum in this
ball. For a regular \x\ = r < 1 we have
Kr"|<Mr/ = |/(x)|<||/|| (/!>0).
Taking a sequence of regular r / 1, we infer \an\ < ||/||(« > 0), and consequently
sup„>0KI < 11/11- ■
The preceding proof works for any field K having a dense valuation: Compare
with (V.2.1), where the residue field k was assumed to be infinite.
The lemma shows that we have an isometric embedding
/ h> (a„)n>o : H(Mp) ->l°°= l°°(Cp).
The following theorem characterizes the image: It gives a criterion for a formal
power series with bounded coefficients to define an analytic element / e H(Mp).
Theorem (Christol-Robba). Let f = J2n>o fl«x" e Cp[[x]] be a formal
power series with bounded coefficients. Define pv = pv(pv — 1) (v > 1). Then
f defines an analytic element on Mp precisely when the following condition
(CR) holds:
For each e > 0 there exist v and N > 0
such that \an+Pv — an\ < e (n > N).
Proof. The proof is based on the Mittag-Lefrier theorem (4.5) for the bounded and
closed infraconnected set
D = Mp = Bsl - ]J B^tf) <ZBD = Bsl = Ap.
The condition is necessary. Let us write the Mittag-Lefrler decomposition of the
space of analytic elements H(Mp) on the open unit ball as
H(MP) = H(AP) © 0f //o((< + Mp)c)
with a sum parametrized by f e fi(p). The space H(AP) is the Tate algebra with
normal basis (x')f>o, and //o((f + Mp)c) has normal basis l/(x - f )m+1 (m > 0).
This proves that the family of functions
*"> ix-\r+i (n-°- *e^ m±°)
constitutes a normal basis of //(Mp). Let us show that each basis element satisfies
the condition given in the theorem. This is obvious for the powers xn (n > 0). On
352 6. Analytic Functions and Elements
the other hand, the rational function
1
(x -7r
/(*>=/- »V+I «€^'W^°)
(having a pole at the point f ^ Mp) can be expanded according to the binomial
formula
1 x-^ (~m —\\ „ „
by the elementary identity on binomial coefficients recalled in (4.4). We have
obtained
to V m J
Let us estimate the difference between two coefficients as in the condition (CR):
-"[("T'V-rr)]-
Now, since since pv is a multiple of pv — 1 and f pv_1 = 1, we have f ~Pv = 1
for v large enough (depending on f). On the other hand, jc h> (^) is uniformly
continuous on Zp, so that, uniformly in n,
( v I — ( ) is small if pv is small in Zp,
V m 7 V m J
which is the case for large u, since pv is a multiple of pv:
^[(■'■ri^-r.-")]-
dm-r.-)]- -v.
Finally, the conclusion will be reached as soon as we observe that (CR)
characterizes a closed subspace of £°°. Let (an) be a sequence in the closure of the space
satisfying (CR). If e > 0 is given, we can first find a sequence (an) satisfying (CR)
4. Analytic Elements 353
and with \an — an | < e {n > 0). Then
l«*+pv -fi«l < max(|fl„+/7v -an+Pv|, \an+Pv -an\, \an -an\)
< max(£, \an+Pv -an\).
This is less than or equal to e when n and v are large enough, by assumption on
the sequence (an). Hence (an) still satisfies (CR).
The condition is sufficient. Fix a positive integer N and consider the rational
function
8nM = > anxn + ^
n<AT
having all its poles in the set of roots of unity (on the unit sphere). We have
fix) - gN(x) = > anxn f .
~. 1 — xpv
The numerator is
(l~x*)(f(x)-gN(x))
= Yla»xn-xPvY.a»xn- H a»x"
n>N n>N N<n<N+pv
We have obtained
/to ~ 8N(X) =
1 -jc^
and since 11 — jc^ | = 1 for \x\ < 1, we have
l/to ~ 8nM\ =
^(aM+Pu-^K+^
< sup \an+Pv -an\ (\x\< 1).
With the postulated condition, ||/ — g^\\ < s. This proves that / is a uniform
limit on |*| < 1 of rational functions gx having no pole in this ball. ■
Examples. Here are three power series with bounded coefficients (hence a
bounded sum in |jc| < 1) that do not define an analytic element on Mp:
• X]n>0 x^ (follows from the Christol-Robba condition),
• exp7rx (\n\ = rp) (exercise),
• (1 + x)l/m (m > 1 not multiple of p) (book by A. Escassut).
354 6. Analytic Functions and Elements
Analytic elements
Formal
power series
power series
converging in
W<1
power series
bounded in
M<1
analytic
elements in
W<1
analytic
elements in
W<1
polynomials
Cp[[x]]
H(Mp)
H(Ap) = Cp{x]
CPM
Sequences
(««)«>o
limsupK|1/n < 1
(r/ > 1)
(«n)«>0
bounded
sequence
Christol-Robba
condition
(4.6)
(n -> oo)
an ^ 0 for
finitely many rc's
n>0
i™
CO
CTO
4. 7. Analyticity of Mahler Series
Theorem. Let f :ZP -> Cp bea continuous function with Mahler series
fM
-£•©■
77ie« / is the restriction of an analytic element f e Cp{x] iff \ck/kl\ -* 0.
Proof. Consider the triangular change of basis of the space of polynomials given
by
w* = £(-i)':-nP;l*n (*>o)
where the coefficients are positive integers: Stirling numbers of the first kind.
Conversely,
"■-EB*-*.
fc<n
where the coefficients are positive integers: Stirling numbers of the second kind.
Hence if / = J2nanxn — !C* ^* (*)* IS a polynomial, we have sup„ \an\ =
sup^ \bk\, and this isometry (an) h> (b*) extends to an isometric embedding of the
4. Analytic Elements 355
completion
Cp{x] —+ C(Zp-Cp),
^anxn h> y^M*)* = Ylck(iX
n>0 fc>0 fc>0 ^ /
where Zfy = Ck/kl. The assertion follows. ■
Corollary. //Y/ze Mahler coefficients cu o/7z continuous function f satisfy \ck \ <
crk (k > ko) for some c > 0 ara/r < rp, f/zen / is the restriction of an analytic
element f on the closed unit ball Ap of Cp.
Proof. Under the assumption, \ck/kl\ < crk/\k\\. Since the general term of the
exponential series ex tends to 0 when \x\ = r < rp, we have rk/\kl\ -> 0. The
conclusion follows by the Theorem in (4.3). ■
Example. Choose and fix an element t e Cp with \t\ < rp. According to the
preceding corollary, the Mahler series
-S'G)
fix)
is the restriction of an analytic element / e H(AP) = Cp{x}. I claim that the
analytic element in question is
f(x) = f**1+*\
Proof. The assumption \t \ < rp indeed implies (Proposition 1 in (V.4.2))
|log(l + r)| = U| <rp,
so that the series expansion
(*log(l+0)n
/iw = E
n>o n-
converges for |jc| < 1. Now, for integers m > 0,
(mlog(l+?))n
Mm) = £
n>o n-
= emlog(l+0 = ^ogd+O^ = (, + tyn = /(||f)
(we have used the identities ex+y = eKey; hence emx = (ex)m ande,og(1+,) = l+t).
By continuity and density,
/,(*) = f{x) = fix) ixeZp).
356 6. Analytic Functions and Elements
Since / — f\ is given by a power series expansion, it vanishes identically in its
convergence disk (a nonzero power series expansion has a discrete sequence of
critical radii with finitely many zeros on each critical sphere). ■
Since
fix) = £V(l) = (1 + tf (x e Zp),
k>0 W
for smaller values of x e Cp. Recall that
it is also clear that for larger values of t e Mp c Cp we can still define (1 + tf
:es of x e Cp. Recall th
pi~1rp<\]og(l+t)\
PJ
= pJ\t\PJ <p'rp
for
r^i-1<\t\<r^
(cf. (1.6)). Hence
l*|<r^=>|Iog(l+OI<pV
and it is enough to assume \x\ < 1/pi to have a convergent series expansion for
(l+t)x = ex}f*il+,).
Still, for |f | < rplp and \x — n\ < IfpK we can define
(1 + tf = (1 + tf - (1 + tf~n = (1 + tf • e(x-n)l0^l+t\
For these values of the parameter t e Cp, the function x i~> (1 + tf is a locally
analytic function defined in the neighborhood
VJ = U b<\/p>W = 'Lp + B\ip>
aeZp
of Zp in Cp.
Remark. The identity
e*los(1+f) = (l+f)*
leads to
4. Analytic Elements 357
hence to
Identifying the coefficients of xn we get the well-known classical identity
[iog(i + or
Hl k>n
= £(-1/-
tk
where the coefficients are again the Stirling numbers of the first kind.
4.8. The Motzkin Theorem
Let D be a closed, bounded, and infraconnected set, / e //(D)x an invertible
analytic element on D. Assume that B = B<a(a) is a hole of D (maximal open
ball in the complement of D). A Motzkin factorization of / relative to the hole B
is a product decomposition / = gfo), where
(l)seff(DUB)x,
(2) fw = (x - a)mh (meZ,he H(BC)* ),
h(x) -> 1 (jc -> oo), and \\h — \\\Bc < 1.
Remarks. (1) We have seen in (4.3) that an analytic element on Bc admits a
convergent Laurent series expansion. If it is not zero, we can write it as
£ aj(x — a)j = am{x — a)m - h(x).
j<tn
Here h(x) = 1 + b\/(x — a) + - • - {amb\ = <zm_i,...) is invertible if it does
not vanish on Bc, i.e., if it has no critical radius greater than or equal to a. Since
h(x) — 1 -* 0 (x -* oo), Mrh (r > o) decreases, Mrh \ 0 (r -* oo), and
II* - llluc = MrAh - 1) = max |^|/W < 1.
(2) When \\f — 1||D < 1, then / is invertible. In fact, since D is closed and
bounded, H(D) is a closed subalgebra (Banach subalgebra) of Q>(D) (4.2). Hence
the geometric series expansion
1 -— = Ya-fT
f l-(l-/)
«>0
converges (in norm) in //(£>). More generally, if / e H(D) and ||/ — /(«)|| <
\f(a)\ for some a e D, then f/f(a) (hence also /) is invertible.
(3) The existence of a Motzkin factorization (with respect to a hole Z? = B<0)
requires Mc / > 0: The growth modulus function is multiplicative:
Mof = MGg.MG(fw)>0.
This condition is also sufficient, as we are going to prove.
358 6. Analytic Functions and Elements
Theorem. Let D be a closed, bounded, and infraconnected set, B = B^ a
hole of D. Then each f G H(D) satisfying \\f — 1\\d < 1 admits a unique
Motzkin factorization (with index m = 0)
f = g-h, \\h~l\\Bc < 1, h(x)^ l(x^oo).
Proof. Let (fn)n>o be a sequence of R(D) converging uniformly to /. Since
\\fn -l\\D< max(||/„ - f\\D, ||/ - 1||D) < 1
for large n, we can disregard the first few values of n and assume \\fn — 1 \\D < 1
for all n. By the rational Motzkin factorization result (3.5) we can write
fn=gn- K (hn = (fn)(B) ~+ 1)-
Now set P(f) = /#, the principal part of a rational function / with respect
to the hole B (as in the Mittag-Leffler decomposition) and Q(f) = fa) the
Motzkin factor of / relative to the same hole. Obviously, P(f) = P(f — 1) and
Q(fn/fm) = Qfn/Qfm (the Motzkin factorization of rational functions is simply
obtained by gathering the linear terms corresponding to the zeros and poles in B).
The norm estimate in (3.5) for the function fn/fm leads to
\\Q(fn/fm)- lib = \\hn/hm - l\\D = \\ P (fn /fm)\\ D-
By the proposition in (4.5),
\\K/hm - l\\D = \\P(fJfm)\\D = \\P(fn/fm ~ DIlD
Hence we have
\\hn/hm -\\\D = \\hn/hm - \\\BC = Ma
-<M"(^)
and by multiplicativity of the growth modulus
Ma(hn - hm) Ma(fn - fm)
Mahm ~~ Mafm
But since \fm - 1| < 1, we have \fm\ = 1 (on D) and MGfm = 1. The same is
true for hm and /. We have obtained
II An ~ Am II* - Ma(hn - hm) < Ma(fn ~ fm) < ||/„ ~ frnWa, (*>
which proves the uniform convergence of the singular (Motzkin) factors hn to
h e H(BC). Since \\h - \\\Bc < 1, we even have h g H(Bc)*. Moreover, we have
J n Jm
I Jm
(hn-hm\
Exercises for Chapter 6 359
a uniform convergence
— -► - e H(BC) (n -* oo),
hn h
which implies a convergence
fn f
hn h
in H(D). The maximum principle on the ball B and Proposition 2 in (4.2) give
llgii ~ Smlls < Ma(gn - gw) < \\gn - gm\\D,
hence
llgn - gmWDUB < \\gn ~ gmWo "> 0
when m,n -* oo, so that g„ -> g e H(D U #). Since
gn Jn J g
uniformly (first on Z), but also on D U B by the maximum principle), the function
g is a unit of H(D U Z?) and does not vanish in DU B.
Let us prove uniqueness. If f = gxh\ = #2^2 are two decompositions, then
£1. *1 = 1
is a Motzkin factorization of / = 1, and it is sufficient to prove the uniqueness in
this simple case. Assume that gh = 1 is a Motzkin factorization and choose
gn^g (gneR(DUBr), hn~+h (hneR(Bc)x).
Then the inequality (*) for the rational functions hn and 1 gives
IIAn - 111^ < WgnK " 1 lb "► 0 (#1 -► OO).
This proves hn —► 1 (n —> oo), /z = lim„_>oc /zn = 1, and g = 1. ■
Exercises for Chapter 6
1. (a) Show that ^[X][[y]] ^ tf[[K]][X] (consider £-<y ^'" W).
(Z?) Give a description of the fraction field K((X)) of A'HX]] using Laurent series of
order > — oo.
2. Using the definition of the product of formal power series prove the identity
D(fg) = gDf + fDg age K[[X]])
for the formal derivative of a product.
360 6. Analytic Functions and Elements
3. (a) Let /, g e K[[X]] be two convergent power series with g(0) = 0. Assume that
the value group \KX\ is dense in R>o- Prove that the numerical evaluation result
(/ ° g)(x) = f(g(x)) is valid in the ball |jc| < r of K9 provided that r < rg and
g(Bsr)cB<zrr
(b) Show that the radius of convergence of a composite fog satisfies
r/og > min (rg, sup{r : Mrg < r/})
and Mr(f og) = MMrg(fl
4. Let f(X) e K[[X]] be a convergent power series and fix a e K, \a\ < /y. Let
X = a + Y and /(X) = f(a + Y) = fa(Y) (this substitution g(Y) = a H- K is not a
substitution of formal power series of the type considered in the theory, since co(g) = 0).
Show that the following double series is summable when X is replaced by an element
x e AT, |*| < r/:
Reorder its terms to obtain another proof of the proposition in (VI. 2.4):
f(a) + Df(a)Y + ^D2f(a)Y2 + - - -.
Deduce that the radius of convergence of g is at least equal to r/. Interchanging the roles
of/ and g, conclude that r/ — rg. [By contrast to the classical case, it is impossible to
obtain an analytic continuation of / using a Taylor series centered at a different point.]
5. Let p be an odd prime. There is a sequence (an)n>o in Cp with
2^r<kl<jr («>0).
What is the radius of convergence of the power series / = J]«>o anXn?Show that the
sphere \x\ — r/ in Cp is empty (the corresponding closed arid open balls coincide).
What can one say of the convergence of / on the sphere \x\ = r/ in Qp?
6. Take K = Qp and consider the formal power series
f(X) = Txn = T^—i g(Y) = Y-YP.
Find the power series representing the composite /og, which is the rational function
1
I - X + XP
when p = 2. In this case, the two roots f,??ofl — X + X2 = 0are easily determined.
Give the power series expansion of (1 — X + X2)~] explicitly using the partial fraction
decomposition
1 a b
1 - X + X2 ~ X-f + X-n
The coefficients of the corresponding power series are periodic mod 6.
(Hint. Note that f and r] are 6th roots of unity.)
Exercises for Chapter 6 361
7. Show that
sin*
tan* = (|*| < rp\
cos*
*3
arctan* = * - — + — =f ■ - - (M < 1),
are inverse functions for |* | < rp. L. van Hamme has suggested the following extension.
Choose i e Cp with i2 = — 1 and use the Iwasawa logarithm to define
arctan* = --Log (* ^ ±i).
2i 1 — i*
Using
zi (arctan* — arctan a) = Log : Log-
? 1 + ia 1 — ia
prove that if a point a e Cp is selected, then arctan * is given by a series expansion
valid in the ball
|* — a\ < min(|a — i\, \a +1|).
Prove that
arctan 1 = 0, lim arctan* = 0.
Ul^oo
8. (a) Let f(X) = ^2n>oanXn be a convergent power series and assume that the set
{|««k/} is unbounded. Prove that there exists an infinite sequence of critical radii
n / rf.
(b) Let (an)n>o be a sequence in Cp with
1*01 < |ail < ■■• < \an\ < -■■ < I.
Prove that the formal power series f(X) = X^>o fl« ^w defines a bounded function
in |*| < r/ with infinitely many critical radii converging to r/.
9. Prove that for any ultrameiric field A', 1 + XK[[X]] is a multiplicative group, and for
r > 0,
U>o J
is a multiplicative subgroup of 1 + XK[[X]].
(Hint. For r = I, the subgroup Gi is simply 1 + XA[[X]]; use dilatations to get the
general statement.)
10. Prove the Liouville theorem in the case K has a discrete valuation but an infinite residue
field.
11. (a) Show that r > 0 is a regular radius for f\, f2,..., /„ iff it is regular for fifz-'-fn-
(b) Let /, g e K[[XJ] be two convergent power series. Assume f(xn) = g(xn) for a
convergent sequence xn -> *oo(*n 7^*00 for all n > 0), where |*oo I < rnin(/yf rg).
Show that / = g.
(c) Formulate and prove a statement analogous to (b) for Laurent series.
362 6. Analytic Functions and Elements
12. Let K = Qpiftp^)- This is a totally ramified extension of Qp, and |Kx | is dense (the
residue field of K is ¥p). Give an example of a polynomial / for which ||/||Gauss =
suPjc<i I/Ml *s not a maximum.
13. Let / denote the Taylor series at the origin of
X4 + (p2-l)X2-p2
1-pX
What is the canonical factorization / = cPQgi (Theorem 2 in VI.2.2) of this formal
power series? Draw the Newton polygon of /.
14. Show that the formal power series Y^n>i PnlXn defines an entire function. What is the
location of its zeros? Give the form of an infinite product that represents this function.
15. Give the Laurent expansions of the rational function
x
(x - l)(x - p)
valid in the region \x\ > 1. Same question for the region 1/p < |jt| < 1.
16. Let f(X) = Y^n>oanXn be a formal power series with coefficients \an\ < 1. Consider
the map g : Mp — {0} -> Cp - Ap defined by y = g(x) = f(x) + 1 /x. Prove that g
is bijective.
(Hint. To show that g is surjective, proceed as follows. For given y with |y| > 1
we are looking for a solution x of* = j~2(l + xf(x)). Show that the sequence
defined inductively by xq = 0, xn+\ — y_1(l + xnf(xn)) is a Cauchy sequence with
\xn\— \/\y\ < 1, and hence it converges in the open unit ball Mp.)
17. What are the critical radii of the polynomial
xn+\ +xn + ... + x2 + \_x 4- 1 (n > 1) ?
How many roots are there on each critical sphere?
18. Let / : R>o -> R>o be a C2-function and define log /(r) = <p(p), where p = log/"-
Show that <p may be convex even when / is not convex (consider /(r) = yfr).
(a) Consider the functions fa(r) = ra (a e R) and discuss the convexity of fa and
the corresponding <pa.
(b) Prove r2f"(r)/f(r) = <p"(p) + <p'(p)(<p\p) - 1) and deduce that if <p is convex and
ipf does not take values in (0, 1), then / is convex.
19. What is the Newton polygon of the polynomial
px3 - (Ap1 + l)x2 + (4p3 + 4p)* - 4p2 ?
(a) Compute the absolute value of the zeros of this polynomial.
(b) Factorize the polynomial and compare with the result obtained in (a).
20. (a) Show that the logarithm log : 1 + B<Tp -* B<Tp is surjective. More precisely,
show that for each \y\ — rp there are exactly p preimages jc,- with |jc,-| = rp and
log(l +xt) = y (but if \y\ < rp, there are only p — 1 preimages).
(b) Draw the valuation polygon of the formal power series of log(l + X).
Exercises for Chapter 6 363
21. What are the Newton polygons of the Chebyshev polynomials 73, T& T$ (any prime p)
(cf. exercises for Chapter V). Same question for T],Tp, Tpi,..., Tpn.
22. Let f — g/h e Cp(x) be a rational function. Assume that / has a zero and a pole on
a sphere |jt| = r. Show that |/| assumes all values (in p® = |C* |) on this sphere.
23. Generalize the mean value theorem (as given in (2.4)) to the case of a parametrized
curve / h-> (/(?), g(t), h(t)) by considering the determinant
<&(t) =
f(a) /(f) f(b)
g&) g(0 gib)
h(a) h(t) h(b)
which vanishes for the two values t —a and t — b.
24. (a) Draw the graph of the growth modulus Mrf of the following rational functions:
-x (l-xY 1 -xn
Tx% \TTx) ' i + xw
Can one guess the location of the zeros and poles of a rational function by the sole
observation of the graph of Mr fl Sketch the graph of the growth modulus Mr^ \ /,
centered at the point 1, for the same functions,
(b) Draw the graph of the growth modulus Mrf of
X \-pX
\-X\- p2X'
25. Give the principal parts Pf of the rational functions
x2+x + l 2jc2-;c-1
f
2x2
at the origin. Take a > 1 and consider a region D := {a < |*| < r}. Compare IIP/Hd
and||/-l||i>.
26. With the notation of (3.5), let / = cYlaGs^x ~ a)tla- Prove that the principal part
Pi(f'/f) with respect to some ball £,- is f[jfu where fi is the Motzkin factor of /
relative to fl,. If ||/-1|Id < 1, use (3.4) and prove ||/'||D = max0<,<£ |U'|D-
27. Fix r > 0 and choose c e £lp — Cp. Then
/ h+ MrAf)
is a multiplicative norm on Cp(X). lfS= dist (c, Cp), show that
inf Mrc(X -a) = 8.
In particular, the general inequality infaec Mr^c(X ~a) > r can be a strict inequality.
28. Show that the union of two infraconnected subsets having a nonempty intersection is
infraconnected.
364 6. Analytic Functions and Elements
29. The subsets of Cp
£<1A(0) n{\x-a\>e: for all a e Qp]
are infraconnected. What are their enveloping balls? What are their holes? Conclude
that Cp — Qp is a union of an increasing sequence of bounded infraconnected sets, each
of them having finitely many holes.
30. (a) Let B be a dressed ball and D a closed and bounded subset of Cp so that
\\/\\b = II/IId for all polynomials /.
Prove that D C B and B = Bd is the enveloping ball of D.
(b) Let D be closed and bounded. Prove \\f\\BD = \\/\\d for / e H(BD).
{Hint. Choose g e R(BD) such that \\f - g\\B[) < ||/||D.)
31. Let D = B<r — LIi<f<£ Bi and B C D any nonempty open ball. Show that the
restriction map H(D) -* H(B) is injective.
(Hint. Use the Mittag-Leffler decomposition to show that each / e H(D) is described
by a power series expansion in B and hence has isolated zeros if nonzero.)
32. The simple domains of the form D — B<r — \\i<i<Li B( can be patched together. For
example, if the hole B\ of D has radius r\, show that any D\ = B<n — Llxy^, Bj
has a nonempty intersection with D and the union DU D\ is again a simple domain of
the same form.
33. Choose a sequence rn / 1 (rn < 1) and let 5W = {|;c| = rw}. Show that
D = B<} -\JSncCp
n
is infraconnected. Moreover, if B c D is any nonempty open open ball, show that the
restriction map H(D) -> H(B) is injective.
(Motzkin calls analytic a set Z) having this property. If D' = D U ^<i(l), it can be
shown that the restriction //(/)') -> H(B<i(l) is «of injective; hence D' is not an
analytic set; this is again the phenomenon of 7"-filters.)
34. Find domains of uniform convergence for the following sequences of rational functions:
xn xn x2n
l-xn' l-x2"' \-xn'
35. Let 0 < e < 1 and D = {x e Cp : |jc| <£ [1 - s, 1 + £]}. Consider the sequence of
functions
fnW = —^- e R(D).
I — xn
This sequence converges in H(D): The limit / ^ 0 is a zero divisor (a nontrivial
idempotent). Conclude that for any D e Cp having at least two points, and which is
not infraconnected, H(D) is not an integral domain.
(Hint. Choose a ^ b e D and an annulus {0 < r\ < \x — a\ < r2 < \b — a\] that
does not meet D and use a sequence similar to the one above. It can be shown that
infraconnectedness is also a sufficient condition for H(D) to be an integral domain.)
Exercises for Chapter 6 365
36. Show that the series X^>o Pn/(x ~~ n) converges for x e Cp —Zp and that it defines
an analytic element on the subsets Dn — {x e Cp : \x — a\ > \/n Va e Zp}.
37. Show that the series
£
converges for x £ {0} U p and defines an analytic element in the complement De of
any (finite) union of balls of the form Un>o B<e(pn) (£ > 0).
38. Show that the exponential is not an analytic element on its convergence ball.
(Hint. Let n be a root of Xp~l + p — 0, so that the convergence ball of f(X) — eIZX
is 1. Then use the Christol-Robba criterion (VI.4.6).)
39. Let pv = pv(pv — 1) and <pv(x) = 1 — xPv. Check the following assertions:
(a) <pv(x) -> 0 (v -> oo) for any |x| = 1.
(« l^vWI = l,if |jc| < 1, Mi(<pv)= 1.
(c) sup|x|=i te>v(x)| = 1.
40. (a) Let £> C Cp be closed, bounded, and infraconnected. Assume / e H(D) and
11/ - 1 ||d < 1. Prove that log / e H(D).
(b) If / e R(D)X, ||/ - 1||d < 1, and / = []i fi is the Motzkin factorization of
/ (with respect to a finite family of open balls Bj as in (3.5)). show that log / =
5Zi 1°8 / *s tne Mittag-Leffler decomposition of log /.
41. Let 0 < r < 1 and (an)n>\ a sequence on the unit sphere with \an — am \ ~ 1 whenever
n t£ m. Define
£0 = £<r(0), Bn = B<r(an\ D = Cp-Un>0Bn.
Choose a sequence \n -> 0 and consider the rational functions
gnM=l + J*-.
Show that ||g„ — l||fic < \Xn\/r -+ 0. Conclude that Y\n>i Sn converges uniformly
on D, x2 n„>i g/i € //(/))*, but the sequence /v = *2 Ili<n<7v £« is not uniformly
convergent on D.
(Hint. Observe that /#+ \ — fa is not bounded on D due to the presence of the unbounded
factor x2.)
\x-p" xj
7
Special Functions, Congruences
The applications given in this chapter concern congruences.
They rely on the first two sections of the preceding chapter (convergence
of power series, growth modulus, critical radii). The more technical notion of
analytic element developed in the last two sections of Chapter VI is not used
here.
1. The Gamma Function Tp
The special functions of classical analysis are defined by a variety of methods:
series expansions, differential equations, parametric integrals, functional equations,
etc. We have seen in (V.4) that the power series method is well adapted to the
definition of the exponential and logarithm in a suitable ball of Cp.
Here is another method adapted to p-adic analysis. Let / be a classical function
defined on some interval [a, oo) c R with rational values f(n) e Q on the integers
n > a, we may look for a continuous function Zp -> Cp extending n i~> f(n).
By the density of Z f! [a, oo) in Zp, there is at most one such interpolation. Of
course, this possibility requires arithmetic properties of the sequence of values
f(n) and the method works only in particular cases. A suitable modification of
the function n i~> n! will lead to an analogue of the classical gamma function.
Another successful example of this method (not treated here) is the Riemann zeta
function, using its values at the negative integers.
To simplify our considerations, we assume first that the prime p is odd: p > 3-
The case p = 2 is treated later in (1.7).
1. The Gamma Function Tp 367
/./. Definition
The function n h> n\ cannot be extended by continuity on Zp. Indeed, let us look
for a continuous function
/ : Zp -+ Qp
satisfying /(«) = nf(n — 1) for all integers n > 1. By continuity and density the
same relation will hold for all n e. Zp. Iterating it, we get
/(„) = n(n - lXw - 2) • - • pmf(pm - 1)
for all integers n > pm, where pm is a fixed power of p. Since / is continuous
on the compact space Zp, it is bounded and there is a constant C > 0 such that
l/C*)l < C (x e Zp). The preceding factorization also shows that
\f(n)\ < \p\m • C
for all integers n > pm. But these integers make up a dense subset of Zp; hence
ll/lloo<lpf -C.
Since the integer m > 1 is arbitrary, the only possibility is 11/1100 = 0. (The single
consideration of the case m = 1 is sufficient: Taking C = ||/||oo we get ||/||oo £
IpI ll/lloo, (1 —l/p)ll/lloo £ 0.) The only continuous function / on Zp satisfying
the functional equation f(n) = nf(n — 1) for all integers n > 1 is / = 0.
The trouble obviously comes from the multiples of p in the factorial n\: Let us
omit them and consider a restricted factorial n\*
n!*:= ft J-
The key to the construction of the /?-adic gamma function lies in a generalization
of the classical Wilson congruence
(p — 1)! = —1 mod p.
Proposition. Let a and v > 1 be two integers. Then
Y[ j = -l (mod//).
a<j<a+pv, p\j
Proof. The integers a < j < a + pv make up a complete set of representatives
of the quotient Z/pvZ. Those that are not multiples of p represent the invertible
elements, namely the elements of the unit group G = (Z//?UZ)*. Grouping each
element g e G with its inverse g_1 we obtain compensations in the product except
wheng = g_1. But
368 7. Special Functions, Congruences
In the ring Z/pvZ we can write
g2 = 1 <=► g2 - 1 = 0 <=^ (g - l)(g + 1) = 0.
The elements in question are g = ±1, or both g — 1 and g + 1 are zero divisors.
The second case corresponds to
p divides both g — 1 and g + 1.
Obviously, this second case can happen only when p divides 2 = (g +1) — (g — 1),
i.e., p = 2. Since we are considering only the case p odd, this does not occur, and
the proposition is proved. ■
The proposition implies that the products
/(*):= (-1)" Y\ j (w>2)
satisfy f{a) = f(a 4- pv) (mod pv). More generally, they also satisfy
f{a) = f{a+mpv) (mod pv) (m e N).
The function a i-> f(a): N — {0,1} -► Z is uniformly continuous for the p-adic
topology, hence has a unique continuous extension 7jp ->7jp.
Definition. The Morita p-adic gamma function is the continuous function
1 p - ^p ~* ^p
r/zar extends
f(n):=(-l)n ]1 ^* («^2)'
Observe that by construction, this p-adic gamma function takes its values in the
clopen subset Z£ of Zp.
Since its definition depends on the prime p, this function is denoted by Vp. (But
as with the functions log and exp, we might simply denote it by T when the prime
p is fixed and there is no risk of confusion.)
1, Tp(3) = -2,
fn\ if n odd, n < p ~ 1,
—n\ if n even, n < p — 1.
7.2. Basic Properties
We have
rp(2) -
Vp(n 4-1) =
1. The Gamma Function Tp 369
From the definition it also follows that Tp{n) e Z* is given by
_ (~l)n+lnl _ (-l)n+ln\
when the integer n is greater than or equal to 2. Still, by its definition, we have
; not multiple of p,
> multiple of /?,
and by continuity, more generally,
rp(w + l)
I— nTp(ri) ifrcisi
—Tp{n) if n is ]
I-xTp(x) if x e Z*,
-rp(x) ifxePZp.
It is convenient to introduce a function hp:
\ -X if
hp(x) =
-x ifxeZ* (|*| = 1),
1 if x e PZP (\x\ < 1),
in order to be able to write the functional equation
rp(x + 1) = hp(x) - rp(x) (x e Zp).
This functional equation can be used backwards to compute the values Tp(l) and
Fp(0) from Tp(2) = 1. In particular, we check that rp(0) = 1. This normalization
also follows by continuity: By the proposition in (1.1) (with a = 0),
hence Tp{pn) -> 1 as n -> oo.
Theorem. For an odd prime p, the p-adic gamma function Tp : Zp -> Qp is
continuous. Its image is contained in Z*. Moreover:
(i) rp(0)=if rp(i) = -i,rl,(2) = if
Tp(n + 1) = (~ir+1«! (1<«< p).
(2) |r„(jc)| = L
(3) \rp(x) - rp(y)\ < |* - h irp(x) -1| < m.
(4) Tp(x + 1) = hp{x)Tp{x).
(5)rp(x)-rp(\-x) = (-\)R^,
where R(x) e {1, 2,..., p], R(x) = x (mod p\
As we shall see in (1.7), the property (3) has to be suitably modified for p = 2.
The exponent in (5) is also different in this case.
370 7. Special Functions, Congruences
Proof. (3) follows from r(a + mpv) = F(a) (mod pv) (aeZ, m eN) by
continuity.
There only remains to prove (5). Put f(x) = rp(x) ■ Tp{\ — x). We have
/(* + 1) = hp(x)rp(x) - rp(-x),
and since rp(—x) = rp(l — x)/hp(—x),
f(x + 1) = e(x) - /(*), £(*) = fcpW/M-4
Now,
(-1 if|*| = i,
[+1 if|jc| < 1.
Take for x an integer « and iterate,
f{n + 1) = £(w) - /(w) = • • • = (-1)#/(1),
with an exponent # equal to the number of integers j < n prime to /?. Since the
number of integers jf < « divisible by /? is [«//?], this exponent # is « — [n/p].
Hence
/(„ +1) = rpin +1) - rp(-w) = (-iy-t«/ri. rp(i)rp(0) = c—iy^,"1"I"/ri-
^ -y - V
"I
To find the formula given in (5), let us take x = m = n + 1 (integer), whence
1 — wi = — h and
rp(m) - rp(i - aw) - rp(w +1) • rPi-n) = (-ly^1-^.
With the expansion of the integer n in base /?,
« = wo + wi/H = n0 + p —
we infer
Since we assume p odd — hence p—\ even — this proves that n ~- [n/p] has the
same parity as no:
(_iyH-Mn/pl _ / jyio+i
Since m = w+1 = «o+l (mod p) and «0+l is in the correct range {1, 2,..., p;»
we have /?(ra) = w0 + 1. and the formula is proved for integral values x = rn of
the variable. By density and continuity, it remains true for all x e Zp. ■
1. The Gamma Function Tp 371
Comment The classical T-function satisfies the Legendre relation
iwa - z)
SIU7TZ
which implies for z = \
ra)2-7r, rd) = jz.
Hence we can say that in Qp, an analogue of the number jt could be taken as
Yp(\f = (-\fP+l)l2. In particular, if/? = 1 (mod 4), rp(\) = V^T is a
canonical square root of — 1 in Qp. This canonical imaginary unit can be identified easily.
In the case p = 1 (mod 4), the Wilson congruence
(p-l)!=^^-LYi2 = -l (modp)
shows that ( £—■ j! mod p is a square root of — 1. Since (p + l)/2 = \ (mod p),
the point (3) of the above theorem gives
M|), r,(£±i) = ,-,)<— (ti)i = - (ti), ,„,„, „>.
7.3. 77ie Gauss Multiplication Formula
The classical gamma function satisfies the identity
]1 r (z + -) = (2n)lm-1)/2mll-2mz)/2 ■ r(mz) (« > 2).
which is the Gauss multiplication formula. It is remarkable that rp satisfies a
similar relation.
Proposition. Let m > 1 be an integer prime to p. Then
0 rp(X + m)=£m' m'~R{mX) • (^P~2)5(mX) • r^diuc),
0<y<m
where
0</<m v '
0<y<m
i?(y)€ {l,...,p}, i?(y) = y mod p,
s(y) = € Zp.
372 7. Special Functions, Congruences
Proof. Let
0< / <:m ^ '
0<y<m
G(x) = Gm(x) = f(x)/rp(mx).
We have to compute the Gaussian factor G(x). Start with
C(jr + 1/",)=rv^M),n„r'(' + ^)
■r-f^ n r^'/»»
hp(mx)rp{mx) Tp{x) ^j<m
hP(x)
G(x).
hp(mx)
Consider the locally constant function
k(x) =
hp(x) _ j -x/(-mx) = \/m if |*| = 1,
hp(mx) [-l/(-l) = l if|jc| < 1
(since (m, /?) = 1). This multiplier is useful to compute the successive values
G(1/ih) = X(0)-G(0),
G(2/#n) = k(0)k(l/m) - G(0), ...
GO"/m) = f[ X(i/ih) • G(0).
0<i<7
Since (m, p) = 1, we have
f] k(i/m) = (l/m?
0<i<y
with an exponent
# = #{i prime to p, 0 < f < j]
= 7-1
M-
Let us find a convenient form for this exponent. Start with the p-adic expansion
of the integer j — 1:
j = 0'-Do+l+/>
=:*(/>
1. The Gamma Function Tp 373
where R(j) = j (mod p) is in the correct range {1,..., /?}. This proves
y-.-[^]-«n-.+(,-.)[^i].
and hence
with
CKl-c/"
*0')
rzzii - j - ru)
i p J- p •
an expression that admits a continuous extension to Z
p
We have proved
R(x) -x
s(x) = U 6 Zp).
0<i'<7
and
ml~RU\mP-lyOK G(0),
G(x) = ml-R^x\mP-lf^^ . G(0)
for integral jc = y/iw (y = mx > 0 multiple of aw). By continuity, the last formula
will also hold for all x eZp. This proves the expected formula
f[ rpl* + iW = £- • ™l-R{mx\rnp-lf{mx)rp(mx)
0<j<m
withem = G(0). ■
Finally, let us observe that em = G(0) is always a fourth root of unity.
Lemma. We have £* = 1. In fact £„ = 1 except when p = 1 (mod 4) and
m is even, in which case e^ ~ — 1.
Proof. When m is odd,
^ = iyi)...ry^)
since rp(0) = 1. Now we can group pairs
r„ I - I r„ ['—±-) = ±1
\mJ \ m J
374 7. Special Functions, Congruences
(by the analogue of the Legendre relation). In this case £m = ± 1. Assume now m
£m = il p\2'' £m == ^ p\2' '
even. The same grouping leaves the middle term Tp(^) solitary:
As we have seen in (1.2), Tp{\)2 = —1 when p = 1 (mod 4), so that em is a
square root of — 1 and
£m = G(0)= f] rp(i)
0<y'<m
is a root of unity of order 4. ■
1.4. The Mahler Expansion
If / is a continuous function on Zp, it can be represented by a Mahler series
ft*)=y>*(Tl A*=(v*/xo).
We have shown (Comment 2 in (IV. 1.1)) that these coefficients are linked to the
values of / by the identity of formal power series
vk yn
Proposition. Let Tp(x + 1) = J2k>o ak (*) be the Mahler series ofTp. Then its
coefficients satisfy the following identity:
^ .+1 xk \-xP ( x*\
2J-1? "krr = -: exp (x + — .
U k] l~x V pJ
Proof. Let us compute e~x<p(x), where <p(x) = ^n>0Fp(n + l)*"/**!. For
this purpose, we make a partial summation over the cosets mod p:
O^P^ («*> + ■/)-'
Here, we can use
(-1)"+1«!
Tp(n + 1):
for « = mp + j, [n/p] = m and get
rp(mp +j +1) = (-ir^+l • (T+-)!-
m\pm
1. The Gamma Function Tp 375
We obtain
xmp+j
*<*)= £ D-1*
,wp+y+i:
0<j<p m>0
W2!/7'
!nm
Finally,
= _exp(^) £(-l)V
expV p / i-(-*)"
whence the desired formula.
7.5. 77ze Power Series Expansion of log Tp
We shall use the following formula (V.5.3) for the Volkenborn integral:
S(f')(x) = f [f(x + jO ~ /(>>)] rfy (* e Zp)
with the function
/(*)
xLogje — x if |jc| = 1,
0 if|Jc|<l,
Log, ifW = l
J [0 if|jc| < 1
= LogfcpU).
Here, Log denotes the Iwasawa logarithm (V.4.5): It vanishes on roots of unity, so
that Log (—x) = Log jc. This implies that the function / is odd, so that fz /(f) dt=0
(Corollary of Proposition 4 in (V.5.3)), a fact that we are going to use presently. On
the other hand hp still denotes the function occurring in the functional equation
rp(x + 1) = hp{x)Tp{x)\
hence
VLog rp(jc) = Log rp(x + 1) - Log Tp(x) = Log hp(x).
376 7. Special Functions, Congruences
Since SVf = f - /(0) and Log Tp(0) = Log 1 = 0, we infer
Logrp(x) = SLoghp(x).
The above formula for the Volkenborn integral with /' = Log hp is now
Logrp(x) = S(Loghp)(x) = S(f')(x)= [ f{x + y)dy
= / [(*4y)Log(x4y)-(*4y)]dy.
For the computations, we come back to
Log (x 4 y) = Log y + Log (1 4 x/y),
and Log(l 4- x/y) = log(l 4 */y) is given by the series expansion if \x/y\ < 1
(e.g., x e pZp and \y\ = 1). Since (1.2) \rp(x) — 1| < |jc|, we also have
[Log r„(p*)| = |rp(/«c) - 1| < \px\,
since |/?Jt| < rp.
Theorem. For x e. pZp, we have
Log r,(x) = A0 x - £ \ x2-1,
^ 2ra(2ra 4 1)
where
k0= \ Logtdt, km = / f~2,ndf (in > 1).
./z* Jz*
TTie radius of convergence of the power series is 1, and this provides a
continuation of Log rp in the open unit ball Mp G Cp.
Proof. The preliminary considerations already prove that
Logiy*)
(x 4 y)Logy - x - y 4 [x 4 y)^-!)""1-^
rfy.
-L
which is equal to
x Logydy+ I (yLog y - y) dy 4 / (-* 4 (x 4 y) V) ' ) ^
Jz* Jz* Jz* v 7
=A0 —0. previous comment
1. The Gamma Function Tp 377
Here is the elementary computation for the series appearing under the integral
sign:
or+v^-ir'-^ = 5>ir-'^ + 5>ir'-
«>i
«y
«>l
71+1
«-l
«yJ
(-ly1-1*"-1-1
^ «r £r (w + Dy
«>1
(-1) I )
1 JC
7I>1
w(w + 1) yn
Now, for odd n > 1, the function equal to 0 on pZp and to 1/y" on Z* is odd
with a vanishing derivative at the origin. This shows that /zx y~n dy = 0 for odd
w > 1 (Corollary at the end of (V.5.3)). There remain only the even terms
Logrp(x) = k0x-^2
r'kn+l
-[ 2m(2m + 1)
with A.Q = /Zx Log r df, and km = fz* t lm dt (m > 1) as asserted. We can
deduce an estimate of these coefficients. If /„ denotes the function equal to 0 on pZp
and to x -2b on Z *, we have || /„ || i = 1 (cf. (V. 1.5)), as follows from
\*fn(x,y)\ =
x—2n v~2n
x-y
v2n 2n
yln _ y2n
(1*1 = \y\ = U.
x-y
x2ny2n(x - y)
ly2n-1 + ---+x2w-1|<l (|*| = |y|=l).
This proves \kn\ <p (Proposition 1 in (V.5.1)) and pkn e Zp (n > 1). The
isometric property of the logarithm on 1 + pZp makes it easy to prove that the norm
||. || i (V. 1.5) of the function equal to 0 on pZp and to Log on units is 1: This proves
Uol < p also. But we can prove a more precise result directly (cf. below). We have
seen (Proposition 3 in (VI. 1.2)) that the radius of convergence of a power series
/ is the same as the one for its derivative /' and hence also for /". Let us apply
it to
f(x) = Ipx - }
„ 2/1+1
^2«(2«+l)
378 7. Special Functions, Congruences
and
f"(x) = -Y^Kx2n-1.
n>l
Since the coefficients kn are bounded, we infer ry» > 1. Finally, jy» < 1 comes
from the fact that | kn \ -/> 0: We show this in the next lemma. ■
Recall that the Bernoulli numbers (V.5.4) are given by the Volkenborn integral
h=L
xkdx (*>0).
zp
Lemma. For n > 1 we have kn ~ b2n (mod Zp). Moreover, |A.o| < 1 and
\kn\ = p for all integers n > 1, such that 2n is a multiple of p — 1.
Proof. (1) We have
r'„(0)
k0 = (iogrp)'(x)u=0 = ^ = r;(0),
and since we have seen in (1.2) that T p(pn) ~ 1 (mod pn) we infer
rp(Pn)-\ rp(pn)-rp(0)
Pn /771
whence
r;(o)=iimW^i^)ezp
y «-»oo pn *
and |Ao| < 1.
(2) The units of the ring Z/pmZ are represented by the integers 0 < j; < pm
that are prime to p. The involution u i-> w"1 on these units shows
i<j<pm, pM i<y<pm, pt/
Dividing by /?m and letting ra -> oo, we obtain by definition (V.5.1) (adapted to a
function vanishing outside Zp
K= I r2ndt= [ t2ndt (modZp).
Jz*p Jz*
(3) Start with
f t2ndt= f t^dt- f t^dt
JZp Jzp JpZp
1. The Gamma Function Tp 379
Let us compute explicitly the second integral:
[ t2ndt = lim — Y j2*
I lim _L. y m2n 2
P Jzc
l<m<pn-x
t2ndt = pln~xbln.
(Observe that this computation proves that in the Volkenborn integral over pZp
we could have replaced formally t by ps with d(ps) = \p\ds = (\/p)ds\) We
have obtained
f t2ndt= f t2ndt-p2n~l f s2nds
J Lin J Lin J LiD
Zip *£ip
= (1 - p2"-1^ = b2n (mod Zp) (n > 1).
The last assertion of the lemma follows now from the Clausen-von Staudt theorem
(V.5.5). ■
The lemma and hence also the theorem are completely proved. Let us summarize
two formulas that follow immediately from the theorem (and its proof).
Corollary. We have
KM
r
^—=/ Log(x + r)rfr, (Logrp)"(*)= / ——dt.
Proof. Everything follows from the previous proof and Proposition 3 of (V.5.3)
(justification of derivation under the integral sign). Observe that the expansion
1 1 1 1^ i"
= = - > (-1)" —
* + / / (l+jc/f) / ;£- t"
can be integrated termwise:
f —!—dt = y(-l)"x"[ r^dt.
Jz*px + t £> Jz«p
Since the integrals of odd functions (with zero derivative at the origin vanish by
the corollary of Proposition 4 in (V.5.3)), there remain only the even powers of t
(corresponding to odd powers of x):
[ ^—dt = -yimx2m~\ km= [ r2mdt.
h*,x + t 4t Jz;
This confirms our previous expression for the coefficients of Log Tp. ■
380 7. Special Functions, Congruences
1.6. The Kazandzidis Congruences
We have already given in (V.3.3) some congruences for the binomial coefficients
©=(:)
(mod pnZp).
It turns out that these congruences hold modulo higher powers of p.
Theorem (Kazandzidis). For all primes p > 5 we have
©"(*) (modP3"*("-*>(")Zp)-
For p = 3 the same congruence holds only mod 32nk(n — fc)(")Z3 (namely one
power of 3 fewer).
The form of these congruences suggests that we should prove (when p > 5)
0/0-
1 (mod p3nk(n - k)Zp).
It is clear that the left-hand side is a p-adic unit, and L. van Hamme had already
observed that it can be expressed in terms of Tp (or in terms of a p-adic beta
function) as follows:
(pn\ l(n\ TP{pn)
The Kazandzidis congruence states that this unit belongs to a multiplicative
subgroup 1 + prZp C Z* with aprecisely determined integer r > 0. The preceding unit
can be studied by means of the logarithm: We have indeed proved | log £ | = |£ — 11
if |£ — I| < rp. On the other hand, we also have \Tp(x) — 1| < |jc|, proving, for
example,
Vp(px) el + pZp (xe Zp).
Since we are assuming p > 3 we have \p\ < rp, and the isometric property of
the logarithm is valid for £ = Tp(px), resp. £ = Tp(py) and £ = Tp(px + py)
(x,y eZp). Hence
Tp(px + py)
rp(px)rp(py)
log
rp(px + py)
rp(px)rp{py)
(x9 y e Zp).
Let us introduce the restricted power series (all its coefficients are in pZp as we
shall see)
f(x) = log rp(px) = k0px - ^ 2n(2n"+1)P2"+lx2n+1-
1. The Gamma Function Yp 381
This is an odd function (this is also a consequence of the Legendre relation, which
implies Tp(px) • Tp(—px) = 1). We now have
log
Tp{px + py)
Yp(px)Yp{py)
= \f(* + y)-f(x)-f(y)\.
As we have seen in (V.2.2), the linear term of f(x + y) — f(x) — f(y) disappears,
and \f(x + y) — f(x) — f(y)\ < C - \xy(x + j)|, where the constant C is the
sup of the absolute value of the coefficients (of index n > 3) of /. Here, we need
to examine carefully these coefficients. The Kazandzidis congruences will follow
from the next theorem. ■
Theorem. Let f(x) = log Tp(px) (x e Zp). Then
(a) f is given by a restricted series having all its coefficients in pZpt
(b) \f(x + y) - f{x) - f(y)\ < \p3xy(x + y)\.
Proof. Let us start with
f(x) = k0px - ^T
^ 2n(2n + 1)
p2n+lx2n+l.
(a) The radius of convergence of / is p > 1 (this function is obtained by a dilatation
x \-+ px from the function considered in (1.5)) and hence the series for f(x) is a
restricted power series. We can write
f(x) = X0px- pJ^pK
„2n-l
.2/i+l
«>1
2n(2n + l)
We have seen that Xq € Zp and pXn e Zp (n > 1). It is enough to observe that
This is obvious for n = 1 and n = 2 and follows from the lemma below for n > 3.
(&) Let us repeat the expression for f(x + y) — f(x) — f(y) in the following
form:
~jhp3((x + y)3 ~*3~y3) ~ ^hp5((x+yy>5~x5-y5>>---'
The leading term in this expression of f(x + y) — f(x) — f(y) is
.2,, , _,2
^p3 - 3(*2j + *j2) = y /73*j(* + j) = ^-^p3Xy(x + j) mod Zp.
When the prime /? is greater than 3, this term is in p3xy(x + y)Zp, whereas it is
only in 32xy(x + y)Z3 when p = 3. The next term is treated similarly:
*2 S
— p°xy(x + y)(*)
t?4 *
—p xy(x + J)
1
4 • 52 ■ 6
/? xj(x + y)
382 7. Special Functions, Congruences
When p = 3 there is a factor 34 making this term smaller than the first one. When
p = 5 there is still a factor 53 of the same size as in the first term. When /? > 5 the
factor p5 makes this term strictly smaller than the first one. The subsequent terms
are treated in the next lemma. ■
Lemma. For n > 3 we have
p2n-3
< 1.
|2n(2n + l)|
Proof. Let us estimate the /?-adic order of the fraction:
2n - 3 - ordp 2n(2n + 1) > 2n - 3 - ordp (2n + 1)!
= 2n
>2n
>2n
1.7. About12
Let us show here how the Morita gamma function is defined for the prime p = 2.
Preliminary comment. Let G be a finite abelian group written additively and let
s = s(G) = J^g.
gcG
In this sum the pairs {g, — g] consisting of two distinct elements contribute 0 to
the sum, and we see that
2n+l-
P
2n+l-
p-\
In
- — -n-
Sp(2n + 1)
-1
1
-3>0.
s= ]C *-
g=-g
But g = — g is equivalent to 2g = 0, and
H = {geG:2g = 0}GG
is a subgroup of G, isomorphic to a product of cyclic groups of order 2: H is of
type (2,2,..., 2). Moreover, we have seen that s(G) = s(H). Now, the sum s(H)
is obviously invariant under any automorphism of the group H: The only case
where s(H) can be nonzero is thus
H cyclic with two elements,
in which case s(H) — 1 is the nontrivial element of this group. Equivalently, s ^ 0
precisely when the 2-Sylow subgroup of G is cyclic and not trivial.
1. The Gamma Function Tp 383
Proposition. For v >3the kernel of the homomorphism
xi->xmod4 : (Z^Z)* -+ (Z/4Z)X = {±1}
is a cyclic group C(2V~2) oforder2V~2 generated by the class of 5, and(Z/2vZ)x
is isomorphic to the direct product ofC(2v~2) and {±1}.
Proof. Since the order of (Z/2i;Z)x is 2v~l, the kernel of the homomorphism
onto (Z/4Z)X = {±1} has order 2V~2. We shall prove that this kernel contains an
element x of order 2V~2. Take x = 1 -h 4f (obviously in the kernel) and use the
fourth form of the fundamental inequality (III.4.3) (Corollary at the end of (V.3.6))
(1 + tf = 1 + nt (mod pntR)
for n=2k and p = 2. Replacing t by At (t e R) we obtain
(1 + 4tf = 1 + 2k4t (mod 2 - 2* • 4R),
(1 + 4f)2* = 1 + 2k+2t (mod 2k+3R).
The element 1 -i-4f has order 2V~2 precisely (and is a generator of the kernel) when
(l+4f)2V"V 1 (mod 2^):
(1 + 4tf~3 = 1 + 21,~If ^ 1 (mod 2^/?).
As appears now, this will be the case exactly when t is odd, \t\ = 1. This proves
that the class of an integer x = 1 -h At is a generator of C(2^~2) precisely when
x ^ 1 (mod 8) and jc = 5 = 1 -h 4 is an eligible candidate! ■
Corollary The product of all units of Z/2VZ is
l(v = l), -l(v = 2), l(v>3).
Proof. This follows from the preliminary observation, since
(Z/2Z)X ^ {1}, (Z/4Z)X = {±1},
whereas if v > 3, then
(Z/2^Z)X = {±1} x C(2V~2)
is a product of two nontrivial cyclic groups. ■
Now let us consider the following sequence:
/(1)=1, f(n)= f] j (n>2).
1<7<«, >odd
Hence f(2) = 1 and
/(2n + 1) = /(2n) (n>l).
384 7. Special Functions, Congruences
Since f(n) is odd for all n > 1, we infer (for the 2-adic absolute value!)
|/(n)| = l, \f(m)-f(n)\<\2\ = ± (nfni>l).
For n — 2V we have
/(!) = !, /(2)=1, /(4) = 3 = -l (mod 4)
and then
mv)= n ^"s+1 (mod2V) (^>3)-
1<7<2v,j" odd
As in (1.1) we infer
|/(n + 2v)-/(n)| =/(«)( f] j - 1 ] < |2V| (v > 3)
and more generally
\f(m) - f(n)\ <\m-n\ (m, n>\, \m - n\ < |).
This proves that the function / is uniformly continuous, and hence has a unique
extension to Z2 -> Z£ = 1 4- 2Z2, which we still denote by /:
l/(*)-/O0l<l*-j| i\x-y\<\).
Since f(2v) == 1 mod 2V (v > 3) we deduce /(0) == 1.
Lemma. We have \f(x 4- 4) - f(x)\ = \ (x e Z2).
Proof. Since the image of / is contained in 1+ 2Z2 of diameter \, we have quite
generally \f(x) — f(y)\ < \. The relation
f(2n + 2) = (2n + l)/(2n) = 2n/(2n) + f(2n)
shows that
|/(2n + 2) - /(2n)| = |2n/(2n)| = |2n| ( < |2| = |).
Similarly,
/(2n + 4) - /(2n) = f(2n) - 1 - 3 - /(2n) ~ ~2f(2n) (mod 4),
|/(2n + 4) - /(2n)| = |2/(2n)| = |2| ( = §).
Since we also have
/((2n + 1) + 4) - f(2n + 1) = /(2n + 4) - /(2n),
we may conclude that |/(jc + 4) - /(jc)| = |2| = | (jc G Z2). ■
2. The Artin-Hasse Exponential 385
In order to have
iyo) = i, rp(i) = -if rp(2) = i
for all primes (including p = 2), we decide to change the sign of f(n) when n is
odd. Thus we define
r2(n) = (-!)"/(«) = (-!)" [7 j (n>2).
l<j<n,j odd
The formula (Definition (1.1))
rp(n) = (-l)" n j (n>2)
l<7<n,pfj
holds now for all primes p. By definition, we have
If(x) if x e 2Z2,
-/(*) if*el+2Z2.
Consequently, when x and y are in the same coset mod 2,
r2(*) - r2(y) = ±(/(x) - /(>)),
and this shows that
|r2(*) - r2(y)\ = \f(x) - f(y)\ (x = y (mod 2)),
so that the inequalities obtained for / are still valid for T2.
Observe that we have
T2(jc + 1) = h2(x)Y2(x\
where
\-x ifjtel+2Z2 (|x| = l),
h2(x) = {
[-1 ifxe2Z2 (|jc| < 1),
in complete similarity with the odd-prime case (1.2).
2. The Artin-Hasse Exponential
The exponential series has a radius of convergence rp < 1 because its coefficients
an = 1 jn! have increasing powers of p in the denominator. It turns out that the
Artin-Hasse power series
exp (x + \xP + jrxP2 + --•)= J2a»xn
H>0
386 7. Special Functions, Congruences
has /7-integral coefficients: an e Q DZp. As a consequence, this power series has
a radius of convergence equal to 1. Dwork has used this power series for the
construction of pth roots of unity in Cp (similar to the construction of nth roots of
unity in C). The Dieudonne-Dwork criterion explains the integrality property of
the Artin-Hasse power series, and Hazewinkel has found a deep generalization of
this phenomenon. We shall present only the initial aspects of these theories.
2.7. Definition and Basic Properties
Let us start by reviewing a couple of elementary formulas concerning the Mobius
function. Recall that for an integer n > 1 this function is defined by (i(l) = 1 and
fi(n) = 0 if n is divisible by a square k2 > 1,
&(piP2 - - - Pm) = (— l)m if the pi are distinct primes.
Lemma. We have
£>(</) = 0, £|jx(<0l=2* (n>\),
d\n d\n
where k is the number of distinct prime divisors of n.
Proof. In fact, if n = p^1 • • - p^k and d \ n is a divisor with fi(d) ^ 0, then d is a
product of a subset of primes /?,■, and quite explicitly,
J2 V(d) = 1 + J2 KPi) + J2 ViPiPj) + - - •
d\n i 1,7
_,_t+Q_...+(_Wi
= e (-iyf) - a - d« = o.
o<i<* ^'
Similarly,
J] \lL(d)\ = 1+1] lM(A)l + £ MPiPj)\ + ''' = (1 + D* = 2"- ■
Proposition. We have identities of formal power series
log(l -xn) = x,
^ n
«>i
and for each prime p
y -*iog(i-/)„ + I/ + i/ + ....
T^. /I p pz
2. The Artin-Hasse Exponential 387
Proof. Recall that
log(l ~-t) = log = V — -
1 — t *—' W2
m>l
Hence
E-— w-*»> = £*»>£ —
~ n *-i *-~? nm
«>1 n>l m>l
= £ -jj- E ^ =*
N>1 n\N
by the first identity of the lemma. Similarly,
> log(l - xn) = > jx(n) >
«>l,ptn «>l,p|« m>l
= £^ E *-)•
The conditions rc | N and rc prime to p amount to n | Np~v, where v = ordpN
(also denoted by pv \\ N). The corresponding sum vanishes (still by the first
identity of the lemma) except if Np~v = 1, namely N = pv (v > 0):
Corollary. We have formal power series identities:
exp(jc) = P[(l-jcBrM(")/w.
exp^ + ^p + ^^+---) = ]~[ (l-jc"rM(n)/". ■
Definition. The Artin-Hasse exponential is the formal power series defined by
Ep(x) = exp (x + ±jc* + ^jc^2 + •-■) = !+* + •--.
Since log and exp are inverse power series for composition (VI. 1), we have
1 1 2
log EJx) = x + -xp + —xp + • - -,
P P2
and by the corollary,
£„(*)= II (I-jcT*00'"
n>l, p|«
is an identity of formal power series.
388 7. Special Functions, Congruences
2.2. Integrality of the A rtin-Hasse exponential
The power series exPfp = 1 + xp//?+•- • converges at least for |jc| < rp, since
\xp/p\ < |jc| for |jc| < rp. Consider the product of the two power series
xp xp
C* =\+x + --- + — + ■ • - and e* /p = 1 + — + .--.
p\ p
Its first coefficients are
xp-\
GH)
expx exp — = 1+H h — + xp I —- + - 1 H
/? (/?- 1)! -1 -
The coefficient of xp is
A miracle happens: The numerator is divisible by p — Wilson's theorem — so
that the whole fraction is in Zp. More is true: All the coefficients in the product
XP Xp2 r-r XpJ yr-,XpJ
expx - exp — • exp —— - • - = I exp —r- = exp > —r
P P2 )>o PJ Jio PJ
are p-integral, hence in Zp. As a consequence, this power series converges for
\x\ < 1-
The radius of convergence of the power series
1 1 2
h(x) = x + -xp + — xp +-.-
P P2
is the same as for its derivative (Proposition 3 in (VI. 1.2)):
h'(x)= l+xp~l +xp2~l + ..-,
namely />, = r/,' = 1. The critical radii and the growth modulus of h are the same
as for the logarithm log(l + x): Both series have the same dominant monomials.
In particular, Ep(x) = exph(x) is well-defined, it converges at least for \x\ < />
and
\\ogEp(x)\ = \h(x)\ = \x\ (|x|<#».
(But |^| is unbounded in the open unit ball Mp C Cp.) This proves that Ep(x) =
exp h(x) is well-defined in the ball \x\ < rp.
Theorem. The coefficients of the Artin-Hasse power series Ep are p-integral
rational numbers, so that Ep(x) 6l+ jcZp[[jc]]. Moreover, the radius of
convergence of this power series is r£p = 1, and
\Ep(x)\ = l, \Ep(x)-l\ = \x\ (|*| <1).
2. The Artin-Hasse Exponential 389
Proof. When p does not divide n, —fi(n)/n is equal to 0 or to ± 1 jn: The binomial
series expansion of (1 — xnytfn)/n j^ its coefficients in Zp, and hence converges
for|jc| < 1 (at least). The infinite product has coefficients inZptoo.lt also converges
in the ball |jc| < 1 by the lemma in (VI.2.3). This proves Ep(x) e 14-*Zp[[jc]], and
in particular r = rEp > 1. Let us show that this radius of convergence is precisely
1. For this purpose, let us prove the identity of formal power series
Ep(xp) = Ep{x)Ep{rx) • • - Ep(Sp-lx)
where r is a primitive pth root of unity: r ^ 1 = rP. The exponent in the product
is indeed
(i + c + --- + cp"> + p(^ + ^2 + ■■■),
whence the identity. Now, each power series Ep(rlx) has the same radius of
convergence r = rEp, while the radius of convergence of Ep(xp) is r1/p. By
Proposition 2 in (VI. 1.2), we obtain
rl/p>min(r,r,...,r) = r,
namely r >rp. This proves r < I.1 Now let
£p(x) -!=* + £ a*** (fl„ € Zp).
Hence we have
\anxn\ < |jc|" < l^l2 < 1*1 (M<1, «>2),
and \Ep(x) — 1| = |jc| (|jc| < 1) since the strongest wins. ■
As we have already observed, the coefficient of xp in the expansion of
ex+xP/p _ f m fP/p
is p-integral. Let us show that this product furnishes a transition between exp and
Ep, with an intermediate radius of convergence (a quantitative way of saying that it
has fewer powers of p in the denominators of its coefficients than the exponential).
Proposition. The radius of convergence of the power series f(x) — e****^ is
r, - r(2p-l)/p2
1J ~" ' p »
hence rp < rf < 1. We have
|exp(x + ix*)| = l (\x\<rf).
or r = oo. but look at exercise 9.
390 7. Special Functions, Congruences
Proof. (1) As formal power series, we have
J>2 PJ
and conversely,
«***'/'= £,(x).«p(-g^.
To prove that this product converges beyond |jc| = rp and get an estimate of its
radius of convergence, it is sufficient to show that the radius of convergence of its
second factor is greater than rp (Proposition 2 in (VI. 1.2)). To get an estimate of
the radius of convergence of
exp("S£)
we use (VI. 1.5). First, let us recall that
Mr V ^— = -L Up1 for 0 < r < r1'^ = r"
J£p> \P2\ ~ ~ P
(the dominant monomial of the log series in the interval rp < r < rp is xp /p2:
Since the preceding monomials are absent in £->2 xpJ /pJ', the first one is
dominant up to rp. The numerical substitution of g(x) = X!;>2 xpl IP* m /(*) = exP x
is allowed when
\x\ < rg = 1 and MMg < rf = rp.
The second condition is
\Ap2I\p2\ < Ipl^, \*\p2 < lp|2+^ = lp|^ = r2f-\
namely
Since
we see that
|jc| < rf-Wr1.
L<l(2-l.) = ^l<iE<l>
p p V p/ r P ~
rp < rf-»"* < I,
and the numerical evaluation is valid in the region considered above. The radius of
convergence of the composite is at least rp p~ ^p > rp. In its ball of convergence,
2. The Artin-Hasse Exponential 391
all factors in
gt+xP/p = £ (JC) .
fl-(^)
have absolute value equal to 1, hence \ex+xP/p\ = 1 (|jc| < rp2p l)lp ).
(2) To simplify the notation, let
p = radius of convergence of ■
F
pi — radius of convergence
P3 = radius of convergence
of exp (i-j ,
°feXP(§^)'
Since \Ep(x) — 1| = \x\ for \x\ < 1, we have p > 1. More precisely, \/Ep{x) =
Ep(—x) if p is odd, proves that p = 1 in this case. Now, let us write
exp
This shows that
(4Y^^m
p2 >min(p,r/,p3)
(Proposition 2 in (VL1.2)), and since p2 < P3 < 1 £ P> we infer that p2 > 0-- ■
2.3. r/i^ Dieudonne-Dwork Criterion
Another proof of the p-integrality of the coefficients of the Artin-Hasse power
series will now be given.
Let k be a field of characteristic p. The identity xp = x in k characterizes its
prime field ¥p. In the polynomial ring k[x], the identity f(x)p = f(xp)
characterizes polynomials / having coefficients in the prime field. For a polynomial /
with integral coefficients, the congruence f(x)p = f(xp) (mod p) means that
f(x)p - f{xp) € PZ[x],
and it should therefore be written more precisely as f(x)p = f(xp) (mod pZ[X]).
For polynomials / with rational coefficients, it turns out that the same congruence
characterizes the integrality of its coefficients. This principle also holds for power
series. The extent to which the operations
first raising x to the power p and then applying /,
first computing f(x) and then raising to the pth power
lead to similar results, is a measure of the integrality of the coefficients of /. A
precise formulation of this principle can now be given.
392 7. Special Functions, Congruences
Theorem (Dieudonne-Dwork). Let f(x) e 1 -f- *Qp[[x]] be a formal power
series. Then the following conditions are equivalent:
(i) The coefficients of f are in Zp.
(H) f{x)P/f(xP) e 1 + pxZp[[x]l
Proof. (0 => (ii) If f(x) e 1 + xZp[[x]]3 then f(x)? ~ f(xp) (mod p). Both
series belong to 1+ *Zp[[jc]], and f(xp) el+ *Zp[[jc]] is invertible, so that (ii)
follows.
(ii) =» (i) Let us write f(x) — ^2i>0 aix1 (ao = 1, a, e Qp) and assume
./w = funfi+pj; v7j (*> e zp).
w
We have fl0 = 1 an^ fli=^6Zp. Let us assume by induction that a{ e Zp for
i < rc and let us compare the coefficients of xn in both members of (*). The
coefficient of xn in the left-hand side is the same as in
\i<n J i<n
The nonwritten monomials are products aixan - ■ -aipxll+l2+'"+lp having at least
two distinct indices ij. It is enough to determine them mod Zp, and for this reason,
all monomials not containing an will play no explicit role, since — by the induction
assumption — they have coefficients in Zp. The only monomials containing an
that are of interest for us have a single factor anxn and all other factors ao = 1 (all
other monomials containing an lead to powers xm, m > n). Hence we find that the
coefficient of xn in the left-hand side of (*) is
ai + Pan + terms in pZp.
if ip—n
With the convention an/p = 0 when n is not divisible by p (i.e., n/p not an integer),
we may write this coefficient as
an/p + Pan + termS in P^p-
The right-hand side of (*) is
and the coefficient of xn in this expression is
an/p + terms in pZp.
Since «//? < «, the induction hypothesis shows that an/p e Zp, and hence a%/p =
flM/p (mod pZp). By comparison we infer pan e pZp and an e Zp. ■
2. The Artin-Hasse Exponential 393
Application. Consider, for example, the Artin-Hasse power series Ep. As formal
power series we have the following identities:
(p§£)=exp('
Ep(x)p = exp ( p ^ — J - exp ( px + xp + + 1 = epxEp(xp).
Hence
Ep(x)p
EJxP)
= epx e 1 + pxZp[[x]],
as we are just going to show. In other words, the p-integrality of the coefficients
of the Artin-Hasse power series follows from the Dieudonne-Dwork criterion and
the following observation.
Proposition. We have
epx e 1 + PxZp[[x]]
and even
epx e 1 + pxZp{x] (p an odd prime).
Proof. For n > 1 we have
pn n-Sp(n)
n\ p — 1
n~\ p-2 1
> n = n -\ > 1,
p-1 p-l p-\
hence the first result. For p = 2 there remains only ord2 (2n/n!) > 1 with equality
precisely when S2(n) = 1, namely when n = 2V is a power of 2. For p > 3 we
see that
, Pn P~2
ordp — > n -> oo {n —> oo),
«! P — 1
and the second result follows. ■
2.4. The Dwork Exponential
The roots of the equation x+xp/p = 0 are 0, as well as the roots of xp~l + p = 0.
All the roots n of xp~l + p = 0 have the same absolute value |jt| = rp. Since the
radius of convergence of exp(x -h xp/p) is greater than r^, we may evaluate this
power series on any such root n. But crude substitution of n in x 4- *p/p gives 0,
and e° — 1 is not the correct result for ex+xP,p | ^! In fact, the condition given in
394 7. Special Functions, Congruences
(VI. 1.5) for numerical substitution is not satisfied, since
M|7r|
\x + y) = rp = rexp-
In the classical, complex case, all roots of unity are special values of the
exponential. It turns out that pth roots of unity can also be constructed analytically by
means of a generalized exponential. Recall that if 1 ^ £ e fip, then |£ — 11 = rp
(II.4.4).
Proposition (Dwork). Choose a root it of the equation xp~l + p = 0 and let
rn denote the result of the substitution ex+xP/p\ _ . Then rn e \ip is the pth
root of unity such that
rn = 1 -f- 7r (mod n2).
Proof. (1) We have
ex+x?/P = j + x _j_ x2^ ^ _ j + x ^mod x2^ ^ indeterminate).
Let us show that we also have
ex+xP/p\x^ = l+* (modTr2).
The sup norm of the function ex+xPlp on its ball of convergence is 1. hence the
coefficients an of its power series expansion ex+xP/p = J]«>oGnJC" satisfy
2p-In
\an\r/ <1 (n>0)
(Lemma in (VI.4.6)). Hence
\a»nn\<rp p .
The exponent of rp is
np2—2np+
p2
-(^)
We want to show that \an7tn\ <z\n\ (rc>2). This is certainly the case when
(Zj-fn > 1. When p > 5, we have (^fn > %n > 1 for all n > 2 and
we are done. When p = 3, we have (£-^-)2rc = |rc > 1 for all n > 3. We have
to estimate <z3. But the coefficients an of exp(jc -h |jc3) are the same as those of
the Artin-Hasse power series for n < 8, hence are 3-integers, and the conclusion
follows. When p = 2, we have (—)2rc = \n > 1 for all n > 5. We have to
estimate the coefficients an for n < 4. But (exercise)
1+7T +7T2+|7r3+^7r4+---,
^x+a2/2
x=tt=-2
2. The Artin-Hasse Exponential 395
and since \tt2 = 1 (is a 2-integer!), ^jr4 = 0 (mod n2) as we desired to show.
(2) As formal power series, we have
,X+XP/P\P _ „P(X+XP/P) _ nX+xP _ nx . **>
(ex+xP/p)
e^ . tr/=zer = eH ■ e
In detail, let ip denote the polynomial ip{x) = jcp. Then (p o exp(jc) = exp px as
formal power series, and hence with h{x) = x -f- xp/p
(exp(x + xp/p))p = ip o exp(jc + xp/p) = <p o (exp oh)(x)
— ((p o exp) o h(x) = exp(/?/z(jc))
= epx+xP = ^ • <?P
(since p is a polynomial, no condition on the order of exp oh is required in Corollary
2 of Proposition 2 (Vl.1.2)). Since |jrp| = |/?jt| = \p\rp < rp, the numerical
evaluation of both exponentials is obtained by substitution (VI. 1.5):
# = (ex+xP/p\x=ji)P = epn - enP = e^ • e"^ = 1. ■
Let us renormalize the situation. Choose a root n of jcp-1 + /? = 0; hence
\n| = rp. Substitute jc = jtj, so that eJTy converges whenever \y\ < 1. The same
substitution in exp(jc + xp/p) leads to a power series
ttj H j = exp (jtj — Try*7) = expjr(j — yp)
converging at least for
\ny\<r?r-W, \y\ < ^p-D/p2-..
The exponent of rp is
2p-l _ 2 = 2p - 1 - p2 = (p - I)2
P2 P2 P2
The power series expjr(j — yp) converges at least for
\y\ < \P\-{p-mp2) = pip-wtf\
hence its radius of convergence is greater than or equal to p^p~l)/(p ) > 1.
Definition. When tt is a root ofxp~l + p = 0 in Qa, the Dwork series is the
formal power series
EAx) = exp(7rU - xp)) e Qp(tc)[[x]].
The radius of convergence of the Dwork series is p^P~l^^P > > 1.
396 7. Special Functions, Congruences
Hence En=^ f og, where f(x) = ex and g(x) — n(x —xp) has order 1 (sufficient
to enable substitution). We shall be interested in the special values taken by this
power series when its exponent vanishes:
xp — x = 0 <£=> x=Oorxe fip-i•
As we have seen,
£„(!)= l+7r (modjr2)
is a generator of [ip.
Theorem (Dwork). Let n be a root ofxp~l + p = 0. Then K = Qp(jt) is a
Galois extension of Qp. It is totally and tamely ramified of degree p — 1, aaw/
£ = Qp(p<p). More precisely:
(a) The field K contains a unique pth root of unity £,n e \ip such that
£„. == 1 +7T (mod tt2).
(b) The series En(x) has a radius of convergence p^p~l^p > 1.
(c) For every a e Qp with ap =awe have
Ejr(a) e [Lp, En(a) =l+an (mod tt2),
so that En(l) = £r-
Proof. Nearly everything has already been proved. Observe that Xp~l + p is an
Eisenstein polynomial relative to the prime p and hence is irreducible over Qp
(II.4.2). If tt and jx' are two roots of this polynomial, then {jt' jir)p~x = 1, hence
tt'/tt e fMp-i C Qp. Thus the splitting field of Xp~l + p over Qp is obtained
by adding a single root tt of this polynomial to Qp. This proves that # is totally
ramified of degree p—l over Qp and hence tamely ramified. The uniqueness of a
pth root of unity $n = \+tt (mod tt2) follows from the simple observation that the
distance between pth roots of unity is rp (Example 2 in (II.4.2), and also (II.4.4)):
Two distinct pth roots of unity are not congruent mod tt2. The other statements of
the theorem follow easily from previous observations. ■
Comments (1)If 1 ^ £ e Cp isarootofunityoforder/?,wehaveseenin(II.4.4)
that f = f — 1 is a root of
and hence |f | = /^ = Ipl1^-"1*. We are now considering roots tt of the simpler
equation
xp~l + p = 0.
2. The Artin-Hasse Exponential 397
Since jr and f have the same absolute value, f = tzu for some u with \u\ = 1:
f — 1 = 7Cu, f = l+jrwe/xpc Cp.
]fap~l = 1, say a = k (mod /?) with 1 < A: < /? (namely A; = ao is the first digit
in the /?-adic expansion of a), then both En{a) and £^(1)* are /?th roots of unity
congruent to 1 -h kit (mod jr2), and the theorem implies
En(a) = £*(!)*.
(2) The Dwork power series is a kind of exponential map: En (0) = 1 and
E
V<p-\ C {a : ap — a] —4 [ip
(3) Let / > 1 and El(x) = exp jt(jc - xpf), so that En(x) = Eln(x). Then
Ef(x) = expn(x-xpf)
2 /—1 /
= expjr(jc — xp) expn(xp — xp )- - -expjr(jcp — xp )
= En(x)EAxp)-'En(xpf~1)
converges at least when each factor converges. The most restrictive condition is
given by the last one: Convergence of EJT(xp ) occurs if
I*"'"| < p^'P2, \x\ < pO-')/(PV-').
With q = pf, we see that the radius of convergence of El is pCp-0/(P9) > \
f
rf
e*
rp = Ipl^l
f+*
rP
Ep (Artin-Hasse)
expEy>o ~T
1
El (Dwork)
€?<*-**) (q = pf)
P M
Radii of convergence of some exponential series
(listed in increasing order)
2.5. Gauss Sums
Sums of roots of unity play an important role in number theory. Let us show how
they can be used to prove that any quadratic extension of the rational field Q is
398 7. Special Functions, Congruences
contained in a cyclotomic one, i.e., in an extension generated by roots of unity. It
is enough to show that the quadratic extensions Q(*/p) (p prime) are contained
in a cyclotomic extension of Q.
Let us choose a root of unity £ of prime order p > 3 in an algebraically closed
field K of characteristic 0. For example, take K = C and r = e2jTi/p. Then the
sum of roots of unity
0<-V«rn W
0<v<p
is the simplest example of a Gauss sum: Here—as in (1.6.6) — (-1 = ±1 denotes
the quadratic residue symbol of Legendre.
Proposition 1. For an odd prime p, we have S2 = ±/?.
Proof. The square of the sum Sp is
For fixed \i ^ 0, v\i goes through all nonzero classes mod p, and we can replace
v by vfji in the double sum:
We consider separately the terms with v = p — 1:
?(tV-»-»®
and for v ^ p — 1
Recall that
=0 because v+1^0
2. The Artin-Hasse Exponential 399
Hence
<-"-»(t)-£©
-^.-(tO-eG)
= P(~1/P)-
The announced formula is proved. ■
Corollary 1. For a prime p > 3, r/ie complex absolute value ofSp is
\SP\c = <Jp. ■
Corollary 2. For a pn/ne p > 3, r/te quadratic extension Q(y/~p) is contained
in the cyclotomic field Q(£, %/^T). ■
Observe that if /? = 2, we have (1 + V11!)2 = 2-v/::T, so that a/2 e QCv^T)
and the quadratic extension Q(a/2) is also contained in a cyclotomic one.
Comment. A theorem of Kronecker asserts that any Galois extension of the
rational field Q with abelian Galois group is contained in a cyclotomic one. This is
a deeper theorem, which has been widely generalized, and belongs now to class
field theory.
The general form of Gauss sums in a field K containing a pth root of unity £ is
obtained as follows. The map u h-> f", Fp —> Kx is a group homomorphism:
/■v+m = /■" . /■**
The map v h> (-), F* -> Kx is a group homomorphism:
(tK)®-
extended by (-J = 0. Replace Z/pZ = Fp by a finite field F9 (where q = pf \s
a power of /?) and let more generally ty and / be two group homomorphisms
^ : Fq -» K* and / : F* -> Kx extended by /(0) = 0.
According to tradition, we shall say that ^ is an additive character of F9 and / a
multiplicative character of F^. By definition, the Gauss sum attached to this pair
400 7. Special Functions, Congruences
of characters is the sum
In the next section we give the /?-adic absolute value of Gauss sums. Now, let us
show how to determine all additive characters of a finite field.
Proposition 2. Let G be a group and K afield. Any set of distinct
homomorphisms G -> Kx is linearly independent in the K-vector space of functions
G-» K.
Proof. Since linear independence of any family is a property of its finite subsets,
it is enough to prove that all finite sets of distinct homomorphisms are linearly
independent. We argue by induction on the number of homomorphisms Vi. Since
homomorphisms are nonzero maps, the independence assertion is true for one ho-
momorphism. Assume that n — 1 distinct homomorphisms are always independent
and consider n distinct homomorphisms fa (1 < * 5 «)- Starting from a linear
dependence relation
QfiW^) + - + ^fcW = 0 (x eG, a, e K),
we multiply it by the value fa(a) (for some a e G):
ax fa(a) fa (x) + - - - + an fa (a)fa(x) = 0 (x e G).
On the other hand, we may replace x by ax in the first equality, and since fa (ax) =
fa(a)fa(x), we obtain
ctifa(a)fa(x)H hanfa(a)fa(x) = 0 (x e G).
If we subtract the two relations obtained, the first term disappears, and we get a
shorter relation:
^i(fa fa) - ^i(a))fa + • • ■ + <xn(fa(a) - fa(a))fa = 0.
By the induction assumption, all the coefficients of this new relation vanish. If we
choose a e G such that fa(a) — fa(a) ^ 0 (this is possible since fa # tyn\
we see that an = 0. Using the induction assumption again, we get o?/ = 0
(1 < i < n). ■
Proposition 3. Let F be a finite field and r : F -> Kx a nontrivial additive
character. Then any other additive character \jf has the form i/(x) — r(ax) for
some a e F.
Proof. The identity
z(a(x -+- y)) = x(ax + ay) = r(ax)r(ay)
2. The Artin-Hasse Exponential 401
shows that for any a e F, ra(x) := x{ax) defines an additive character. Now,
a h> Ta is a homomorphism
*a+b(x) = *(0z + b)x) = r(ax + bx) = r(ax)r(bx) = ra(x)r^(x).
It is injective, since t is a nontrivial character:
ra(x) =1 (xeF)=4fl=0.
The additive characters (ra)aGF constitute a basis of the F- vector space of functions
F ~-> Kx. Any additive character must be in this family by Proposition 1. ■
As a consequence, we observe that the Gauss sums G(tJs9 x) can be computed
easily from G(t, /):
GW, X) = G(ra, x) = JT r(ax)x(x).
xeF*
If we assume a ^ 0 and replace x by a~lx in the sum, then we obtain
G«r,x)= 51 Tfefl-1*)/^-1*) = £ TMx(fl"I)xM = x(fl",)C(T,x).
2.6. 77ze Gross-Koblitz Formula
Let us choose a primitive pth root of unity fp in Cp and let AT = Qp(fp). Then
|fp — 11 = rp, and $p — 1 is a generator of the maximal ideal P of R C AT. As we
have seen in (2.4), there is a generator n of P uniquely characterized by
n*"1 =-p, 7r=fp-l (mod«p-l)2).
Conversely, if we choose a root 7r e Cp of ttp~1 = —p, the field K = Qp(tt) is
a Galois extension (2.3) of Qp, it contains all roots of unity of order /?, and the
Dwork series furnishes En(\) = £p, the unique root of unity of order p satisfying
t;p == 1 + n (mod 7r2).
Since an additive character of the field ¥p is uniquely determined by its value
\^(1) G /Xp, we choose the nontrivial additive character
We can now consider Gauss sums of the form
C(X,^)=X>(*)C (X(0) = 0),
where x is a multiplicative character of ¥p with values in K. More precisely, the
values of x are roots of unity having order dividing p — 1 (and 0):
G(x, VO € Q(/Xp, Mp_,) = Q(/Xp(p_,)).
402 7. Special Functions, Congruences
We shall consider Gauss sums of the form
£ a>(x)-aHx)=Ylo>(xrati (a>(0) = 0),
0^xgFp xeFp
where oj(jc) e p-p-i denotes the unique root of unity in K having reduction x in
the residue field R/P of K ((IL4.3) and (III.4.4)). Here, the integer a only counts
mod p — 1: It is better to take a e —j^-Z/Z and set
Ga = ~YJ^)~{P~l)aKn (0>(P) = 0).
*eFp
A reason for the choice of sign is that we now have Go = 1:
0<f</7 0<f</7
It is remarkable that these Gauss sums are linked to the Morita p-adic gamma
function: When a = a/(p — 1) (0 < a < /? — 1) we have explicitly
«■-* "r'(^r)-
This is a particular case of the Gross-Koblitz formula. Since the values of Tp are
units of Zp, the preceding formula gives the exact order of Ga, and
\Ga\ = \7r\a=rap = \p\^.
Conversely, this case of the Gross-Koblitz formula shows that
J E Q(7T, flp(p-i)),
and this is an algebraic value, since ttp~~1 = — p.
There is a more general formula Let a E Z(p) = Q D Zp be a rational number
with denominator N prime to p and choose a sufficiently high power q = p^ of
p so that the extension Fg of degree / of its prime field contains a root of unity of
order N. We shall work in the tamely ramified extension
K = Qp(7T,llq-l)CCp
having ramification index e = p ~- I, residue degree /, and hence degree n = ef
over Qp. Considering a e ^Z/Z C —^-Z/Z, we choose a representation
0 < (or) = —— < 1
q-\
of a and write the p-adic expansion of the numerator:
a =a0+aip-\ M/-] pf~l <q - 1 < q = pf.
<
3. The Hazewinkel Theorem and Honda Congruences 403
Let Sp(a) — y£20<j<faj denote the sum of digits of a as in (V.3.1) and introduce
the integers a^ having the p-adic expansions obtained by cyclic permutations
from the expansion of a = a(0):
a(l) = af-i +a0p + •-- + af-2pf~\
a(2) = «/-2 +af-\p H \-af^3pf~\
aif l) = ax + a2p + a3p2 -\ \-a0pf l.
On the other hand, if the nontrivial additive character rfr of the prime field Fp is
chosen as before, the composite of ^ with the trace
Tr: Fg -^ Fpy x h> x + xp + • • • + xpf~l
is a nontrivial additive character of Fq (the trace is nontrivial, since the extension
Fq/Fp is separable: All extensions of finite fields are separable). Then we have
the following general formula.
Theorem (Gross-Koblitz). Let 0 < a = < 1. The value of the Gauss sum
Ga is explicitly given by
Ga = - £ co(xraMTr(x)) = *Ma) fl Tp (~r) ' -
O^xgF^ 0<j<f W V'
3. The Hazewinkel Theorem and Honda Congruences
3.1. Additive Version of the Dieudonne-Dwork Quotient
The power series
does not have coefficients in Zp (powers of p appear in the denominators).
However, its exponential — the Artin-Hasse power series—has ^-integral coefficients.
This phenomenon will now be studied more closely. Observe that
so that
f(xP) = xp + \xP2 + jtxP* +
f{x) =x
p
has integral coefficients! Let us introduce the operator
Hpf(x) = f(x)-^—t
404 7. Special Functions, Congruences
on formal power series. We have an identity of formal power series:
exppHpf(x) = exp(pftx) - f(xp))
= (exp/(*))"
expf(xP) '
The expression Hpf is an additive version of the Dieudonne-Dwork quotient
<p(x)p/<p(xp) (2.3), and we shall formulate criteria for ^-integrality of some formal
power series in terms of Hp>
Proposition. Let f denote the formal power series f(x) = log(l + x). Then
H2f(x) = J2 (*" ~ *2nV" e Z(2)[W].
n odd
Hpf(x) = ]T (-l)"-1*"//? e Z(p)[[jc]] (/? Art odd prime).
Hence for all primes p, Hp(\og(l + x)) has p-integral coefficients.
Proof. We have
iog(i+^) = V(-irl-)
1 1 x^"
P P^x n
If the prime p is odd, we have (—1)" = (—l)pn, and the announced result follows
in this case. When p — 2, let us write explicitly
xn
iog(i+*)=]£ — y; —,
« odd « even
and
« odd n=2m even
EX % ■> X
AA n AA ™
n odd m odd
As announced, all coefficients are in Z(2).
aa n ^ 2w
n odd n=2m even
3.2. 77ze Hazewinkel Maps
Let us consider the following setting: Either
A = Z^M C B = Q[t]
3. The Hazewinkel Theorem and Honda Congruences 405
or
A = Zp[t] CB = Qp[t],
and a is the Q-linear map (resp. Qp-linear map)
a : B -» B, a(t) = £«,■*' ■"* a^P) = ^2aifPi
extended to
ff* : B[[x]] -> £[[*]], £ fl|-(r)x' »-> X] ^('V,
letting o act on the coefficients only. Note that (a*/)(jt") = <?*(/(*")), so that we
may unambiguously write this term a*f(xn).
Definition. Any map Hp : B[[x]] -» £[[*]] of the form
f»Hpf = nx) - I x>:/(^'),
vvfrere /cN* = {l,2,...}wfl subset of indices, will be called a Hazewinkel
map.
In the next three propositions, Hp denotes a Hazewinkel map.
Proposition 1. Let f = Y,m>\ fm*"1 e B[[x]], so that /(0) = 0. Then
Hpf e A[[x]] =^mfmeA (m > 1).
Proof. The coefficients of Hpf = ]£ nmXm € A[[x]] are given by
hm= fm 22 V1 fm/p' € A
with the convention fm/p, = 0 if i > ordp(jn), namely if m/pl is not an integer.
This series of identities starts with hm = fm e A when (m, p) = 1. We proceed
by induction on the order v of m, the case v = 0 having just been treated. When
p | m, we have
Pi
so that
™fm = ~ V" ^'/ih/p" = T] P' la I ~T fm/pl I (m°d ™A)
Gv4 by induction
and hence mfm e A as expected.
406 7. Special Functions, Congruences
Remark. When / = J2 fm*"1 e B[[x]] and Hpf e A[[jc]], we can write fm =:
am/m with coefficients am e A, and the formal power series / is a logarithmic
series
/w=X>m*m=X;
WJ>I
m
Proposition 2. Let g = Ylm>i SmXm and h = J]m>i ^^m be two formal
power series with zero constant term. Then
Hp(g oh) = Hp(gXh) + - J] X) (CT,«m) • (>* W'm - KKx'W) ■
P I m>l
Proof. By definition,
Hp(g oh) = goh--Y °Ug o h)(xP'),
Pi
while
tfpteX*) = g(h) - - J] ^(^').
The first terms are the same and cancel by subtraction. Using the obvious relation
<(goh) = ai(g)oGl(h)
and the expansion g = J2m>i SmXm we get the announced result. ■
In the special case / = {1} (a single term in the index set /),
Hpf = f(x)--o.f(xn,
P
and we recover the additive version (3.1) of the Dieudonne-Dwork quotient in
the case of constant coefficients (cf. generalization in (3.3) below). The following
conditions are equivalent:
(0 fm - i\/p)ofm/p e A.
(ii) am - oam/P e mA.
(iii) am = oam/p (mod mA).
(iv) am(t) = am/p(tp) (mod pvA[t]) (v = ordp m).
Definition. A sequence («m)m>i in A = Z(p)[f ] is a p-Honda sequence when it
satisfies the following Honda congruences:
am(t) = am/p(tp) (mod mZ(p)[t]) when p \ m.
3. The Hazewinkel Theorem and Honda Congruences 407
In particular, a sequence (am)m>\ in Z(p) is a p-Honda sequence when
am == am/p (mod wiZ(p)) when p | m.
The paragraph preceding the definition proves the following result.
Proposition 3. For a formal power series f(x) = ]Cm>i cimxm/m e £[[*]]
we have the equivalences
(i) Hpf — f(x) o*f(xp) has its coefficients in A C B,
P
(ii) («m)m>i is a p-Honda sequence in A.
Proposition 4. Let A be a ring, I an ideal of A containing a prime p, and x
and y two elements of A satisfying x = y (mod Ir)for some integer r > 1.
Then
pv \m ==> xm = ym (mod Ir+V) (v e N).
Proof. (1) Let us write x = y + z with z e /r. Hence
xp = (y + zf = yp + z/>(- - ■) + zp
with
zp(- -) e z/ C V • / = /r+1
and
This establishes the case m = /? (u = 1) of the lemma.
(2) The case m = pv is treated by induction on u, the basic step vh> u-f 1
being analogous to the first case already treated. Hence
xpl = ypV (mod I)r+V (t > 0).
(3) Finally, if we raise a congruence to the power I = m/pv, it is preserved: If
x' = y' (mod If, say x' = y' + zf with z' e Is, then
(jCy = (/)f+z/(-.-) €(// + /'. ■
This proposition shows that the sequence ((1 -f- -*)"%>] is a /?-Honda sequence
for any prime p. Let us state it explicitly.
Corollary. Let m > 1 be an integer divisible by p.Ifv > 1 denotes its p-adic
order, then
(1 + x)m = (i -f-*T/p (mod pvZ[x]).
408 7. Special Functions, Congruences
Proof. Observe that (1 + x)p ~ 1 -\-xp (mod p) and apply the proposition. ■
3.3. The Hazewinkel Theorem
The particular form of the Hazewinkel theorem that we are going to state and prove
has been specifically studied by various authors:
Barsky, Cartier, van Hamme, Honda,..., Zuber
(neither exhaustive nor chronological... but in alphabetical order!). It has many
applications. Let us first give the Dieudonne-Dwork theorem (2.3) in a more general
form.
Theorem (Dieudonne-Dwork). Letf(x)e l-\-xQp[t][[x]] be a formal power
series. Then the following conditions are equivalent:
(i) The coefficients of f are in Zp[t].
(ii) f(x)P/a*f(xn e 1 + P*Zp[t][[x]].
Proof. As in (2.3): Only observe at the end of the implication (ii) => (i) that the
coefficient of xn in the left-hand side is now
an/P(t)p + pan(t) + terms in pZp[t]
and in the right-hand side
c*an/p(t) = anlp(tp) = an/p(t)p (mod pZp[t]).
The conclusion follows. ■
If we know a priori that the coefficients of / are rational, namely
f(x)el+xQ[t][[x]l
we get equivalent statements:
(i) The coefficients of f are in Z(p)[f],
(ii) f(x)P/a*f(xP) e I -i-PxZip)[t][[x]]
simply since Zp D Q = Z(p). Let us come back to the notation of (3.2):
either A = Z{p)[t] cB = Q[t] or A = Zp[t] c B = Qp[t]
and
o : B -> £, a(t) = J^a/r1" «-> a(tp) = ^«^p'
3. The Hazewinkel Theorem and Honda Congruences 409
is extended to
a* : B[[x]] -> B[[x]], ]£ a^x1 h* ]£ fl,-(rlV
*>0 i>0
by letting a act on the coefficients only.
Theorem. For a formal power series f(x) = J2m>i amXm/m e B[[x]] we
have equivalent statements:
(i) Hpf = f(x) o*f(xp) has its coefficients in A C B.
P
(ii) <p = ef has coefficients in A.
Proof. (i)=>(ii) Assume that f(x) = ]£m>1 fmxm = J2m>\ amxm/m satisfies
(i). By Proposition 3 in (3.2), (am) is a/?-Honda sequence. Then Hp(f) has
coefficients in A and by the proposition in (2.3), exp pHp(f) has /^-integral coefficients
exp pHp(f) = (exp f)p/expo* f(xp) = (exp f)p/o* exp fix?)
= <p{x)p/oMxp) e 1 + pxZ(p)[t][[x]].
By the general form of the Dieudonne-Dwork criterion,
<p(x) = exp(f) e l+*A[[x]]
has ^-integral coefficients.
The proof of the converse (ii) => (i) is based on Proposition 4 in (3.2). Assume
that tp = exp(/) has /^-integral coefficients. Write
f(x) = logexp(/(*)) = log(l + (e*x) - 1)) = g(h(x)),
namely / = g o h with g(x) = log(l ■+• x) and h(x) = ef(x) — 1. Proposition 2 in
(3.2) can be applied to this composition:
Hp(f) = Hp(g oh) = Hp(g)(h) + - 5ZS °lZ™ (hMP'm - (<Kxpl)T) •
By (3.1) Hp(g) has ^-integral coefficients and by assumption, h has ^-integral
coefficients. There only remains to consider the second term, where g has constant
coefficients (independent of t)
°lgm =gm =±l/m.
Now, for all formal power series h e A[[x]] having ^-integral coefficients, we
have
h(x)pI == alh(xp') (mod p).
410 7. Special Functions, Congruences
As Proposition 4 of (3.2) shows, this congruence is improved when raised to a
power m:
h(xfm = oih(xp')'" (mod p)l'+1 (v = ordpm).
This proves
h(x)p'm -alh(xp')m <e pmA[[x]l
and the p-integrality of the remaining sum follows. ■
3.4. Applications to Classical Sequences
Proposition (Beukers). Let M be a d x d matrix with integer coefficients.
Define an — Tr(Mn). Then for any prime py (an)n>\ is a p-Honda sequence
dm = am/p (mod mZ(p)) if p \ m.
Proof. We have to prove that e* has coefficients in A = Z(p) where f(x) =
5Zw>i cimxm /m. This logarithmic generating function is easily evaluated:
/(*) = £(TrM™)^=Tr(£^)
= Tr (-Iog(l - Mx)).
Hence
exp f(x) = expTr Iog(l - Ma)-1 = detexplog ((1 - Ma)'1)
= det(l - Ma)"1 = l/det(I - Mx)
has its coefficients in A. ■
Corollary 1. The Lucas sequence
£o = 2, £, = !, *n+i = £„ + 4-i (n>l),
lv a p-Honda sequence for any prime p.
Proof. Let M = I n e M2(Z). The characteristic polynomial of M is
a2 — a — 1, hence M2 — M — I = 0 (Hamilton-Cayley). We deduce
M"+2 = Mn+X + M" (« > 0).
Since Tr I2 = 2, Tr M — 1, this proves that f „ = Tr M" is the Lucas sequence. ■
Corollary 2. The Perrin sequence
a{) = 3, «! = 0, fl2 = 2, «„+2 = a„ + fln-i (« > 0*
is a p-Honda sequence for any prime p.
3. The Hazewinkel Theorem and Honda Congruences 411
(° l l\
Proof. Let M = | 1 0 0 e M^(Z). The characteristic polynomial of M is
—x3 + x + 1, hence M3 — M — I =0 (Hamilton-Cayley). We deduce
Mn+3 = M"+1 + M" (n > 0).
Since Tr/3 = 3, TrM = 0, and TrM2 = 2, this proves that an = TrM" is the
Perrin sequence. ■
3.5. Applications to Legendre Polynomials
Let (Pn)n>o denote the sequence of Legendre polynomials. This sequence can be
defined by its generating function
where R2 = 1 — 2xt + x2. Recall that these polynomials Pn{t) e Q[f ] satisfy
deg/>„=«, P„(l)=l (#i>0).
They can be computed according to the Rodrigues formula
This formula shows that the coefficients of Pn are rational numbers with
denominators powers of 2. More precisely, the coefficients of Pn belong to (l/2n)Z. They
are ^-integral for all odd primes p.
The following generating functions are well known (they can be checked by
differentiation with respect to x):
> /V-iW— =log— —-,
£i m l~f
xm 2
Y Pm(t)— = lOg ■
Hence
\m>l /
'fe'-^H
2
exp
?* + /?'
412 7. Special Functions, Congruences
and for each odd prime p, we get two p-Honda sequences. Explicitly,
p\m=* Pm~i(t) = P{m/P)~i(tp) (mod mZ(p)[t]),
p\m=^ Pm(t) = Pm/P(tp) (mod mZ(p)[t]).
For example, to check that
write
x-t + R , jc-1 1 -jc /, „ 1-r \I/2
-i + I^(-iW^)
so that the denominator 1 — t disappears: All coefficients are in Z[~, t] c Z(p)[f].
The integrality verification for the other generating function is similar and therefore
left as an exercise. ■
The change of variable t = 1 + 2t clears the powers of 2 in the denominators,
and congruences mod 2 (or mod 4) can also be established.
3.6. Applications to Appell Systems of Polynomials
Let (An(f))„>0 be an Appell family (IV.6.1) in Zp[t]: A'n = nAn^ (n > 1). The
following result generalizes the corollary of Proposition 4 in (3.2).
Theorem (Zuber). For an Appell family (An(t))n>o in Zp\t\ the following
conditions are equivalent:
(i) There exists a eZp such that
An(a) = An/P(a) (mod nZp) (n > 1, /? | /?).
(h) There exists a eZp such that (An(a))n>\ is a Honda sequence
An(a) = An/P(ap) (mod nZp) (n>l, p\n).
(Hi) (An)n>i is a Honda sequence of polynomials
An(t) = An/P(tn (mod nZp[t]) (n > 1, p | n).
Proof, (i) «£> (ii) by the p-adic mean value theorem (V.3.2),
\ap-a\<\p\<rp (aeZp).
3. The Hazewinkel Theorem and Honda Congruences 413
Hence
\An/p{ap)-An/p(a)\<\p\-¥\<\n\9
\P\
and the equivalence follows.
(iii) r=> (ii) is obvious.
(i) r=> (iii) This uses (3.3): It is enough to show that
exp]T An(t)— = J2 Q&y^ ^0 = 1) (*)
n>\ n k>0
has p-integral coefficients, namely qk(t) e Zp[t].
(1) Let us compute the partial derivative d/dt of the defining equation (*):
or equivalently (using A'n = /2A„_i),
«>1 m>0 £>0
This gives
(2) Let us compute the partial derivative d/dx of the defining equation (*):
53 i^*"-* • exp J] • • • = J] a^**-1
or equivalently,
«>1 m>0 £>]
This gives
n+m=k,n>\
and
(3) Comparing (/) with (//),
q'k = ^oflfc-i + (* ~ Oflfc-i =(k + AQ- l)qk-\ (* > 1).
414 7. Special Functions, Congruences
Iteration leads to
qZ = (k + A0-W + A0- 2)qk-2, - • •,
q™ = (k + A0 - 1 )(* + A0 - 2) - - • (k + A0 - €)^_£ (1 < £ < *).
Now, the Taylor formula is
n<;<t \ J /
Qkit)
Since A0 eZp, all binomial coefficients are in Zp. Moreover, by assumption, all
qn(a)eZp, since (A„(a))n>] is a p-Honda sequence. We conclude that the
polynomials quit) have p-integral coefficients. ■
Exercises for Chapter 7
1. For /, g e Q[x], prove that
f = g (mod nZp[x]) <=» f = g (mod pvZ{p)[x])
(cf. Exercise 29 in Chapter I).
2. Let p be an odd prime. Show that the closure of the set of pairs (n> n !*) in Zp xZp is
the union of two graphs.
{Hint. Consider the graphs of ±Tp.)
3. Find the limit lim„_>oo Fp(pn). More precisely, can you evaluate
lim (!>(/?") -l)//>"?
4. Prove the congruence
(1 + 4fj2* = 1 + 2k+2t (mod 2*+3) (f e Z)
by induction on the integer k > 0.
5. More on the gamma function Vo-
(a) Check the formula
(~iy(2[/z/2])! , _.
(2^/2][/z/2]l)
(b) Prove T2(/z + l)r2(-/z) = (~l)1+[Crt+1)/2J (n > 1).
(c) Let m > 1 be an odd integer. Prove ]~[o<A:</n ^lik/m) = ±1.
6. For any prime p and 0 < a < p, show that
/z h-> (—IV
/?"/z!
Exercises for Chapter 7 415
has a continuous extension to Zp given by
x i-> (-\f+lrp(a + px + 1) e Z; C Q^.
Prove the following generalization. Let q = pf (/ > 1 j be a fixed power of /?, and
0 < a < q. Show that
(-p)^"'ml
admits a continuous extension Zp -^ Q^ given by
x^(-i/^(-/7)ord^: Y\ r#,([fl//>I"] + //-I"x + i).
0</</
(//i/zf. Write a telescopic product with no = a -\- qm, tif —m
(a-\-qm)\ n$\ n\\ /z/-l-
_ («o + p«i)! («i + ptn)\ (a/_i h- /wz)!
= ±/>",r#,(flo+/>ni+i)-/^2rp(ii,+i)-.-pmrp(ii/-,+i).
Observe that when the prime p is odd, q — 1 is even and (— l)^1*™ — _|_i: Hence this
sign is relevant only if p — 2, in which case it is (m) = (— l)m.)
7. With TTp~l = —p, prove
enx e \+7txZp[tt][[x\\.
7tn n n — 5n(rt) Sd(h) 1
{Hint. For n > l.ord^ — = p—± = -£-f > - )
n\ p — 1 /? — 1 /? - 1 /? — 1
8. Compute the first coefficients of the Artin-Hasse exponential Ep for p = 2, 3 (and 5).
In particular, show that
£2(X) = 1 + X + X2 + f X3 -f §X4 + X5(- • -),
£3(X) - 1 + X + X2 + iX3 + 35 X4 + *5(-' ■)-
Compute the first coefficients of the Dwork exponential for p = 2. 3 (and 5).
9. Here is another proof of the fact that the radius of convergence r of the Artin-Hasse
exponential Ep is smaller than or equal to 1.
(a) Show that if this radius r were greater than 1, then the unit sphere would be a critical
sphere of Ep. and Ep would have a zero a ^ 1 on this sphere.
(b) Use the identity
Epix)Ep^x) - ■ Ep{^-]x) = Ep(xP)
(where £ is a primitive /rth root of unity: £ ^ 1 = £p) to show that Ep would have
infinitely many zeros on the unit sphere, thus contradicting (Vl.2.1).
416 7. Special Functions, Congruences
(c) When p ^ 2, give another proof of r£ < 1 based on the identity
E,p(x)/Ep(x) = J2xPJ~1-
(Hint. Use Propositions 2 and 3 in (VI. 1.2), as well as \/Ep(x) = Ep(—x) to show
thatr1/jEp = r£p.)
10. When tt^"1 = -p and Try = x, then we have e"<y-yp) = ex+xP/p. Find the
corresponding general expression for £7r0,~~-v*) (# = p/f y > l) in terms of x = 7ry.
11. Prove the following relations for the coefficients of Dwork's exponential en^x~xq^ =
«A„ = 7TA„_i (1 < n < q), nAn = 7r(An-i - qAn-.q) (n > #).
(ffrm. Differentiate the above generating function.)
12. For 0 < a = a/(p - 1) < 1 let Ga denote the Gauss sum - J]?-"0£*(£)- Use **«
Gross-Koblitz formula to prove GaG]_a = rtp.
13. Let x be a nontrivial multiplicative character F* —> Cx, and consider the Gauss sum
G(X) = J]x(v)fv
ueF£
(where £ ^ 1 is a pth root of unity in C). Show that the complex absolute value of this
Gauss sum is
|G(X)lc = VV-
(Hint. Prove G(x)G(x) = p exactly as in the proof of Proposition 1 in (VII.2.5). There,
the Legendre symbol was a multiplicative character x such that x ^ 1 = X2- But here,
X2 may be nontrivial.)
14. Let x : Fx -► Cx be a nontrivial, complex-valued, multiplicative charaaer of F^. As
in the previous exercise, we consider the Gauss sums G(x)'- |G(x)lc = y/P- Show that
the only case when G(x) = £^/p for some root of unity e, happens when x = (t)-
(Hint. Let co : Fx —*c Cx denote an injective homomorphism, considered as a
complex-valued, multiplicative character of ¥p (analogous to the Teichmiiller
character (III.4.4)). Show that any nontrivial multiplicative character x : Fx -> Cx can be
written uniquely x = <*>~a (1 < a < p — 2). If G(co~a)2/p is a root of unity, use the
Gross-Koblitz formula to show that a = ^j--)
15. Check the following formulas by differentiation with respect to x
En xm , x-t + R
Pm-\(t)— = log—- ,
m \—t
2
Y Pm(t)— = log
,. m \-tx-\-R
Exercises for Chapter 7 417
16. Let (Bn)n>o denote the sequence of the Bernoulli polynomials. If p is an odd prime,
prove the following congruences:
Bpn(t) = Bn(tp) (modnZp) (n > 1).
For p = 2, prove that a single power of 2 is lost, i.e.,
2B2n{t) = 2Bn{t2) (mod nZ2) {n > 1).
(Hint. Use (Chapter V, Exercise 10) and (VII.3.6) for the Appell sequence fin(t) =
2pBn(t) € Zp[r].)
Specific References for the Text
1.2
A. Cuoco: Visualizing the p-adic integers, Amer. Math. Monthly 98 (1991), pp. 355-364.
A. Robert. Euclidean Models of p-adic Spaces, Proc. of the 4th Int. Conf. (Nijmegen),
M. Dekker (1997), pp. 349-361.
1.3.1
N. Bourbaki: Topologie Generale, Hermann (Paris 1974), Chap. IX §3.
1.6.8
F.K. Schmidt: Mehrfach Perfekte Korper, Mathematische Annalen 108 (1933), pp. 1-25
(cf.Th.V,p.7).
I Exercise 1
These numbers were apparently already considered by Gergonne and Lucas in the nineteenth
century and called congruent numbers:
R. Cucuuere: Jeux Maihematiques, in Pour la Science (Juin 1986), pp. 10-15.
II.1.2
N. Bourbaki: Topologie Generale, Hermann (Paris 1974), Chap. IX §3.
II.4.6
A. Robert: A Good Basis for Computing with Complex Numbers. Elemente Math. 49 (1994),
pp. 111-117.
It was pointed out to me by H. Bninotte (Diisseldorf) that the surjectivity proof given in
this article has a gap. One can deduce it from
I. Katai: Number systems in imaginary quadratic fields, Ann. Univ. Sci. Budapest, Sect.
Comp. 1994. pp. 91-93, MR#95k: 11134.
II.A
N. Bourbaki: Algebre commutative, Hermann (Paris 1964), Chap. VI §9.
A. Weil: Basic Number Theory, Springer Verlag (3d ed. 1974), Chap. 1.
420 Specific References for the Text
IIL1.6
M. Krasner: Nombre des extensions d'un degre donne d'un corps p-adique, in: Les
tendances geometriques en algebre et theorie des nombres, Colloques Internat. du CNRS
N° 143 (1966), pp. 143-169.
IIL2.4
For spherical completeness, cf. Chapter 4 of A.C.M. van Roou: Non-Archimedean
Functional Analysis
IIL2.5
B. Diarra: Ultraproduits ultrametriques de corps values, Ann. Sc. de FUniv. de Clermont
D, Serie Math. 22 (1984), pp. 1-37.
IIL3.3
S. Lang: Algebra, Addison-Wesley (2nd ed. 1984), p. 412 and 420, or (3d ed. corrected
1994), p. 347 and p. 482.
Ill Exercise 7
Trees have always played an important role in p-adic analysis. They were revived recently
by Berkovitch, Escassut, Mainetti, and others in the context of circular filters, from
which our version for balls is derived (Exercise 2 in Chapter II).
IV.1.3
J.E Adams: On the groups J(X)-III, Topology, vol. 3 (1965), pp. 193-222.
IV.1.5
L. van Hamme: Three generalizations of Mahler's expansion ..., in p-adic Analysis, Proc.
Trento 1989, ed. by F Baldassari et al., Springer Verlag (1990), Lect. Notes #1454,
pp. 356-361.
I V.4.6 For compact operators and compactoids
J.-P. Serre: Endomorphismes completement continus des espaces de Banach p-adiques,
IHES n°12 (1962), pp. 69-85.
A. CM. van Roou: Non-Archimedean Functional Analysis p. 133.
IV.5.5
L. van Hamme: Continuous operators which commute with translations..., Proc. of the 1st
Int. Conf. (Laredo), M. Dekker (1992), pp. 75-88.
G.-C. Rota, D. Kahaner, A. Odlyzko: Finite Operator Calculus, J. of Math. Anal, and
AppL, vol. 42, N° 3, (1973) pp. 684-760.
S.M. Roman, G.-C. Rota: The Umbral Calculus, Adv. in Math., vol. 27, nb. 2 (1978) pp. 95-
188.
\3
A. Robert: A note on the numerators of the Bernoullinumbers, Expositiones Math. 9 (1991)
pp. 189-191.
V.4.5
M.-C. Sarmant(-Durix): Prolongementde lafonction exponentielie en dehors de son cercle
de convergence, C.R. Acad. Sc. Paris (A), 269 (1969), pp. 123-125.
VI.1.4
For an example of entire function, bounded on Qp see the book by W. Schikhof, Ultrametric
Calculus, p. 126.
Specific References for the Text 421
VI.2.2
S. Lang: Algebra, Addison-Wesley 2nd ed. p. 215, Th. 11.2 (or 3d ed. p. 208, Th. 9.2).
VL2.3 Part (c) is already in
L. Schnirelmann: Sur les fonctions dans les corps normes ..., Bull. Acad. Sci. USSR
Math., vol. 5/6 (1938) pp. 487-497.
VI.2.6
D. Husemoller: Elliptic Curves, Springer-Verlag, GTM 111 (1987) ISBN: 0-387-96371-5
(3-540-96371-5 Berlin),
J.H. Silverman: Advanced Topics in The Arithmetic of Elliptic Curves, Springer-Verlag,
GTM 151 ISBN: 0-387-94325-0 (3-540-94325-0 Berlin).
VL3
For real/complex analysis, my favorite is
W. Rudin: Real and Complex Analysis, 2nd.ed., McGraw-Hill, New-York (1974) ISBN:
0-07-054233-3.
[Runge p. 288; Mittag-Leffler p. 291; Hadamard three-circle p. 281; Picard for essential
singularities p. 227.]
VL3.6
B. Guennebaud: Surune notion de spectre pour les algebresnormees ultrametriques (These,
Univ. Poitiers 1973).
G. Garandel: Les semi-normes multiplicatives sur les algebres d'elements analytiques
Indag. Math. 37 n° 4 (1975) pp. 327-341.
VL4.7
M. Abramowitz, I. Stegun: Handbook of Mathematical Functions, Dover (1972) ISBN:
486-61272-4 (Chap. 24 p. 824).
VI.4.8
E. Motzkin: La decomposition d'un element analytique en facteurs singuliers.
Ann. Inst. Fourier 27, fasc.l (1977), pp. 67-82.
VIL1.6
A. Robert, M. Zuber: The Kazandzidis Supercongruences. A simple Proof and an
Application. Rend. Sem. Mat. Univ. Padova, vol. 94 (1995), pp. 235-243.
VH.2.5
B.Gross, N. Koblitz: Gauss sums and the p-adic T-function, Annals of Math. 109 (1979),
pp. 569-581.
R.F. Coleman: The Gross-Koblitz formula, pp. 21-52 in Galois representations and
arithmetic algebraic geometry, Papers from a Symposium held at Kyoto Univ. (1985) and
Tokyo Univ. (1986), ed. by Y. Ihara, Adv. Studies in Pure Maths. 12, North-Holland
(1987), ISBN 0-444-70315-2.
S. Lang: Cyclotomic Fields, Springer-Verlag, GTM 59 (1978),
- . - : Cyclotomic Fields II, GTM 69 (1980).
VII.2.6
A simple proof of the Gross-Koblitz formula appears in A.M. Roben: The Gross-Koblitz
Formula Revisited, Rend. Sem. Mat. Univ. Padova (to appear in 2001).
VII.3.2
Proposition 4 appears repeatedly in various forms
N. Bourbaki: Algebre commutative, Chap. VI, lemme I, p. 157 and Chap. IX, lemme 1, p. 2.
422 Specific References for the Text
vn.3.3
M. Hazewinkel: Formal Groups and Applications, Academic Press, New-York (1978),
573 p. (contains over 500 references!).
VIL3.4
L. van Hamme: The p-adic Moment Problem, pp. 151-163 in p-adic Functional Analysis
(Santiago-Chile 1992), Editorial Univ. de Santiago.
VIL3.5
T. Honda: Two Congruence Properties ofLegendre Polynomials, Osaka Journal of Math.,
vol. 13 (1976), pp. 131-133.
A. Robert: Polynomes de Legendre mod p, in Univ. Blaise Pascal, Sem. d'analyse 1990-91,
t. 6 (pp. 19.01-19.11).
- . - : Polynomes de Legendre mod 4,
C.R. Acad. Sc. Paris, t. 316, Serie I (1993), pp. 1235-1240.
VII.3.6
M. Zuber: Proprietes de congruence de certaines families de polynomes, C.R. Acad. Sc.
Paris, t. 315, Serie I (1992), pp. 869-872.
Bibliography
General
Y. Amice: Les nombres p-adiques, PUF, Collection Sup. "Le mathematicien" 14 (Paris
3975).
G. Bachmann: Introduction to p-adic numbers and valuation theory. Academic Press,
New York (3964).
A. Escassut: Analytic Elements in p-adic Analysis, World Scientific (1995) ISBN: 983-
02-2234-3.
F. Gouvea: p-adic Numbers: An Introduction, Springer-Verlag, Universitext (1993) ISBN :
0-387-56844-1 (3-540-56844-1 Berlin).
N. Koblitz: p-adic Numbers, p-adic Analysis and Zeta-Functions, Springer-Verlag, GTM
58 (1977, 3984) ISBN: 0-387-90274-0 (3-540-90274-0 Berlin).
K. Mahler: p-adic Numbers and their Functions, Cambridge University Press, Cambridge
tract 76 (2nd ed. 3981).
A. Moxxa: Analyse non-archimedienne. Springer-Verlag, New York (1970).
A.C.M. van Roou: Non-Archimedean Functional Analysis. Marcel Dekker, Pure and Appl.
Math. 51(1978).
W.H. Schikhof: Ultramciric Calculus, An Introduction to p-adic Analysis, Cambridge
Studies in Adv. Math. 4, Cambridge Univ. Press (1984). ISBN: 0-523-24234-7.
J-P. StRRE: Coursd'arithmetique,PUF, Collection Sup. "Le mathematicien'" 2 (Paris 1970),
Tnglish translation: A Course in Arithmetic. Springer-Verlag (1973).
424 Bibliography
Advanced, More Specialized
S. Bosch, U. Guntzer, R. Remmert: Non-Archimedean Analysis, A Systematic Approach to
Rigid Analytic Geometry, Springer-Verlag, Grundlehren Nr. 261 (1984).
B. Dwork: Lectures on p-adic Differential Equations, Springer-Verlag, Grundlehren Nr. 253
(1982).
B. Dwork, G. Gerotto, F.J. Sullivan: An Introduction to G-functions, Ann. of Math. Studies
133, Princeton Univ. Press (1994), ISBN: 0-691-03681-0.
J. Fresnel, M. van der Put: Geometrie Analytique Rigide et Applications, Birkhauser,
Progress in Math. 18 (1981).
L. Gerritzen, M. van der Put: Schottky Groups and Mumford Curves, Springer-Verlag,
L.N. in Math. 817 (1980).
Proceedings of Congresses in p-adic Functional Analysis
J. Bayod et al. (editors): p-adic Functional Analysis, Proc. of the 1st Int. Conf. (1990,
Laredo, Spain) Lect. Notes in pure and appl. math. 137, Marcel Dekker (1992).
N. de Grande-de Kimpe et al. (editors): p-adic Functional Analysis, Proc. of the 2nd Int.
Conf. (1992, Santiago. Chile) Editorial Universidad de Santiago (1994).
A. Escassut et al. (editors): p-adic Functional Analysis* Proc. of the 3d Int. Conf. (June
1994, Clermont-Ferrand, France) Annales Math. Blaise Pascal, vol. 2, N°l (1995).
W.H. Schikhofet al. (editors): p-adic Functional Analysis, Proc. of the 4th Int. Conf. (June
1996, Nijmegen, The Netherlands) Lect. Notes in pure and appl. math. 192, Marcel
Dekker (1997) ISBN: 0-8247-0038-4.
J. Kakol et al. (editors): p-adic Functional Analysis. Proc. of the 5th Int. Conf. (June 1998,
Poznan, Poland) Lect. Notes in pure and appl. math. 207, Marcel Dekker (1999) ISBN:
0-8247-8254^2.
A.K. Katsaras et al. (editors): p-adic Functional Analysis, Proc. of the 6th Int. Conf. (July
2000, loannina, Greece) (to appear).
Tables
Field
Q2
Qp
p odd prime
Units
ZJ = 1+2Z2
Z^Dl+pZ,
index p-1
Squares
1+8Z2
index 4
inZJ
index 2
inz;
Roots of unity
M2 = {±1}
/V-i
Number of
quadratic
extensions
7
3
Field 3B<i D #<i Residue field Nonzero |. |
Properties
QpZP3 pip Fp pz
KD RD P = jtR Vq(q-=Pf) Mz =/?*
QapD Aa D Ma ka = ¥ap = Fpoo p^
CpDXpD Mp
QPD AQD Mq
*", = *>
uncountable
R>
locally compact
ef — dimQp K < 00
locally compact
algebraically closed
not locally compact
algebraically closed
complete
algebraically closed
spherically complete
426 Tables
Umbral calculus
Delta
operator
(IV5)
D = d/dx
(IV5.5)
Basic sequence
of polynomials
(IV.5.2)
r I
umbral
operator
(Pn)n >0
j __ p„{x+ny)\
translation principle
Related
sequences
(IV.6.1)
Appell sequences
Dpn = npn^x
Sheffer sequences
8sn —nsn-i
Binomial identity: pn(x + y) = "(p(x) + /?(>))","
Appell sequences: pn(x -\-y) = "(p(x) + y)n,"
Sheffer sequences: s„{x + y) = " (s(x) + /?(y))V
Analytic elements
Formal
power series
power series
converging in
1*1 < r
power series
bounded in
W<1
analytic
elements in
W<1
analytic
elements in
W<l
polynomials
Cp[[x}]
M(MP)
H(AP) = Cp{x]
Cp[x]
Sequences
Iimsup|tf„|1/'7 < 1
(rf > 1)
bounded
sequence
Christol-Robba
condition
(4.6)
(n —> oo)
an y£ 0 for
finitely many «'s
«>0
ex
CO
Tables 427
Radius of convergence of some exponential series
(listed in increasing order)
Ep (Artin-Hasse) El (Dwork)
/ e e*+f expE7>0^ **-*•> (* =p/)
Zp-1
Tf rp = \p\7=* rpp 1 p p*
Basic Principles of Ultrametric
Analysis in an Abelian Group
(1) The strongest wins
M > \y\ => \x + y\ = \x\.
(2) Equilibrium: All triangles are isosceles (or equilateral)
a + b + c = 0,\c\ < \b\ => \a\ = \b\.
(3) Competitivity
f/zere w i ^ y a-wc/z//ior |az| = |fl7-1 = max \ak\.
(4) A dream realized
(tf/?)/2>o ^ fl Cauchy sequence <=> d(an, an+\) —► 0.
(5) Another dream come true (in a complete group)
2_\ >nan conver8es ^^ fl/2 —> 0.
When SZ^ofln converges, Y,n>0 \an\ may diverge, but
| £fl„| < sup \an\ = max 1^1
and the infinite version of (3) is valid.
(6) Stationarity of the absolute value
an —> a =£0 => there is N with \an\ = \a\forn > N.
Conventions, Notation, Terminology
We use the abbreviations
iff "'if and only if," := "equal by definition,'* = nontrivial equality
■ is the "end of proof (or "absence of proof) sign.
In a statement: (/"), («),... always denote equivalent properties.
In the table of contents, an asterisk * before a section indicates that it will not be used later
and may be omitted in a first reading.
Set Theory
V{E) power set of E\ Set of subsets of E\ 0: Empty set.
A c B means "jc e A => jc e B" hence: A c E <f=> A e V(E).
(certain authors denote this inclusion by c).
When A <z B <z E, B — A — B\A denotes the complement of A in £,
E — A = Ac is the complement of a subset A c E.
A subset of E having only one element is a singleton set: x e E =» {jc} eV(E).
\J ■ Disjoint union symbol, partition of a set.
E1: Set of families (or functions) I -> E.
E^); Set of families I -> E having components equal to the base point
of E (the neutral element in a group C the 0 in a ring A ...)
except for finitely many indices.
Let / : E -> F,x \-+ f{x) be a map. Then
/ is infective when x ^= y ==» /(*) ^ /(>), namely / is one-to-one,
or equivalently when /(jc) = /(y) =» x — y,
/ is surjective when /(£) = F (namely / is orcto),
/ is bijective when it is one-to-one and onto.
432 Conventions, Notation, Terminology
The characteristic function of a subset A c E is the function
{1 if x e A,
0 if x <£ A.
Fundamental Sets of Numbers
N = {0, 1,2,...,«,...} C Z c Q C R C C, N* = {1. 2,...,«,...} = N>0-
When /? e {2, 3, 5, 7, 11,...} is a prime, ¥p = Z/pZ.
/? | « means /? divides n, p\n means p does not divide n,
pv \\n means that /?v is the highest power of p dividing n,
R>0 = [x 6 R : x > 0}, R>o = {x eR:x > 0}, [a, b): interval a < x < b.
Z(p) = {«/&: a e Z, & > 1, b prime to /?} C Q.
Z[l/p] = [apv :aeZ, ieZ)cQ.
When a > 0 and 5 C R, as = {as : s e S] c R>o, e.g.. pz C pQ C R>o
fjc] G Z integral part of x e R: [jc] < x < [*] + 1.
(jc) fractional part of jc € R: x = [x] + (jc).
gcd: Greatest common divisor; 1cm: Least common multiple.
&if Kronecker symbol (= 1 if i =■ j, = 0 otherwise).
Groups, Rings and Modules
Ax: Multiplicative group of units (i.e.. invertible elements) in a ring A.
A[X]: Polynomial ring in one indeterminate X and coefficients in the ring A,
a monic polynomial / is a polynomial having leading coefficient 1:
Xn + flu-i*"-1 + - - - + flo if deg / = «■
A[[X|]: Formal power series ring.
A {A'}: Restricted power series over a valued ring A
(Chapter V: Power series with coefficients -> 0) A[X] cAfflc A[[X]].
An integral domain is a commutative ring A ^ {0} having no zero divisor.
K = Frac A: Fraction field of an integral domain A. In particular,
#(X) = Frac A[X]: Rational fractions,
K((X)) = Frac A[[X]] p tf(X)): Formal Laurent series ring.
A[\/q]: Partial fraction ring corresponding to denominators in {1. q, q2,...},
where q is not a zero divisor in the ring A.
If G is an abelian group, then [g e G : gn = e for some integer n > 1}
is the torsion subgroup of G: In particular,
M(A) denotes the group of roots of unity in a commutative ring A,
li = //(Cx) = jipoc x fiiph where
pLp^: /?th-power roots of unity (/?-Sylow subgroup of ji),
ji{p)\ Roots of unity having order prime to p,
jin{A) — [x e A : xn = 1}: nth roots of unity in the ring A.
f g
A pair of homomorphisms A -> B —> C is exact when /(A) = kerg.
A short exact sequence (SES) is an exact pair with
/ injective and g surjective; hence C is a quotient of B by f(A) = A,
f s
written 0 ^ A ^ B -> C —* 0 for additive groups
(replace 0 by 1 for multiplicative groups).
Conventions, Notation, Terminology 433
Fields, Extensions
Characteristic of a field K: Either 0 or the prime p such that p ■ 1# = 0 e K,
in which case the prime field ¥p is contained in K.
For each prime /?, the group F* is cyclic; when the prime p is odd, the squares in F* make
up a subgroup of index two. kernel of the Legendre symbol (- J = ± 1.
In a field (or a ring) of characteristic p we have (x + y)p = xp + yp.
Ka: Algebraic closure of a field A'; when K — Ka is algebraically closed of characteristic
0, fin(K) is cyclic and isomorphic to Z/nZ.
Pl(K) = K U {oo} denotes the projective line over the field K.
Topology, Metric Spaces
The closure of a subset A c X (X being a topological space) is denoted by A.
A Hausdorff space is a topological space X in which for every pair of distinct points, it is
possible to find disjoint neighborhoods of these points: Equivalently, the diagonal Ax is
closed in the product X x X.
The diameter of a subset A c X with respect to a metric d is
diam(A) = S(X) — sup^. yeA d(x, y) < oo.
We say that A is bounded when diam(>4) < oo.
The distance of a point x e X to a subset A c X is */(*, A) = inffl€>i */(*, a),
d(x, A) = Q <==$> x e A.
The balls in a metric space (X, d) are denoted by
Bsr(a) = B<r(a\X) = {* e X : </(*, a) < r}: closed (dressed) ball,
B<r(a) = B<r(a\ X) = {x e X : d(x, a) < r}: open (stripped) ball.
For a ball with centers equal to the base point (the neutral element in a group, the 0 element
in a ring), the notation will be just /?<,., B<r.
The sphere of radius r > 0 and center a in the metric space (X, d) is
£,.(0) = (xeX : d(x, a) = r} = B<r(a) - B<r(a).
A metric space is separable if it has a countable dense subset.
C(X\ K): Space of continuous functions X —> AT, or simply C(X) when A' is understood;
C/>(X; AT): Subspace consisting of the bounded continuous functions (when AT is a valued
field). The sup norm of a bounded function is
ll/ll - ll/llx - sup |/(x)| (/ e Cb(X; K)).
xeX
Index
A
absolute value II. 1.3
— overQH.2.1
algebraic variety 1.6.1
Amice-Fresnel theorem VI.4.4
analytic element VI.4.2
Appell sequence IV.6.1
Artin-Hasse exponential VII.2.1
B
Baire space III. 1.4
balanced subset II.A.6
balls (stripped and dressed) II. 1.1
Banach space (ultrametric —) IV.4.1
basic system of polynomials IV.5.2
Bell (numbers and polynomials) IV.6.3
Bell-Carlitz polynomials (IV, exercise 19)
Bernoulli (numbers and polynomials) V.5.4
Beukers proposition VII.3.4
binomial identities IV.5.2
— polynomial IV. 1.1
C
Cantor set 1.2.2
carry (operations in basis p) 1.1.2
Chebyshev polynomials (V, exercise 7)
Christol-Robba theorem VI.4.6
Clausen-von Staudt theorem V.5.5
clopenset IL1.1
commutant (bicommutant) IV.5.3
composition operator IV.5.3
continuity of roots of equations III. 1.5
continuous retraction I.A.6
convexity (and duality) VI. 1.4
covering of circle I. A. 1
critical radius VI. 1.4
cyclotomic polynomial (- units) II.4.2
D
delta operator IV.5.1
diagonal (in a Cartesian product) 1.3.3
Dieudonne-Dwork criterion VII.2.2, VII.3.3
differential quotient (higher order—) V.2.4
divisible group III.4.1
dominant (monomial) VI. 1.4
dressed ball 11.1.1
duality (convexity theory) VI. 1.6
Dwork series VII.2.3
E
Eisenstein (irreducibility criterion)
11.4.2
— polynomial II.4.2
entire function VI.2.3
enveloping ball BD VI.4.1
equivalent absolute values II. 1.7
— norms 11.3.1,111.3.2
Euclidean model 1.2.5
extension of absolute values
— existence 11.3.4
— uniqueness II.3.3
436 Index
K
Fibonacci numbers (IV, exercise 20)
filters III.A. 1
finiteness (extensions of Qp of given degree)
III.1.6
formal power series 1.4.8, IV.5, VI. 1
fractal subset 1.2.3
fractional part {x) 1.5.4
fundamental inequalities III.4.3
gamma function Vp (Morita) VII. 1.1
Gaussian 2-adic numbers II.4.5
Gauss multiplication formula for Vp
VII. 1.3
— normV2.1
generalized absolute value II.2.2
— ball VI.3.1
— over Q II.2.4
— Taylor expansion IV.5.2
Gould polynomials (IV, exercise 21)
granulation, type of— V.1.2
growth modulus VI.1.4, VI.3.3
H
Haar measure II. A. 1
Hahn-Banach theorem (/?-adic) IV.4.7
Hadamard formula (radius of convergence)
VI. 1.2
— three-circle theorem VI.2.6
Hazewinkel maps VII.3.2
— theorem VII.3.3
HensePs lemma 1.6.4, II. 1.5
hexagonal field (3-adic numbers) II.4.6
Honda sequence (and congruences)
VII.3.2
homothety (= dilatation) 1.5.6, VI.3.1
I
IFS (iterated function system) 1.2.5
indecomposable compact space I.A.6
indefinite sum IV1.5
Ingleton theorem IV.4.7
index with respect to a hole VI.3.5
infinite product VI.2.3
infraconnected set VI.4.1
injective Z-module III.4.1
/^-integer 1.5.4
integral part (p-adic) [jc] 1.5.4
inverse system (— projective system) 1.4.2
involution a 1.1.2
irreducibility criterion (Eisenstein) II.4.2
isolated singularity VI.2.6
Iwasawa logarithm Log V.4.5
Kazandzidis congruences VII. 1.6
Krasner's lemma III. 1.5. III.3.2
Legendre polynomials VII.3.5
— relation for r^VII.l.2
— quadratic residue symbol 1.6.6
length of an expansion in basis p IV.3.2
— of a word 1.2.4
linear fractional transformation VI.3.1
Liouville theorem VI.1.4
Lipschitz function V.1.5
locally analytic function VI.4.7
locally constant function IV.3.1
local ring II. 1.4
Lucas sequence VII.3.4
M
Mahler series IV.2.3
— theorem IV.2.4
maximum principle VI.2.5, VI.2.6
mean value theorem V.3.2, V.3.4
metric, /?-adic —1.2.1
Mittag-Leffler theorem VI.3.4, VI.4.5
Mobius function n(n) VII.2.1
module of an automorphism II .A. 1
Monna-Fleischer theorem IV.4.5
Motzkin factorization VI.3.5, VI.4.8
multiplicative norm VI.1.4, VI.3.6
N
Newton algorithm 1.6.4
— approximation method 1.6.3
— polygon VI. 1.6
normal basis (ultrametric Banach space)
IV.4.2
O
order of composition operator IV.5.3
order of formal power series IV.5.3, VI. 1.1
/?-adic integer 1.1.1. — number 1.5.1
— metric 1.2.1
Perrin sequence VII.3.4
Picard theorem (essential singularity) VI.2.6
Pochhammer symbol (x)n IV. 1.1
principal ideal domain 1.1.6
— pan at a pole VI.3.2
projective limit (inverse system) 1.4.2
Index 437
Q
quadratic residue symbol (Legendre) 1.6.6
R
radius of convergence VI. 1.2
— (exp and log) V.4.1
ramification index II.4.1
reduction mod p 1.1.5
— of ultrametric Banach space IV.4.3
regular radius VI.1.4
representation theorem IV.4.4
residue degree II.4.1
— field II. 1.4
restricted factorial VII. 1.1
— formal power series V.2.1
Rodrigues formula VII.3.5
Rolle's theorem VL2.4
roots of unity in C 1.5.4
— inC^III.4.2
Runge theorem VI.4.2
S
saturated set 1.3.3
Schnirelman's theorem VI.2.3
self-similarity dimension 1.2.3
Sheffer polynomials, — sequences IV6.1
Sierpinsky gasket 1.2.5
solenoid Sp I.A.I
spherically complete metric space III.2.4
stereographic projection I.A.6
Stirling numbers (1 st and 2nd kind)
VI.4.7
Strassman theorem VI.2.1
strict differentiability V1.1
stripped ball II. 1.1
support (of a family) I V.4.1
—- differentiability V.l.l
T
tame ramification II.4.1
Tate homomorphism xp 1.5.4
Teichmiiller character III.4.4
topological field 1.3.7
— group 1.3.1
— ring 1.3.6
totally disconnected 1.2.1
— ramified (extension) II.4.1
transition map (inverse system) 1.4.2
translation principle IV.5.5
type of a granulation V.1.2
U
ultrafilterIII.A.2
ultrametric absolute value II. 1.3
— Banach space IV.4
— distance, — space II. 1.1
— field II. 1.3
— group II. 1.2
ultraproduct IIL2.2
uniformly equivalent metrics 1.2.1
unit (/?-) 1.5.4
universal field Qp III.2.2
universal property of inverse limits 1.4.2
unramified extension II.4.1
maximal —11.4.4,111.1.2
V
valuation of n ! V.3.1
— polygon VI. 1.6
— subringll.1.4
valued field 1.3.7
van der Put sequence IV.3.2
— theorem IV.3.3
van Hamme theorem IV.5.4
Volkenborn integral V.5.1
W
wild ramification II.4.1
Wilson congruence VII. 1.1
Z
Zuber theorem VII.3.6