Текст
                    Lecture Notes in Mathematics 1358
Edilois:
A. Dokl. Heidelberg
F. Takcns, Groningcri
B. Tcissicr, Paris


Springer Berlin Heidelberg New York Barcelona Hong Kong London, Milan Paris Singapore Tokyo
David Mumford The Red Book of Varieties and Schemes Second, Expanded Edition Tnciudes the Michigan Lectures A974) on Curves and their Jacobians Springer
Authors David В. Mumibrd Division of Applied Mathematics, Box F Brown University Providence, RI 02912, USA E-mail: David_Mumford(E?bro\vn edu Enrico Arbarelio Scuola Normale Superiore 7, Piazza dei Cavalieri 56126 Pisa, Italy E-mail: ea@bibsns.sns.it Cataloging-in-Pubiication Data applied for Die Deutsche Bibliothck - CIP-Einheitsaufnahme Mumford, David: Tlie red book of varieties and schemes : includes the Michigan lectures A974) on curves and their Jacobians / David Mumford. - 2., expanded ed. - Berlin ; Heidelberg ; New York ; Barcelona ; Hong Kong ; London ; Milan ; Paris ; Singapore ; Tokyo : Springer, 1999 (Lecture notes in mathematics ; 1358) ISBN 3-540-63293-X Mathematics Subject Classification A991): 14-01, 14A15, !4НЮ. 14Н42, 141145 ISSN 0075-8434 ISBN 3-540-63293-X Springer-Vcrlag Berlin Heidelberg New York This work is subject to copyright. All rights are reserved, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, re-use of illustrations, recitation, broadcasting, reproduction on microfilms or in any other way, and storage in data banks. Duplication of this publication or pans thereof is permitted only under the provisions of the Geiman Copyright Law of September 9, 1965. in its current version, and permission for use must always he obtained from Springer-Vcrlag. Violations are liable for prosecution under the German Copyright Law. © Springer-Verlag Berlin Heidelberg 1999 Printed in Germany The use of general descriptive' names, registered names, trademarks, etc. in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevani protective laws and regulations and therefore free for general use. Typesetting: 1. Jcbram using a Springcr-TgX macro-package SPIN: 10568814 41/3143-543210 - Printed on acid-free paper
Preface to the Second Edition At the same time that the first edition of the Springer Lecture Notes version of the "Red Book of Varieties and Schemes" was sold out, the supply of my lecture notes "Curves and their Jacobians", published by the University of Michigan Press, was also exhausted and the copyright returned to me. These two sets of notes have similar purposes. At one time, I had had ambitious plans to write a multi-volume introduction to algebraic geometry and specifically to moduli problems, this being the part of the field which I knew best. Before writing these, I gave lectures on various occasions to explain the ideas informally with the idea that these might be a first draft for parts of a longer more polished version. What I found, however, was that these first drafts, written informally, with pictures and diagrams, printed in a typewriter font and, above all, being short, were more appealing to many readers than a full-blown Grundlehrcn-style book. Therefore, Springer-Verlag and I made the plan to keep both sets of notes in print a bit longer, reprinting them together, with the original "Red Book" as part one and "Curves and their Jacobians" as part two. The "Red Book" originated in mimeographed lectures delivered at Harvard in the 60's, whose primary goal was to show that the language of schemes was fun- fundamentally geometric and expresses the intuitions of algebraic geometry clearly. 1 "Curves and their Jacobians", on the other hand, is a write-up of four lectures given at the University of Michigan in 1974, whose goal was to give a bird's eye view of curves, their moduli spaces, their jarobians and how these relate to each othci. Although the styles differ because the "Red Book" was addressed to stu- students and "Curves and their Jacobians" to faculty as well, both try to describe ideas as informally and simply as possible. Since my research now is in stochastic methods in signal processing and image analysis (which I like to call "Pattern Theory" following Grenander), I have not kept up with the literature on the1 moduli of curves. The last section of the uriginal notes entitled "Guide to the Literature and References" is quite out of date and it is impossible for me to bring it up to date, although I have added reference.s to Harris' work with me and Eisenbud on the Kodaira dimension uf the Moduli Space. 1 (Added in proof) Rereading Chapter 3, I reali' e that I particularly wished to show that Zariski's work, on which I had grown up, could be recast, with no loss of its force, in the language of schemes.
VI Preface to the Second Edition For the Schottky problem, there has been a great deal of progress and Profes- Professor Enrico Arbarello has generously contributed a new section "Supplementary Bibliography on the Schottky Problem" to give pointers to this work. Another useful reference for recent work is the book "The Moduli Space of Curves", edited by Dijkgraaf, Faber and van dcr Geer, Birkhauscr-Boston. Cambridge, Mass. May, 1999. David Mumford
Preface to the First Edition These notes originated in several classes that I taught in the mid 60's to intro- introduce graduate students to algebraic geometry. I had intended to write a book, entitled "Introduction to Algebraic Geometry", based on these courses and, as a first step, began writing class notes. The class notes first grew into the present three chapters. As there was a demand for them, the Harvard mathematics de- department typed them up and distributed them for a while (this being in the dark ages before Springer Lecture Kotos came to fill this need). They were called "In- "Introduction to Algebraic Geometry: -preliminary version of the first 3 chapter^ and were bound in red. The intent was to write a much more inclusive book, but as the years progressed, my ideas of what to include in this book changed. The book became two volumes, and eventually, with almost no overlap with these notes, the first volume appeared in 1976, entitled "Algebraic Geometry I: complex protective varieties". The present plan is to publish shortly the second volume, entitled "Algebraic Geometry II: schemes and cohomology", in collabo- collaboration with David Eisenbud and Joe Harris. David Gieseker and several others have, however, convinced me to let Springer Lecture Notes reprint the original notes, long out of print, on the grounds that they serve a quite distinct purpose. Whereas the longer book "Al- "Algebraic Geometry" is a systematic and fairly comprehensive exposition of the basic results in the field, these old notes had been intended only to explain in a quick and informal way what varieties and schemes are, and give a few key examples illustrating their simplest properties. The hope was to make the basic objects of algebraic geometry as familiar to the reader as the basic objects of differential geometry and topology: to make a variety as familiar as a manifold or a simplicial complex. This volume is a reprint of the old notes without change, except that the title has been changed to clarify their aim. The weakness of these notes is what had originally driven me to undertake the bigger project: there is no real theorem in them! I felt it was hard to convince people that algebraic geometry was a great and glorious field unless you offered them a theorem for their money, and that takes a much longer book. But for a puzzled non-algebraic geometer who wishes to find the facts needed to make sense of some algebro-gcomctric statement that they want to applv, these notes may be a convenient way to learn quickly the basic definitions. In twenty years of giving colloquium talks about algebraic geometry to audiences of mostly non-
VIII Preface to the First Edition algebraic geometers, I have learned only too well that algebraic geometry is not so easily accessible, nor are its basic definitions universally known. It may be of some interest to recall how hard it was for algebraic geome- geometers, even knowing the phenomena of the field very well, to find a satisfactory language in which to communicate to each other. At the time these notes were written, the field was just emerging from a twenty-year period in which every researcher used his own definitions and terminology, in which the "foundations" of the subject had been described in at least half a dozen different mathematical "languages". Classical style researchers wrote in the informal geometric style of the ftalian school, Weil had introduced the concept of specialization and made this the cornerstone of his language and Zariski developed a hybrid of algebra and geometry with valuations, universal domains and generic points relative to various fields к playing important roles. But there was a. general realization that not all the key phenomena could be clearly expressed and a frustration at sacrificing the suggestive geometric terminology of the previous generation. Then Grothendieck came along and turned a confused world of researchers upside down, overwhelming them with the new terminology of schemes as well as with a huge production of new and very exciting results. These notes attempted to show something that was still very controversial at that time: that schemes really were the most natural language for algebraic geometry and "that you did not need to sacrifice geometric intuition when you spoke "scheme". I think this thesis is now widely accepted within the community of algebraic geometry, and I hope that eventually schemes will take their place alongside concepts like Banach spaces and cohomology, i.e. as concepts which were once esoteric and abstruse, but became later an accepted part of the kit of the working mathematician. Grothendieck being sixty this year, it is a great pleasure to dedicate those notes to him and to send him the message that his ideas remain the framework on which subsequent generations will build. Cambridge, Mass. Feb. 21, 1988.
Table of Contents Preface to the Second Edition V Preface to the First Edition VII Table of Contents IX I. Varieties 1 §1. Some algebra, 1 §2. Irreducible algebraic sets 5 §3. Definition of a morphism 11 §4. Sheaves and affine varieties 16 §5. Definition of prevaricties and morphisins 25 §6. Products and tho Hausdorff Axiom 33 §7. Dimension 40 §8. The fibres of a morphism 48 §9. Complete varieties 54 § 10. Complex varieties 57 II. Preschemes 65 §1. Spec (R) 6G §2. The category of preschemes 77 §3. Varieties and preschemes 86 §4. Fields of definition 94 §5. Closed subprrachemes 103 §6. The functor of points of a prescheme 112 §7. Proper morphisms and finite morphisms 121 §8. Specialization 127 III. Local Properties of Schemes 137 §1. Quasi-coherent, modules 138 §2. Coherent modules 146 §3. Tangent cones 153 §4. Non-singularity and differentials 164 §5. Etale morphisms 174 §6. Unifonwing parameters 183
X Table of Contents §7. Non-singularity and the UFD property 187 §8. Normal varieties and normalization 196 §9. Zariski's Main Theorem 207 §10. Flat and smooth morphisms 214 Appendix: Curves and Their Jacobians 225 Lecture I: What is a Curve and How Explicitly Can We Describe Them? 229 Lecture II: The Moduli Space of Curves: Definition, Coordinatization, and Some Properties 243 Lecture III: How Jacobians and Thota Functions Arise 257 Lecture IV: The Torelli Theorem and the Schottky Problem 271 Survey of Work on the Schottky Problem up to 199G by Enrico Arbarello 287 References: The Red Book of Varieties and Schemes 293 Guide to the Literature and References: Curves and Their Jacobians 294 Supplementary Bibliography on the Schottky Problem by Enrico Arbarello 301
I. Varieties The basic object of study in algebraic geometry is an arbitrary prescheme. How- However, among all preschemes, the classical ones known as varieties are by far the most accessible to intuition. Moreover, in dealing with varieties one can carry over without any great difficulty the elementary methods and results of the other geometric categories, i.e., of topological spaces, differentiable manifolds or of analytic spaces. Finally, in any study of general preschemes, the varieties are bound, for many reasons which I will not discuss here, to play a unique and cen- central role. Therefore it is useful and helpful to have a basic idea of what a variety is before plunging into the general theory of preschemes. We will fix through- throughout an algebraically closed ground field к which will never vary. We shall restrict ourselves to the purely geometric operations on varieties in keeping with the aim of establishing an intuitive and geometric background: thus we will not discuss specialization, nor will we use generic points. This set-up is the one pioneered by Serre in his famous paper "Faisceaux algebriques coherents". There is no doubt that it is completely adequate for the discussion of nearly all purely geometric questions in algebraic geometry. §1. Some algebra We want to study the locus V of roots of a finite set of polynomials fi (X\,..., Xn) in kn (k being an algebraically closed field). However, the basic tool in this study is the ring of functions from V to к obtained by restricting polynomials from kn to V. And we cannot get very far without knowing some- something about the algebra of such a ring. The purpose of this section is to prove 2 basic theorems from commutative algebra that are key tools in analyzing these rings, and hence also the loci such as V. We include these results because of their geometric meaning, which will emerge gradually in this chapter (cf. §7). On the other hand, we assume known the following topics in algebra: 1) The essentials of field theory (Galois theory, separability, transcendence de- degree). 2) Localization of a ring, the behaviour of ideals in localization, the concept of a local ring. 3) Noetherian rings, and the decomposition theorem of ideals in these rings. 4) The concept of integral dependence, (cf., for example, Zariski-Samuel, vol. 1). D. Mumford: LNM 1358, pp. 1-63, 1999. © Springer-Verlag Berlin Heidelberg 1999
2 I. Varieties The first theorem is: Noether's Normalization Lemma. Let R be an integral domain, finitely generated over a field k. If R has transcendence degree n over k, then there exist elements x\,..., xn € R, algebraically independent over k, such that R is integrally dependent on the subring k[xi,... ,xn] generated by the x's. Proof (Nagata). Since R is finitely generated over k, we can write R as a quo- quotient: R = k[Y1,...,Ym]/P , for some prime ideal P. If m = n, then the images yi,...,ym of the V's in R must be algebraically independent themselves. Then P = @), and if we let Xi = ?/i, the lemma follows. If m > n, we prove the theorem by induction on m. It will suffice to find a subring S in R generated by m — 1 elements and such that R is integrally dependent on S. For, by induction, we know that S has a subring k[xi,...,xn] generated by n independent elements over which it is integrally dependent; by the transitivity of integral dependence, R is also integrally dependent on k[x\,...,xn] and the lemma is true for R. Now the m generators y\,..., ym of R cannot be algebraically independent over к since m > n. Let /B/l,---,2/m) =0 by some non-zero algebraic relation among them (i.e., / (yi,..., ym) is a non-zero polynomial in P). Let ri,... ,rm be positive integers, and let Z2 = 2/2 - y{2 , Z3=y3-yl3,...,Zm=ym- J/[m • Then /B/i,32 +y[2,...,zm +y[m) = 0 , i.e., yi,Z2,...,zm are roots of the polynomial / (Yi, Z2 + Y{2 ,...,Zm + Y{m). Each term a • YI y^ in / gives rise to various terms in this new polynomial, i=l including one monomial term „ „,bi+r2b2 + ...+rmbm a' V\ A moment's reflection will convince the reader that if we pick the n's to be large enough, and increasing rapidly enough: o«cr2 «;r3«: ...«;rm, then these new terms a • Y^l+'"+Tmbm will all have distinct degrees, and one of them will emerge as the term of highest order in this new polynomial. Therefore, / (YUZ2 +Y1r2,...,Zm + Y1r™)=b- Y? + [ terms of degree < N] , (b ф 0). This implies that the equation / {y\, z2 + y[2,. • •, zm + y[m) = 0 is an equation of integral dependence for y\ over the ring k[z2,... ,2m]. Thus y\ is integrally dependent on k[z2,.. .,zm], so y2,... ,ym are too since yi = Z{ + y7^ (i = 2,..., m). Therefore the whole ring R is integrally dependent on the subring S = к [z2,..., zm]. By induction, this proves the lemma. □
§1. Some algebra 3 The second important theorem is: Going-up theorem of Cohen-Seidenberg. Let R be a ring (commutative as always) and S С R a subring such that R is integrally dependent on S. For all prime ideals P С S, there exist prime ideals P' С R such that P' П S = P. Proof. Let M be the multiplicative system S — P. Then we may as well replace R and S by their localizations Rm and Sm with respect to M. For Sm is still a subring of Rm, and Rm is still integrally dependent on Sm- In fact, we get a diagram: 3 R RM U U S SM • i Moreover Sm is a local ring, with maximal ideal Рм = i(P) • Sm and P = ъ~1{Рм)- If P* С Rm is a prime ideal of Rm such that P* П Sm = Рм, then j~1{P*) is a prime ideal in R such that Г1^*) ИЗ = Г1 (P*nSM) = г-г(Рм) = P Therefore, it suffices to prove the theorem for Rm and Sm- Therefore we may assume that S is a local ring and P is its unique maximal ideal. In this case, for all ideals AcR, AnSCP.I claim that for all maximal ideals P' С S, P' П S equals P. Since maximal ideals are prime, this will prove the theorem. Take some maximal ideal P'. Then consider the pair of quotient rings: R R/P' U U S S/SDP1 . Since P' is maximal, R/P' is a field. If we can show that the subring S/SilP' is a field too, then S П P' must be a maximal ideal in S, so S П P' must equal P and the theorem follows. Therefore, we have reduced the question to: Lemma. Let R be a field, and S С R a subring such that R is integrally depen- dependent on S. Then S is a field.
4 I. Varieties Note that this is a special case of the theorem: For if S were not a field, it would have non-zero maximal ideals and these could not be of the form P' C\S since R has no non-zero ideals at all. Proof of lemma. Let a € S, а ф 0. Since R is a field, I/a € R. By assumption, I/a is integral over S, so it satisfies an equation I" + hXn-1 + ... + bn = 0, bi € S. But this means that Multiply this equation by an~1 and we find - = -ъг - ab2 - ... - an~16n € S. a Therefore S is a field. □ Using both of these results, we can now prove: Weak Nullstellensatz. Let к be an algebraically closed field. Then the maxi- maximal ideals in the ring к [X\,..., Xn] are the ideals (Xi - ai,X2 - a2,. ..,Xn- an), where oi,... ,an € k. Proof. Since the ideal {X\ — oi,... ,Xn — an) is the kernel of the surjection: k[Xu...,Xn] —> к it follows that к [X\,..., Xn] / {X\ — a\,..., Xn — an) = к, hence the ideal (Xi — oi,... ,Xn — an) is maximal. Conversely, let M С к [Xi,... ,Xn] be a maximal ideal. Let R = к [Xi,..., Xn] /M. R is a field since M is maximal, and R is also finitely generated over A; as a ring. Let r be the transcendence degree of R over к. The crux of the proof consists in showing that r = 0: By the normalization lemma, find a subring S С R of the form k[yi,...,yr] such that R is integral over S. Since the y^s are algebraically independent, S is a polynomial ring in r variables. By the going-up theorem - in fact, by the special case given in the lemma - S must be a field too. But a polynomial ring in r variables is a field only when r = 0. Therefore R is an algebraic extension field of к. Since к is algebraically closed, R must equal к. In other words, the subset к of k[Xi,...,Xn] goes onto к [Xi,..., Xn] /M. Therefore In particular, each variable Xi is of the form ai+rrii, with аг- € к and mi € M. Therefore, Xi — o, € M and M contains the ideal (Xi — ai,..., Xn — an). But the latter is maximal already, so M = (Xi — ai,..., Xn — an). О
§2. Irreducible algebraic sets 5 The great importance of this result is that it gives us a way to translate affine space kn into pure algebra. We have a bijection between kn, on the one hand, and the set of maximal ideals in k[Xi,...,Xn] on the other hand. This is the origin of the connection between algebra and geometry that gives rise to our whole subject. §2. Irreducible algebraic sets For the rest of this chapter к will denote a fixed algebraically closed field, known as the ground field. Definition 1. A closed algebraic subset of kn is a set consisting of all roots of a finite collection of polynomial equations: i.e., {(xi,...,xn) | /i (xi,...,xn) = ■■• = fm(xi,...,xn) = 0} . It is clear that the above set depends only on the ideal A = (Д,..., /m) generated by the /j's in [X\,... ,Xn] and not on the actual polynomials /,. Therefore, if A is any ideal in к [Xi,..., Xn], we define V(A) = {xekn\ f(x) = 0 for all / € A} . Since к [Xi,..., Xm] is a noetherian ring, the subsets of kn of the form V(A) are exactly the closed algebraic sets. On the other hand, if E is a closed algebraic set, we define = {/€ k[Xu...,Xn] \f(x)=0 for all x € £} . Clearly I(U) is an ideal such that S = V(I(U)). The key result is: Theorem 1 (Hilbert's Nullstellensatz). Proof. It is clear that у/А С I(V(A)). The problem is to show the other inclu- inclusion. Put concretely this means the following: Let A = (fi,..., /m). If g £ к [Х\,..., Xn] satisfies: {/i (oi,...,on) = ... = /TO(oi,---,On) = 0} => p(ai,...,an) = 0 then there is an integer £ and polynomials hi,..., hm such that To prove this, introduce the ideal В = A- k[Xu.. .,Xn,Xn+1} + A - g - Xn+1)
6 I. Varieties in к [X\,..., -Xn+i]. There are 2 possibilities: either В is a proper ideal, or В = к [Xi,..., -Xn+i]. In the first case, let M be a maximal ideal in к [Xi,..., Xn+i] containing B. By the weak Nullstellensatz of §1, M = (Xi —ai,...,Xn — an,Xn+1 - on+i) for some elements a; € k. Since M is the kernel of the homomorphism: к [Xi,..., Xn, Xn+i\ > к f '—> f{ai,---,an+i), В С М means that: i) Л (ab...,an) = ... = fm(ai,...,an) = 0 and ii) 1= g(a1,...,an) -an+1. But by our assumptionon g, (i) implies that g (oi,..., an) = 0, and this contra- contradicts (ii). We can only conclude that the ideal В would not have been a proper ideal. But then 1 € B. This means that there are polynomials hi,..., hm, hm+i € к [Xi,..., -Xn+i] such that: fi (X\, . . ■ , Xn) — 2 ^* + A "~ 9 (-^1> • • • j-^n) " -Xn+l) * Substituting g~x for Xn+i in this formula, we get: Clearing denominators, this gives: for some new polynomials h* € к [Xi,..., Xn], i.e., g € \/~A. П Corollary. V and I set up a bijection between the set of closed algebraic subsets of kn and the set of ideals А С к [Xi,..., Xn] such that A = \[~A. This correspondence between algebraic sets and ideals is compatible with the lattice structures: i) А с В => V(A) D V(B) ii) 27i С E2 => I (Hi) D iii) 1
§2. Irreducible algebraic sets 7 iv) V(A f\B) = V(A) U V(B) where A,B,Aa are ideals, Hi, H2 closed algebraic sets. Proof. All are obvious except possibly (iv). But by (i), V(AnB) D V(A)\JV(B). Conversely, if с $ V(A) U V(B), then there exist polynomials / € A and g € В such that /(ж) ф 0, g(x) ф 0. But then / • g € An В and (/ • g){x) ф 0, hence x(£V(Ar\B). О Definition 2. A closed algebraic set is irreducible if it is not the union of two strictly smaller closed algebraic sets. (We shall omit "closed" in referring to these sets). Recall that by the noetherian decomposition theorem, if А С к [Xi,..., Xn] is an ideal such that A = yA, then A can be written in exactly one way as an intersection of a finite set of prime ideals, none of which contains any other. And a prime ideal is not the intersection of any two strictly bigger ideals. Therefore: Proposition 2. In the bisection of the Corollary to Theorem 1, the irreducible algebraic sets correspond exactly to the prime ideals of к [Xi,..., Xn]. Moreover, every closed algebraic set H can be written in exactly one way as: H = Hi U ... U Нк where the Hi are irreducible sets and Hi <£ Hj if i ф j. Definition 3. The Hi of Proposition 2 will be called the components of H. In the early 19th century it was realized that for many reasons it was inade- inadequate and misleading to consider only the above "affine" algebraic sets. Among others, Poncelet realized that an immense simplification could be introduced in many questions by considering "projective" algebraic sets (cf. Felix Klein, Die Entwicklung der Mathematik, part I, pp. 80-82). Even to this day, there is no doubt that projective algebraic sets play a central role in algebro-geometric questions; therefore we shall define them as soon as possible. Recall that, by definition, Tn(k) is the set of (n+l)-tuples (x0,..., xn) € kn+1 such that some Xi ф 0, modulo the equivalence relation (xo,...,xn) ~ (axo,...,axn), aek*, (where k* is the multiplicative group of non-zero elements of k). Then an (n +1)- tuple (жо,Ж1,...,жп) is called a set of homogeneous coordinates for the point associated to it. Pn(fc) can be covered by n + 1 subsets Uq,Ui, ... ,Un, where Tj _ ( points represented by homogeneous 1 ^ coordinates (xo,xi,... ,xn) with жг- ф 0 J Each Ui is naturally isomorphic to kn under the map
I. Varieties kn ( \ f X0 Xi Xn\ Xi ... Л (xo,x1,...,xn) i—> I—,—,...,— I, I— omitted] . \ Xi Xi Xi J \Xi J The original motivation for introducing Tn{k) was to add to the affine space kn = Uo the extra "points at infinity" fn(k) - Uo so as to bring out into the open the mysterious things that went on at infinity. Recall that to all subvectorspaces W С kn+1 one associates the set of points P € Fn(fc) with homogeneous coordinates in W: the sets so obtained are called the linear subspaces L of Pn(&)- If W has codimension 1, then we get a hy- perplane. In particular, the points "at infinity" with respect to the affine piece Ui form the hyperplane associated to the subvectorspace Xi = 0. Moreover, by introducing a basis into W, the linear subspace L associated to W is naturally isomorphic to Pr(&), (r = diml'F — 1). The linear subspaces are the simplest examples of projective algebraic sets: Definition 4. A closed algebraic set in Fn(fc) is a set consisting of all roots of a finite collection of homogeneous polynomials /; € к [Xo,..., Xn], 1 < i < m. This makes sense because if / is homogeneous, and (xo,..., xn), (axo,..., axn) are 2 sets of homogeneous coordinates of the same point, then f(xo,...,xn)=Q «=^ / (az0, • • •, ocxn) = 0. We can now give a projective analog of the V, I correspondence used in the affine case. We shall, of course, now use only homogeneous ideals А С к [Хо,..., Xn]: i.e., ideals which, when they contain a polynomial /, also contain the homogeneous components of /. Equivalently, homogeneous ideals are the ide- ideals generated by a finite set of homogeneous polynomials. If Л is a homogeneous ideal, define ^ (ж°' • • •'Жп) are homogeneous coordinates of If U С Fn(A;) is a closed algebraic set, then define T,y\ _ { ideal generated by all homogeneous polynomials 1 ^ \ that vanish identically on U J Theorem 3. V and I set up a bijection between the set of closed algebraic sub- subsets of¥n(k), and the set of all homogeneous ideals А С к[Хо, ■.. ,Xn], such that A = \f~A except for the one ideal A = (Xo,..., Xn). Proof. It is clear that if U is a closed algebraic set, then V(I(U)) = U. Therefore, in any case, V and / set up a bijection between closed algebraic subsets of Tn(k) and those homogeneous ideals A such that:
§2. Irreducible algebraic sets 9 These ideals certainly equal their own radical. Moreover, the empty set is V ((Xo,..., Xn)), hence 1 € / (V ((Xo,..., Xn))); so (Xo, ...,Xn) does not sat- satisfy (*) and must be excluded. Finally, let A be any other homogeneous ideal which equals its own radical. Let V*(A) be the closed algebraic set correspond- corresponding to A in the affine space kn+1 with coordinates Xo,... ,Xn. Then V*(A) is invariant under the substitutions (Xo,..., Xn) —> (aX0,..., aXn) , all a € k. Therefore, either 1) V*(A) is empty, 2) V*(A) equals the origin only, or 3) V*(A) is a union of lines through the origin: i.e., it is the cone over the subset V(A) in Pn(fc). Moreover, by the affine Nullstellensatz, we know that (**) A In case 1), (**) implies that A = к [Xo,..., Xn], hence I(V(A)) - which always contains A - must equal A since there is no bigger ideal. In case 2), (**) implies that A = (Xo,... ,Xn) which we have excluded. In case 3), if / is a homoge- homogeneous polynomial, then / vanishes on V(A) if and only if / vanishes on V*(A). Therefore by (**), if / vanishes on V(A), then / € A, i.e., A D I(V(A)). Since the other inclusion is obvious, Theorem 3 is proven. □ The same lattice-theoretic identities hold as in the affine case. Moreover, we define irreducible algebraic sets exactly as in the affine case. And we obtain the analog of Proposition 2: Proposition 4. In the bijection of Theorem 3, the irreducible algebraic sets cor- correspond exactly to the homogeneous prime ideals ( (Xo,..., Xn) being accepted). Moreover, every closed algebraic set S in Tn(k) can be written in excatly one way as: S = Sx U ... U Sk , where the Si are irreducible algebraic sets and Si <]L Sj if 1ф j. Problem. Let S С Tn(k) be a closed algebraic set, and let H be the hyperplane Xo = 0. Identify Fn - H with kn in the usual way. Prove that S П (Pn - H) is a closed algebraic subset of kn and show that the ideal of S П (Fn — H) is derived from the ideal of S in a very natural way. For details on the relationship between the affine and projective set-up and on everything discussed so far, read Zariski-Samuel, vol. 2, Ch. 7, §§3,4,5 and 6.
10 I. Varieties Example A. Hypersurfaces. Let / (Xo,... ,Xn) be an irreducible homogeneous polynomial. Then the principal ideal (/) is prime, so / = 0 defines an irreducible algebraic set in Fn(fc) called a hypersurface (e.g. plane curve, surface in 3-space, etc.). Example B. The twisted cubic in Рз(&). This example is given to show the exis- existence of nontrivial examples: Start with the ideal: Ao = (xz - y2,yw - z2) С k[x,y,z,w]. V(A0) is just the intersection in Тз(к) of the 2 quadrics xz = y2 and yw = z2. Look in the affine space with coordinates X = x/w, Y = y/w, Z = z/w (the complement of w = 0). In here, V(A0) is the intersection of the ordinary cone XZ = Y2, and of the cylinder over the parabola Y = Z2. This intersection falls into 2 pieces: the line Y = Z = 0, and the twisted cubic itself. Correspond- Correspondingly, the ideal Ao is an intersection of the ideal of the line and of the twisted cubic: Ao = (j/, z) П (xz - y2, yw - z2, xw - yz) . The twisted cubic is, by definition, ^(^4). [To check that A is prime, the simplest method is to verify that A is the kernel of the homomorphism ф: k[x,y,z,w] -A k[s,t] ф(х) = s3 ф(у) = s2t ф{г) = st2 ф(т) = t3 ] . In practice, it may be difficult to tell whether a given ideal is prime or whether a given algebraic set is irreducible. It is relatively easy for principal ideals, i.e., for hypersurfaces, but harder for algebraic sets of higher codimension. A good deal of effort used to be devoted to compiling lists of all types of irreducible algebraic sets of given dimension and "degree" when these were small numbers. In Semple and Roth, Algebraic Geometry, one can find the equivalent of such lists. A study of these will give one a fair feeling for the menagerie of algebraic sets that live in P3, F4 or F5 for example. As for the general theory, it is far from definitive however.
§3. Definition of a morphism 11 §3. Definition of a morphism We will certainly want to know when 2 algebraic sets are to be considered iso- morphic. More generally, we will need to define not just the set of all algebraic sets, but the category of algebraic sets (for simplicity, in Chapter 1, we will stick to the irreducible ones). Example C. Look at a) k, the affine line, b) у = x1 in A;2, the parabola. Projecting the parabola onto the x-axis should surely be an isomorphism between these algebraic sets: X-QXIS More generally, if V С kn is an irreducible algebraic set, and if / G к [Xi,..., Xn], then the set of points: V* = {(x1,...,xn,f(x1,...,xn))\(xu...,xn)eV}ckn+l is an irreducible algebraic set. And the projection (xi,x2,... ,xn+i) i—> (xi,x2,.. -,xn) should define an isomorphism from V* to V. Example D. An irreducible conic С С Р2(&) will turn out to be isomorphic to the projective line Pi(&) under the following map: fix a point Pq £ C. Identify Pi (k) with the set of all lines through Po in the classical way. Then define a map
12 I. Varieties by letting a(d) for all lines t through Po be the second point in which t meets C, besides Po. Also, if t is the tangent line to С at Po, define a(t) to be Pq itself (since Pq is a "double" intersection of С and this tangent line). Example E. T\(k) and the twisted cubic Рз(&) will be isomorphic. For T\(k) consists of the pairs (s,t) modulo (s,t) ~ (as,at), i.e., of the set of ratios C = s/t including /3 = oo. Define a map by a(C) = the point with homogeneous coordinates (l,/3,/32,/33) and a(oo) = @,0,0,1). The image points clearly satisfy xz = y2, yw = z2 and zw = yz, so they are on the twisted cubic. The reader can readily check that a maps Pi (k) onto the twisted cubic. Example F. Let С be a cubic curve in Рг(А;) and let Po £ С For any point P € C, let t be the line through P and Po and let a(P) be the third point in which t meets C. Although this may not seem as obvious as the previous examples, a will be an automorphism of С of order 2. We shall use this example later to work out our definition in a nontrivial case. Now turn to the problem of actually defining morphisms, and hence isomor- isomorphisms, of irreducible algebraic sets. First consider the case of 2 irreducible affine algebraic sets. Definition 1. Let Sx С kni and E2 С кП2 be two irreducible algebraic sets. A map a : E\ —> £2 will be called a morphism if there exist n-2 polynomials /1,..., /П2 in the variables Xi,..., Xni such that (*) a(x) = (/i(xi,...,xni),...,/n2 (xi,...,xni)) for all points x = (xi,... ,xni) e E\.
§3. Definition of a morphism 13 Note one feature of this definition: it implies that every morphism a from E\ to E2 is the restriction of a morphism a' from kni to km. This may look odd at first, but it turns out to be reasonable - cf. §4. Note also that with this definition the map in Example С above is an isomorphism, i.e., both it and its inverse are morphisms. To analyze the definition further, suppose P1 С k[Xlt...,Xni] P2 С *[Xi,...,Xna] are the prime ideals I (Hi) and I{E2) respectively. Set Then Ri (resp. R2) is just the ring of A;-valued functions on E\ (resp. E2) obtained by restricting the ring of polynomial functions on the ambient affine space. Suppose g £ R2. Regarding g as a function on E2, the definition of morphism implies that the function g • a on E\ is in Ri - in fact Therefore a induces a /г-homomorphism: a* :R2 —> Ri. Moreover, note that a is determined by a*. This is so because the polynomials /1,..., fn2 can be recovered - up to an element of Pi - as a* (Xi),..., a* (Xn^)\ and the point a(x), for x e Ui, is determined via /i,.-.,/n2 modulo Pi by equation (*). Even more is true. Suppose you start with an arbitrary k- homomorphism Л : R2 —У Ri • Let /1 be a polynomial in к [X\,..., Xni] whose image modulo Pi equals X(Xi), for all 1 < i < ni. Then define a map by a' (xi,..., xni) = (/1 (xi,..., xni),..., /n2 (xi,..., xni)). If x = (xi,...,xni) e Hi, then actually a'(x) will be in E2'- for if g € P2, then g(a\x))=g(f1(x),...,fna(x)). But g (fu ..., /na) =g(X (Xi),..., Л (Xna)) modulo Pi = X(g) modulo Pi = 0 modulo Pi . Therefore, g(a'(x)) = 0 and a'(x) e E2. We can summarize this discussion in the following:
14 I. Varieties Definition 2. Let E С kn be an irreducible algebraic set. Then the affine ring Г(Е) is the ring of /г-valued functions on E given by polynomials in the coordinates, i.e., k[Xu...,Xn}/I(E). Proposition 1. If Ei, Ei are two irreducible algebraic sets, then the set ofmor- phisms from E\ to Ei and the set of k-homomorphisms from Г(Е2) to Г(Е\) are canonically isomorphic: hom(E1,E2) *< honifc (Г(Е2),Г(Е1)). Corollary. // E is an irreducible algebraic set, then Г{Е) is canonically iso- isomorphic to the set of morphisms from E to k. Proof. Note that Г (к) is just k[X]. Using Г(Е), we can define a subset V(A) of E for any ideal А С Г(Е) as {x e E | f(x) = 0 for all f eA}. Moreover, the Nullstellensatz for kn implies immediately the Nullstellensatz for E: {/ e Г(Е) I f(x) = 0 for all x e V(A)} = VI. Even more than Proposition 1 is true: Proposition 2. The assignment E h—► Г(Е) extends to a contravariant functor Г: Category of irreducible algebraic sets + morphisms }j Category of finitely generated integral 1 | domains over к + k-homomorphisms j which is an equivalence of categories. Proof Prop. 1 asserts that Г is a fully faithful functor. The other fact to check is that every finitely generated integral domain R over к occurs as Г(Е). But every such domain can be represented as: hence as Г(Е) where E is the locus of zeroes of /i,..., /TO in kn. D Because of the usefulness of continuity in topology and other parts of geome- geometry, another natural question is whether there is a natural topology on irreducible algebraic sets in which all morphisms are continuous. We will certainly want points of к to be closed, so their inverse images by morphisms must be closed. If we take the weakest topology satisfying this condition, we get the following:
§3. Definition of a morphism 15 Definition 3. A closed set in kn is to be a closed algebraic set V(A). By the results of §2, these define a topology in kn, called the Zariski topology. An ir- irreducible algebraic set E С кп is given the induced topology, again called the Zariski topology. It is clear that the closed sets of E are exactly the sets V(A), where A is an ideal in Г(Е). It is easy to check that all morphisms are continuous in the Zariski topology. A basis for the open sets in the Zariski topology on E is given by the open sets: Ef = {x e E | f{x) ф 0} for elements / £ ГA7). In fact, Ef = E — F((/)), hence Ef is open. And if U = E — ViA) is an arbitrary open set, then U= /ел One should notice that the Zariski topology is very weak. On к itself, for instance, it is just the topology of finite sets, the weakest T\ topology (since any ideal A in k[X] is principal - A = if) - therefore ViA) is just the finite set of roots of /). It follows that any bijection a : к —> к is continuous, so not all continuous maps are morphisms. In any case this is a very unclassical type of topological space. Definition 4. A topological space X is noetherian if its closed sets satisfy the descending chain condition (d.c.c). It is equivalent to require that all open sets be quasi-compact (= having the Heine-Borel covering property, but not necessarily T2). Now since ideals in к [X\,..., X2] satisfy the a.c.c, it follows that closed sets satisfy the d.c.c. - so the Zariski topology is noetherian. Our simple definition of morphisms for affine algebraic sets does not work for projective algebraic sets. The trouble is that it automatically implied that the morphism will extend to a morphism of the ambient affine space. There is no analogous fact in the projective case. Look at the case of Example D. Let E\ be the conic with homogeneous equation (*) xz = y2 in P2(fc). Let E2 = Pi(fc). Let Po e Ex be the point @,0,1). To every point Q £ Рг(^) — {Po}, we can associate the line PoQ, and by identifying the pencil of lines through Po with Pi (A;) we get a point of Pi (A;). In terms of coordinates, this can be expressed by the map: (a, 6, c) 1—> (a, b) as long as ia,b,c) are not homogeneous coordinates for Po, i.e., a or b is not zero. Let (s,£) be homogeneous coordinates in Pi (A;). Then the map from E\ to Ei should be defined by:
16 I. Varieties s = x Unfortunately, this is undefined at Po itself. But consider the second map defined by: s = У t = z . This is defined except at the point Pi £ E\ with coordinates A,0,0); moreover, at points on E\ not equal to Pi or P2, the ratios (x : y) and (y : z) are equal in view of the equation (*). Therefore A and В together define everywhere a map from Ei to Ei. On the other hand, it will turn out that there are no surjective morphisms at all from P2(fc) to Pi(fc) (cf. §7). Thus defining morphisms between projective sets is more subtle. We find that we must define morphisms locally and patch them together. But the problem arises: on which local places. We could use the affine algebraic sets Е-ЕПН where H С Pn(&) is a hyperplane. But in general these will not be small enough. We shall need arbitrarily small open sets in the Zariski-topology: Definition 5. A closed setin Pn(&) is to be a closed algebraic set V(A). By the results of §2, these define a topology in Pn(&), the Zariski-topology. Irreducible algebraic sets themselves are again given the induced topology. As in the affine case, a basis for the open sets is given by: where / is a homogeneous polynomial. Moreover, it is clear that the Zariski topology on fn(k) is noetherian. Problem. Check that Tn(k)x0 is homeomorphic to kn under the usual map i,.. .,xn/x0). Finally, to define morphisms locally, we will need to attach affine coordinate rings to a lot of the Zariski-open sets U and give a definition of affine morphism in terms of local properties. Clearly, we should begin by constructing the apparatus used for defining things locally. §4. Sheaves and affine varieties Definition 1. Let X be a topological space. A presheaf F on X consists of i) for all open U С X, a set F(U) ii) for all pairs of open sets Ui C^a map ("restriction") vesU2Ul : F(U2) —
§4. Sheaves and affine varieties 17 such that the following axioms are satisfied: b) IfUi CU2 CU3, then F(U3) F(U2) commutes. Definition 2. If Fi, F2 are presheaves on X, a map <p : F\ —> F2 is a collection of maps (p(U) : Fi(l7) ->• F2(U) for each open C/" such that if 17 С V, tp(V) F2(V) resv,u resy,u tp(U) F2(U) commutes. Definition 3. A presheaf F is a sheaf if for every collection {£/,} of open sets in X with U = UUi, the diagram is exact, i.e., the map JJ is injective, and its image is the set on which П and П resE/^nt/; : П F(uj) "^ П П agree.
18 I. Varieties When we pull this high-flown terminology down to earth, it says this. 1) If xi,X2 £ F(U) and for all г, resu,Ui^i = Tesu,Ui^2, then xi = xi. (That is, elements are uniquely determined by local data.) 2) If we have a collection of elements Xi £ F(Ui) such that res^^n [/,£;/ = wsUj,Uir\UjXj for all г and j then there is an x £ F(U) such that res^^x = Xi for all г. (That is, if we have local data which are compatible, they actually "patch together" to form something in F(U).) Example G. Let X and Y be topological spaces. For all open sets U С X, let F(U) be the set of continuous maps U —> Y. This is a presheaf with the restriction maps given by simply restricting maps to smaller sets; it is a sheaf because a function is continuous on UUi if and only if its restrictions to each Ui are continuous. Example H. X and Y differentiable manifolds. F(U) = differentiable maps U —>• Y. This again is a sheaf because differentiability is a local condition. Example I. X, Y topological spaces, G(U) = continuous functions U —> Y which have relatively compact image. This is a subpresheaf of the first example, but clearly need not be a sheaf. Example J. X a topological space, F(U) = the vector space of locally constant real-valued functions on U, modulo the constant functions on U. This is clearly a presheaf. But every s £ F(U) goes to zero in П F(Ui) for some covering {Ui}, while if U is not connected, F(U) ф @). Therefore it is not a sheaf. Sheaves are almost standard nowadays, and we will not develop their prop- properties in detail. Recall two important ideas: A) Stalks. Let F be a sheaf on X, x £ X. The collection of F(U), U open containing x, is an inverse system and we can form Fx = Urn F(U) , xeu called the stalk of F at x. Example. Let F(U) = continuous functions U ->R. Then Fx is the set of germs of continuous functions at x. It is Ux(zuF(U) modulo an equivalence relation: /i ~ /2 if /1 and /2 agree in a neighbourhood of x. B) Sheafification of a presheaf Let Fo be a presheaf on X. Then there is a sheaf F and a map / : Fo —> F such that if g : Fo —> F' is any map with F' a sheaf, there is a unique map h : F —>• F' such that
§4. Sheaves and affine varieties 19 F' commutes. (F is "the best possible sheaf you can get from Fo". It is easy to imagine how to get it: first identify things which have the same restrictions, and then add in all the things which can be patched together.) Thus in Example I above, if X is locally compact, the sheafification of this presheaf is the sheaf of all continuous functions in functions on X; and in example J, the sheafification of this presheaf is @). Notation. We may write F(U,F) for F(U), and call it the set of sections of F over U. F(X, F) is the set of global sections of F. In other contexts we may denote F(X) by H°(X,F) and call it the zeroth cohomology group. (In those contexts it will be a group, and there will be higher cohomology groups.) Suppose that for all U, F(U) is a group [ring, etc.] and that all the restriction maps are group [ring, etc.] homomorphisms. Then F is called a sheaf of groups [rings, etc]. In this case Fx is a group [ring, etc.], and so on. Example K. For any topological space X, let Fcont,x(U) = continuous functions £/"—>• E. Then Fcont,x(U) is a sheaf of rings. Note that if g : X —> Y is a continuous function, the operation / i—>■ f-g gives us the following maps: for every open U С Y a map Fcont,y (U) ->• Fcont}x (g~XU) such that FY(U) ^ Fxig^U) res res FY(V) Fx(g-lv) commutes for all open sets V С U. This set-up is called a morphism of the pair (X,FX) to the pair (Y,FY). Example L. Suppose that X and Y are differentiable manifolds, and that i*diff,x and .Fdiff,y are the subsheaves of Fconttx and -Fcont,y of differentiable functions. Let g : X —> Y be a continuous map. Then g is differentiable if and only if for all open sets U CY, f e Fdiff}Y(U) => f .g e Fdiff)x (g^U).
20 I. Varieties Example M. Similarly, say X, Y are complex analytic manifolds. Let F&ntx and -Fan,у be the sheaves of holomorphic functions. Then a continuous map g : X —> Y is holomorphic if and only if for all open sets U С Y, / £ Fan y(U) ==>■ f -g £ F^x^U). Thus the idea of using a "structure sheaf" to describe an object is useful in many contexts, and it will solve our problems too. Definition 4. Let X С kn be an irreducible algebraic set, R its affine coordinate ring. Since X is irreducible, I(X) is prime and R is an integral domain. Let К be its field of fractions. Recall that R has been identified with a ring of functions on X. For x £ X, let mx = {f £ R\ f(x) = 0}. This is a maximal ideal, the kernel of the homomorphism R —> к given by / •->■ f(x). Let ox = Rmx. We have then ox = {f/g \ f,g £ R,g(x) ф 0} С К. Now for U open in X, let Ox(U) = П Д. • xeu All the QX(U) are subrings of K. If V С U, then ox(U) С ox(V); if we take the inclusion as the restriction map, this defines a sheaf ox. The elements of QX(U) can be viewed as functions on U. Say F £ ox(U), and x £ U. Then F £ ox, so we can write F = f/g with g(x) ф 0. We then define F(x) = f(x)/g(x). Clearly F(x) = 0 for all x £ U implies F = 0, so we can identify ox(U) with the associated ring of functions on U. Proposition 1. Let X be an irreducible algebraic set and let R = Г(£). Let f £ R, and Xf = {x £ X | f(x) ф 0}. Then Qx(Xf) = Rf. Proof. lig/fn £ Rf, then g/fn £ ox for all x £ Xf, since by definition f(x) ф 0. Thus Rf Cox(Xf). Now suppose / £ ox(Xf) С К. Let Б = {g £ R \ g ■ F £ R}. If we can prove fn £ B, for some n, that will imply f £ Rf, and we will be through. By assumption, if x £ Xf, then F £ ox, so there exist functions g,h £ R such that F = h/g, g(x) ф 0. Then gF = h £ R, so g £ B, and В contains an element not vanishing at x. That is, V(B) С {х \ f(x) = 0}. By the Nullstellensatz, then feVB. a In particular, Corollary. Г(Х,ох) = R. Remarks. I) Assume that / £ ox{U) and that / vanishes nowhere on U. Then Proof. Obvious, since f(x) ф 0 ==> 1/f £ ox. II) The stalk of ox at x is ox.
§4. Sheaves and affine varieties 21 Proof. Since the sets Xf are a basis of the Zariski topology of X, we have lim ox{U) = lim ox{Xf) = lim Rf. Since all restriction maps in our sheaf are injective, this is just \J Rf, which /(*)#o is clearly ox. D III) The field К can also be recovered from the sheaf ox. Recall that X is irreducible, i.e., not the union of two proper closed subsets. Equivalently, the intersection of any two nonempty open sets is nonempty. But this means that we actually have an inverse system of all open sets, just like our previous in- inverse systems of open sets containing a given point x; in this way we can define a generic stalk of any sheaf F on X. In particular, it is evident that К is the generic stalk of the structure sheaf ox • IV) If h £ 2X(U) for some open U С X, then it need not be true that h = f/g with f,g (E R and g vanishing nowhere on U. For example, let X С к* be V{xw — yz), and let U = Xy U Xw. The following function h € ox{U) is not equal to f/g, g ф 0 in U: h = x/y on Xy, and h = z/w on X^2. The proposition shows that this is true however if U has the form Xg. Proposition 2. Let X С kn, Y С кт be irreducible algebraic sets, and let f : X -> Y be a continuous map. The following conditions are equivalent: i) f is a morphism ii) forallger(Y,oY),g-fer(X,ox) in) for all open U С Y, and g в Г (U,oY) =>• g • f e Г (f^U, ox) iv) for all x e X, and g £ of^ => g ■f £ ox. Proof. Trivially iii) => ii), and iv) ==> iii) by the definition of ox. i) 4=Ф ii) is essentially proved in Proposition 1, §3. We assume ii), then, and prove iv). Let g e o/(x). We write g = a/b, a,b e r(Y,oY), b(f(x)) ф 0. By ii), a- f,b- f e Г (X,ox); hence g ■ f — a- f/b • f £ q^, since we have b ■ f(x) ф 0. D This shows, among other things, that our sheaf gives us all the information we need for defining morphisms. We are ready, then, to cut loose from the ambient spaces and define: 2 The proof that this h is not of the form f/g, g ф 0 in U, requires a later result but it goes like this: Assume h = f/g. Let Z = V(y,w): then Z is a plane in X and U = X — Z. By assumption V(g) П X С Z. Since all components of V(g) П X have dimension 2 (cf. §7), and since Z is irreducible, either V(g) П X = z or V(g) П X is empty. If V(g) П X = 0, then h = f/g G QX(X) which is absurd (since x = у h and at some points of X,x = 1 and у = 0). Now let Z' = V(x,z). Then {@,0,0,0)} = Z П Z' = V(g) П Z' . In other words, g would be a polynomial function on the plane Z' that vanishes only at the origin. This is impossible too.
22 I. Varieties Definition 5. An affine variety is a topological space X plus a sheaf of A;-valued functions ox on X which is isomorphic to an irreducible algebraic subset of some kn plus the sheaf just defined. Definition 6. The affine variety (kn,okn) is An, affine n-space. Definition 5 and Proposition 2 set up the category of affine varieties in pre- precise analogy with the category of topological spaces, differentiable manifolds, and analytic spaces. There are, however, some very categorical differences between these examples. Consider the following statement: Bijective morphisms are isomorphisms. This is correct, for example, in the category of compact topological spaces, of Banach spaces, and of complex analytic manifolds. On the other hand, it is false for differentiable manifolds - consider the map: where f(x) = x3. The statement is also false in the category of affine varieties: a bijection / : Xi —> X-2 of varieties may well correspond to an isomorphism of the ring of Xi with a proper subring of the ring of X\. Here are 3 key examples to bear in mind. Example N. Let char(A;) = p ф 0. Define the morphism А1 Л A1 by f(i) = tp. This is bijective. On the ring level, this corresponds to the inclusion map in the pair of rings: k[X] ^ k[Xp]. f is not an isomorphism since these rings are not equal. Example O. Let к be any algebraically closed field. Define the morphism А1 Л A2 by f(t) = (t2,t3). The image of this morphism is the irreducible closed curve C:X3 = Y2. The morphism / from A1 to С is a bijection which corresponds to the inclusion map in the pair of rings: k[T]^k[T2,T3]. These rings are not equal, so / is not an isomorphism.
§4. Sheaves and affine varieties 23 Example P. Define A1 -A A2 by X = t2 - 1, Y = t (t2 - l). It is not hard to check that the image of this morphism is the curve D: A) Y2 = X2(X + 1) . (Simply note that one can solve for the coordinate t of the point in A1 by the equation t = Y/X. Then substitute this into X = t2 — 1.) Also, / is bijective be- between A1 and D except that both the points t = — 1 and t = 1 are mapped to the origin. Let Xi = A1 — {1}, an affine variety with coordinate ring к [T, (T — I)"] (cf. Proposition 4 below). Then / restricts to a bijection /' from Xi to D. This morphism corresponds to the inclusion in the pair of rings: к [T, (T - I)] «-* к [T2 - 1,T(T2 - 1)] . Since these rings are unequal, /' is not an isomorphism. The last topic we will take up in this section is the induced variety structure on open and closed subsets of affine varieties. Let Y be an irreducible closed subset of an affine variety (X, ox). [Irreducible, now, in the sense given by the topology on X] Define an induced sheaf ox of functions on Y as follows:
24 I. Varieties If V is open in У, Vx € V, 3 a neighbourhood U of x in X A;-valued functions and a function F € 2A-(t/) such that fiy(v) = < ';¥""^ 1 7 ОП /= restriction to U П F of F. Proposition 3. (Y,oY) is an affine variety. Proof. Say X is isomorphic to (E,os) in &n. Let R = k[Xi,... ,Xn], A = /(X) С R. Let У correspond to X' С X. Then X' is an irreducible algebraic set in kn, so we have an affine variety (E',0^,). We claim (Y,oY) is isomorphic to It suffices to show that the sheaves are equal. Since the inclusion of X' in X is a morphism, the restrictions of functions in os to X' are functions in oE,. This shows that all the functions in oY correspond to functions in oE,. Conversely, every function /' e os, (E'g), g € R, is a restriction of a function / £ Q.E {Eg) since both of these rings are quotients of Rg. Therefore all functions in os correspond to functions in oY too. D Proposition 4. Let (X,ox) be an affine variety, and let f £ Г(Х,ох). Then [Xf,oX\X j is an affine variety. [The restriction of the sheaf ох to the open set Xf is defined in the obvious way.] Proof. Say we identify X with X С kn. Let A = I(E) С k[Xx,... ,Xn]. Let /i € к [X\,..., Xn] be some element giving / as a function on X. Let В be the ideal in к [Xi,..., Xn, Xn+i] generated by A and by 1 — Д • Xn+i. We claim В is prime and, if X* = V(B) С kn+1, then (X*,^,) = (xf,oX\x\ From the definition of В we see that which is an integral domain, so В is prime. Define a morphism a : X* —> X by (xi,.. .,xn,xn+i) —>■ (xi,... ,xn). It's an injection with image Хд, since (xi,... ,xn,xn+i) e X* if and only if (xi,..., xn) € X and 1 = /i (xi,...,xn) xn+i. We leave to the reader the verifi- verification that it is a homeomorphism onto Хд. We saw above that .Г(Х*,О£.,) ~ Г(Х,ох)^. Therefore, by Proposition 2, a~l is a morphism from Хд to X*, i.e. (X*,^,) and [Xf,ox\X J are isomor- isomorphic. D What we have done to get Xf is to push the zeroes of / out to infinity. For example, suppose X = A1 and / is the coordinator X\. Then В = A - X1X2), giving a hyperbola:
§5. Definition of prevarieties and morphisms 25 Projection of the hyperbola down to the axis is an isomorphism with Xf. Not all open subsets of affine varieties are affine varieties. For instance, you cannot push the origin in A2 out to infinity, and A2 - @,0) is not an affine variety. In fact, no rational function is well-defined on A2 - @,0) but not at @,0); i.e., the intersection of the local rings o^, for all x £ A2, x ф @,0), is contained in O@)o) • Hence if we had any embedding A2 — @,0) ->• kn, the coordinate functions giving this embedding would have to extend to all of A2, and so the image of A2 — @,0) would not be closed. There is an analogous statement about complex functions: a holomorphic function on С х С — @,0) is necessarily holomorphic at @,0). §5. Definition of prevarieties and morphisms Definition 1. A topological space X plus a sheaf ox of A;-valued functions on X is a prevariety if 1) X is connected, and 2) there is a finite open covering {Ui} of X such that for all г, Wi, ох\и.) is an affine variety. Definition 2. An open subset U of X is called an open affine set if [U,oxiUj is an affine variety. Note that the open affine sets are a basis of the topology. In fact, we know by Proposition 4, §4, that this is true within each of the open affines Ui, and they cover X. Definition 3. A topological space is irreducible if it is not the union of two proper closed subsets (equivalently, the intersection of any two nonempty sets is nonempty). Proposition 1. Every prevariety X is an irreducible topological space.
26 I. Varieties Proof. Let V be open and nonempty in X. Let U\ be the union of all open affine sets meeting V, Ui the union of all those disjoint from V; then U\ U Ui — X. Suppose у (Е U\ nU?; then there are affine open sets W\, Wi containing y, such that W\ D V Ф 0, Wi П V = 0. But then W\ П V is a nonempty open set in the affine VFi, so it is dense in it; Wi П W\ is also a nonempty open set in W, so it meets W\ (IV. This shows that Wi C\V cannot be empty. This is a contradiction, hence no such у exists. Since X is connected and U\ is nonempty, U\ = X. Now let U be any other open set, and say x (E U. By the above, there is an affine open set W containing x and meeting V. Then both V П W and U П W are nonempty open sets in the affine W, so 7 П И^ П 17 Ф 0, and a fortiori unv^di. a In particular, every open set is dense. Thus prevarieties are not like differen- tiable manifolds, which can have disjoint coordinate patches; to get a prevariety, we just put things around the edges of one affine piece. Proposition 2. // X is a prevariety, then the closed sets of X satisfy the de- descending chain condition, i.e., X is a noetherian space. Proof. Let {Zi} be a sequence of closed sets, such that Z\ D Zi D Z3 D ... . Since X is covered by finitely many affines, it suffices to show that {U П Zi} stationary for each affine open U in X. The result in the affine case follows immediately from the fact that Г(и,ох) is noetherian as we noted in §3. D In particular, every variety is quasi-compact. Proposition 3. Let X be a noetherian topological space. Then every closed set Z in X can be written uniquely as an irredundant union of finitely many irre- irreducible closed sets (called the components of Z). Proof. Suppose Z is a minimal closed set for which the Proposition is false; this exists since X is noetherian. Then Z is not itself irreducible, so Z = Z\ U Z2, where Z\, Zi are smaller and hence are unions of the required type. Then so is Z. D Let X be a prevariety. Since X is an irreducible topological space, any 2 nonempty open subsets have a nonempty intersection. Therefore, all sheaves have "generic" stalks: Definition 4. The function field k(X) is the generic stalk of ox, i.e. k(X) = lim ox(U) . all non-empty open U In fact, k(X) equals the function field of each open affine set U in X, since the open subsets of U are cofinal. In particular, this shows that k(X) is really a field. The elements of k(X) are called rational functions on X, although they are, strictly speaking, only functions on open dense subsets of X.
§5. Definition of prevarieties and morphisms 27 Another type of lim over oA-(t/)'s is sometimes very useful. This is interme- intermediate between the lim that leads to ox and that which leads to k(X). Let Y С X be an irreducible closed subset of X. Then let: Qy,x = li™ ox{U) . open sets U such that UHY ф$ To express more simply the ring which you get in this way, fix one open affine U С X which meets Y. Let R be the coordinate ring of U and P = I(Y C\U) the ideal in R determined by Y. Then: oYX = lim ox(Uf) = RP . open sets Uf feR,f$P In particular, oYX is a local ring with quotient field k(X) and residue field k(Y). Proposition 4. An open subset of a prevariety is a prevariety. Proof. Let U С X be open. Since X is irreducible, U is connected. U is of course a union of affine open subsets. But since X is noetherian, U is quasi-compact and hence it is covered by finitely many affines. D Now let У be a closed irreducible subset of a prevariety X. The sheaf ox induces a sheaf o_Y on Y as follows: If V is open in Y, {, . ir ,. V a; € V, 3 a neighbourhood U of x in X Л л-valued functions , c' . ° ,rn , , I , Tr and a function F G Ovft/) such that > f on I/ ~ I 7 K / = restriction toUHV of F J Proposition 5. Г/ге pair (Y,oY) is a prevariety. Proof. This follows immediately from the definition and Proposition 3, §4. D Combining Propositions 4 and 5, we can even give a prevariety structure to every locally closed subset of a prevariety X. The set of all prevarieties so obtained are called the sub-prevarieties of X. Example Q. Pi: Take two copies U and V of A1. Let u, v be the coordinates on these 2 affine lines. Let Uq С U (resp. Vb С V) be defined by и ф 0 (resp. и ^ 0). Then r{U,ov) = k[u], so r(E/b,Otf) = к [щи-1]. Similarly r(V0,ov) = к [и,и]. Define a map ip : [To —> Vo taking the point with coordinate и = a to the point with coordinate v = ^; this gives a map ip* : к [v,v~*] —> к [и,и~г] taking v to u, v to u. ip* is an isomorphism of rings, so ip is an isomorphism of varieties
28 I. Varieties {if has an inverse since if* does). Now we patch together U and V via if, i.e., we form U U V with Uq and Vq identified via if. This has a sheaf on it, in the obvious way, and is a prevariety. The space is homeomorphic to Y\, and we call it the variety Pb Our patching can be pictured as follows: We could have patched U and V differently: v —> u, v * —> и 1 also gives an isomorphism of Uq onto Vq. But this is a silly way to patch; we are leaving out the same point each time: and the result is A1 with a point doubled: This is a prevariety, of course, but, in fact, not a variety (cf. §6). We could define all projective varieties by this kind of scissors and glue method, but there is a more intrinsic definition. Definition of Projective Varieties. Let P С к [Хо,... ,Xn] be a homoge- homogeneous prime ideal, X = V{P) С Yn{k). We want to make X (with the Zariski topology) into a prevariety. We do it by defining a function field, getting local rings, and intersecting them, just as for affine varieties. The elements of k[Xo,... ,Xn], even the homogeneous ones, do not give functions on X; but the ratio of any two having the same degree is a func- function. Since P is homogeneous, R = k[X0,.. .,Xn] /P is in a natural way oo a graded ring 0 Rn and an integral domain. We let k{X) be the zeroth n=0 graded piece of the localization of R with respect to homogeneous elements, i-e-, {f/g | f,g £ Rn for the same n).
§5. Definition of prevarieties and morphisms 29 If x € X, and g £ Rn it makes sense to say g{x) ф 0, even though g is not a function on X; for g changes by a nonzero factor as we change the homogeneous coordinates of x. Hence we can define a ring ox in k{X) as {f/g e k(X) | g(x) ф 0}. The set is clearly an ideal in the ring o^, and any element not in mx is invertible in ox. Thus ox is a local ring. We now define a sheaf ox on X by ox (U) = Pi Од., for all U С X open . x€U We can identify ox with a sheaf of A;-valued functions. For suppose x e U, and a e r(U,ox). Then a can be written f/g with /, g £ Rn, g(x) ф 0, and we define a(x) = f(x)/g(x). [That is, we lift /, g to F, G homogeneous polynomials of degree n in к [Хо,..., Xn], let ж be a set of homogeneous coordinates of x, and take F(x)(/G(x). It is clear that this value is unchanged if we take a different set of homogeneous coordinates, or if we change F and G by members of Pn, so we have a well-defined function.] We still should check that if a € Г A7,0,.) and a(x) = 0 for all x € U, then a = 0. But this also comes out of the next step, which consists in checking that (X,ox) is locally isomorphic to an affine variety. In fact, we claim that for all г, 0 < г < n, (X D fn(k)xi, restriction of ox) is an affine variety. We will check this only for Xo, since the general case goes just the same. For every homogeneous polynomial FgP, F/Xq^ can be written as a polynomial F' in the variables ^,... ^. Let P' С к [Y\,..., Yn] be the ideal generated by all these F'. We can map k[Y\,...,Yn] into a subring of k(X) by taking Yi to the function given by Xi/Xo; the kernel of this map is exactly P', so P' is prime. It is easy to see that we get an isomorphism <p : X П fn(k)xn —> X' = V(P') С kn by taking x to (I1,..., ^ J; if is actually a homeomorphism (cf. Problem at end of §3). Now for x e X D fn(k)x0, the local ring a,, is the set of all elements of k(X) having the form f/g for /, g in some Rm, g(x) ф 0. X' has the affine coordinate ring R' = к [Yi,..., Yn] /P. If k(X') is its quotient field, 2^(ж) is the set of all elements in k(X') having the form F/G for F,G e R', G(ip(x)) Ф 0. The map defined above taking R' into k(X) extends to an isomorphism k(X') —> k(X). I claim that this map takes o(fiX precisely onto o^.. First of all it clearly maps ovx into ox; and if f,g e Rm, g{x) ф 0, then f/g = f/X^jg/X^ in k{X) and f/Xfi1, g/X^ come from F,G in R' with G(<p(x)) ф 0. Thus the local rings correspond; since the sheaves were defined by intersecting local rings, they also correspond, and X D fn(k)x0 is indeed an affine variety.
30 I. Varieties Definition 5. Let X and Y be prevarieties. A map / : X —> Y is a morphism if / is continuous and, for all open sets V in Y, Proposition 6. Let f : X -> Y be any map. Let (Vi) be a collection of open affine subsets covering Y. Suppose that {Ui} is an open covering of X such that 1) f(Ui) С Vi and 2) f* maps Г (Vi,oY) into Г (Ui,ox). Then f is a morphism. Proof. We may assume the Ui are affine; for if U С Ui is affine, /* certainly maps r(Vi,oY) into Г (U,ox), and we can replace Ui by a set of affines that cover Ui. First of all, the restriction fi of / to a map from Ui to Ц, is a morphism. In fact, the homomorphism is also induced by some morphism gi : Ui —>• Vi (Proposition 1, §3). And since the functions in r(Vi,oY) separate points, a map from Ui to Vi is determined by the contravariant map from F(Ui,ox) to r(Vi,oY). Therefore fi = gi and fi is a morphism. In particular, fi is continuous and this implies immediately that / itself is continuous. It remains to check that /* always takes sections oY to sections ox. But if У С У is open, and g € Г (V,oY), then g • / is at least a section of ox on the sets f~l (V П Vi), hence on the sets f-1(V) П Ui. Since ox is a sheaf, g • f is actually a section of ox in f~l (V). D To illustrate the meaning of our definitions, it seems worthwhile to work out in detail a non-trivial example. We shall reconsider Example F, §3. Let С be the plane cubic curve defined in homogeneous coordinates by: zy2 = x (x2 - z2) . Look first at СПР2(А;J, with affine coordinates X = x/z, Y = y/z. The equation of С becomes: Y2 = X ■ (X2 - 1). For all lines I through the origin, we want to interchange the 2 points in l П С (other than the origin). Start with a point (a, b) £ C. This is joined to the origin by the line
§5. Definition of prevaxieties and morphisms 31 X = at Y = Ы. Intersecting this with the cubic, we get the equation bH2 = at(a2t2-l) or 0 = at(t - 1) (a2t + 1) . Thus the 2nd point of intersection is given by t = —I/a2. In other words, the morphism on С is to be given by: These are not polynomials, so at any rate they do not define a map from this affine piece of С into itself; this is as it should be, since as we can see from the drawing, we want the origin itself to go to the one point at infinity on the cubic. To describe the subsets on which we will get a morphism, we must throw out the various "bad" points one at a time. We need names for them: Pi = @,1,0) (the only point at oo with respect to the affine piece X, Y) P2 = @,0,1) (the origin) /V ~ / \ 'n -i\ \ other points on the X-axis . ЧГ2 = (-1>U, 1) J The morphism - call it / - should interchange Pi and P2, Q\ and Q2. Define V\ = C-{PUP2) U2 = C-iP^QuQ^ U3 = C-{P2,QUQ2} V2 = С-{P2,Q1,Q2} = С nf2(k)y. Then 1) Ui,U2, Щ is an open covering of C, 2) V\, V2 is an affine open covering of C, and 3) if / is defined set-theoretically as above, then f(Ui) С Vi, f{U2) С V2 and /A1з) С V\. Therefore, by Proposition 6, it suffices to check that and Г[Г(У210с)] С and then it follows that / is a morphism. In more down to earth terms: note that X, Y are affine coordinates in Vi and
32 I. Varieties S = x/y T = z/y are affine coordinates in V2. Then what we have to check is that /, described via one of these 2 sets of coordinates, is given by polynomials in U\, U2, and U3. In U\\ X,Y and 1/X are functions in r(Ui,oc). Thus describing the image point also by coordinates X, Y, f is given by: and these are polynomials in a, 6, I/a. In JJi'- X, Y and ^ty are functions in Г A/2,0^1). Describe the image of the point (a, 6) by coordinates S, T now: S[f(a,b)] = a/b T[f(a,b)} = -a2/b. This does not look very promising until we use the relation b2 = a- (a2 — 1) to rewrite these as: S = b/a?-l T = -ab/a2 - 1 which are polynomials in a, 6, I/a2 — 1. In £/3: S and T are functions in Г^Цз,^). Describe the image of the point S = с, Т = d by coordinates X, Y: X[f(c,d)] = -d/c Y[f(c,d)} = -d/c2. However с and d are related by d = с • (с2 — d2), so we get: X = d2-c2 Y = d(c2-d?)-c. D Problem. Generalize the result by which we covered Pn by open affine sets as follows: for all homogeneous polynomials H G к [Xo,... ,Xn] of positive degree, show that Рп(&)я 1S an an<me variety.
§6. Products and the Hausdorff Axiom 33 §6. Products and the Hausdorff Axiom We want to define the product X x Y of any two prevarieties X, Y. Now we will certainly want to have An x Am = An+m. But the product of the Zariski topologies in An and Am does not give the Zariski topology in An+m; in A1 x A1, for instance, the only closed sets in the product topology are finite unions of horizontal and vertical lines. The only reliable way to find the correct definition is to use the general category-theoretic definition of product. Definition 1. Let С be a category, X, Y objects in C. An object Z plus two morphisms X Z \ Y is a product if it has the following universal mapping property: for all objects W and morphisms W X Y there is a unique morphism t : W -> Z such that r = p-t, s = q-1, i.e., such that commutes. The induced morphism t in this situation will always be denoted (r, s). We call p and q the projections of the product onto its factors. Clearly a product, if it exists, is unique up to a unique isomorphism commuting with the projections.
34 I. Varieties The requirement of the definition can be rephrased to say hom(W, Z) -^> Ьот(И^, X) x Ьот(И^, Y) (under the obvious map induced by p and q). We shall prove that products exist in the category of prevarieties over k. Note that we have no choice for the underlying set; for if X x Y is a product of the prevarieties X and Y, X x Y as a point set must be the usual product of the point sets X and Y. To see this, let W be a simple point; this is a prevariety (A0, in fact). The maps of W to any prevariety S clearly correspond to the points of S, and by definition hom(W,X x Y) ~ hom(W,X) x hom(W,Y). Proposition 1. Let X and Y be affine varieties, with coordinate rings R and S. Then 1) there is a product prevariety X x Y. 2) X xY is affine with coordinate ring R <g>j. S. 3) a basis of the topology is given by the open sets 4) 2(Ж)У) is the localization of ox ®k Qy at the maximal ideal mx • oy + ox • my. Proof. We recall the following result from commutative algebra: let R and S be integral domains over the algebraically closed field k. Then R <g>j. S is an integral domain. [Cf. Zariski-Samuel, vol. 1, Ch. 3, §15.] Represent ХСГ as V(/i,...,/mi) . FCP as V(gi,...,gm2) . Then the set X x Y с кП1+П2 is the locus of zeroes of fj(Xi),gj(Yi) in к [Xi,..., Xni, Yi,..., Yn2\. Moreover But R <8>k S is an integral domain; hence (fj,gj) is prime, X x Y is irreducible, and R <g>j. 5 is its coordinate ring. This gives us an affine variety X xY. The next step is to prove that it is a categorical product. We have natural projections p,q:XxY —> X,Y [e.g., p(xu ... ,yn2) = (xu ... ,sni)] which are clearly morphisms. Suppose we are given morphisms f : Z -> X, s : Z -> Y. There is just one map of point sets t: Z ->• X xY such that r = p-t, s = g • t (since X x Y as a point set is the product of -X" and Y), and to verify the universal mapping property we need only check that t is always a morphism. But this is simple. Since X x Y is affine, it suffices to check that g G Г (X x Y,oXxY) =>■ p-t G T(Z,oz). NowT(X x У,оХхУ) is generated by the images of Г (X, ox) = R and Г (У, oY) = S. Both of these by composition with t
§6. Products and the Hausdorff Axiom 35 go into Г (Z,o%) since r and s are morphisms; therefore all of Г (X x Y,QXxY) goes into Г (Z,oz). We have proved 1) and 2), and 3) follows from 2). Now 0(Ж)У) is the localization of R <S>k S at the ideal of all functions vanishing at (x,y). Clearly R <S>k S С 0.x ®k Qy С О(ж у), and therefore we can get 0(Ж)У) by localizing ox <g>k oy at the ideal of all functions in it vanishing at (x,y). We claim that ideal is precisely mx -Qy + ox -niy. Evidently all these functions do vanish there. Conversely, if we take any h = £ /i ® 9% € ox®koy, with, say, /,-(ж) = a<, gi(y) = A, then we claim /i — X) aiA £ "Ъ • 2y + Qx • my. Indeed it equals D We can now "glue together" these affine products to obtain: Theorem 2. Let X and Y be prevarieties over k. Then they have a product. Proof. We start, of course, with the product set. For all open affine U С X, V С Y and all finite sets of elements fa G Г(и,ох), gi G r(V,oY) we form (U x V)sfi9i; these we take as a basis of the open sets. (They do form a basis, since (U x V)sfigi П (?/' x V')r/(e;. contains (U" x V")^/^..^/^»., where ?7" [resp. V"] is an affine contained in U П C/"' [resp. У П V'].) Note that ont/хУ this induces the topology of their true product. Let К be the quotient field of k(X) <8>д. fc(F) [which, as before, is an integral domain]. For x G X, у G Y we let О(Х)У) С iiT be the localization of ox ®k Qy at the ideal mx • oy + ox • my, and we set = П 2(»,y)' This gives us a sheaf of functions. Furthermore, it coincides on each U xV (U,V affine) with the product of the affine varieties. Clearly X x Y is connected and covered by finitely many affines, so it is a prevariety. Now suppose Z -^> X, Z -^> У are morphisms. Set-theoretically there is a unique map (r, s) : Z -> X x Y composing properly with the projections; we want to check that it is a morphism. For each U in X, V in Y affine, look at Zu,v = r~1(U) П s~1(V). These are open sets covering Z, and since being a morphism is a local property it is enough to prove (r, s) \ Zu.v is a morphism. That is, we may assume r{Z) С U, s(Z) С V. But in the last proposition we saw that then the product map Z ->• U x V is a morphism; and G x У is an open subprevariety of X x Y, so the composite map Z-^UxV^XxY is а morphism. Remarks. I. If U is any open subprevariety of X, then t/ x Y is an open subpre- subprevariety of X x Y. II. If Z is a closed subprevariety of X, then Z x Y is a closed subprevariety of X x У. [It is enough to prove (Z C\U) xV & closed subprevariety of U x V for U, V affine, and this is easily checked.]
36 I. Varieties Theorem 3. The product of two projective varieties is a projective variety. Proof. Since a closed subvariety of a projective variety is a projective variety, it is enough to show Pn x Pm is a projective variety. In fact, we can embed it as a closed subvariety of Pnm+n+m. Take homogeneous coordinates Xo,.. ■, Xn in Pn, Yo,..., Ym in Pm, and Uij (i = 0,..., n, j = 0,..., m) in P(n+1)(m+1)_1. Define a map: /.ГО» \у Ш> i Ш) . Jrn X Jrm У Jrnm+n+m by , ч / ч a point with homogeneous coordinates (xo,...,xn)x(yo,...,ym)—> TT._rqi [This makes sense; multiplying all X{ for all yj by A multiplies all Uij by A; and some Uij is nonzero.] Clearly We claim that first / is injective. Assume that for some А ф 0, Xiyj = Xx^y'j for all г and j; then we want to prove that for some fi, v ф 0, Xi = fj,x't, yj = vy'y We may by symmetry assume x0 ф 0, y0 ф 0. Then we have 0 Ф xoyo = x'oy'o, so x'o Ф^чУоФ 0- We have Xi/x0 = Xiyo/xoyo = х[уЬ/х'оу'о = х[/х'о and similarly for yj/yo- This proves what we want. Now we claim / is an isomorphism of (Pn)x; x (^m)y. onto a closed subva- subvariety of (Рп+т+пт)с/;.- We may assume i = 0,j = 0 for simplicity. On (Pn)x0 we take affine coordinates Si = Xi/Xo, i = 1,... ,n. On (Pm)y0 we take affine coordinates Tj = Yj/Y0, j = 1,... ,m. On (Pn+m+nm)[/oo we take affine coordi- coordinates Rij = Uij/Uoo, i > 1 or j > 1. In these coordinates, / takes (s,t) to the point Rij = Sitj if i, j > 1 ^iO — Si, Roj = tj . Hence the image is the locus of points satisfying Rij = RioRoj for all i,j > 1. This is certainly closed. Its affine coordinate ring is k[Rij] / (Rij — R^Roj), which is clearly isomorphic to the polynomial ring к [Rio, Roj]. Under /*, this is mapped isomorphically to к [Si,Tj] which is the affine coordinate ring of (Pn)x0 x (Pm)Fo. Hence we do have an isomorphism. Let Z = J(Pn x Pm). Since Z П (Pn+m+nm)v.. is closed for all i,j, Z is closed. / is a homeomorphism on each of these affines, so it is a homeomorphism globally, and in particular Z is irreducible. Thus Z is a projective variety. Since / is an isomorphism on each affine piece, it is an isomorphism globally and hence Z is isomorphic to Pn x Pm. D
§6. Products and the Hausdorff Axiom 37 The classical example is the embedding Pi x Pi -»• P3. Take coordinates u,v on Pi, s,t on Pi, x,y,z,w on P3. / is defined by x = us, у = vt, z = ut, w = vs. It is easy to see that the image of / is the quadric xy = zw. [Thus, over an algebraically closed field all nondegenerate quadrics (those corresponding to nondegenerate quadric forms) are isomorphic to Pi x Pi.] One can show in general that the homogeneous ideal of the image is generated by the elements UijUi'j' — Uij'Ui'j, so the image is an intersection of quadrics. We now take up the main topic of this section - the Hausdorff axiom: Definition 2. Let X be a prevariety. X is a variety if for all prevarieties Y and for all morphisms 9 {y e Y I f(y) = g(y)} is a closed subset of Y. One case of this criterion is particularly simple: let Y = X x X, f = g = p2. Also let A:X -^X xX be a morphism (idx, idx)- ^ is called the diagonal morphism. Then = {zeXxX\Pl(z)=p2(z)}. Therefore the Hausdorff axiom implies A{X) is closed. But the converse is also true: Proposition 4. Let X be a prevariety. Then X is a variety if and only if A(X) is closed in X x X. Proof. Suppose f,g : Y ->• X are given. Then induce a morphism (/, g) : Y ->• X x X. Since A{X) being closed implies the Hausdorff axiom. D Example R. Let X be: 7
38 I. Varieties i.e., 2 copies U\,U2 of A1, say with coordinates X\ and ж2, patched by the map Xi = X2 on the open sets x\ Ф 0 and x-i ф 0. Consider the isomorphisms of A1 with each of the 2 copies: О X с Call these гьг2 : A1 -)• X. Then {у G A1 | ii(y) = «2B/)} = A1 — {0} is not closed in A1, hence X is not a variety. Remarks. I. A subprevariety of a variety is a variety. A product of 2 varieties is a variety. II. An affine variety is a variety: in fact, if X is affine, Y is an arbitrary prevariety, and f,g :Y ->• X are 2 morphisms, then locus of zeroes of the functions s.f.-s.g., a\\ser(X,ox) and this set is closed. III. From II., we can check that if f,g : Y -> X are any morphisms of any prevarieties, then {y G Y \ f(y) = g(y)} is always locally closed. In fact, call this set Z. If z G Z, let У be an affine open neighbourhood of f{z) (= р(г)). Then in the affine variety and this set is closed since У is a true variety. IV. Another useful way to use the Hausdorff property is this: if / : X ->• Y is a morphism of prevarieties and Y is a variety, then the image of the morphism which is the ргарЛ of /, is closed. Moreover, if we let Гд be this image, then Г/ is even a closed subprevariety of X x Y isomorphic to X under the mutually inverse morphisms: (id,/) X
§6. Products and the Hausdorff Axiom 39 (here pi : X x Y -> X is the projection). Proposition 5. Let X be a prevariety. Assume that for all x,y G X there is an open affine U containing both x and y. Then X is a variety. Proof. Suppose f,g :Y -> X are 2 morphisms such that Z = {y G Y \ f(y) = g(y)} is not closed. Let z G Z, x = f(z), у = g(z). By assumption, there is an open affine V containing x and y. Let U = f~1(V) П g~1(V): U is an open neigh- neighbourhood of Z. If f',g' are the restrictions of f,g to morphisms from U to V, then ZnU = {yeU\f'(y) = is closed in U, since V, being affine, is a variety. Therefore z G Z C\U, and Z is closed. D Corollary. Every protective variety X is a variety. Proof. For all x,y e №n(k), there is a hyperplane not containing x or y, i.e., an element H = ЕоцХг G k[X0,...,Xn] such that x,y G Рп(&)я- But Рп(&)я is affine, hence X П Рп(&)я is open and affine in X. D Proposition 6. Let X be a variety, and let U, V be affine open subsets with coordinate rings R, S. Then U C\V is an affine open subset with coordinate ring R • S (the compositum being formed in k(X)). Proof. U x V is an affine open subset of X x X with coordinate ring R®k S. Let Z = A(X). Then Z П (U x V) is a closed subset of U x V isomorphic via A to UC\ V. Since A is an isomorphism, Zn(U xV) is irreducible, hence it is a closed subvariety of U x V. Therefore it is affine, and its coordinate ring T is a quotient of the coordinate ring of U x V. But it is also open in Z, hence Z С k(Z), i.e., T is the image of R <8>k S in k(Z). Again since Л is an isomorphism, this proves that UnV = A~1{Zn(UxV)} is an open affine subvariety of X, and that its coordinate ring is A*(R (8) S) in k(X). But A*(f®g) = f-g, hence A*(R<g>S) = R■ S. D Problem. Prove the following converse: let X be a prevariety, {Ui} an affine open covering of X. Let Ri be the coordinate ring of Ui. Then if Ui П Uj is an affine subset of X with coordinate ring Ri • Rj, X is a variety. Zariski and Chevalley have used a quite different definition of a variety which is useful for many questions, especially "birational" ones, i.e., questions dealing with various different prevarieties with a common function field. Definition 3. Let О С О' be local rings. We say O' dominates О or O' > O, if m' D m (equivalently m' ПО = m) where m, m' are the maximal ideals of O, O' respectively.
40 I. Varieties Local Criterion: Let X be a prevariety. Then X is a variety if and only if for all x, у G X such that x ф у, there is no local ring О С k{X) such that О > q_x and (9 > oy. The fact that this holds for varieties will be proven in Ch. II, §6. We omit the converse which we will not use. If one starts with this criterion, however, one can take the elegant Chevalley-Nagata definition of a variety: we identify a variety X with the set of local rings ox. For this definition, a variety is a finitely generated field extension К oik plus a collection of local subrings of К satisfying various conditions. [Notice: the topology can be recovered as follows - for each / G K, let Uf be the set of those local rings in X containing /.] Problem. Let X С Pn be defined by a homogeneous ideal P С к [Хо,..., Xn]. Let P define the affine variety X* С An+1. Show that for all г, f Subset of X* 1 ^ Ai / Subset of X 1 \ where 1^0 J" X \ where X{ ф 0 j " §7. Dimension Definition 1. Let X be a variety. dimX = tr.d.kk(X). If U is an open and nonempty set in X, dimU = dimX, since k{U) = k(X). Since к is algebraically closed, the following are equivalent: ) ii) k(X) = к iii) X is a point. Proposition 1. LetY be a proper closed subvariety ofX. ThendimY < dimX. Proof . Lemma. Let R be an integral domain over к, Р С R a prime. Then tr.d.kR > tr.d.kR/P, with equality only if P = {0} or both sides are oo. [By convention, tr.d.R is the tr.d. of the quotient field of R.J Proof. Say P ф 0, tr.d.kR = n < oo. If the statement is false, there are n elements X\,..., xn in R such that their images xi in R/P are algebraically independent. Let 0 ф p G P. Then p,Xi,...,xn cannot be algebraically inde- independent over к, so there is a polynomial P(Y,Xi,...,Xn) over к such that P (p, x\,..., xn) = 0. Since R is an integral domain, we may assume P is irre- irreducible. The polynomial P cannot equal aY, a G /г, since p ф 0. Therefore P is not even a multiple of Y. But then P @,xi,...,xn) = 0 in Л/Р is a nontrivial relation on the X\,...,xn- Now choose [/Clan open affine set with UOY ф 0. Let R be the coordinate ring of U, P the prime ideal corresponding to the closed set UdY. Then P ф 0, since U C\Y фи. к(Х) is the quotient field of Л, and /г(У) is the quotient field of R/P. Therefore the Proposition follows immediately from the lemma. D
§7. Dimension 41 In the situation of this Proposition, dimX - dim У will be called the codi- mension of X in Y. This is half of what we want so that our definition gives a good dimension function. The other half is that it does not go down too much. Theorem 2. Let X be a variety, U С X open, g G r(U,ox), Z an irreducible component of {x G U \ g{x) = 0}. Then if g фО, dim Z = dimX — 1. Proof. Take Uo С U an open affine set with Uo П Z ф 0. Let R = r(U0,ox), f = resu,uo9 £ R- Then Z П Uo corresponds to a prime P С R. Z is by hypo- hypothesis a maximal irreducible subset of the locus g = 0, so Z П t/o is a maximal irreducible subset of the locus / = 0, i.e., P is a maximal prime containing /. Thus we have translated Theorem 2 into: Algebraic Version (Krull's Principal Ideal Theorem). Let R be a finitely generated integral domain over k, f G R, P an isolated prime ideal of (/) (i.e., minimal among the prime ideals containing it). Then if f ф 0, tr.d.kR/P = tv.d.kR- 1. This is a standard result in commutative algebra (cf. for example, Zariski- Samuel, vol. 2, Ch. 7, §7). However there is a fairly straightforward and geometric proof, using only the Noether normalization lemma and based on the Norm, so I think it is worthwhile proving the algebraic version too. This proof is due to J. Tate. At this point, it is convenient to translate the results of §1 into the geometric framework that we have built up: Definition 2. Let / : X ->• Y be a morphism of affine varieties. Let R and S be the coordinate rings of X and Y, and let /* : S ->• R be the induced homeomorphism. Then / is finite if R is integrally dependent on the subring f*(S). Note that the restriction of a finite morphism from X to Y to a closed subvariety Z of X is also finite. Examples N and О in §3 are finite morphisms, but example P is not. The main properties of finite morphisms are the following: Proposition 2. Let f : X ->• Z be a finite morphism of affine varieties. A.) Then f is a closed map, i.e., maps closed sets onto closed sets. B.) For all у eY, f~l(y) is a finite set. C.) f is surjective if and only if the corresponding map f* of coordinate rings is injective. Proof. Let / be dual to the map f*:S->Rof coordinate rings. As usual, the maps V, / set up bijections between the points of X (resp. Y) and the maximal ideals m of R (resp. S). If V(A) is a closed set in X, then f(V(A)) corresponds to the set of maximal ideals /* (m), m с R a maximal ideal containing A. But by the going-up theorem, whenever R D S and R is integral over S every prime ideal P С S is of the form P' П S for some prime ideal P' С R (cf. §1). If В = f*~\A), apply the going-up theorem to S/B С R/A; therefore f(V(A))
42 I. Varieties corresponds to the set of maximal ideals m such that m D B, i.e., to V(B). Therefore, / is closed. Moreover, f(X) = Y if and only if Ker(/*) = @). B.) is equivalent to saying that for all maximal ideals m С S, there are only a finite number of maximal ideals ml С R such that m = f* {ml). Since such ml all contain f*(m) • R, it is enough to check that R/f*(m) • R contains only a finite number of maximal ideals. But R/f*(m) • R is integral over S/m, i.e., it is a finite-dimensional algebra over k, so this is clear. D Via this concept, we can state: Geometric form of Noether's normalization lemma. Let X be an affine variety of dimension n. Then there exists a finite surjective morphism ж: Proof of Theorem 2. We first reduce the proof to the case P = \/(/), i.e., geometrically, to the case Z = V((f)). To do this, look back at the way in which we related the geometric and algebraic versions of the Theorem. Note that the algebraic version is essentially identical to Theorem 2 in the case X = U is affine with coordinate ring R. Suppose we have the decomposition (/) = P П P[ П ... П Pi in R. Geometrically, if Z[ = V (P/), this means that Z, Z{,..., Z[ are the com- components of V((f)). Now we pick an affine open Uq С X such that: Uo П Z ф 0 Uo П Z[ = 0, i = l,...,t. For example, let Uo = Xg, where: деР{П...ПР1, д^Р. Then replace X by Uo, R by R(g), and in the new set-up, Vuo((f)) = Vx((f))nU0 = ZHUo is irreducible; hence in R^, y/{f) = P • R(g) is prime. Now use the normalization lemma to find a morphism as follows: X R k[Xu...,Xn] = S. Let К (resp. L) be the quotient field of R (resp. S). Then K/L is a finite algebraic extension. Let
§7. Dimension 43 /o = Then I claim /o G 5 and If we prove (*), the theorem follows. For R/P is an integral extension of S/SDP, so tr.d.kR/P = tr.d.fcS/S П P . But 5 is a UFD, so the primary decomposition of a principal ideal in S is just the product of decomposition of the generator into irreducible elements. Therefore (*) implies that /o is a unit times /q0 for some integer £ and some irreducible /oo, and that P П S = (/oo)- Hence tr.d.fcS/S П P = tv.d.kk [Xu... ,Xn] /(/oo) = n. We check first that /0 G P П S. Let be the irreducible equation satisfied by / over the field L. Then /o is a power (an)m of an. Moreover, all the a^s are symmetric functions in the conjugates of / (in some normal extension of L): therefore the сц are elements of L integrally dependent on S. Therefore ai G S. In particular, /o = a™ G S, and since — an ' \J +aiJ +... + an) - J • \an J +an alt +--- + an <hi-l) + JO , /o G P too. Finally suppose g G P П 5. Then g e P = y/(fj, hence n _ f h some integer n , 5 ~ ; " ' some Л G Л . Taking norms, we find that 9n[K:L] = NK/L(gn) = NK/L(f)-NK/L(h)e(f0) since NK/ih is an element of S, by the reasoning used before. Therefore g G a/(/o), and (*) is proven. D Definition 3. Let X be a variety, and let Z С X be a closed subset. Then Z has риге dimension r if each of its components has dimension r (similarly for pure codimension r).
44 I. Varieties The conclusion of Theorem 2 may be stated as: V((g)) has pure codimen- sion 1, for any non-zero g G Г(Х,ох). The Theorem has an obvious converse: Suppose Z is an irreducible closed subset of a variety X of codimension 1. Then for all open sets U such that Z C\U Ф 0 and for all non-zero functions f on U vanishing on Z, Z П U is a component of / = 0. In fact, if W were a component of / = 0 containing Z C\U, then we have dimX > dimW > dim ZnU = dimX - 1 hence W = Z C\U by Proposition 1. Corollary 1. Let X be a variety and Z a maximal closed irreducible subset, smaller than X itself. Then dim Z = dimX — 1. Corollary 2 (Topological characterization of dimension). Suppose is any maximal chain of closed irreducible subsets of X. Then dimX = r. Proof. Induction on dimX. Corollary 3. Let X be a variety and let Z be a component of V ((/i, •. •, fr)), where /i,..., fT G Г (X,ox). Then codim Z <r. Proof. Induction on r. Z is an irreducible subset of V ((/i,..., Л—1)M so it is contained in some component Z' of V ((/i,..., Л—i)). Then Z is a component of Z'nV((fr)), since Z'r\V((fr)) СУ((/Ь..., fr)). By induction codim Z' < r-1. If fr e 0 on Z', Z — Z'. If /r does not vanish identically on Z', then by the theorem, dim Z = dim Z' — 1, so codim Z <r. D Of course, equality need not hold in the above result: e.g., take fi = .. • = fi, r> 1. Corollary 4. Let U be an affine variety, Z a closed irreducible subset. Let r = codim Z. Then there exist f\,..., fr inR = Г(U,Оц) such that Z is a component ofV((fu...,fr)). Proof. In fact, we prove the following. Let Z\ D Z2 D ... D ZT = Z be a chain of irreducibles with codim Zi = i (by Cor. 2). Then there are f\,..., fr in R such that Zs is a component of V ((/i,..., fs)) and all components of V ((/i,...,/«)) have codim s. We prove this by induction on s. For s = 1, take f\ G I(Zi), f\ ф 0, and we have just the converse of the theorem. Now say /i,..., /s-i have been chosen. Let Zs-\ = Yi,...,Yt be the components of V ((/i,..., Л-i))- For all г, Zs 75 Yi (because of their dimensions), so I(Yi) ~fi I(ZS). Since the I(Yi) are prime, ULi ОД) 2 Hzs) [Zariski-Samuel, vol. 1, p. 215; Bourbaki, Ch. 2, p. 70]. Hence we can choose an element fs G I(ZS), 's ^ Ui=i
§7. Dimension 45 If Y is any component of V ((/i,..., fs)), then (as in the proof of Cor. 3) Y is a component of Yi П V ((/s)) for some г. Since fs is not identically zero on Y{, dim У = dimYi — 1, so codim Y = s. By the choice of fs, Zs С V ((/i,..., /s)). Being irreducible, Zs is contained in some component of V ((/i,..., fs)), and it must equal this component since it has the same dimension. . D In the theory of local rings, it is shown that one can attach to every noetherian local ring О an integer called its Krull dimension. This number is defined as either a) the length r of the longest chain of prime ideals: PogP1g...gPn = M (M the maximal ideal of O) or b) the least integer n such that there exist elements /i,..., fn G M for which (Cf. Zariski-Samuel, vol. 1, pp. 240-242; vol. 2, p. 288. Also, cf. Serre, Algebre Lo- Locale, Ch. 3B). Recall that in §5 we attached a local ring oz x to every irreducible closed subset Z of every variety X. We now have: Corollary 5. The Krull dimension ozx is dimX — dimZ. Proof. By Corollary 2, for all maximal chains of irreducible closed subvarieties: n = dim X — dim Z. But it is not hard to check that there is an order reversing isomorphism between the set of irreducible closed subvarieties Y and X contain- containing Z, and the set of prime ideals P С Qz x. Or else, use the second definition of Krull dimension: first note that if /i,..., /n G Qz x, then Mz,x = if and only if there is an open set U С X such that a) UnZ^iH, b) /ь...,/пег(г7,ох), с) UnZ = V((fu...Jn)). Then using Corollaries 3 and 4, it follows that the smallest n for which such /j's exist is dim X — dim Z. D Suppose Z С X is irreducible and of codimension 1. A natural question to ask is whether, for all у e Z, there is some neighbourhood U of у in X and some function / e Г (U,ox) such that Z П U is not just a component of / = 0, but actually equal to the locus / = 0. More generally, if Z С X is a closed subset of pure codimension r, one may ask whether, for all у G Z, there is a neighbourhood U of у and functions /b ..., fr e Г (U, ox) such that
46 I. Varieties znu = v«fu...,fr)) . This is unfortunately not always true even in the special case where Z is ir- irreducible of codimension 1. A closed set Z with this property is often referred to as a local set-theoretic complete intersection and it has many other special properties. There is one case where we can say something however: Proposition 4. Let X be an affine variety with coordinate ring R. Assume R is a UFD. Then every closed subset Z С X of pure codimension 1 equals V((f)) for some f G R. Proof. Since R is a UFD, every minimal prime ideal of R is principal [i.e., say P С R is minimal and prime. Let /eP. Since P is prime, P contains one of the prime factors /' of /. By the UFD property, (/') is also a prime ideal and since (/') С P, we must have (/') = P]. Let Z\,..., Zt be the components of Z. Then / ((Zi),..., I(Zt)) are minimal prime ideals. If I(Zi) = (fi), then Z = V((fu...,ft)) . D Proposition 5. dimX x Y = dim-X" + dimF. The proof is easy. The results and methods of this section all have projective formulations which give some global as well as some local information: Let X С Pn(fc) be a projective variety, and let I(X) С к [Xo,... ,Xn] be its ideal. Theorem 2*. /// G к [Xo,..., Xn] is homogeneous and not a constant and f $ I, then X П V((/)) is non-empty and of pure codimension 1 in X, unless Proof. All this follows from Theorem 2, except for the fact that X П V((/)) is not empty if dimX > 1. But let X* С An+1 be the cone over X, i.e., the affine variety defined by the ideal I(X) in k[X0,.. .,Xn]. By the problem in §6, we know that dimX* = dimX + 1, hence dimX > 2. Let V*((/)) be the locus / = 0 in An+1. Since therefore X* n V*((/)) Ф 0 and by Theorem 2, X* П V*((/)) has a compo- component of dimension at least 1. Therefore X* П V*((/)) contains points other than @,0,... ,0); but the affine coordinates of such points are homogeneous coordi- coordinates of points in X П V((/)). □ Corollary 3*. If fi,. ..,fr G к [Xo,... ,Xn] are homogeneous and not con- constants, then all components of X П V ((/i,... ,/r)) have codimension at most r in X. And if dim X > r, then X C\V ((/i,..., /r)) is non-empty.
§7. Dimension 47 Corollary 4*. If Y С X is a closed subvariety, (resp. Y is the empty subset), then there exist homogeneous non-constant elements /i,...,/refe [Xo,... ,Xn] where r = codimension of Y (resp. r = dim X + 1) such that Y is a component ofXnV((f1,...,fr)) (resp.XnV((fu...Jr)))=<b. Proof One can follow exactly the inductive proof of Corollary 4 above, using к [Xo,..., Xn] instead of the affine coordinate ring. In case Y = 0, the last step is slightly different. By induction, we have f\,..., fr such that xnv((fu...,fr)) is a finite set of points. Then let /r+i be the equation of any hypersurface not containing any of these points. D This implies, for instance, that every space curve, i.e., one-dimensional sub- variety of Рз(&), is a component of an intersection Hi П H2 of 2 surfaces. Proposition 4*. Every closed subset о/Рп(/г) of pure codimension 1 is a hyper- hypersurface, i.e., equals V((f)) for some homogeneous element f e к [Xo,... ,Xn]. Proof. Exactly the same as Proposition 4, using the fact that к [Xo,... ,Xn] is a UFD. D An interesting corollary of these results is the following global theorem: Proposition 6. Suppose is a morphism. Assume W = /(Pm) is closed (actually this is always true as we will see in §9). Then either W is a single point, or dim W = n. Proof. Let r = dim W and assume 1 < r < n — 1. By Cor. 4* applied to the empty subvariety Y of W, there are homogeneous non-constant elements /i > /2, • • • > fr+i ^ к [Xo,..., Xm] such that Also, by Cor. 2*, W П V((fi)) ф 0 for all i. Let Zi = f-lV((fi)). Then Zi П ... П Zr+i =0, and Z{ ф 0, 1 < % < r + 1. Note that the hypersurface V((fi)) in Pm is defined locally by the vanishing of a single function. Therefore the closed subset Zi in Pn is also defined locally by the vanishing of a single function. Therefore Zi = Pn or else Zi is of pure codimension 1, hence by Cor. 4* a hypersurface. Since r + 1 < n, the intersection of r + 1 hypersurfaces in Pn cannot be empty because of Cor. 3*. This is a contradiction, so in fact r = 0 or r = n. D
48 I. Varieties §8. The fibres of a morphism Let / : X ->• Y be a morphism of varieties. The purpose of this section is to study the family of closed subsets of X consisting of the sets f~1(y), у GF. Definition 1. A morphism / : X -> Y is dominating if its image is dense in Y, i.e., Y = f(X). Proposition 1. If f : X -> Y is any morphism, let Z = f(X). Then Z is irreducible, the restricted morphism f : X —> Z is dominating and /'* induces an injection k(Z) С к(Х) . Proof. Suppose Z = Wi U W2, where W\ and W2 are closed subsets. Then X = f-\W{) U f~1(W2). Since X is irreducible, X = f~l{W{) or f~l{W2), i.e., f(X) С Wi or W2. Therefore Z = f(X) is equal to Wx or W2: hence Z is irreducible. /* is clearly dominating and since f'(X) is dense in Z, for all nonempty open sets U С Z, f~1(U) is nonempty and open in X; therefore, we obtain a map: k(Z)= lim r(U,oz) A lim r(V,ox) = k(X) . U CZ V CX open open nonempty nonempty D This reduces the study of the fibres of an arbitrary morphism to the case of dominating morphisms. Note also that a finite morphism is dominating if and only if it is surjective. Theorem 2. Let f : X —» Y be a dominating morphism of varieties and let r = dimX — dimF. Let W С Y be a closed irreducible subset and let Z be a component of f~1(W) that dominates W. Then dim Z > dim W + r or codim (Z in X) < codim (W in Y). Proof. If U is an affine open subset of Y such that U C\W ф 0, then to prove the theorem we may as well replace Y by U, X by f~1(U), W by W П U, and Z by Z П f~l(U). Therefore we assume that Y is affine. By Cor. 4 to Th. 2, §7, if s = codim (W in Y), there are functions /1,..., fs G Г (Y,oY) such that W is a component of V ((/1,..., /s)). Let gi E Г (Х,ох) denote the function /*(/i). Then Z С V ((gi,...,g8)) and I claim Z is a component of V ((#i,... ,gs))- Suppose ZcZ'cV((gi,...,g.)) ,
§8. The fibres of a morphism 49 where Z' is a component of V ((#1,..., gs)). Then W = f(Z) С f(Z') С V((/i,...,/,)). Since W is a component of V((/i,...,/s)) and /(Z') is irreducible, it follows that W = f(Z'). Therefore, Z' С f~l(W). But Z is a component of /^(W)- Therefore Z = Z', and Z is also a component of V ((gi,...,gs)). By Cor. 3 to Th. 2, §7, this proves that codim (Z in X) < s. D Corollary. // Z is a component of f~l{y), for some у е У', then dimZ > r. Theorem 3. Let f : X ->• У бе a dominating morphism of varieties and let r = dimX — dim Y. 77ien there exists a nonempty open setU С У such that: i) Ucf(X) ii) for all irreducible closed subsets W С Y such that W П U Ф 0, and for all components Z of f~l{W) such that Z П f~l(U) Ф 0, dim Z = dim W + r or codim (Z in X) = codim (W in Y) . Proof. As in Theorem 2, we may as well replace У by a nonempty open affine subset; therefore, assume that Y is affine. Moreover, we can also reduce the proof easily to the case where X is affine. In fact, cover X by affine open sets {Xi} and let fa : Xi -> Y by the restriction of /. Let Ui С Y satisfy i) and ii) of the theorem for /j. Let U = C\Ui. Then with this U, i) and ii) are correct for / itself. Now assume X and Y are affine, and let R and S be their coordinate rings. / defines a homomorphism /• : S —> R which is an injection by Proposition 1. Let К = k(Y), the quotient field of S. Apply the normalization lemma to the К -algebra R <8>s К. Note that R <8>s К is just the localization of R with respect to the multiplicative system S*, hence it is an integral domain with the same quotient field as R, i.e., k(X). In particular, tiA.K(R®sK) = tr.d.k{Y)k(X) = tr.d.kk(X) - tr.d.kk(Y) = r . Therefore, there exists a subring: K[Yl,...,Yr]cR®sK such that R®s К is integrally dependent on К [Yi,..., Yr]. We can assume that the elements Yi are actually in the subring R: for any element of R <8>s К is the product of an element of R and a suitable "constant" in К. Now consider the 2 rings: S[Yu...,Yr]cR.
50 I. Varieties R is not necessarily integral over S [Yi,..., Yr]; however, if a G R, then a satisfies an equation: where the Pi are polynomials with coefficients in К. If g is a common denom- denominator of all the coefficients, a is integral over S(g) [Yb... ,Yr]. Applying this reasoning to a finite set of generators of R as an 5-algebra, we can find some g G S such that R^ is integral over S(g) [Yi,... ,Yr]. Define U С Y as Yg, i.e., {y G Y | g{y) ф 0}. The subring 5E) [Yi,..., Yr] in R^ defines a factorization of / restricted to f~1(U): X D ГНи) Ux Pi X D U where тг is finite and surjective. This shows first of all that U С f(X) which is (i). To show (ii), let W С Y be an irreducible closed subset that meets U, and let Z С X be a component of f~l{W) that meets f~l{U). It suffices to show that dim Z < dim W + r since the other inequality has been shown in Theorem 2. Let Zq = Z П f~l{U) and let Wo = W П U. Then 7r(Zo)c Wox Therefore dim тг (Zo) < dim (Wo x Ar) = dim W + r . The restriction тг' of тг to a map from Zo to тг (Zo) is still dominating and finite. Therefore it induces an inclusion of к (тг (Zo)) in к (Zo) such that к (Zo) is algebraic over /г (тг (Zo)). Therefore dim Z < dim тг (Zo) < dim W + r . D Corollary 1. f as above. Then there is a nonempty open set U С Y such that, for all у EY, f~l{y) is a nonempty "pure" r-dimensional set, i.e., all its components have dimension r. Theorems 2 and 3 together give a good qualitative picture of the structure of a morphism. We can work this out a bit by some simple inductions.
§8. The fibres of a morphism 51 Definition 2. Let X be a variety. A subset A of X is constructible if it is a finite union of locally-closed subsets of X. The constructible sets are easily seen to form a Boolean algebra of subsets of X; in fact, they are the smallest Boolean algebra containing all open sets. A typical constructible set which is not locally closed is Corollary 2 (Chevalley). Let f : X ->■ Y be any morphism. Then the image of f is a constructible set in Y. More generally, f maps constructible sets in X to constructible sets in Y. Proof. The second statement follows immediately from the first. To prove the first, use induction on dim Y. a. If / is not dominating, let Z = f(Y). Then f(Y) с Z and dimZ < dimF, so the result follows by induction. b. If / is dominating, let U С Y be a nonempty open set as in Theorem 3. Let Zi,..., Zt be the components of Y - U and let Wit,..., Wisi be the components of f~l{Zi). Let g{j : Wij -» Z{ be the restriction of /. Since dim Zi < dimF, gij(Wij) is constructible. Since f(X) is constructible too. Corollary 3 (Upper semi-continuity of dimension). Let f : X any morphism. For all x G X, define < \ ( /,. ry\ Z a component of e(x) = ^max(dimZ) ,_W//- ^ ... [ / {f{x)) containing ж U Y be
52 I. Varieties Then e is upper semi-continuous, i.e., for all integers n Sn(f) = {xeX\ e(x) > n} is closed. Proof. Again do an induction on dim Y. Again, we may well assume that / is dominating. Let U С Y be a set as in Theorem 3. Let r = dim X -dim Y. First of all, if n < r, then Sn(f) = X by Theorem 2, so Sn(f) is closed. Secondly, if n > r, then Sn(f) С X - /-1(k0 by Theorem 3. Let Z\,...,Zt be the components of Y — U, Wit,. ,.,WiSi the components of f~x(Zi) and gij the restriction of / to a morphism from Wij to Zi. If Sn(gij) is the subset of Wij defined for the morphism g^, just as Sn(f) is for /, then Sn(gij) is closed by the induction hypothesis. But if n > r, then it is easy to check that so Sn(f) is closed too. □ Definition 3. A morphism / : X ->■ Y is birational if it is dominating and the induced map Г : HY) -> k(X) is an isomorphism. Theorem 4. If f : X -+Y is a birational morphism, then there is a nonempty open setUcY such that f restricts to an isomorphism from f~1(U) to U. Proof. We may as well assume that Y is affine with coordinate ring S. Let U С X be any nonempty open affine set, with coordinate ring R. Let W = f(X — U). Since all components of X—U are of lower dimension than X, also all components of W are of lower dimension than Y. Therefore W is a proper closed subset of Y. Pick g € S such that g = 0 on W, but g ф 0. Then it follows that f-\Yg)cU . If g' = f * g is the induced element in R, then in fact so by replacing Y by У5 and X by £/«,/, we have reduced the proof of the theorem to the case where Y and X are affine. Now assume that R and S are the coordinate rings of X and Y. Then / defines the homomorphism k(X) R r ( С с
§8. The fibres of a morphism 53 Let rci,..., xn be a set of generators of i?, and write Xi = yi/g, j/i,..., yn, g e S. Then /* localizes to an isomorphism from S(g) to Щд). Therefore Yg satisfies the requirements of the theorem. □ The theory developed in this section cries out for examples. Theorem 3 and its corollaries are illustrated in the following: Example S. A2 -A A2 defined by: f(x,y) = (xy,y) . i) The image of / is the union of @,0) and (A2) = A2 - {points where у = 0} . This set is not locally closed. ii) / is birational, and if U = A2,, then /-1 (U) = A2, and the restriction of / to a map from f~1(U) to U is an isomorphism. iii) On the other hand, /-1(@,0)) is the whole line of points (x,0). iv) S0(f) = A2, Si(f) = {(x,0)}, S2(f) = 0 (notation as in Corollary 3, Theo- Theorem 3). To illustrate Theorem 4, look again at: Examples 0, P bis. In example O, §4, we defined a finite birational morphism / : A1 -> С , where С is the affine plane curve X3 = Y2. If U = С - {@,0)}, then f'^U) = A1 - {@)}, and f~l{U) -^> U. On the ring level: k[T] 2&[T2,T3] , but fcp^T-1] = к [Т2 ,T3 ,T~2] . In example P, we defined a finite birational morphism / : A1 —> D and then considered its restriction /M1 -{1}—>D to get a bijection. If U = D - {@,0)}, then f'-l{U) = A1 - {A), (-1)} and
54 I. Varieties §9. Complete varieties An affine variety can be embedded in a projective variety, by a birational in- inclusion. Can a projective variety be embedded birationally in anything even bigger? The answer is no; there is a type of variety, called complete, which in our algebraic theory plays the same role as compact spaces do in the theory of topological spaces. These are "maximal" and projective varieties turn out to be complete. Recall the main result of classical elimination theory (which we will reprove later): Given r polynomials, with coefficients in k: fr(x0,...,xn; yi,...,ym) , all of which are homogeneous in the variables xo, • ■ •, xn, there is a second set of polynomials (with coefficients in к): 9\ (уь-..,Ут) such that for all m-tuples (ai,...,am) in k, gi(a,\,.. .,am) = 0, all i if and only if there is a non-zero (n + l)-tuple (bo,...,bn) in к such that fi (bo,... ,ftm; ai,...,am) = 0, all i. (Cf. van der Waerden, §80). In our lan- language, the equations /i = ... = fr = 0 define a closed subset IcPnxAm . Let p2 be the projection of Fn x Am onto Am. The conclusion asserted is that is a closed subset of Am; in fact that In other words, the theorem is: P2 : Fn x Am —> Am is a closed map, i.e., it maps closed sets onto closed sets, (modulo the fact that every closed subset of Fn x Am is described by a set of equations /i,..., fr as above). This property easily implies the apparently stronger property -p2 : Fn xX ->■ X is closed, for all varieties X. This motivates: Definition 1. A variety X is complete if for all varieties Y, the projection mor- phism P2 : X x Y —> Y is a closed map.
§9. Complete varieties 55 The analogous property in the category of topological spaces characterizes compact spaces X, at least as long as X is a reasonable space - say completely regular or with a countable basis of open sets. This definition is very nice from a category-theoretic point of view. It gives the elementary properties of complete- completeness very easily: i) Let / : X ->■ Y be a morphism, with X complete, then f(X) is closed in Y and is complete. ii) If X and Y are complete, then X x Y is complete, iii) If X is complete and Y С X is a closed subvariety, then Y is complete, iv) An affine variety X is complete only if dimX = 0, i.e., X consists in a single point. [In fact, (iv) follows from (i) by embedding the affine variety X in its closure X in a suitable projective space; and noting that X — Y = X(l (hyperplane at oo) is non-empty by Theorem 2*, §7]. It is harder to prove the main theorem of elimination theory: Theorem 1. Fn is complete. Proof (Grothendieck). We must show that for all varieties Y, P2 '• Fn x Y ->■ Y is closed. The problem is clearly local on У, so we can assume that Y is affine. Let R = r(Y,oY). Note that Fn x Y is covered by affine open sets U{ = (fn)x. x Y, whose [x x 1 X?"»• • • > ir • N°w suppose Z is a closed subset of fnxY. The first problem is to describe Z by a homogeneous ideal in the graded ring S = R [Xo,..., Xn] over R. Let Sm be the graded piece of degree m. Let Am С Sm be the vector space of homogeneous polynomials / (Xo,..., Xm), of degree m, coefficients in i?, such that for all г, Then A = EAm is a homogeneous ideal in S. Lemma. For all i and all g G I(ZC\Ui), there is a polynomial f (E Am for some m such that / Y Y \ 9 = f[ ^,...,^7- • Proof. If m is large enough, X™ • g is a homogeneous polynomial /' G Sm- To check whether /' G Am, look at the functions 3 3 Л3 gj is clearly zero on Z П Ui П Uj. And even if it is not zero on Z П Uj, %Ц- • gj is zero there. Therefore / = X{ • f G Am+i and this / does the trick. □
56 I. Varieties Now suppose у &Y —p2(Z). Let M = I(y) be the corresponding maximal ideal. Then ZDUi and (Рп)^ x {y} are disjoint closed subsets of Ui. Therefore I{Z П Ui) + M • R\ —,..., |A In particular where a* G /(Z П С/<), m^ G M, and Pij G i? fe,..., ^-1. If we multiply this equation through by a high power of Xi and use the lemma, it follows that xi = ai + 2_^ miJ' 9и ' j where a\ G Лту, 9ij €. Sn- (N may as well be chosen large enough to work for all i). In other words, X[* G An + M • Sn, for all i. Taking iV even larger, it follows that all monomials in the Xi of degree iV are in An + M • Sn, i.e., (*) SN = AN + M • SN . Now by the lemma of Nakayama applied to Sn/An there is an element / G R-M such that f • Sn С An- In fact: Nakayama's Lemma. Let M be a finitely generated R-module, and let А С R be an ideal such that M = A-M . Then there is an element f G 1 + A which annihilates M. Proof. Let mi,..., mn be generators of M as an i?-module. By assumption, 3=1 for suitable elements a,ij G A. But then: rrij = 0. Solving these linear equations directly, it follows that det (Sij — a,ij) • nik = 0, all к.. But if / = det (S^ - aij), then 1-/6A □ Now if / • Sn С AN, then f • X? e AN, hence / G I(Z П Ui). This shows that / = 0 at all points of p2(Z), i.e., p2(Z) П Yf = 0. This proves that the complement ofp2(Z) contains a neighbourhood of every point in it, hence it is open. □
§10. Complex varieties 57 Putting the theorem and remark iii) together, it follows that every projective variety is complete. For some years people were not sure whether or not all complete varieties might not actually be projective varieties. In the next section, we will see that even if a complete variety is not projective, it can still be dominated by a projective variety with the same function field. Thus the problem is a "birational" one, i.e., concerned with the comparison of the collection of all varieties with a common function field. An example of a non-projective complete variety was first found by Nagata. Theorem 1 can be proven also by valuation-theoretic methods, invented by Chevalley. This method is based on the: Valuative Criterion. A variety X is complete if and only if for all valuation rings R С k(X) containing к and with quotient field f(X), R D ox for some xeX. §10. Complex varieties Suppose that our algebraically closed ground field к is given a topology making it into a topological field. The most interesting case of this is when к = С, the complex numbers. However, we can make at least the first definition in complete generality. Namely, I claim that when A; is a topological field, then there is a unique way to endow all varieties X over к with a new topology, which we will call the strong topology, such that the following properties hold: i) the strong topology is stronger than the Zariski-topology, i.e., a closed (resp. open) subset Z С X is always strongly closed, (resp. strongly open), ii) all morphisms are strongly continuous, iii) if Z С X is a locally closed subvariety, then the strong topology on Z is the one induced by the strong topology on X, iv) the strong topology on X x Y is the product of the strong topologies on X and Y; v) the strong topology on A1 is exactly the given topology on k. (These are by no means independent requirements.) We leave it to the reader to check that such a set of strong topologies exists; it is obvious that there is at most one such set. Note that all varieties X are Hausdorff spaces in their strong topology. In fact, if Л : X -> X x X is the diagonal map, then A(X) is strongly closed by (i). Since X x X has the product strong topology by (iv), this means exactly that X is a Hausdorff space. From now on, suppose к = С with its usual topology. Then varieties not only have the strong topologies: they even have "strong structure sheaves", or in more conventional language, they are complex analytic spaces3. This means 3 The standard definition of a complex analytic space is completely analogous to our definition of a variety: i.e., it is a Hausdorff topological space X, plus a sheaf of C-valued continuous functions их on X, which is locally isomorphic to one of
58 I. Varieties that there is a unique collection of sheaves (in the strong topology) of strongly- continuous C-valued functions {Qx}, one for each variety X, such that: i) for each Zariski-open set U С X, ox(U)cQx(U), ii) all morphisms / : X ->■ Y are "holomorphic", i.e., /* takes sections of Qy to sections of Qx, iii) if Z С X is a locally closed subvariety, then Qz is the sheaf of C-valued functions on Z induced by the sheaf Qx on X, iv) if X = An = Cn, then Qx is the usual sheaf of holomorphic functions as a set onCn. (Again these properties are not independent.) We leave it to the reader to check that this set of sheaves exists: the uniqueness is obvious. Moreover, it follows immediately that (X, Qx) is a complex analytic space for all varieties X. The first non-trivial comparison theorem relating the 2 topologies states that the strong topology is not "too strong": Theorem 1. Let X be a variety, and U a nonempty open subvariety. Then U is strongly dense in X. Proof (Based on suggestions of G. Stolzenberg). Since the theorem is a local statement (in the Zariski topology), we can suppose that X is affine. By Noether's normalization lemma (geometric form), there exists a finite surjec- tive morphism 7Г : X —> An . Let Z = X — U. Then tt(Z) is a Zariski closed subset of An. Since all compo- components of Z have dimension < n, so do all components of tt(Z), hence ir(Z) is even a proper closed subset of An. In particular, there is a non-zero polynomial g (Xi,..., Xn) such that tt(Z) С {(xi,...,xn) \g(xi,...,xn) = 0} . Now choose a point x G X — U. Let's first represent ж(х) as a limit of points y@ = (x^\..., х{пЛ e An such that g (yW) ф 0. To do this, choose any point y(i) e An such that g (yA)) ф 0, and let h(t) = g (A-t) -ф) +t -у (i.e., ir(x) and y^ are regarded as vectors). Then h ф 0 since h(l) ф 0. Therefore h(t) has only a finite number of zeroes, and we can choose a sequence of numbers ti G С such that U ->■ 0, as i ->■ oo, and h(ti) ф 0. Then let the standard objects: i.e., the locus of zeroes in a polycylinder of a finite set of holomorphic functions, plus the sheaf of functions induced on it by the sheaf of holomorphic functions on the polycylinder. For details, see Gunning-Rossi, Ch. 5.
§10. Complex varieties 59 Then yW ->■ 7г(ж) strongly, as i ->■ oo, and p B/^) 7^ 0. The problem now is to lift each y^ to a point z^ G X such that z^ ->■ ж. Since j/W ^ tt(Z), all the points z^ must be in £/, hence it will follow that x is the strong closure of U. We will do this in 2 steps. First, let тг~1(тг(а;)) = {x, X2,. ■ •, xn}. Choose a function g G Г (X,ox) such that g(x) = 0, but g(x{) ф 0, 2 < г < n. Let F (X\,... ,Xn,g) = 0 be the irreducible equation satisfied by X\,...,Xn and g in Г(Х,ох). We shall work with the 3 rings and 3 affine varieties: Г(Х,ох) X и k[Xi,...,Xn,Y]/(F) - k[Xu...,Xn,g] U ^2 ..., Хп\ ' АП ' "" == " ' 7Tl • Since g is integrally dependent on к [Xi,... ,Xn], F has the form: Writing (X\,..., Xn) as a vector, we abbreviate F (X\,..., Xn, Y) to F(X, У). Now since p(rc) = 0, 0 = F{ir(xIg(x))=Ad{ir(x)). Therefore, Ad (y^) ->■ 0 as г ->■ oo. On the other hand, Г of the roots of the equation in t Therefore we can find roots № of F (y^, t) = 0 such that £(*) ->■ 0. Then (у(г),^(г)) is a sequence of points of V(F) converging strongly to ni(x). This is the 1st step. Now choose generators hi,..., /i/v of the ring Г (Х,ох). Via the /i»'s, we can embed X in A^, so that its strong topology is induced by the strong topology in A^. Each hi satisfies an equation of integral dependence: h™ + an - /i™~ + .. • + aim — 0 with aij G k[Xi,...,Xn,g]. Therefore, if £ G V(F) is a relatively compact subset, all the polynomials a»j are bounded on E, so each of the functions hi is bounded on ?rf 1(i7). Since \hi\ < C, all i, is a compact subset of A^, ?rf 1(XI) is a relatively compact subset of X. On the other hand, tti is a surjective map since tti
60 I. Varieties is a finite morphism. So choose points z^ G X such that tti (z^) = (y^\t^). Since the points (y^\t^) converge in V(F), they are a relatively compact set. Therefore {z^} is a relatively compact subset of X. Suppose they did not converge to x: then some subsequence z^ would converge to some x' ф х. Since () (() тг(гг'), тг(гс') = x, so x' = Xi, for some 2 <i <n. But then —> g(x') Ф0 (as A; ->■ oo). This is a contradiction, so 2^ -* re. П Corollary 1. If Z С X is a constructible subset of a variety, then the Zariski closure and the strong closure of Z are the same. The main result of this section is: Theorem 2. Let X be a variety over С Then X is complete if and only if X is compact in its strong topology. Proof. Suppose first that X is strongly compact. Let Y be another variety, let P2 : X x Y -)■ Y be the projection, and let Z С X x Y be a closed subvariety. Since X is compact, P2 is a proper map in the strong topology. Therefore P2 takes strongly closed sets to strongly closed sets (cf. Bourbaki, Topologie Generate, Ch. I, §10). Therefore p<i{Z) is strongly closed. Since it is also constructible (§8, Th. 3, Cor. 3), it is Zariski closed by the Cor. to Theorem 1. Conversely, we must show that complete varieties are strongly compact. First of all, it is clear that Fn(C) is strongly compact. For example, it is a continuous image of the sphere in the space of homogeneous coordinates: U={(zo,...,zn) surjective Therefore all closed subvarieties of Fn(C) are strongly compact. The general case follows from: Chow's Lemma. Let X be a complete variety (over any algebraically closed field k). Then there exists a closed subvariety Y of Pn(&) for some n and a surjective birational morphism: 7Г :Y —>X .
§10. Complex varieties 61 Proof. Cover X by open affine subsets Ui with coordinate rings Ai for 1 < i < m, and let U* = U\ П ... П Um. Embed all the C/j's as closed subvarieties of An (for some n). With respect to the composite inclusion: тт. г кп г F AA Ui is a locally closed subvariety of Pn(&); let Ui be its closure in Tn(k). Note that Ui x ... x Um is isomorphic to a closed subvariety of Рлг(&) for some N by Theorem 3, §6. Consider the composite morphism: U* —> U* x ... x U* С Ui x ... x Um • The first morphism is an isomorphism of U* with a closed subvariety of (U*)m - the "multidiagonal"; the second morphism is the product of all the inclusions U* С Ui С Ui, i.e., it is an isomorphism of (U*)m with an open subvariety of Ui x ... x Um. Therefore the image is a locally closed subvariety of U\ x ... x Um isomorphic to U*. Let Y be the closure of the image. Y is certainly a projective variety and we will construct a morphism тг: Y ->■ X. To construct тг, consider the morphism U* -^U* xU* CX xY induced by a) the inclusion of U* in Y, b) the inclusion of U* in X. Let Y be the closure of the image. Since X and Y are complete, therefore X x Y and Y are complete. The projections of X x Y onto X and У give the diagram: U* С у 4 . и* с x и* с у This shows that the projections p and g are both isomorphisms on [/*, hence they are birational morphisms. Moreover, since Y is complete, p(Y) and q(Y) are closed in X and У respectively; since p(Y) D U*, q(Y) D U*, this implies that p and q are surjective. I claim (*) q is an isomorphism. When (*) is proven, we can set ж = p ■ g~l and everything is proven. У is a closed subvariety of the product X x U\ x ... x Um. We want to analyze its projection on the product X x Ui of only 2 factors. Look at the diagram: ri projection onto 1st + (i+l)st factors у _ XxUi
62 I. Varieties Since the projection гДУ) is closed (since Y is complete) and contains the image of U* as a dense subset, it follows that п(У) is just the closure of U* in X x U{ via the bottom arrows. Sublemma. Let S and T be varieties, with isomorphic open subsets Vs С S, Vr С Т. For simplicity identify Vs with Vr and look at the morphism V -Ay xV CS xT . If V is the closure of the image, then V С (S x V) = V(l (V x T) = A(V) . V not in htrt 9 not in her» Proof. It suffices to show that A(V) is already closed in V x T and in S x V. But A(V) П (V x T), say, is just the graph of the inclusion morphism V -¥T. Hence it is closed (cf. Remark II, following Def. 2, §6). □ Therefore n(Y)n(XxUi) = ri(Y)n{UiXUi) = {{x,x)\xeUi}. Therefore Y(l(X xU1x...xUix...xUrn)=Yn(UixUiX...xUrn) . Call this set !*• From the second form of the intersection it follows that IyA is an open covering of Y. Prom the first form of the intersection, it follows that
§10. Complex varieties 63 if Yi = Y(l(UiX...xUiX...x UM) • Since q is surjective, this implies that {Yi} must be an open covering of Y. But now define: 0-i(ub...,um) = (ui,ub...,um) , (which makes sense exactly because the ith component щ is in Ui, hence is a point of X too). Then Ui is an inverse of q restricted to Yi'. a) q(<Ji (ub...,um)) = q(ui,Ui,...,Um) = (ub..^,um) b) by the sublemma, all points (v, u\,..., um) of Yi satisfy v = щ, hence Therefore q is an isomorphism and тг can be constructed. QED for Chow's lemma and Th. 2.
II. Preschemes The most satisfactory type of object yet derived in which to carry out all the operations natural to algebraic geometry is the prescheme. I fully agree that it is painful to go back again to the foundations and redefine our basic objects after we have built them up so carefully in the last chapter. The motivation for doing this comes from many directions. For one thing, we have so far neglected a very essential possibility inherent in our subject, which is, after all, a marriage of algebra and geometry: that is to examine and manipulate algebraically with the coefficients of the polynomials which define our varieties. It does not make much sense to say that a differentiable manifold is defined by integral equations; it makes good sense to say that an affine variety is the locus of zeroes of a set of integral polynomials. Another motivation for preschemes comes from the possibility of constructing via schemes an explicit and meaningful theory of in- infinitesimal objects. This is based on the idea of introducing nilpotent functions into the structure sheaf, whose values are everywhere zero, but which are still non-zero sections. Schemes with nilpotents are not only useful for many appli- applications, but they come up inevitably when you examine the fibres of morphisms between quite nice varieties. Thirdly, it is only when you use schemes that the full analogy between arithmetic and geometric questions becomes explicit. For example, there is the connection given by the general theory of Dedekind do- domains which unites the theory of a) rings of integers in a number field and b) rings of algebraic functions in one complex variable. A much deeper connection is given by class field theory, between the tower of number fields and the tower of coverings of an algebraic curve defined over a finite field. The analogies sug- suggested by this approach can be carried so far that they even give a definition of the higher homotopy groups of the integers, (i.e., of Spec(Z)): The vision of combined arithmetic-geometric objects goes back to Kronecker. It is interesting to read Felix Klein describing what to all intents is nothing but the theory of schemes: "Ich beschranke mich darauf, noch einmal das allgemeinste Problem, welches hier vorliegt, im Anschlufi an Kroneckers Festschrift von 1881 zu charakterisieren. Es handelt sich nicht nur um die reinen Zahlkorper oder Korper, die von einem Parameter Z abhangen, oder um die Ana- logisierung dieser Korper, sondern es handelt sich schliefilich darum, fur Gebilde, die gleichzeitig arithmetisch und funktionentheoretisch sind, also von gegebenen algebraischen Zahlen und gegebenen algebraischen D. Mumford: LNM 1358, pp. 65-136, 1999. © Springer-Verlag Berlin Heidelberg 1999
66 II. Preschemes Funktionen irgendwelcher Parameter algebraisch abhangen, dasselbe zu leisten, was mehr oder weniger vollstandig in den einfachsten Fallen gelungen ist. Es bietet sich da ein ungeheurer Ausblick auf ein rein the- oretisches Gebiet, welches durch seine allgemeinen Gesetzmafiigkeiten den grofiten asthetischen Reiz ausiibt, aber, wie wir nicht unterlassen dtirfen hier zu bemerken, alien praktischen Anwendungen zunachst ganz fern liegt. §1. Spec (R) It is possible to associate a "geometric" object to an arbitrary commutative ring R. This object will be called Spec (R). If R is a finitely generated integral domain over an algebraically closed field, Spec (R) will be very nearly the same as an affine variety associated to R in Chapter I. However in this section we will be completely indifferent to any special properties that R may or may not have - e.g., whether R has nilpotents or other zero-divisors in it or not; whether or not R has a large subfield over which it is finitely generated or even any subfield at all. We insist only that R be commutative and have a unit element 1. First define a point set: (I) Spec (R) = the set of prime ideals P \R itself is not counted a [R itself is not counted as a prime ideal but @), if prime, is counted ]. In order to have an unambiguous notation, we shall write [P] for the element of Spec (R) given by the prime ideal P. This allows us to distinguish between the times when we think of P as an ideal in R, and the times when we think of [P] as a point in Spec (R). Secondly, define a topology on Spec (R), its Zariski topology: (II) Closed sets = Sets of the form {[P] \ P D A} for some ideal ACR. This set will be denoted V(A). It is easy to verify that 0 ii) V(A(lB)=V(A)UV(B) . So the collection of closed subsets {У(Л)} does define a topology. Thirdly, let 4 Die Entwicklung der Mathematik im I9*en Jahrhundert, reprint by Chelsea Publ. Co., 1956, Ch. 7, p. 334.
§1. Spec (Я) 67 (III) Spec (R)f = {[P] \fiP). Since Spec (R)f = Spec (R) - V((f)), Spec (R)f is an open subset of Spec (R): we shall refer to these open subsets as the distinguished open sub- subsets. They form a basis of the open sets of Spec (R) because any open subset Spec(i?) — V(A) is simply the union of the distinguished open sets Spec (R)f, for all elements / G A. Spec (R) need not satisfy axiom Tl. In fact, the closure of [P] is exactly V{P), i.e., {P1 \P' DP}. Therefore [P] is a closed point if and only if P is a maximal ideal. At the other extreme, when R is an integral domain, (O) is a prime ideal and [(O)] is called the generic point of Spec (R) since its closure is the whole of Spec R. More generally, we define: Definition 1. Suppose Z is an irreducible closed subset of Spec (R). Then a point z G Z is a generic point of Z if Z is the closure of z, i.e., every open subset Zq of Z contains z. Proposition 1. Ifx G Spec (R), then the closure of{x} is irreducible andx is a generic point of this set. Conversely, every irreducible closed subset Z С Spec (R) equals V(P) for some prime ideal P С R, and [P] is its unique generic point. Proof. Let Z be the closure of {x}. If Z = W\ U W2, where Wi is closed, then x G W\ or x G W2. In each case, W\ or W2 is a closed set containing x and contained in its closure, i.e., W\ or W2 equals Z. Conversely, suppose V(A) is irreducible. Since V(A) = У(л/~А), we may as well assume that A = \f~A. Then we claim A is prime. If not, there exist f,g G R such that f-geA,f£A,g£ A. Let В = A+(f), С = A+(g). Then A = BnC: in fact, if h = af + a,\ = Cg + ai is an element of В П С (a,\, a^ G A; a,/3 G R), then h2 = aP-f-g + ai(Pg + a2) + a2(af) G A , hence /1 G A also. Therefore = V(B) U On the other hand, since A = л/А, A is the intersection of the prime ideals P that contain it. In particular, there is a prime ideal P such that PDA, but P$f. Then [P] G V{A)-V{B), i.e., V(A) 2 V(B). Similarly V(A) 2 V(C). We conclude that V(A) is not irreducible. This contradiction shows that A must have been a prime ideal. But then V(A) is the closure of the point [A] as mentioned above, so [A] is a generic point of V(A). If [P1] were another generic point, then [P1] G V(A) implies А С Р'. On the other hand, [A] would also be in the closure of [P1], so AD P'. This proves that A = P', hence there is only one generic point. □ Proposition 2. Let {fa \ ot G S} be a set of elements of R. Then Spec (R) = (J Spec (R)fa if and only if 1 G (...,/«,...), the ideal generated by the fa 's.
68 II. Preschemes Proof. Spec (R) is the union of the Spec (R)fjs if and only if every point [P] does not contain some fa. This means that no prime ideal contains (...,/«,...), and this happens if and only if Notice that 1 G (...,/«,...) if and only if there is a finite set fai,..., /<*„ of the /a's and elements gi,... ,gn of R such that: This equation is the algebraic analog of the partitions of unity which are so useful in differential geometry. The fact that one always has such an equation whenever one is given a covering of Spec (R) by distinguished open subsets is the reason for the cohomological triviality of affine schemes, (the so-called Theorems A and В - cf. Chapter 6). Corollary. Spec (R) is quasi-compact. Proof. It suffices to check that every covering by distinguished open sets has a finite subcover. Because of the Proposition, this follows from the fact that 1 G (...,/«,...) =Ф- 1 G (/«!, • ■ ■, fan) for some finite set a.\,..., an G S. О An easy generalization of this argument shows that Spec (R)f is also quasi- compact. But unless R is noetherian, for example, there may be some open sets U С Spec (R) which are not quasi-compact. We shall now endow the topological space Spec (R) with a sheaf of rings, Qspec (я)> called its structure sheaf. For simplicity of notation, let X = Spec(-R). At first, we will work with the distinguished open sets of X. We need a few properties of these sets: a) XfnXg = Xfg, all f,geR. (This is easy to check.) b) Xf D Xg if and only if g G /(/) • (In fact, since л/ijj = f]{P \ f G P}, it follows that 3P, feP, giP 3P, [F\ixh [P]exg. ) Now we want to associate the localization Rf of the ring R with respect to the multiplicative system (/, /2, /3,...) to the open set Xf. Note that if Xg С Xf, then by (b) there is a canonical map Rf ->■ Rg. (Explicitly, we know gn = h • f for some h and n, and we map
§1. Spec (Я) 69 a a- hr In particular, if Xg = Xf, we have canonical maps Rf ->■ Rg and Rg ->■ Rf which are inverse to each other, so we can identify Rf and Rg. Therefore we really can associate a ring Rf to each open set Xf. Furthermore, whenever Xk С Хд С Xf, we get a commutative diagram of canonical maps: Rg Furthermore, if [P] G Xf, then f £ P and there is a natural map Rf -* Rp, since the multiplicative system R — P contains the multiplicative system {/, /2,...}. It is easy to check in fact that Rp = lim Rf = lim Rf . feR-p f such that [P] e xf Lemma 1. Suppose Xf = \J Xfa. If an element g G Rf has image 0 in all aes the rings Rfa, then g = 0. Proof. Let g = b/fn. Let A = {c G R \ cb = 0}. The following are clearly equivalent: i) g = 0 in Rf ii) 3m such that /m • b = 0 in R iii) f eVA iv) PDA^feP. Therefore if g ф 0, we can choose a prime ideal P D A with f £ P, i.e., [P] G Xf. Then [P] G Xfa for some a. Using the commutative diagram Rf RP it folllows that g goes to 0 in Rp. Since b = g • fn, so does b. Therefore there is some ce R- P with с • b = 0, i.e., сбА This contradicts the fact that P D A and the lemma is proven. □
70 II. Preschemes Lemma 2. Suppose again that Xf = (J Xfa. Suppose that we have elements a£S 9a £ Rfa such that ga and gp have the same image in Rfaf0- Then there is an element g G Rf which has the image ga in Rfa for all a. Proof. We prove this only for / = 1; the general case is left to the reader. It suffices to assume the covering is finite, say X = X^ U ... U Xfk. For we may choose a finite subcovering of the Xfa; and if g G R goes to gi G R^, 1 < i < k, then for all a both ga and image (g) in Rfa have the same images down in the rings Rfafi, 1 < i < k, and hence are equal by Lemma 1. Write out gi = biff™ G Rf{ (since there are only finite many i's, we can choose a single n). The images of gi and gj in Rfifi are b*f? A bifi and J These are equal by hypothesis, which means that there is an тц such that: (/г/;ГЧМ?-&,7Г]=0 in Д. Let M be bigger than all rriij, set N = m + M, and b'{ = fo ■ /fM. Then we have 9i = b'i/f^ in Rfi and b'Jf -Vrflf = Q in R. On the other hand, we have г=1 г=1 Therefore 1 G (ff,..., /^), so 1 = E^i/^ f°r some h{ G R. Let g = Then г=1 г=1 i.e., g goes to gj in Rfr □ The lemmas show that assigning Rf to Xf gives us something as close to a sheaf as we can come when we are only considering a basis of open sets. Moreover there is one and only one way to extend this assignment to all open sets so as to get a sheaf. Explicitly, for each open U С X, let Г (U,ox) be the set of elements {sp} G Д RP [P]eu for which there exists a covering of U by distinguished open sets Xfa together with elements sa G Rfa such that sp equals the image of sa in Rp whenever [P] G Xfa. This is easily seen to be a ring. Moreover, if V С U, it is easy to see that the coordinate projection П Rp -» П R" [p]eu [p]ev
§1. Spec (Я) 71 takes Г(и,ох) into r(V,ox). Taking this as the restriction map, we get a presheaf ox. 1.) ox is a sheaf. Proof. Suppose U = (J Up. If we have an element of Г (£/, ox) going to 0 in all r(Up,ox), then its component at each Rp is 0, so it's 0. If we have elements sp G r(Up,ox) agreeing on overlaps, they clearly determine a unique element of flfpiec/ Rp'> this element is locally given by elements from Rf's since each of the S/j's was. □ 2.)r(Xf,ox)=Rf. Proof. We have a map Rf ->■ flrpiex ^p w^ich is injective by Lemma 1 and which lands us in F(Xf,ox) by definition. The map is surjective by Lemma 2. □ 3.) The stalk of ox at [P] is RP. Proof. We can get the stalk by talking a direct limit over a basis of open sets containing [P], so we get: (Qx)[p] = Hm r(Xf,ox) = Urn Rf = RP . Note that once we have Lemmas 1 and 2, the rest is purely sheaf-theoretic. One further point which is useful: since our topological space has non-closed points, we have some maps between the stalks ox of ox. Suppose Pi С Pi are 2 prime ideals. Let X{ = [Pi]. Then ar2 G {%i}, so every neighbourhood U of x^ contains xi; this gives us a map: oX2 = lim r(U,ox) —> lim Г(У,ох)=оХ1 . x2eu Recalling that a,.. = RPi, we see that this is just the natural map Rp2 ->■ Rpt. Example A. Let A; be a field. Then Spec (k) has just one point [(o)], and the structure sheaf is just к sitting on that point. More generally, let R be any commutative ring with descending chain condi- condition. Then R is the direct sum of its primary subrings: R= © Щ i=l (cf. Zariski-Samuel, vol. 1, p. 205). Now, quite generally, the spectrum of a direct sum of rings is just the disjoint union of the spectra of its components (each being open in the whole):
72 II. Preschemes Spec (Я 0 S) = Spec (R) II Spec E) . In our case, Spec (R) is the union of the Spec (Ri); since Ri is primary, it has one prime ideal Mi = {x G Ri | x nilpotent } , and Spec (Ri) is one point. Therefore, Spec (R) itself consists in n points with the discrete topology. Moreover, the structure sheaf just consists in Ri sitting as the stalk on the ith point. Geometrically, the presence of nilpotents in Mi should be taken as meaning that the ith point is surrounded by some infinitesimal normal neighbourhood. Example B. Spec (fc[X]): the affine line over k. This is denoted A^. k[X] has 2 types of prime ideals: (o) and (f(X)), f an an irreducible polynomial. Therefore Spec (fc[X]) has one closed point for each monic irreducible polynomial, and one generic point [(o)] whose closure is all of Spec (fc[X]). Assume к is algebraically closed. Then the closed points are all of the form [(X — a)]: we call this "the point X = a", and we find that A\ is just the ordinary X-line together with a generic point. The most general proper closed set is just a finite union of closed points. The stalk of oAi at [(X — a)] is: k[X]{X-a) = | -щ f,9 polynomials, g{a) ф 0 \ . The stalk at [(o)] is k(X), which we called before the function field. Note that whereas previously (Ch. I, §3) the generic stalk was only analogous to the stalks at closed points, it is now just another case of the same construction. Example C. Spec (Z). Ъ is a P.I.D. like k[X], and Spec (Z) is usually visualized as a line: generic point There is one closed point for each prime number, plus a generic point [(o)]. The stalk at [(p)] is Z(p) and at [(o)] it is Q, so Q is the "function field" of Spec (Z). The non-empty open sets of Spec (Z) are gotten by throwing away finitely many primes p\,..., pn. If m = Цpi, then this is the distinguished open set Spec (Z)TO, and r(Spec(Z)TO,OspecZ) = {— aGZ, к > 0\ . The residue fields of the stalks ox are Z/2Z, Z/3Z,..., Q: we get each prime field exactly once.
§1. Spec (R) 73 Example D. Almost identical comments apply to Spec (R) for any Dedekind domain R. In fact, all prime ideals are maximal or @); hence again we have a "line" of closed points plus a generic point. (However, if R is not a P.I.D., we cannot conclude as before that all non-empty open sets are distinguished; we know only that they are gotten by throwing out a finite set of closed points.) A very important case is when Л is a principal valuation ring5. Such a ring has a unique maximal ideal M, hence Spec (R) has 2 points [(o)] and [M]. Then [(o)] is an open point and [M] is closed: UO)] Imagine it as the affine line after all but one of its closed points has been thrown away. Valuation rings should always be considered as generalized one- dimensional objects. Example E. A^ = Spec {k[X, Y]), к algebraically closed. We get the maximal ideals (X — a, Y — b), the principal prime ideals (/(X, Y)), for / irreducible, and @). By dimension theory there are no other prime ideals. The set of maximal ideals gives us a set of closed points isomorphic to the usual X, F-plane. Then we must add one big generic point; and for every irreducible curve, a point generic in that curve but not sticking out of it: 5 i.e., a valuation ring with value group Z. I call this principal because in the literature an ambiguity has arisen between those who call these rings "discrete valuation rings" and those who call them "discrete, rank 1 valuation rings".
74 II. Preschemes То get a proper closed set, we take a finite number of irreducible curves, generic points and all, plus a finite number of closed points. Clearly, adding the non- closed points has not affected the topology much here: given a closed subset of the set of closed points in the old topology, there is a unique set of non-closed points to add to get a closed set in our new plane. Example F. There is a somewhat more startling way to make a scheme out of the non-closed points in the plane. Let '}■ Then О is the stalk of ox at @,0) if X = Spec (k[X,Y]). О has the maximal ideal (X, Y), the principal prime ideals (f(X,Y)) where / is irreducible and /@,0) = 0, and @). Therefore Spec (O) has only one closed point: KOH UX.YI1 If you throw away the closed point and the F-axis, you get the distinguished open set X ф 0, which now contains none of the original closed points. On the other hand, if К = k(X), then this scheme is Spec (К[F]s) for a certain multiplicative system 5 С if [^], i.e., it is part of the affine line over K. Example G. Spec ( П & ) > & a field. Those familiar with ultrafilters and similar y=i far-out mysteries will have no trouble proving that this topological space is the Stone-Cech compactification of Z+. Logicians assure us that we can prove more theorems if we use these outrageous spaces. Example H. Spec (Z[X]). This is a so-called "arithmetic surface" and is the first example which has a real mixing of arithmetic and geometric properties. The prime ideals in ЩХ] are: i) @). ii) principal prime ideals (/), where / is either a prime p, or a Q-irreducible polynomial written so that its coefficients have g.c.d. 1,
§1. Spec (R) 75 iii) maximal ideals (p.f), p a prime and / a monic integral polynomial irreducible modulo p. The whole should be pictured as follows: [C.X I! [E.X*2H K3.X-DJ |lE.X*1)J ' К5.ХЯ Их)] V(B» V«3» V(E» V«7» Exercise. What is V((p)) П V((/)), / a Q-irreducible polynomial? What is ^((/)) n ^((flO)> / and 9 distinct Q-irreducible polynomials? Each closed subset V((p)) is a copy of A^ , z, but they have all been pasted together here. The whole set-up is called a surface for 2 reasons: 1) all maximal chains of irreducible proper closed subsets have length 2, just as in A|. 2) If О is the local ring at a closed point x, then О has Krull dimension 2. In fact, if x = [(p,/)], then its maximal ideal is generated by p and /, and there is no single element g G О such that m = \Дд). The elements of R can always, in a certain sense, be interpreted as functions on Spec (R). For all x = [P] e Spec (Я), let k(x) = Rp/P-Rp = quotient field {R/P) = residue field (ox) . Then if a G R, a induces an element of Rp, and then in k(x): we call this the value a(x) of a at x. These values lie in different fields at different points. This
76 II. Preschemes generalizes what we did before for varieties. In that case, ox was a sheaf of k- algebras when we associate to each a £ к the constant function on X with value a. And for all points x of the variety X, к is mapped isomorphically onto the residue field k(x) of ox. We could therefore pull all values back to a single field k, and the к (ж)-valued function associated to a G R above becomes the k-valued function associated to a G R in Chapter I. The other extreme is illustrated by Spec (Z). If m G Z, then its "value" at [(p)] is "ra (mod p)" in Z/pZ. More generally, if a G Г1 (f/, o^) and ж G U, we can let а(ж) be the image of a in k(x). In the case of varieties, we actually embedded ox in this way in the sheaf of k-valued functions on X. This does not generalize. In fact, if a G R: a(x) = 0, all x G Spec (R) <=> a G P, all prime ideals P С R a is nilpotent . The significance of nilpotent elements in R is best understood by considering rings R that arise as к [X\,..., Xm] /A, where A is an ideal which is not neces- necessarily equal to y/A. We will see in §5 that whenever one ring Л is a quotient of another ring 5, then Spec (R) is a "subscheme" of Spec E) in a suitable sense. This is exactly what happened with varieties so let's assume it will work this way for schemes for this discussion. In our case, this means that we can find Spec (.R)'s with plenty of nilpotents embedded inside the very innocent looking A£ =Speck[Xu...,Xn]. Example I. Let Z = Spec (к[Х]/(Х2)), к algebraically closed. Then Z is a single point, but in addition to the constant functions a € k, Z supports the non-zero function X whose value at the unique point of Z is 0. On the other hand k[X]/(X2) is just a quotient of k[X] obtained by "very nearly" setting X = 0. Geometrically this means that Z can be embedded in A1 so that its unique point goes to the origin 0 G A1 and so that its sheaf - which is just fc[X]/[X2] - can be obtained by taking all functions / G OojAi modulo functions vanishing to 2nd order at 0. Then the function X which just vanishes to 1st order is still non-zero as a function on Z. And if / is any polynomial in Z, then one can compute from the restriction of / to the subscheme Z not only its value at 0, but even its 1st derivative at 0. This example will be taken up more fully in §5. In the most general case, Spec (к [Х\,..., Xn] /A) can be interpreted as the subset V(A) in A£, but with a sheaf of rings on it suitably "fattened" so as to include the first few terms of the Taylor expansion of polynomials / in directions normal to V(A). Another case would be Spec (k[X, Y]/(g2)), g being an irreducible polynomial. This Spec is the curve g = 0 with "multiplicity 2" and from the restriction of a polynomial / to it or, what is the same, from the image of / in k[X, Y]/(g2), one can reconstruct all 1st partials of / including the one normal to g = 0. Now by analogy if R is any ring with nilpotents, one can visualize Spec (R) as containing some extra normal material, in the direction of which one can for example take partial derivatives, but which is not actually tangent to a dimension present in the space Spec (R) itself. All this will, I hope, be much clearer in §5, where we will return to this discussion.
§2. The category of preschemes 77 The following useful fact generalizes Prop. 4, Ch. I, §3: Proposition 3. Let R be a ring and let f G R. Then the topological space Spec (R)f together with the restriction of the sheaf of rings Ospec (д) to Spec (R)f is isomorphic to Spec (Rf) together with the sheaf of rings Ospec (Д/). Proof. Let i : R -> Rf be the canonical map. Then if P is a prime ideal of R, such that f £ P, i(P) • Rf is a prime ideal of Rf; and if P is a prime ideal of Rf, г~г(Р) is a prime ideal of R not containing /. These maps set up a bijection between Spec (R)f and Spec (Rf) (cf. Zariski-Samuel, vol. 1, p. 223). The reader can check easily that this is a homeomorphism, and that in fact the open sets Spec (R)fg с Spec (R)f and Spec (Rf)g с Spec (Rf) correspond to each other. But the sections of the structure sheaves Ospec (д) and oSpec ,R \ on these two open sets are both isomorphic to Rfg. Therefore, these rings of sections can be naturally identified with each other and this sets up an isomorphism of i) the restriction of Ospec (д) to Spec (Я)/, and ii) Ospec (д ) compatible with the homeomorphism of underlying spaces. □ §2. The category of preschemes There is only one possible definition to make: Definition 1. A prescheme is a topological space X, plus a sheaf of rings ox on X, provided that there exists an open covering {Ua} of X such that each pair \Ua,oX\Ua J is isomorphic to (Spec (Ra), 0spec (Ra)) ^or some commutative ring R. Definition 2. An affine scheme is a prescheme (X, ox) isomorphic to (Spec (Д), ospec (д)) for some ring R. Notice that an open subset U of a prescheme X is also a prescheme if we take as its structure sheaf the restriction of ox to U. To see this, first note that an affine scheme (F, oy) has a basis of open sets U such that Ш, Оу^и) is again an affine scheme: i.e., take U = Yf = {y e Y \ f{y) Ф 0}, where / € Г(у,оу) and use the Prop, of the last section. Therefore if X is covered by open affines Ua, UC\Ua can be further covered by open affines for any open set U. The definition of a morphism of preschemes is slightly less obvious. Definition 3. If (X,ox) and [Y,oY) are 2 preschemes, a morphism from X to Y is a continuous map / : X -> Y, plus a collection of homomorphisms: one for each open set V С Y, such that
78 II. Preschemes a) whenever V\ C Vi are 2 open sets in Y, then the diagram: ft r(V2,oY) res res r(VuoY) and a G F(V,oY), then а(/(ж)) = О commutes, and b) if V С Y is open and x € / implies fy(a)(x) = 0. The difficulty here is that, unlike the situation with varieties, it is now neces- necessary to give explicitly the pull-back /* on "functions" in the structure sheaf o_Y in addition to the map / itself. Of course for any set 5 there is always a natural pull-back from the sheaf of 5-valued functions on Y to the sheaf of 5-valued functions on X. But in the first place the function associated to a section of oY does not determine that section back again, and in the second place the values of these functions, and the corresponding functions on X lie in completely un- unrelated fields. Condition (b) expresses the only possible compatibility of / and /*. This condition may be expressed in other ways, too. Suppose x € X and у = f{x). For all open neighbourhoods U and V of ж and у respectively such that f(U) С V, we have a homomorphism: Passing to the limit over such U and V, we get a homomorphism between stalks: Um Г (V,oy) y€V x£U r, Notice that these stalks are local rings (since the stalks on affine schemes are always local rings). Then condition (b) asserts that /* is a local homomorphism, i.e., equivalently /* (my) С тх or
§2. The category of preschemes 79 Whenever this is the case, /* also induces a homomorphism: = Qy/my -Л ojmx = k(x) . Call this kx. By definition, kx has the property: (b') For all a G Г (V,oY), for some neighbourhood V of Y, ft(a)(x) = kx[a(y)] . In other words, if kx is used to relate k(y) and к (ж), then the pull-back /* agrees with the pull-back of the functions associated to the sections of o_Y. Given 2 morphisms X —> Y and Y -^-> Z, we can define their composition g • / : X -> Z in an obvious way. This gives us the category of preschemes. The first result that assures us that we are on the right track is: Theorem 1. Let X be a prescheme and let R be a ring. To every morphism f : X —>■ Spec (R), associate the homomorphism: Д - Г (Spec (R), ogpec (я)) А Г (X, ox) . Then this induces a bijection between hom(X, Spec (R)) and hom (R, Г (X, ox)). Proof. For all /'s, let Af : R -> Г(Х,ох) denote the induced homomorphism. We first show that / is determined by Af. We must begin by showing how the map of point sets X -> Spec (R) is determined by Af. Suppose x G X. The crucial fact we need is that a point of Spec (R) is determined by the ideal of elements of R vanishing at it (since P = {a G R \ a([P]) = 0}). Thus f(x) is determined if we know {a G R \ a(f(x)) = 0}. But this equals {a G R \ f*(a)(x) = 0}, and /*(a) is obtained by restricting Af (a) to ox. Therefore f{x) = [{a G R | (Afa) (x) = 0}] . Next we must show that the maps /£} are determined by Af for all open sets U С Spec (R). Since /* is a map of sheaves, it is enough to show this for a basis of open sets (in fact, if U = UUa and s G Г Ш, Ospec (д))) then fi;{s) is determined by its restrictions to these sets /-1 (Ua), and these equal fija (resu,uas).) Now let Y = Spec (R) and consider /* for the distinguished open set Yj,. It makes the diagram f* iHx)* r(Yb,oY) = Rb res res Г(Х1вх) f- r(Y,oY) =
80 II. Preschemes commutative. Since these are ring homomorphisms, the map on the ring of frac- fractions Rb is determined by that on R: thus Af determined everything. Finally any homomorphism A : R -> Г(Х,ох) comes from some morphism /. To prove this, we first reduce to the case when X is affine. Cover X by open affine sets Xa. Then A induces homomorphisms Aa:R—>Г(Х,ох)^>Г{Ха,оХа) . Assuming the result in the affine case, there is a morphism fa : Xa -> Spec (R) such that Aa = Afa. On Xa DXp, fa and fp agree because the homomorphisms Г{ХаПХр,ох) agree and we know that the morphism is determined by the homomorphism. Hence the fa patch together to a morphism / : X -> Spec (R), and one checks that Af is exactly A. Now let A : R -> 5 be a homomorphism. We want a morphism / : Spec E) -> Spec (R). Following our earlier comments, we have no choice in defining / : for all points [P] € Spec E), This is continuous since for all ideals / С R, /~1(F(/)) = V(A(I) • 5). Moreover if U = Spec (Я)о, then f~1(U) = Spec E)д(а), so for /£} we need a map Ra -> ^(o)- We take the localization of A. These maps are then compatible with restriction, i.e., A(a).A(b) commutes. Hence they determine a sheaf map (in fact, if U = UUa, Ua dis- distinguished, and s G Г [ U, Ospec (д)) then the elements /^ {resu,ua s) patch to- together to give an element /^(s) in Г (/~1(^))^sPec E)))- Fr°m our definition
§2. The category of preschemes 81 of /, it follows easily that /* on <2u-ip] takes the maximal ideal ra^-ipj into □ Corollary 1. The category of affine schemes is isomorphic to the category of commutative rings with unit, with arrows reversed. Corollary 2. Spec (Z) is the final object in the category of preschemes, i.e., for every prescheme X, there is a unique morphism f : X —> Spec (Z). This is very important because it shows us that every prescheme X is a kind of fibred object, with one fibre for each prime p, and one over the generic point of Spec (Z). More concretely, this fibering is given by the function x i—> char, (к(ж)) associating to each x the characteristic of its residue field. If X = Spec (Л), it is given by [P] i—>■ the prime defined by the ideal P П Z . Each particular fibre of X over Spec (Z) will turn out to be a prescheme in its own right whose structure sheaf is a sheaf of fc-algebras, where к = Q or Z/pZ: in other words, a more geometric object. Prop. 1 of the last section generalizes to arbitrary preschemes: Proposition 2. Let X be a prescheme, and Z С X an irreducible closed subset. Then there is one and only one point z G Z such that Z = {z}. Proof. Let U С X be an open affine set such that Z П U ф 0. Then any point z G Z dense in Z must be in Z C\U; and a point z G Z П U whose closure contains Z П U is also dense in Z. Therefore it suffices to prove the theorem for the closed subset ZnU. But by Prop. 1 of §4, there is a unique z G ZПU dense in Z П U. D This point z will be called the generic point of Z. As a result of this Proposi- Proposition, there is a 1 — 1 correspondence between the points of X and the irreducible closed subsets of X. A local version of this is often useful. If x G X, then there is a 1 — 1 corre- correspondence between the following sets: i) irreducible closed subsets Z С U, such that x G Z, ii) points z G X such that x G {z}, iii) prime ideals P С ox x. The proof is left to the reader. We want to give next 2 examples of preschemes which involve non-trivial patching:
82 II. Preschemes Example F bis. Let О be an arbitrary noetherian local ring. Since О has a unique maximal ideal M, Spec (O) has a unique closed point x = [M]. Let X be the open subscheme Spec (O) — {x}. The closed points of X correspond to prime ideals P <JLM such that there are no prime ideals between P and M. X is only very rarely affine itself: In fact, to cover X by affines, choose elements Д,..., /„ G M such that Then X=|JSpec i=l where Spec (Of{) and Spec @/,-) are patched along Spec (О/{^). If О = C[[Xi,..., Xn]] for example, the resulting X turns out to have topological prop- properties identical to the ordinary Bn — l)-sphere. Example J. Pg. This prescheme is to be the union of (n+l)-copies Uq, U\,..., Un of integral affine space: To simplify notation, introduce variables Xjj, 0 < i,j < n and i ф j and set Ui = Spec Z [Xoi, Хц, . . . , Xj-i^, ^г+1,г, • • • , ^m] • Then Ui П Uj, as a subset of Ui, is to be the open set (Ui)x..; and as a subset of Uj it is to be the open set (Uj)x... Since (Uj)Xi. = SpecZ[Xoj,...,Xnj,Xr.i] these 2 preschemes can be identified by the map of rings: Xkj = Xki/Xji, 0<к<п, кфг,к Xij = 1/Xji with inverse Хы = Xkj/Xij, 0<к<п, кфг,2 This identification of variables is usually abbreviated by introducing n + 1 vari- variables X{, 0 < i < n, and replacing X^ by Xi/Xj. Then as usual Ui = Spec Z [X0/Xi,..., Xn/Xi] and the identification of Ui and Uj is given by the identity map between rings Z [X0/Xi,..., XJXi, Xi/Xj] and Z [X0/Xh ..., Хп/Х3,Х3/Х{]. Then if F (Xo,... ,Xn) is a homogeneous integral polynomial, one can talk of the set F = 0, where, by definition, if d = degree (F).
§2. The category of preschemes 83 There is one exceedingly important and very elementary existence theorem in the category of preschemes. This asserts that arbitrary fibre products exist. It is much easier to prove this statement than to understand all the applications that it has. It is a much more far-reaching fact than the mere existence of plain products was in the category of prevarieties, and we will see some of the key uses of this theorem in the rest of this chapter. Recall that if morphisms: X Y / r s are given, a fibre product is a commutative diagram with the obvious universal property: i.e., given any commutative diagram X Y there is a unique morphism t : Z -> X XsY such that qi = pi • t, q2 = P2 • The fibre product is unique up to canonical isomorphism. Theorem 3a. // A and В are С-algebras, let the diagram of affine schemes: Spec {A®CB) Spec (A) Spec (Б) Spec (C)
84 II. Preschemes be defined by the canonical homomorphisms С -> А, С -> В, А -> A ®c B, В -> A ®c B. This makes Spec (А ®с В) into a fibre product. Theorem 3b. Given any morphisms r : X -> S, s : Y ^ S, a fibre product exists. Proof of 3a. It is well known that in the diagram (of solid arrows): A®CB ^ D the tensor product has the universal mapping property indicated by dotted ar- arrows, i.e., is the "direct sum" in the category of commutative C-algebras, or the "fibre sum" in the category of commutative rings. Dually, this means that Spec (A (g>c B) is the fibre product in the category of affine schemes. But if T is an arbitrary prescheme, then by Theorem 1, every morphism of T into an affine scheme Spec (E) factors uniquely through Spec (Г(Т,от)): Spec(r(T,oT)). Using this, it follows immediately that Spec (A<g>c B) is the fibre product in the category of all preschemes. □ Note for example, that if A£=Spec (Z\Xu...,Xn]), then for all rings jR: Ag x Spec (R) = Spec {R(XU...,Xn]). We name this important scheme: Definition 4. A£ = Spec (R[Xi,...,Xn]). Proof of 3b. We shall leave most of this to the rader, since it is almost completely a mechanical patching argument. The main point to notice is this: suppose
§2. The category of preschemes 85 XxsY is some fibre product and suppose that Xq С X, Yo С Y and So С S are open subsets. Assume that r(X0) С So and s(Y0) С So- Then the open subset is always the fibre product of Xo and Yo over So- This being so, it is clear how we must set about constructing a fibre product: 1st cover 5 by open affines: Spec (Ck) = WkcS. Next, cover r~1{Wk) and s~1(Wk) by open affines: Spec (AM) = Uhti С X, Spec (BkJ) = VkJ С Y. Then the affine schemes: Spec (Акл ®Ck Bk,j) = Фк,г,з must make an open affine covering of X x s Y if it exists at all. To patch together Фк,%,з and Фк1 ,v j', let p\, pi and p[, p'2 stand for the canonical projections of Фк,1,з and фк',1',з' onto its factors. Then one must next check that the open subsets: Pi1 (uk,i П Uk',i>) Прз1 (Vk,j П Vk',j') С Фк,г,з and р'Г1 {Uk',i' П Uk,i) np'f1 {Vk>,j> П Vfc)j) С Фк>,г>,з> are both fibre products of Uk,i П f/fc',i' and Vkj П Vfc'^/ over 5. Hence they are canonically isomorphic and can be patched. Then you have to check that everything is consistent at triple overlaps. Finally you have to check the universal mapping property. All this is at worst confusing and at best obvious: in the former case, cf. EGA, Ch. 1, pp. 106-107 for assistance. □ Example J bis. To illustrate this patching, what is the scheme: Pg x Spec (R) R any ring? Since Pg is the union of (n + l)-affines Ui = Spec %[Хо/Хг,...,Хп/Хг) , Pg x Spec (R) is just the union of the affines Ul = SvecR[Xo/Xii...,Xn/Xi] with the standard patching. All homogeneous polynomials F (Xo,..., Xn) over R define closed subsets F = 0 of Pg x Spec (R).
86 II. Preschemes Definition 5. P£ = Pg x Spec (R). Problem. For all homogeneous F G R [Xo,..., Xn], show that {x el where E di = d = 0} is isomorphic to Spec R degree (F). " Yd F '" nxf{ F Here is an example of how one can use the Universal Mapping Property of fibre products to prove something tangible: Proposition 4. Ifr-.X^-Sis surjective, thenpi : XxgY -¥Y is surjective. Proof. Let у G Y be any point; since r is surjective, there is a point x € X such that r(x) = s(y). The maps r* and s* define inclusions of fields k(s(y)). Let ft be a composition of these fields. Define morphisms a : Spec ft -> X, /5 : Spec i? -> F by a) mapping Spec (ft) - which is one point - to x and у respectively, b) defining a* and /5* to be the compositions o_ у —> k(x) —> ft and respectively. Then r • a = s • /5. Therefore there is a morphism 7 : Spec ft -> X Xs F such that pi ■ 7 = /5. Let z = Image G). Then P2(z) = У- □ §3. Varieties and preschemes I want to make it crystal clear how our new category of preschemes contains our old category of varieties. When this is done, we shall redefine the term variety, and use it to mean instead the corresponding prescheme (or more precisely, prescheme/ k). Definition 1. Let R be a ring. Then a prescheme X over R is a morphism 7Г : X -4 Spec (R).
§3. Varieties and preschemes 87 Note that by Theorem 1, §2, this is the same as giving Г (X, ox) an jR-algebra structure. This then gives all the rings F(U,ox) jR-algebra structures in such a way that the restriction maps are jR-algebra homomorphisms. For instance, if R = к is a field, we will have an injection of к into each Г (U,ox). Another example: take R = Z. Then every prescheme is a prescheme over Ъ in exactly one way. Definition 2. Let X and Y be preschemes over R. An jR-morphism from X to Y is a morphism / : X -¥Y such that X Spec(R) commutes. Alternatively, this just means that the maps fv (V С Y open) are all R- algebra homomorphisms. Definitions 1 and 2 give us the category of preschemes over R. Definition 3. A prescheme X over R is of finite type over R, if X is quasi- compact and for all open affine subsets U С Х, F(U,ox) is a finitely generated jR-algebra. Proposition 1. LetX be a prescheme over R. If there exists a finite affine open covering {Ui}, 1 < и < n, of X such that R{ = Г (Ui,ox) is a finitely generated R-algebra, then X is of finite type over R. This proposition is the archetype of a large number of similar propositions all of which assert: to check a property that refers to some behavior for all open affines somewhere, it suffices to check it for an open affine covering there. Proof. Since Ui = Spec (Ri), Ui is quasi-compact, hence X is quasi-compact (since there are only a finite number of Ui's). Now suppose U = Spec E) С X is any affine open subset. If fi e Ri, then {Ui)f. = Spec Ri [1//*], and Ri [1//*] is also a finitely generated jR-algebra. But the (£/*)*. are a basis of the topology on X and U is quasi-compact (since it is affine), so U is a union of finitely many sets UI = Spec Щ, with R'{ a finitely generated jR-algebra. Since Щ С U, we have a map res : 5 -> Щ. If / € 5, then Uf П U[ = (£/-)res f (deleting the prime ideals of 5 containing / clearly gives the same result on U[ as throwing out the primes of R!{ containing res /). We can cover U[ with (finitely many) smaller open sets of the form Uf, since the latter are a basis of the topology on U. Moreover if Uf С Щ, Г (Uf,ox) = Г ((^)res / »0x) = R'i [1/res /] which is again finitely generated. Thus we may assume we have U covered by finitely many sets Uft, fi G 5, such that Г (Uf{, ox) = S[l/fi] is finitely generated. We now construct a subring
88 II. Preschemes 5 of 5. Put in 5 all the fc, and enough other elements so that the image of 5, together with 1/fi, generates 5[1//»]; only finitely many elements are needed. Finally, since the Uf{ cover U, we can write 1 = Efigi with gi € 5; put the gi in 5 also. Now we claim that S = S (hence 5 is finitely generated). For take a e S. Then in 5[1//*], a = elff?P, i.e., f?a = ft in S[l/fi] for some ft € 5. That means f^ (ft*a - ft) = 0 in 5, so f?i+kia € 5. Taking N large enough, we have f^a € 5, for all i. But a = 1 • a = (£figi)k a. Taking к large enough, we can make every term in (Efigi)k contain some //^, so a times it will be in 5. □ Corollary. If X is an affine scheme and U С X is an open affine subscheme, then Г (U,ox) is finitely generated over Г (X,ox). Warning. X may be of finite type over R, even for R = C, and yet Г (Х,ох) may not be a finitely generated jR-algebra. This is the fact that Hilbert's 14th Problem was completely false; Zariski gave a systematic way of constructing counterexamples. Definition 4. A prescheme X is reduced if ox contains no nilpotent sections, i.e., Г (U,ox) has no nilpotent elements for all open sets U С X. One can check that this holds if and only if all the stalks ox x have no nilpotents, and that it also holds if there is a covering of X by open affine sets Ui such that Г (Ui,ox) has no nilpotents. Theorem 2. Let к be an algebraically closed field. Then there is an equivalence of categories between: 1) the category of reduced, irreducible preschemes of finite type over k, and k- morphisms. 2) the category of prevarieties over к and morphisms of these (as in Ch. I). Proof. We shall construct functors in both directions which are, up to canonical identifications, inverse to each other. A) Suppose X is a prevariety. For all irreducible closed sets W С X with dimW > 0, let [W] be a symbol. Let X be the union of X and this collection of symbols [W]. For all open sets U С X, let U* be the union of U and the set of symbols [W] for which W П U ф 0. (The idea is that if U meets W at all, it meets it in an open dense subset, so U should contain the generic point [W] of W). It is easy to see that (U< = IK {ихг\и2у = и; пи; u*nx = и. Therefore we have a topology on X which induces on the subset X its Zariski topology. Moreover, U -> U* and U* -> U* П X set up a bisection between the
§3. Varieties and preschemes 89 set of all open subsets of X and the set of all open subsets of X. Therefore finally we can just push the sheaf ox across via def Then (X,Qx) is a ringed space with fc-algebra structure. We leave it to the reader to check that (X,ox) is a prescheme of finite type over k: in fact it has the same affine coordinate rings as X had. It is reduced since ox has no nilpotent elements; it is irreducible since [X] is in every non-empty open set U* and is therefore a generic point. To get a functor, suppose F : X\ -> X2 is a morphism in the category of prevarieties. Extend F to map: T : X2 the prescheme associated to Xi) as follows: 2) For all irreducible closed subsets W С Х\ (dimW > 0), let U JJ - | [dosure of F(W)] if dim¥(W) > 0 If f/2* is any open subset of X2, then one checks that Therefore T is continuous. Finally, to define the required homomorphism of fc-algebras: T*' * * we just use the "pull-back" of functions, i.e., an element s € r(U2,oX2) is a k-valued function on U2, and we define B) To go backwards, we need the following: Lemma 1. Let X be a prescheme of finite type over k, and let x E X. Then x is closed if and only if the composition k{x) is surjective.
90 II. Preschemes Proof. If we prove this for affine X's, it follows easily in the general case. On the other hand, if X = Spec (R), the lemma asserts that a prime ideal P С R is maximal if and only if к maps onto R/P. The non-obvious implication is: P maximal =*► к = R/P, and this is just the Nullstellensatz. □ This lemma implies the following for preschemes X of finite type over к: а point s6lis closed if it is closed in some open neighbourhood of itself. This is definitely false for general preschemes: cf. Example D. Lemma 2. Let X be a prescheme of finite type over k. Then the closed points of X are dense in every closed subset of X. Proof. It suffices to prove that every locally closed subset Y of X contains a closed point. Let U be an open subset of X such that Y П U is non-empty and closed in U. Let у G Y П U be a closed point ofYDU. Then (y) is still closed in U, and by Lemma 1, {y} is still closed in X. □ Now let A' be a reduced irreducible prescheme of finite type over k. Let X be the set of closed points of X, with the induced topology. Then the map u—>unv sets up a bijection between the set of open sets of X, and the set of open sets of X, in view of Lemma 2. Via this bijection, we can carry the sheaf ox over to a sheaf ox on ^> i-e-> yia Г(ипХ,ох) = for all U С X open . By Lemma 1, we can give sections of ox values in к at each point of X, i.e., if U С X is open, / G Г (U,ox) and x G U, then let ,, ч Г the element a £ к such that fix) = s vy [/ — ckG mx, the maximal ideal of oa Note that if f(x) = 0, all ж G U, then / = 0. (In fact, it suffices to check this for affine f/'s; and then if U* С X is the open affine such that U* П X = U, f is also identically zero on U* since U is dense in U*; therefore as we saw in §1, / is nilpotent, hence 0 since X is reduced.) Now it's easy to verify that (X,ox) is a variety. To get a functor, suppose T : X\ -> X2 is a fc-morphism of preschemes of the type being considered. If X G X\, recall that T* defines an injection of fc-extension fields:
§3. Varieties and preschemes 91 Therefore / = k(x) implies к = к(f(x)), hence by Lemma 1 T takes closed points to closed points. Therefore T restricts to a continuous map F : X\ -> X2 of the corresponding prevarieties. One checks easily that F is actually a morphism. It remains to put (A) and (B) together by showing that our 2 functors are mutually inverse. We leave this to the reader. □ Note carefully; Re-definition 5. If A; is an algebraically closed field, a prevariety over A; is a reduced and irreducible prescheme of finite type over k. In the few occasions when we have to refer back to our old notation of prevariety, we will call them old prevarieties. A detail that must be checked is: Proposition 3. The product of 2 prevarieties X and Y over k, in the category of prevarieties over k, is the same as their fibre product X x Y, in the Spec (fc) category of all preschemes. Proof. This follows easily from Statement B), Prop. 1, Ch. 1, §5, and we will omit the details. □ Here is an application which illustrates the use of the generic point of a variety. What we do here should be compared with the proof of Chow's lemma, §9, Ch. I, where we were using these techniques but in disguise. Suppose X and Y are 2 prevarieties over k, and suppose we are given fc-isomorphisms of their function fields with a third field K. Then a and /5 define morphisms: Spec К А X Spec К A Y by requiring that the images of A and В be the generic points x, у of X and Y (Spec (K) consists in one point) and that A* : ox = k(X) -> К be a, and that B* : oy = k(Y) -¥ К be /5. By the proposition, then, we get a morphism: Let t be the image point and let T = {t}, a closed subset of X x Y. T is irreducible, as it is the closure of a point, hence T is a closed subprevariety of X xY. Moreover, the function field k(T) of T is isomorphic via (A,B)* to К also. We have constructed a diagram:
92 II. Preschemes Spec(К) in which pi and pi are birational morphisms of prevarieties. We describe this situation by saying that f3~l -a is a birational correspondence between X and Y and that T is its graph. If we use Th. 4 of §7, Ch. I, we can see what is happening more clearly: a) There is a non-empty open set U С X such that p^l{U) is isomorphic to U via pi. b) There is a non-empty open set У С У such that р^1{У) is isomorphic to V Set Wo = Uo = Vo = Then pi and pi define isomorphisms of the 3 open sets Uq,Vo,Wq in X, Y and T. In other words, the isomorphism fi~l • a of the function fields of X and У extends to an isomorphism between the open dense subsets Uo, Vo of X and У; and T is obtained by taking the graph of its isomorphism in X x Y and closing it up. T itself is, in general, a many-many correspondence between X and У. To see what's happening explicitly, we must as usual look at affine pieces. Cover X by Щ = Spec Ai, Y by Vj = Spec Bj. Then X x Y is covered by Ui x Vj = Spec (Ai <g)j Bj). For all i,j, t G Ui x Vj, as its image in X [resp. У] is the generic point and so lies in all Ui [resp. Vj]. Then T П (Ui x Vj) is the closure of {t} in Ui xYj-, these are an open affine covering of T, and we want to see what they are. Spec (K) —У Ui x Vj corresponds to К -f-^- Ai <g> Bj, and ipij is just the ( R\ composition Ai <g> Bj —> k(X) <g> &(У) —Ц if. Then t is [V"-1 @)], so its closure is V (^PijX@)), and T П (Ui x Vj) is the affine variety Spec (Ai <g> Bj/ip~^(Q)). But this is just Spec of the composite ring a(Ai) • fl(Bj) С if that we get by pushing the two rings together. Example K. Here is one of the oldest and prettiest examples of this definition. Take X = P2, and let xo,xi,X2 and Уо,У\,У2 be homogeneous coordinates in X and У. Let Uo С X and Vo С У be defined as the open sets xo • x\ • X2 фО and Уо • У\ ' У2 ф 0. Define an isomorphism between Uq and Vo by the map: l/i =
§3. Varieties and preschemes In fact, this is just an extension of the isomorphism of function fields: ' X\ 2 93 xo Уо Уо »—>• 2/i/2/i 1—> 2/0/2/2 • But the closure T С Р2 x P2 of this isomorphism is a very remarkable affair. We leave it to the reader to work out the full set-theoretic many-many correspon- correspondence that we get: let's just check one piece of T to get an idea. Let U = Spec к V = Spec к —, — — Л. J/0 xo 2/1 Lx0 What is T П (U x V)? In any case, it is ABC У1 D Spec where P is the ideal of relations we get by putting all functions in the same function field. In particular, А — С G P and BD — С & P. But dividing out by A — С and BD — C, the quotient ring is k[B, D], already an integral domain of the right dimension. So P = (A - C, BD - C) and ГП (U x V) = Spec (k[B, £>]). Finally, the projection gives: Tn(UxV) k[B,D] U V k[A,B] k[C,D] where pi (A) =B-D,p\(C) = BD. Thus the line В = 0 in Tn(UxV) collapses to the point A = В = 0 in U; and the line D = 0 in T П (U x V) collapses to the point С = D = 0 in V:
94 II. Preschemes Tn(Ux V) §4. Fields of definition Let к be an algebraically closed field and let X be a closed subvariety of An, defined by equations: fi(X1,...,Xn)=0, l<i<m. Suppose that we are particularly interested in some subfield ко of к (usually because ко is important for arithmetic reasons, or because ко has an interesting topology). Then if the coefficients of the fi all lie in ко, it will be very important to make use of this fact: it may, for example, pose Diophantine or rationality questions. When this happens one says that X is "defined over &o". Let A=(f1,...Jm)=I(X). Then if the coefficients of the fc lie in ко, it follows firstly that A = к • Aq where Aq = А П ко [Xi,..., Xn]', and secondly, if Ro = ко [Х\,..., Xn] /Aq, then the affine ring R = к [X\,..., Xn] /A of X is of the form Rq <8>k0 &• This shows us the scheme-theoretic significance of X being defined over ко: there exists an affine scheme Xq = Spec (Ro) of finite type over ко such that
§4. Fields of definition 95 (*) X = X0 xSpec (fco) Spec (k) . In fact, conversely, suppose X is an affine variety over k, and that Xq is a prescheme of finite type over ко such that (*) holds. Then let Ro = Г (Х0,оХо) and write Ro as a quotient of a polynomial ring: ко [Xi,..., Xn] (/i,..., /m). It follows that if R = Г (X, ox), then R £ к [Xi,..., Xn] / (/i,..., /m), hence X is isomorphic to the closed subvariety of An defined by /» = 0, 1 < г < m. And since the coefficients of the fa are in ко, X is "defined over &o" in our original sense. On the other hand, notice that the relationship given by (*) between X and Xo does not involve any specific affine embedding and, in fact, can be consid- considered for any X and Xo, not necessarily affine. It suggests that by considering preschemes of finite type over ко we can set up a whole /го-geometry even when ко is not algebraically closed. In this fco-geometry we can make definitions of a Diophantine type that have no analog over k. But whenever we want to visualize what is going on, we can form fibre products with Spec (k) and obtain a "classi- "classical" geometric set-up. In fact,'the easiest way to think of a prescheme Xo over ко is as a prescheme X over k, plus an extra "fco-structure" given by expressing X as a fibre product Xo xSpec (fco) Spec (k). Conversely, we will be able to express a given prevariety X as such a fibre product whenever it is defined for us by equations all of whose coefficients lie in ко. Assume that we are given any prescheme Xo over ко and that we define X as Xo xSpec (fco) Spec (к): we shall often write X = Xo х*.о к for simplicity. Explicitly, if Xo is the union of open affine sets (£/o); = Spec (Ri), then X is just the union of the affine schemes Ui = Spec (Ri <8>fco k) . Definition 1. Let a be an automorphism of к over ко. The conjugation map ax '■ X -> X is the underlying map of the morphism: Ixo x0 X0 XSpec (fc0) SPeC (*) X0 XSpec (fc0) SPeC (*0 X X where ф : Spec (k) -> Spec (k) is defined by taking a~l as the homomorphism ф* : к -» к. One checks immediately that (cr-r)x = ах'Тх, i-e., the Galois group of к/ко is acting on the topological space X. In particular, each ax is a homeomorphism of X. In simple cases, this conjugation is exactly what you expect it to be: Assume Xo = Spec fc0 [Xi,. ..,*„]/ (/i,... ,/m) X = Speck[X1,...,Xn]/(f1,...,frn) ж G X is the closed point X\ = a.\,..., Xn = an where ai,..., an G k; /»(«i,..., an) = 0, all г .
96 II. Preschemes Then ax{x) = {the closed point X\ = a (a>i),...,Xn = a (an)} . Proof. By definition ax (x) = (lx0 x Ф) 0*0, and (lx0 x Ф)* maps g (X\,..., Xn) to ga (Xi,... ,Xn), where ga is the polynomial g with a~l applied to its coefficients. But x = [(Xi — a\,...,Xn - an)], so if ax{x) = [P], Р=Aхо*Ф)* (Xi -au...,Xn-an) . Since (lx0 x Ф) (Xi — cra>i) = Xi—a>i, it follows that P = (Xi — aa±,..., Xn — aan) D Theorem 1. Let Xq be a prescheme over ко, let X = Xo х*.о k, and letp : X -> Xo be the projection. Assume that к is an algebraic closure of ко. Then 1) p is surjective and both open and closed (i.e., maps open/closed sets to open/closed sets). 2) For all x,y E X, p(x) =p(y) if and only if x = сгх{у), some a G Gal(fc/A;o). In other words, for all x G Xo, p~l{x) is an orbit of GaX{k/ko). Moreover, p~l{x) is a finite set. Proof. Since all these results are local on Xo, we may as well replace Xo by an open affine subset Uo and replace X by p~1(Uo). Therefore assume Xo = Spec (R), X = Spec (R <S>k0 &)• First of all p is surjective by Prop. 4, §2. Secondly, I claim p is closed: let V{A) be any closed subset of X, where A is an ideal in R <S>k0 k. Let Ao = А П R, and consider the pair of rings: R <8>fc0 k/A и R/Ao. Since к is algebraically dependent on ко, R<S>k0 k/A is integrally dependent on R/Ao. By the going-up theorem, every prime ideal Po С R/Ao is of the form P П (R/Ao) for some prime ideal P С R ®fc0 k/A. Therefore every prime ideal Po С Л such that Po D Aq is of the form РП R for some prime ideal P С R ®fc0 к such that P D A: this means that p(V(A)) = V(A0), hence p is closed. Thirdly, let's prove B). We must show that if РьРг С R<S>k0 к are 2 prime ideals, then Pi П R = P2 П R <=> P2 = AД <8> a) (Pi), some a G ■$= is obvious. Now assume Po = Pi П R = P2 П Л. Let if be the quotient field of R/Pq- Consider the diagram: и и Л К .
§4. Fields of definition 97 Since К ®fc0 к is the localization of (R ®k0 k) /Po • (R ®k0 k) with respect to the multiplicative system R — Pq, and since a) Pi D Po • (R ®k0 k) and (b) Pi is disjoint from R — Po, it follows that P[ = j(Pi)-K®k0 к is a, prime ideal and that Pi = j-1 (P!). Therefore it suffices to prove that P'2 = AK ® v) (P{) and it will follow that Pi = Ar ® a) (Pi). Now consider the integral domains К ®k0 k/P{ over K: let Q be a big extension field of К containing both of them. We obtain a picture like this: K®kok/P{ (Consider the solid arrows as inclusion maps to simplify notation.) Then a>i (k) and а^(к) must both be equal to the algebraic closure of ко in Q. Therefore there is an automorphism a G Ga\(k/ko) such that a? = a\ • a. But then if Xi G K, Vi € k, [yi) = 0 in П (a(yt)) = 0 in П Pi so AK® a) Moreover, if P С R ®k0 ^ is a prime ideal, then P has only a finite number of distinct conjugates: in fact, let P be generated by /i,...,/m and let fi = ^2j fij ® o/-ij- Then P is left fixed by all cr's which leave the a^'s fixed, and this is a subgroup in Gal(fc/A;o) of finite index. Finally, p is also an open map. In fact, let U С X be any open set. Then U'= U ax(U) <reGal(fc/fc0) is also open. But by B), p(U) = p(U') and U' = p~l (p(U')). Therefore Xo - p(U) = p(X — U') which is closed since p is a closed map. Therefore p(U) is open. □
98 II. Preschemes Corollary. Xq, as a topological space, is the quotient of X by the action of G&\(k/k0). Definition 2. The ко-topology on X is the set of Gal(fc/A;o)-invariant open sets, i.e., the set of open sets {p~1(U) | U open in Xo}. Theorem 2. Let Xo be a prescheme over ко, let X = Xo ®ko к and letp : X -> Xo be the projection. Assume that Xo is of finite type over ко (hence X is of finite type over k). 1) For all U С Xo open, the canonical map is bijective. Moreover, the functions f G Г (U, oXo) (*) f(ax(x))=a(f(x)) for all closed points x G p~1(U), all a G Gal(fc/A;o). 2) If ко is perfect and X is reduced, then Г (U, oXq ) is exactly the subring of elements f G Г (p~1(U),ox) satisfying (*). Proof. Again it suffices to prove the theorem when U is affine. In fact, if it is proven in that case, take your arbitrary U and cover it by a finite set of open affine sets U{. Since oXq is a sheaf, we get an exact sequence: 0 —> Г (U,oXo) ^ (Ui,Qxo) ~ hnUjM -тевщмпиЛ^)}- Tensoring 0 — Г (U,Qx0) ®ko k-lKraUbOx) ®ко к) -+ П (Г (Щ П Uj,oXo) ®ko k) where the 2nd arrow maps {sj} to with к over ко, we get a diagram: о Now we know ft is bijective by the affine case. This implies that ск is injective. Since this is now proven for every open U, it follows also that 7 is injective. Therefore ск is even bijective. The remaining results follow immediately for an arbitrary U once they are proven in the affine case. Now assume U = Spec (R), hence p~l(U) = Spec (R®ko k). Then
§4. Fields of definition 99 Г {U,oXo) ®k0 к * Г {P-\U),ox) R®kok R <8>fco к is bijective by the very definition of fibre product. Moreover, suppose M С R<S>k0 к is a maximal ideal. We know k([M]) = R <S>k0 к/М = к since R is of finite type over ко. If а G Gal(fc/fco), then ax([M]) = [AД <g> a) Af] = [1Д <8> <т(М)]. Therefore (*) asserts: f eR=> f mod AД B) <т(М)) = <j{/ mod M} . But say f = а + g, а & k, g & M so that / mod M = cc. Then hence / mod Aд (8) сг(М)) = сг(а;). Now assume that R®k0 к has no nilpotent elements and that ко is perfect. Since R <8>k0 к has no nilpotent elements and is finitely generated over к, @) = p| {M I M maximal ideal in R <8>fco к} . Suppose f £ R ®k0 & satisfies (*). Following backwards the argument we just gave shows that this means exactly that / — Aд ® а) f is in every maximal ideal M. But therefore / = Aд (8) a) /, all cr 6 Gal(fc/A;o). Since ко is perfect, ко is the set of си G A: invariant under Gal (к/ко). Therefore R is the set of си G R®k0 к invariant under Gal {к/ко). Therefore / G R. □ It follows that Xo, as a prescheme over ко, can be reconstructed from 3 things: i) the prescheme X over к ii) the action of Gal (к/ко) on X via conjugation iii) the subsheaf of ox, defined only on the fco-open sets, which is oXq. We call oXo, regarded as a sheaf in the fco-topology on X, the sheaf of ko- rational functions. It is the analog of the ring of polynomials with coefficients in ко. Moreover in good cases (X reduced, ко perfect), (i) and (ii) alone suffice to give us back Xo. Therefore, presenting X as a fibre product Xo х*.о к is the same thing as endowing X with a ko-structure consisting of A) the conjugation action {ox}, and (if necessary) B) the subsheaf oXo of fco-rational functions. Note that if x G X is closed, then p(x) is closed in Xo; and if у G Xo is closed, then p~l (y) is a finite set of conjugate closed points of X. It is worthwhile making this situation more precise: Proposition 3. Let у G Xo be a closed point. Then k(y) is a finite algebraic extension of ко, and there is a natural bijection between the set of points x G p~l{y), and the set of ко-isomorphisms ofk(y) into k.
100 II. Preschemes Proof. For all x G p~1(u), we get maps: к (ж) U ко к (у) hence we get a fco-isomorphism of k(y) into k. Since p 1{у) ф 0, one such exists and k(y) is algebraic over ко. Also for any у G Xo, k(y) is a finitely generated field extension of ко, since Xo/ko is of finite type. Conversely, if we are given Чу) о ко define ф : Spec (к) -> Xo by Image(V>) = y, and ф* : oyY -> A; given by 0. By the functorial meaning of fibre product, we get a morphism {Ф, Ispec (*)) • Spec (к) —> Xo xSpec (fco) Spec (к) = X . If x is the image point, this gives an inverse to the first procedure. Hence the set of X's, 0's are isomorphic. □ Definition 3. A closed point у G Xo is rational over ко if k(y) = ко. Corollary. // ко is perfect and x G X, then p(x) is rational over ко if and only if x is left fixed by all conjugations. Example L. The circle. Take к = С, ко = M, Xo = Spec (ЩХ,У]/ {X2 + Y2 - l)) . Then X is a complex affine conic, with 2 points at infinity. Aside from its generic point, it looks like this:
§4. Fields of definition 101 at oo real points strictly complex points atoo Xq, aside from its generic point, is the quotient space by conjugation: point at oo points x with IK(x)«C points x with IK(x) S|R Here the boundary of the disc is the circle of points rational over R corresponding to the maximal ideals (X-a,Y-0) where a,{3 e R and a2 + ft2 = 1, i.e., the real locus defined by X2 + Y2 = 1. The interior points (a,/3), where a, ft 6 R and a2 + fl2 < 1, correspond to the maximal ideals: (X2+Y2 -l,aX + 0Y-l) and are not rational over R (To prove this, note that over C, every pair of conjugate complex roots of X2 + Y2 = 1 is the intersection of X2 + Y2 = 1 with a unique real line whose real points lie outside the circle.) Example M. One further example of this type the details of which we leave to the reader: ко = R, к = С, Xq the real plane curve Y2 = X(X2 — 1).
102 II. Preschemes X: J real points -**""^ J Here -1 ^ X(x)<0 L^~~\J I /—^Cj \. —■**" "~ "~~ "T "** *^ y' points x with points x with IKIxIxIR I point at oo Г ^ real points point at oo "Here x( lsi i What is the connection between Xq being reduced and irreducible and X being so? Example N. Take k = C, k0 = Ш, Xo = Spec (ЩХ,У]/ (X2 + Y2)). Since M[X, Y]/ {X2 — Y2) is an integral domain, Xq is reduced and irreducible. How- However, X = Spec(C[X, Y]/(X + iY) ■ {X - iY)) is the union of 2 lines. Therefore X is reduced, but not irreducible. Note inci- incidentally that Xq has exactly one point rational/R- namely the intersection of two lines. Example 0. Take ко imperfect of characteristic p, a G &o — Ajq • Let £(X, Y) be a linear polynomial and let Xo = Spec(ko[X,Y]/(l(X,Y)*-a)) X = Xo is reduced and irreducible, but X is just a "p-fold line". Set-theoretically, X is the line t = a1//p, but t — al/p is a non-zero nilpotent function on X. Proposition 4. Let Xo be a reduced and irreducible prescheme over ко. Let к be an algebraically closed extension of k0. Let x G Xo be its generic point and let ko(Xo) be the stalk of oXo a^ x- Then i) Xq Xfc0 к is reduced <=Ф- ko(Xo) is separable over k0
§5. Closed subpreschemes 103 ii) Xo Xfc0 к is irreducible <=3> ко is separably algebraically closed in ko(Xo)- Proof. We may as well assume Xo = Spec (R), R an integral domain over ко with quotient field ko(Xo)- For any extension к D k0, notice: ko(Xo) ®к0 к satisfies @) = у/Щ 1 ^^ ( R®kok satisfies @) = [ resp. у/Щ prime] / \ [ resp. у/Щ prime] Xo Xfc0 к reduced resp. irreducible] . In fact, R®fc0 к С ko(Xo) <S>k0 k, and the latter is the localization of R®к0 к with respect to the multiplicative system of non 0-divisors а<8>1, a £ R, а ф 0. This gives the 1st =^" easily. Then (i) follows from the assertion: Given L D ко, к D ко, к algebraically closed, then L <g>fc0 к has no nilpotents <=3> L separable/A^. (Cf. Zariski-Samuel, vol. I, Ch. 3, Th. 39; Bourbaki, Modules et Anneaux Semi- simples, §7.3.) Moreover (ii) follows from: Given L D ко, к D ко • к algebraically closed, then L <8>k0 к/у/Щ is a domain 4=Ф> ко separably algebraically closed in L (Cf. Zariski-Samuel, vol. I, Ch. 3, Th. 38 + 40; Bourbaki, Modules et Anneaux Semi-simples, §7.3.) □ Definition 4. Let Xo be a prescheme of finite type over &o- Then Xo is a prevariety over k0 if it is reduced and irreducible and if a) its function field ko(Xo) is separable over ко, and b) ко is algebraically closed in ko(Xo). (Compare Lang, Ch. 3, §§1-2.) §5. Closed subpreschemes We begin with a digression on some general facts about sheaves. Let T, Q be sheaves of abelian groups on the topological space X, ip : T -> Q a morphism. For U open, we let K(U) = ker(^(C/) -> G(U))\ it is easy to check that К is a sheaf, which we call the kernel of (p. We say (p is injective of /C = 0, i.e., if F(U) -> Q(U) is injective for all U. It is easily seen that this holds if and only if all the maps ipx : J-x —> Qx on stalks are injective. The cokernel is a little tricky to define. The presheaf C* given by C*(U) = 0{и)/ф(Т{и)) need not be a sheaf; we let С be the associated sheaf, and call it the cokernel of ip. Recall that by definition of C, there is a presheaf map С* -> С inducing isomorphisms on all stalks. Hence we have Cx = QXI^*{TX) for all x. We say then that <p is surjective if С = 0, or equivalently if (px : Tx —> Qx is surjective for all x. This does not imply that <pu is surjective for all U.
104 II. Preschemes One can check that these agree with the usual categorical definitions, and that the category of sheaves of abelian groups on X is an abelian category. In particular, this includes the assertion: Proposition 1. // <p : T -> Q is injective and surjective, then it is an isomor- isomorphism. Proof. As <p is injective, we have T(U) -> Q{U) injective for all U, and we simply must show these maps are onto. Let 5 G Q(U). For all x G U, let 5Ж be the image of 5 in Qx. (px is by hypothesis an isomorphism, so there is a unique tx G Tx such that (px(tx) = sx. There is an open neighbourhood Ux С U of x such that tx is the image in Tx of an element tx G T{UX). Then 5 and (p(tx) induce the same element in Qx, so they agree locally; replacing Ux by a smaller neighbourhood, we may assume resu,uxs = v(tx) in Q(UX). Now the Ux are a covering of U. On UXI r\UX2, ip (txi - tX2) = s - s = 0 in Q (UXI П UX2); since <p is injective, txi = tX2 on UX1 П UX2. Hence the tx patch together toatG T(U). Then (p(t) and 5 agree on each Ux, so <p(t) = s. □ Suppose Y С X is a closed subset. Then the sheaves of abelian groups on Y correspond bijectively to the sheaves of abelian groups on X such that Tx = {0} if x $. Y. We actually have canonical functors in both directions: If To is a sheaf on У, we extend it by zero by defining, for U С X open, @), UnY = ® The inverse to this is given by restricting a sheaf on X to one on F6. Since we are dealing with sheaves T such that Tx = @), x £ У, it is easy to check that the restriction Я of T to Y satisfies Г(С/, T) = ГA1 П Y, Я), all open U in X, hence it is an inverse to the process of extending by 0. Now let X be a prescheme; we want to define the concept of a closed sub- prescheme. Let У be a closed subset of the underlying space of X. Since we may have nilpotent sections in ox that are not determined by their values on points, we cannot "restrict" sections of ox to Y; we must specify the sheaf Oy explicitly. If oY is a sheaf on F, we can extend it by zero; then we want there to be given a map ox -> oy; also this map should be surjective since an allowable function on the subobject should extend locally to the ambient space. [Again beware - ox(U) -> qy(U) need not be surjective for all U; cf. Ch. I, §4]. Thus we want: Definition 1. A closed prescheme of X is Recall that if У is any subspace of X and T is a sheaf on X, then the restriction of T to У is defined by: For all U open in У, For all x € U, 3 a neighbourhood Vx of x in X Л and t € Г{ух,Т) such that t and s give the > same element of Ty, all у €VX(~)U. J
§5. Closed subpreschemes 105 1) a closed set Y С X, 2) a sheaf of rings oY on Y such that (Y, oY) is a prescheme, 3) a surjective homomorphism 7Г from ox to oY (extended by zero). Let Q = ker7r, so Q is a sheaf of ideals in ox, i.e., Q is a subsheaf of ox such that for all U, Q(U) is an ideal in ox(U): we shall call such an object an ox-ideal, for short. We claim that Q determines the closed subpreschemes up to canonical isomorphism. For first of all, Y = {x G X \ Qx ф ож}, as the sequence 0 -> Qx -> oxX -> ox Y -> 0 is exact; and secondly oY extended by zero is canonically isomorphic to the cokernel of Q —> ox. Thus a closed subprescheme is really just an ox -ideal. [But not all ox-ideals can occur; we will return to that later.] Definition 2. A morphism of preschemes / : Y -> X is a closed immersion if A) /is injective, B) / is closed, i.e., Z С Y closed =Ф f(Z) closed, C) fy '• o/(y) -> oy is surjective for all у G Y. Note that this is equivalent to saying that / factors via A) an isomorphism of Y with a closed subprescheme of X, followed by B) the canonical injection of the closed subprescheme into X. The canonical example of a closed subprescheme is this: Proposition 2. Let R be a ring, А С R an ideal. The canonical map ж : R -> R/A defines a morphism f : Spec (R/A) -> Spec (R). Then f is a closed immersion and the corresponding sheaf of ideals Q satisfies: i) r(Spec(R)f,Q) = A-Rf ii) Qx=A-oXjSpec(R). Proof By definition, if [P] G Spec {R/A), f([P]) = [n^P]. But P -» тг^) is a bijection between the primes of R/A and the primes of R containing A, so / is an injection with image V(A). More generally, if В D A is any ideal and В = В/A, then / (V(B)) = V(B). Therefore / is closed. Suppose x = [P], f(x) = [P]. We get the diagram: O/(x),Spec (Д) = RP Qx,sPec (r/a) = WA)p = Rp/A ■ RP . Thus f* is the localization of 7Г : R —> R/A, hence it is surjective. Therefore / is a closed immersion. Also, it follows that the stalks of the kernel Q are A-Rp. Finally we have
106 II. Preschemes Г ((Spec R)f,Q) = кег {Г ((Spec R)f,Qspec (д)) —■» ((Spec R)f,oSpec {R/A))} = A-Rf . □ We will now prove that the converse of this is true too! Thus, while the Nullstellensatz gave us a correspondence between those ideals A such that A = y/A and the closed subsets of affine varieties, we now get a correspondence between all ideals A and closed subpreschemes. Theorem 3. Let X = Spec (R), and let Y С X be a closed subprescheme, f : Y -> X the inclusion. Let Q be the ox -ideal defining Y, A = F(X,Q). Then Y is canonically isomorphic to Spec (R/A), i.e., there is a commutative diagram: Yc >■ Spec R Spec (R/A) Proof. First of all, note that / factors through Spec (R/A). For, /* : R -> F(Y,oY) factors through R/A by definition of A; hence / factors through Spec (R/A), by Th. 1, §2. So we may assume A = @), i.e., f*:R-+ F(Y,oY) is injective. Y is quasi-compact, since it is a closed subspace of Spec (R). Hence it is covered by a finite number of open affines, say Spec (Si). We now claim / is surjective. Since f(Y) is closed in Spec (R), f(Y) = V(B) for some ideal В in R. If 5 G B, s(x) = 0 for all x G V(B), so f*s = 0 at all points of Y. In particular, resy^pec (Si)(f*s) is given by an element <x; £ Si which is 0 at every point of Spec (Si), and hence is nilpotent: suppose а™{ = О. There are only finitely many Si, so a™ = 0 for all i if n is big enough. Then (f*s)n is an element which restricts to 0 on each open piece Spec (Si), so it is 0. As /* is injective, sn = 0. Thus В С л/Щ, hence V(B) = B(@)) = Spec (R), and / is indeed surjective. It follows that / is a homeomorphism, since by definition it is closed and injective. That is, we can suppose that we have a single topological space and two sheaves of rings ox, oY on it, together with a surjective homomorphism ox -* Qy', we must show that it is injective. Say у G Y, and f(y) = [P]; we have a map Qf^x = Rp -> oyY which we must show injective. Suppose not: then there is an element a £ R in the kernel. We want to show a = 0 in Rp, i.e., ab = 0 for some b £ R — P. We know f*a goes to 0 in oyY. Set U = {y' G Y \ f*a goes to 0 in oy,Y}; U is open, since an element goes to 0 in a stalk if and only if it is 0 in a
§5. Closed subpreschemes 107 neighbourhood. As / is a homeomorphism, f(U) is an open neighbourhood of f(y} = [P]. Hence f(U) contains a set (Spec R)t for some t £ R — P, i.e., f*a is 0 in the open set F/.(f) = {у' \ f*t(y') ф 0}. Let en = TeaY,spee(Si)U*t)- We have then Yf*t П Spec(Si) = (Spec(Si))a.. Now Г ((Spec Si),. ,oy) = (&)„., and f*a is 0 in (Si),.. If ац G Si gives f*a on Spec Si, this tells us ац-сг™' = 0 in Si for some щ. Hence, as there are only finitely many Si's, оца™ = 0 for all i and big enough n. Thus (f*a) • (f*t)n restricted to each Spec (Si) is 0, so f*{atn) = 0 in r(Y,oY). Since /* : R -» Г(У,оу) is objective, ain = 0 in R. As i ^ P, a does go to 0 in Rp. □ Corollary 1. Let f :Y -> X be a morphism of preschemes. Then the following are equivalent: (i) f is a closed immersion, (ii) for all affine open sets U С X, f~l{U) is affine and the map is surjective, (Hi) there exists an affine open covering Ui of X such that f~l (Ui) is affine and the map r(Uhox)-*r(f-l(Ui),oY) is surjective for all i. Proof, (i) ==>■ (ii) by the theorem; (ii) ==>■ (iii) obviously; (iii) ==>■ (i) by Prop. 2. □ Corollary 2. Let X be a prescheme and let Q be an ox-ideal. Let Y = {y G X | 1 ^ Qy} -a closed subset of X. The cokernel of О -> Q -> ox is О outside of X, so consider it as a sheaf of rings oY on Y. Then (F, oy) is a subprescheme of (X,ox) if and only if For all у £ X, there is a neighbourhood U of у and sections {sa} in r(U,Q) such that (*) for all x G U. Proof. If (F,oy) is a subprescheme, and у G X, then by the theorem (*) holds if you take U to be any affine open neighbourhood of y, and sa to be generators of F(U, Q) over Г (U,ox). Conversely, if (*) holds, and U is taken to be affine, say U = Spec (Л), then (U П F,oy \u) = Spec (R/Ssa • R) . D (*) is the condition for Q to be quasi-coherent. We shall investigate this fully in Ch. Ill, §1.
108 II. Preschemes Example I bis. Closed subschemes of Spec (k[t\), к algebraically closed. Since k[t] is a PID, all non-zero ideals are of the form The corresponding subscheme Y of A1 = Spec (k[t\) is supported by the n points ai,..., an, and at щ its structure sheaf is where rrii = mai^i = (t — щ). Y is the union of the a^'s "with multiplicity г»". The real significance of the multiplicity is that if you restrict a function / on A1 to this subscheme, the restriction can tell you not only the value / (a^) but the first Ti — 1-derivatives: -^ (ai), i<r«-l . In other words, Y contains the (r» — l)st order normal neighbourhood of {аг-} in A1. Consider all possible subschemes supported by {0}. These are the schemes Yn = Spec (k[t}/ (tn)) . Y\ is just the point as a reduced scheme, but the rest are not reduced. Corre- Corresponding to the fact that the defining ideals are included in each other: (*)э(*2)э(*3)э...э(Пэ...э@) , the various schemes are subschemes of each other: Fi С F2 С Y3 С ... С Yn С ... С А1 . Example P. Closed subschemes of Spec (k[x,y]), к algebraically closed. Every ideal А С &[ж,у] ^s °f the form: (/)ng for some / G k[x, y] and Q of finite codimension (to check this use noetherian decomposition and the fact that prime ideals are either maximal or principal). Let Y = Spec ([x,y]/A) be the corresponding subscheme of A2. First, suppose n A = (/). If / = {"I fp, with fi irreducible, then the subscheme Y is the union of the irreducible curves fi = 0, "with multiplicity r»". As before, if g is a function in A2, then one can compute solely from the restriction of g to Y the first Ti — 1 normal derivatives of g to the curve fi = 0. Second, look at the case A of finite codimension. Then A = Q1n...nQt
§5. Closed subpreschemes 109 where \/Qi is the maximal ideal (x — сц,у - bi). Therefore, the support of Y is the finite set of points (п{,Ь{), and the stalk of Y at (a{,bi) is the finite dimensional algebra k[x,y]/Qi. For simplicity, look at the case A = Q\, y/Qi = (x,y). The lattice of such ideals A is much more complicated than the one- dimensional case. Consider, for example, the ideals: (x,y) D (ax + Cy,x2,xy,y2) D (x2,xy,y2) D (x2,y2) D @) . These define subschemes: / @,0) with reduced structure С Ya,p С Y2 С F3 С Since (ax + /3y, x2,xy, y2) D (ax + fly), Уа,/з is a subscheme of the reduced line £a>p defined by ax + /3y = 0 : Ya)p is the point and one normal direction. But F2 is not a subscheme of any reduced line: it is the full double point and is invariant under rotations. Y3 is even bigger, is not invariant under rotations, but still does not contain the 2nd order neighbourhood of @,0) along any line. If g is a function on A2, g \ya 0 determines one directional derivative of g at @,0), g \y2 determines both partial derivatives of g at @,0) and g |y3 even determines the mixed partial f^ @,0). As an example of the general case, look at A = (x2,xy). Then A = (x) П (x2,xy,y2). Since y/~A = (x), the support of Y is the y-axis. The stalk ozy that has no nilpotents in it except when z = @,0). This is an "embedded point", and if a function g on A2 is cut down to F, the restriction determined both partials of g at @,0), but only щ at other points :
110 II. Preschemes Proposition 4. Let X be a prescheme and Z С X a closed subset. Consider the various closed subschemes Z\ = (Z,ox/Ii) that can be defined with support Z. Among all these, there is a unique reduced subscheme - call it Zq = (Z,ox/Io). Furthermore, if Z\ = {Z,ox/I\) is any other, then I\ С /о (so Zq is a subscheme of Zx) and I0 = Vh (Le., Г (U,/0) = y/r{U,h) for U affine). Proof. Define Iq by Now suppose U = Spec (Я), and Z C\U = V(A) where А С Л is an ideal such that A = y/A. Then I claim Iq)X = A • ox, all x G C/, hence Iq is quasi-coherent and defines a closed subscheme Zq. First of all, A • ox С /о,ж since all elements of A vanish on Z П U. Conversely, suppose 5 G (/о)ж; then 5 is the restriction of some t £ Г (£//, /о), where x G U/, since the U/ are a basis in U. But inside Uf = Spec (Rf), ZnUf = V(A-Rf) and moreover A • Rf = yjA ■ Rf. And t(x) = 0, all x G ZП Uf means that t is in all prime ideals P of Rf containing A-Rf, hence t G A-Rf. Therefore 5 G A• ox. Zq is reduced since for all x G Z, °x,z0 = (/о)ж and (/о)ж = а/(/о)ж by what we just proved. The uniqueness assertion will follow if we prove Iq = y/T[ for all ox-ideals I\ such that Z\ = (Z,ox/I\) is a closed subscheme. Again, suppose U = Spec (R). Z\ П U is a closed subscheme of U, hence Zx П U = Spec (R/B). But V (B) is the support of Spec (Д/JB), hence = V(A), hence V^ = A, i.e., y/r{U,h) =r(U,I0). □ A very important example of a closed subprescheme are the fibres of a mor- phism. Let X —> Y be a morphism and let у G Y be a closed point. We shall put a subscheme structure on the fibre f~1(y) over y. Let к = к(у). Consider the fibre product Xy in the diagram: i Spec (h) X Y where i is the morphism with image у and i* : oyY -> к the canonical map of oy Y to its residue field. Let V = Spec E) be an affine open neighbourhood of у and let Ui = Spec (Ri) cover /~1(У). Let у = [M], M С S maximal. Then by Theorems
§5. Closed subpreschemes 111 3a, b, §2, Xy is covered by open affine pieces Xy = Spec (Ri <S>s k). But к = S/M, hence Xjp = Spec (Ri/M • Ri). Therefore j is a closed immersion. In other words, j is an isomorphism of Xy with a closed subscheme of X with support f~1(y) defined by the sheaf of ideals /, where (for all affine open U С f I mentioned at the beginning of this chapter that one of the reasons why it is necessary to include schemes with nilpotents in our set-up is that they occur as fibres of morphisms between very nice varieties. To see this, let X = Speck[X1,...Xn]/(f1,...,fm) be an arbitrary affine scheme of finite type over k. Define ф : A£ _> AT by the homomorphism: fi Then X is exactly the fibre of ф over the origin! When the residue field к (у) of a point у £ Y is not algebraically closed, it is sometimes more important to embed k(y) in an algebraically closed field k, and to take the fibre product as above, but via the morphism i : Spec (k) -> Y, defined by Image (г) = у, and г* : oyY -> k(y) -> k. The prescheme Xy obtained in this way is called a geometric fibre of /. Problem. Show that the closed subpreschemes of P£ correspond bijectively with the homogeneous ideals Ack[X0,...,Xn] such that A:(X0,...,Xn)=A (i.e., if / G к [Xo,... ,Xn], Xi • / G A for all г, then / G A. This condition says that A is biggest among those ideals that induce the same ideals in each affine piece of Pn.)
112 II. Preschemes §6. The functor of points of a prescheme We have had several indications that the underlying point set of a scheme is peculiar from a geometric point of view. Non-closed points are odd for one thing. A serious difficulty is that the point set of a fibre product X x 5 Y does not map injectively into the set-theoretic product of X and Y. For example: Spec (C) xSpec (r) Spec (C) =* Spec (C xR C) ^ Spec (C) U Spec (C) (U denotes open disjoint union). As we will see, this prevents the point set of a group variety from being an abstract group! The explanation of these con- confusing facts is that there are really two concepts of "point" in the language of preschemes. To see this in its proper setting, look at some examples in other categories: Example Q. Let С = category of differentiable manifolds. Let z be the manifold with one point. Then for any manifold Z, home (z, X) = X as a point set . Example R. Let С = category of groups. Let z = Z. Then for any group G, homc(Z,G) = H as a point set . Example S. Let С = category of rings with 1 (and homomorphisms / such that /A) = 1). Let z = Z[X]. Then for any ring Л, home B, R) = R as a point set . This indicates that if С is any category, whose objects may not be point sets to begin with, and z is an object, one can try to conceive of homc(z,X) as the underlying set of points of the object X. In fact, X 1—у homc(z,X) extends to a functor from the category С to the category (Sets), of sets. But it is not satisfactory to call homc(z,X) the set of points of X unless this functor is faithful, i.e., unless a morphism / from Xx to X2 is determined by the map of sets: Example T. Let (Hot) be the category of CW-complexes, where hom(X, Y) is the set of homotopy-classes of continuous maps from X to Y. If z = the 1 point complex, then (ot)B5 X) = тго(.Х'), (the set of components of X) and this does not give a faithful functor.
§6. The functor of points of a prescheme 113 Example U. Let С = category of pre-schemes. Taking the lead from Examples 1 and 4, take for z the final object of the category С : z = Spec (Z). Now home (Spec (Z),X) is absurdly small, and does not give a faithful functor. Grothendieck's ingenious idea is to remedy this defect by considering (for arbitrary categories C) not one z, but all z: attach to X the whole set: In a natural way, this always gives a faithful functor from the category С to the category (Sets). Even more than that, the "extra structure" on the set \Jzhomc(z,X) which characterizes the object X, can be determined. It con- consists in: i) the decomposition of (Jz homc(z,X) into subsets Sz = home (z, X), one for each z, ii) the natural maps from one set Sz to another Sz>, given for each morphism g : z' —)■ z in the category. Putting this formally, it comes out like this: Attach to each X in C, the functor hx (contravariant, from С itself to (Sets)) via (*) hx(z) = home (z, X), z an object in С . ( \ и ( \ — / mduced map from homc(z,X) 1 {**) nx{g)- | tQ homx(ziiX) ] ' g : z' -¥ z a morphism in С Now the functor hx is an object in a category too: viz., Fund (C°, (Sets)), (where Fund stands for functors, C° for С with arrows reversed). It is also clear that if g : X\ —)■ X2 is a morphism in C, then one obtains a morphism of functors hg : hxx —>■ hx2 • All this amounts to one big functor: h : С —>• Fund (C°, (Sets)) . Proposition 1. h is fully faithful, i.e., if XX,X2 are objects of С, then, under h, home (X\, X2) —> hompunct (hxi, hx2) • Proof. Easy. The conclusion, heuristically, is that an object X of С can be identified with the functor hx, which is basically just a structured set. Return to algebraic geometry! What we have said motivates I hope:
114 II. Preschemes Definition 1. If X and К are preschemes, a K-valued point of X is & morphism / : К —t X; if К = Spec (R), we call this an R-valued point of X. If X and К are preschemes over a third prescheme 5, i.e., we are given morphisms px : X —t S, рк '■ К —» 5, then / is a K-valued point of X/S if px • f = Рк', if К = Spec (R), we call this an R-valued point of X/S. The set of all R-valued points of a prescheme X, or of X/S, is denoted X(R). "Points" in this sense are compatible with products. That is to say, if K, X, and Y are preschemes over 5, then the set of K-valued points of X x$ Y/S is just the (set-theoretic) product of the set of .ftT-valued points of X/S and the set of .ftT-valued points of Y/S. This is just the definition of the fibre product. In particular .ftT-valued points of a group prescheme X/K - to be defined in Ch. IV - will actually form a group! The concept of an .R-valued point generalizes the notion of a solution of a set of diophantine equations in the ring R. In fact, let: /lv)/m^2 [Xi, . . . , Xn] X = Spec {2>[Xu..., I claim an R-valued point of X is the "same thing" as an n-tuple a\,..., an 6 R such that /i (oi,..., on) = ... = fm (ai,..., an) = 0. But in fact a morphism Spec (Д) -^ Spec (Z [Xu ... ,Xn] / (/1}..., /m)) is determined by the n-tuple ai = g* (X{), 1 < i < n, and those n-tuples that occur are exactly those such that h !->■ h (oi,...,an) defines a homomorphism i.e., solutions of /i,..., /m. An interesting point is that a prescheme is actually determined by the functor of its R-valued points as well as by the larger functor of its K-valued points. To state this precisely, let X be a prescheme, and let hx' be the covariant functor from the category (Rings) of commutative rings with 1 to the category (Sets) defined by: h{x\R) = hx (Spec (Д)) = hom[Spec (R),X] . Regarding hx as a functor in X in a natural way, one has: Proposition 2. For any two preschemes X\,X2, hom(Xi,X2) -^ horn Hence /i(°) is a fully faithful functor from the category of preschemes to Funct{ (Rings), (Sets)) .
§6. The functor of points of a prescheme 115 This result is more readily checked privately than proven formally, but it may be instructive to sketch how a morphism F : h^] —> hjQ will induce a morphism / : X\ —у Xi- One chooses an affine open covering Ui = Spec (A{) of X\\ let Ii : Spec (Ai) = U{^XX be the inclusion. Then Ii is an Aj-valued point of X\. Therefore F(I{) = fi is an Aj-valued point of X2, i.e., fi defines Щ ¥ Spec (А{) —+ X2 . Modulo a verification that these fi patch together on U{ П Uj, these fi give the morphism / via Grothendieck's existence problem comes up when one asks why not identify a prescheme X with its corresponding functor K^, and try to define preschemes as suitable functors: F : (Rings) —>• (Sets) . The problem is to find "natural" conditions on the functor F to ensure that it is isomorphic to a functor of the type h^'. For example, let me mention one prop- property of all the functors hrx' which was discovered by Grothendieck (Compatibility with faithfully flat descent): Let q : A —^ В be a homomorphism of rings making В into a faithfully flat A-algebra, i.e., f V ideals I C A, W ' r- -Ч-^I-B, and q-1(I-B) = I " Then if pi, P2 : В —t В ®а В are the homomorphisms 0 ^ 0 <g> 1 and 0 !->■ 1 <g> 0, the induced diagram of sets: F(A) ^$ F(B) Z4 F(B®AB) F(P2) is exact, (i.e., F(g) injective, and Im F(g) = {x \ F(p\)x = F(p2)x}). In terms of preschemes, this means the following: Let А, В be as above and let X be a prescheme. Let / : Spec (B) 4lbea morphism and consider the diagram:
116 II. Preschemes Spec(S)xSpec(A)Spec(S) Pi VI \ Spec(B) —^ X q ' ^ - " Spec(A) " (where q denotes the morphism of schemes induced by the homomorphism q). Then there is a morphism q : Spec (A) —t X such that / = g • q if and only if f ■ Pi = f • P2, and if g exists at all, it is unique. To tie these R-valued points in with our usual idea of points, consider the case where R = /г, a field. What is a /г-valued point of a prescheme X? Spec (/г) has just one point, so a map / : Spec (/г) —» X has one point ж G X as its image. The ring map /* : oxX —* к must be a local homomorphism, so it factors through к (ж). Conversely, if we are given a point ж G X and an inclusion к (ж) С /г, we get a /г-valued point with image X, if we define /* by: all U С X open, x eU. Thus: (set of /г-valued \ ^ j set of points x € X, plus 1 points of ж / ~ I injections к (ж) <-»■ к J For example, for each point ж G X, there is a canonical morphism ix : Spec (к(ж)) —> X with image ж. If X is a prescheme over /го to start with and к D /го, then the /г-valued points of X over ко correspond to those ж G X and injections к (ж) м- к which reduce to the identity on the subfield /го: О С /Cq. For example, suppose к = k0 is algebraically closed and X is of finite type over k. Then {set of /г-valued points 1 ^ Г set of points x e X 1 ofX/k J ~ I such that к ^> к(ж) J II {set of closed points 1 xeX j '
§6. The functor of points of a prescheme 117 Here is another example: let ко be any field, but assume к algebraically closed. Let Xq be a prescheme of finite type over ко, and let X = Xq xSpec (fco) Spec (k). Then: {set of k- valued points 1 ^ (set of /г-valued I o{X0/ko j ~ \ points of X/ к j set of closed points ofX Thus if Xq is a prevariety/fco, we recover the underlying set of the old variety over к associated to X. For this reason, when к is an algebraically closed overfield of ко, к-valued points of Xo/ko are often called geometric points of Xq. Among prevarieties Xo/ko, Xq —* /ixo/fco(Spec (^)) ^s a faithful functor into (Sets); but this is false if X is allowed to have nilpotent functions on it. We can describe very simply the set of R-valued points of a scheme X when R is any local ring. In fact, if M is the maximal ideal of R, then Spec (R) has just one closed point, [M]. If / : Spec (R) —t X is a morphism, then (by simple topology) any open set in X containing /([.M]) contains the whole image of /. In particular the image always lies in one affine piece of X. (Cf. "Illustration" of Spec (R) in §1.) Proposition 3. Let x G X. Then there is a bijection between the set of R- valued points of X such that f([M]) = x and the set of local homomorphisms q : О.Д -> R. Proof. Notice that 0\щ Spec R = R, so from / we get a local homomorphism g. To show we have a bijection we may replace X by an affine neighbourhood Spec (A) of x; suppose x = [P]. We have a commutative diagram: {set of / = Spec (R) ->X 1 J set of local homomorphisms 1 such that f([M]) =x j ~* { g : AP = o^ -> R J \ s {set of homomorphisms <p : A —у R 1 such that tp~l{M) = P j ' So all arrows are bijections. □ The final topic that we want to consider is the Hausdorff condition in the category of preschemes. Let X be a prescheme. The basic idea is to study X by taking a test prescheme К and considering two K-valued points of X: If x G K, when should we say that / and g are "equal" at ж? Among varieties we just asked that f{x) = g{x). For preschemes, the approximate definition is:
118 II. Preschemes Definition 2. f(x) = g(x) if / • ix = g • ix, where ix : Spec (к(ж)) -^ К is the canonical morphism. Equivalently, this means that f(x) = g(x), and that the 2 maps fx,gl : к(/(ж)) —» к(ж) are equal. It is easy to check that if К is reduced, then / = g if and only if f(x) = fif(x), all ж G if. Proposition 4. For all f,g : К -t X, {xeK\ f{x) = g(x)} is locally closed. Proof. Call this set Z; assume x G Z. Let у = /(ж) = g(x), and let £/i = Spec (R\) be an affine open neighbourhood of у in X. Let £/2 = Spec (#2) be an affine open neighbourhood of x in К such that Then / and g induce homomorphisms: f*,g*:Ri Let A be the i^-ideal generated by the elements f*(a) — g*(a), at, G R\. Then I claim Z П E/2 = V"(A). In fact, if [P2] G t/2, ^2 С Д2 being prime, then /*"' (P2) = 5* (P2) = Л say, and the homomorphisms / ,g~* : R\/P\ —>■ R2/P2 are equal /*(«) " 5*(a) € P2, all a G Д1 P2 D A. Definition 3. A prescheme X is a scheme if for all preschemes .ftT and all K- valued points f,g of X, {x € К \ f(x) = fif(x)} is closed. Proposition 5. If X is a prescheme over a ring R, then the criterion for X to be a scheme is satisfied for all К, /, д if it is satisfied in the case: К = X xSpec (д) X f = Pi 9 = P2 ■ Proof. Let /, g : К —у X be given, К arbitrary. Let тг : X —>■ Spec (R) be the given morphism. Let Zi = {xeK\f(x)=g(x)} Z2 = {xeK\ir(f(x))=ir((x))},
§6. The functor of points of a prescheme 119 and let Q be the o^-ideal generated by the functions f*ir*a - д*тт*а, all a G R. Then Q is quasi-coherent and 1 ^ Qx Ф> x G Z2. Therefore (Z2,oK/Q) is a closed subscheme Z of K. Moreover, Z\ С Z, so Z\ is closed in К if and only if it is closed in Z. But the restrictions of тг • / and it • g to Z are equal, since the homomorphisms are equal. Therefore / and g induce a morphism h=(f,g):Z^XxSpec{R)X . But for all x G Z, let у = /i(x), and then: /0*0 = 00*0 «=> Pi(l/) = Ыу)- Therefore £i = Л ({l/ 6 X xSpec (д) X | pi(y) = p2(y)}) , hence if the set in braces is closed, so is T\. D Corollary 1. If к is an algebraically closed field, then a prevariety over к is a variety in the sense of Ch. I if and only if it is a scheme. Proof. Let Z С X xk X he the set of points z such that pi(z) = P2(z). If z is a closed point then z G Z if and only if p\{z) = P2(z). But the locally closed set Z is closed if and only if its intersection with the set of closed points of X x k X is closed in the induced topology, i.e., if and only if X is a variety in the old sense. □ In Ch. I, §5, we asserted that varieties X also satisfied the "local criterion": If x, у G X, x ф у, there is no local ring О С к(Х) such that О > Qx and О > oy . We can now prove this. If such an О existed, then by Proposition 3 we can define 2 O-valued points of X: Spec such that if жо is the closed point of Spec (O), /(xo) = x, g(x0) = y. But if x\ is the generic point of Spec (Q), then f(x\) = g(x\) = the generic point of X, and /ajjfl'xi both give the identity map from k(X) to the quotient field of O. This shows that Xle{z\ f(z) = g(z)} xo<t{z\ f(z) = g(z)} . Therefore {z \ f(z) = g(z)} is not closed.
120 II. Preschemes Proposition 6. If X is a scheme and U, V С X are open affine sets, then UC\V is affine and the canonical homomorphism: r(U,ox)®r(V,ox)-*r(UnV,ox) is surjective. Proof. Let A : X -> X x X be the diagonal (X x X being the absolute product, i.e., over Spec (Z)). First, A(X) is closed since X is a scheme. And if you cover X by open affines Щ = Spec (Ri), then A(X) is covered by the affines Щ х Щ. But Щ = A~l (Ui x U{) and A* is the canonical surjection: This shows that Л is a closed immersion (cf. Cor. 1 of Th. 3, §5). But U x V is an open affine in X x X with ring Г (U,ox)<g> Г (V, ox). Therefore U П V = A~l (U x V) is also affine and its ring is a quotient of Г (U,ox) <g> i (V, ox). D Problem 1) Check that a fibre product of schemes is a scheme. 2) Prove that P^ ^s a scheme (cf. Example J). 3) Given a diagram of schemes: X Y where X\ and Y\ are closed subschemes of X and Y, show that X\ a closed subscheme of X xsY. (Hint: You must use the fact that Si is a scheme.) From now on, all our preschemes will be assumed to be schemes. Y\ is
§7. Proper morphisms and finite morphisms 121 §7. Proper morphisms and finite morphisms In this section, we will generalize the results of Ch. I, §8. Definition 1. Let / : X —t Y be a morphism. / is of finite type if for all open affines U С Y, f~l{U) is °f finite type over Г(£/,оу), i.e., f-1(U) is quasi- compact and Г(У,ох) is finitely generated over r(U,oY) for all open affine vcr\u). Proposition 1. Let f : X —» Y be a morphism. To prove that f is of finite type, it suffices to check the defining property for the open affine sets Ui CY of one covering {Ui} ofY. Proof. Let Ui = Spec (Щ) and assume /-1 (Ui) of finite Ьуре/Щ. Note that for all / G Щ, /-1 (Uij) is of finite type over (Ri)f. Let U = Spec (R) be any open affine in Y. Then U admits a finite covering by open sets of the type (Ui)p f G Ri, since it is quasi-compact. Therefore we can assume U = U\ U ... U Un, Ui affine and /-1 (Ui) of finite type over Г (Ui,oY), and hence of finite type over R. Therefore by Prop. 1, §3, U is of finite type over R. О Definition 2. Let / : X —t Y be a morphism. Then / is proper if / is of finite type and for all morphisms g : К ->Y, К any prescheme, the projection p2 :XxYK—*K is a closed map of topological spaces. Since p2 being closed is a local property on K, it certainly suffices, in this definition, to look at affine -ftT's. The following functorial properties of proper ness follow immediately from the definition: i) closed immersions are proper, ii) the composition of proper morphisms is proper, iii) if / : X —t Y is proper and g : К —t Y is any morphism, then p2 : X x у К —» К is proper. iv) Suppose X —> Y -^ Z are morphisms and that g is of finite type. / is surjective and g • f is proper. Then g is proper. We certainly should check that a variety X over к is complete if and only if 7Г : X —t Spec (k) is proper. This follows easily from Chow's lemma and the next theorem: Theorem 2. The morphism f : P^ —^ Spec (Z) is proper. Proof. We must show that the projection p : P£ —> Spec (R) is closed, for every R (cf. Ex. J bis, §2). We proved in Ch. I, §8, that p is closed when R is a finitely generated integral domain over an algebraically closed field
122 II. Preschemes к. However, with small modifications, the identical proof works in general. Let's follow this proof. Start with Z с PJ, a closed subset. As before, Z can be described as a homogeneous ideal AcR[X0,Xu...,Xn] . In fact, essentially the same argument shows Lemma. For all closed subschemes Z С Рд, for all i and all g G I (Z П Ui), there is a homogeneous polynomial G G R[Xo,... ,Xn] of degree r such that G/X' G / (Z П Uj), all j and G/ХГ e I (Z П U,), all j and G/XT = g. Next take a point у G Spec (R) — p(Z), and assume у = [P], where P С R is a prime ideal (no longer necessarily maximal). One can still conclude that q Xn Prom this one gets (*)' (SN/AN) ®д (Rp/P • RP) = @) , hence by Nakayama's lemma (**)' (Sn/An) ®д RP = @) , Since Sn is a finitely generated i2-module, this implies that there is an element / G Rp such that / • Sn С An- As in Ch. I, this implies that Z is disjoint from Pg x Spec (Д/), hence p(Z) С V((/)). This shows that p(Z) is closed. □ In particular, if Y is any scheme and X is a closed subscheme of Pg x Y, then P2 : X -¥ Y is proper. Proposition 3. Suppose f \ X —¥ Y is proper and R is a valuation ring, with quotient field K. Suppose we are given morphisms ф,фо: Spec (К) tX Spec(R) Then there is one and only one morphism ф : Spec (R) —t X such that ф eoctends to фо and f • ф = ф. Proof. If фо had 2 extensions ф', ф" : Spec (R) ->■ X, then apply the definition of scheme to ф' and ф" and it follows that ф' = ф". This takes care of uniqueness. Next apply the definition of proper to ф : Spec (R) ->■ Y and to the closed subset of X xy Spec (R) obtained by taking the closure Z of (фо,г) [Spec K]. Give Z the structure of reduced closed subschemes. Then pi{Zi) is a closed subset of Spec (R) containing its generic point, hence equal to Spec (R). I claim that Pi : Z —У Spec (R) is an isomorphism, hence Z is the graph of the ф we are looking for.
§7. Proper morphisms and finite morphisms 123 Everything follows from: Lemma. With R and К as above, let f : Z -¥ Spec (R) be a surjective bi- rational morphism, where Z is a reduced and irrational scheme. Then f is an isomorphism. Proof. Let x G Z be a point over the closed point of Spec (R), and let у be the generic point of Z. We get a diagram: oyZ -« D o^z К ч Э R Since /* is a local homomorphism, and R is a valuation ring, R = oxZ. But then the local homomorphism defines a morphism g : Spec (R) —t Z by Prop. 3, §6. It is clear that f • g = Ispec (R), and g • f = lz since g • f(y) = lz(y) and Z is a scheme. □ Corollary. Let X be a complete variety over a field k. Then for all valuation rings R С К, such that к С R and К = quotient field of R, R D ox for some xeX. The valuation criterion asserts that the property in Prop. 3 implies conversely that / is proper. We will not use this; for a proof, cf. EGA, Ch. II. Proposition 4. Let ф : R -t S be a homomorphism of rings such that S is integrally dependent on (f>(R). Then the corresponding morphism Ф : Spec E) —> Spec (R) is a closed map. Proof. This is exactly the going-up theorem of Cohen-Seidenberg. In fact, if V(A) is a closed subset of Spec E), then I claim Ф(У(А)) = V (ф-1(А)). То see this, let [P] 6 V (ф~1(А)). Then P is a prime ideal in R containing ф~1{А). By the going-up theorem applied to S/A and the subring Р/ф~1(А), there is a prime ideal P' С 5 such that ф~1(Р1) = P and P' D A. Therefore [P1] 6 V(A) and Ф([Р']) = [Р]. П Corollary. Let ф : R -t S be a homomorphism of rings such that S is a finite R-module. Then the corresponding morphism Ф : Spec E) —» Spec (R) is proper. It is very useful to have a class of morphisms / : X —t Y which over every affine in Y look like those in this corollary, but where the image itself is not necessarily affine.
124 II. Preschemes Definition 3. A morphism / : X —t Y is affine if for all open affine sets U С Y, f~1(U) is affine. / is finite if it is affine and if for all open affine sets U С Y, Г (f~1(U),ox) is a finite module over Г (U,oY). We will prove the following key point in Ch. Ill, §1, when we have the ap- appropriate machinery. Proposition 5. To prove that a morphism f \ X -¥ Y is affine or finite, it suffices to check the defining property for the open affine sets Ui С Y of one covering {Ui} ofY. According to the corollary to Prop. 4, every finite morphism / : X —t Y is proper. It is easy to check that finite morphisms also have finite fibres, i.e., f~l (y) is a finite set for every у eY. A deep result, due to Chevalley, asserts that when У is a noetherian scheme (see Ch. Ill, §2) then conversely every proper morphism / : X —У Y with finite fibres is a finite morphism. As an example of how the definition of "proper" works, consider the projec- projection taking (ж, у) to ж. If ж С A| is the closed subset x • у = 1, then p{X) = Aj. — @) which is not closed. Therefore p is not proper. Looking back to Chapter I, Ex- Examples N and О are examples of finite bijective morphisms which are not iso- isomorphisms. But Example P is clearly not a finite morphism and it is worthwhile seeing that it is not even proper. We started with a finite morphism / : A1 —► С , С an affine plane curve. Letting Do = A1 — A), and /' = restriction of / to Do, we obtained a bijection between Do and C. Now /' itself is certainly a closed map since the topology of a one-dimensional variety is so trivial. Therefore to show that /' is not proper, we have to take a fibre product. Let D\ = A1 — (—1). Consider the fibre product: > Do 4 n open open Al Ai P1 .1 A1 x<?AJ >■ A1 \P2 open ^ *" ^' Then p2 : Do Y.c D\ -t D\ is not closed. To see this, look at the structure of A1 ®c A1. It is a closed subscheme of A1 xk A1 whose closed points are the pairs (xi,x2) such that f(xi) = f(xi). Therefore it is the union of A) the diagonal {(x,x)}, and B) the 2 isolated points A, —1) and (—1,1). Do xq D\ consists in
§7. Proper morphisms and finite morphisms 125 the open subset U of diagonal points {(x,x) | x ф 1,-1} and the open subset V containing the single point (—1,1). Therefore U and V are closed too. But p2(U) = Z?i — {1} is not closed in D\. A natural way in which finite morphisms occur is via projections. To fix notation, let FR be the union of (n + l)-affines with the usual patching. Let li = Y^ o>ijXj, 0 < i < r j=o be any linear forms, with o^- G R. Recall that the open set W{ given by t{ ф 0 is the affine scheme Spec л |_,..., _ (cf. Ex. J bis). Let L be the "linear" subset £0 = ... = ir = 0. Now let FrR be the union of the (r + l)-affines: p^,,^ , 0<г<г. Define the projection: __ . 7П)Т1 Т » Jn>f* R by requiring that -Ki{Wi) С V{ and that vrj (Yk/Yi) = £к/£{, for all 0 < г, к < г. Then clearly ?rf 1(T^) = Wi, so tti is an affine morphism of finite type. Proposition 6. Let Z be a closed subscheme of FR disjoint from L. Let it be the restriction of tti to Z. Then it : Z —> Гд is a finite morphism. Proof, it is certainly affine since it is the restriction of tti to a closed subscheme. For all к such that 0 < к < n, Z П Uk and V ( ( j^,..., j^ J J are disjoint closed sets in Uk- Therefore Suppose in fact that г=0
126 II. Preschemes where fk G I(Z П Uk) and gik G R [f^,..., ^-1. By the lemma in Th. 2, there is a homogeneous polynomial Fk G R [Xo,..., X — n) of degree d such that a) Fk/X£ = fk b) Fk/X?eI(ZnUi), all г. Equation (*) shows that Fk must have the form i=0 degree (Gik) = d- 1. Now consider the ring extension given by тт* But because /(Z П Wi) contains Fo/£f,..., Fn/£f, it follows that the monomials , о < aj < d 3=0 form a module basis of R fe,..., ^H /i(ZnUi) over Д [4/^,..., 4/^]- There- Therefore 7Г is finite. П At this point, it is almost trivial to give the classical proof of Noether's normalization lemma so as to illustrate the power of this method (cf. Ch. I, §6). Let к be an algebraically closed field and let X be an affine variety over k. If IcAJ, embed A£ С P£ as the complement of the hyperplane Xo = 0 and let X be the closure of J in F]J. Let L be a maximal linear subspace of ¥n — An disjoint from X. Let Iq = Xo, l\,..., £r be a set of independent linear equations defining L. Use these to define the projection:
§8. Specialization 127 тг is finite by the Proposition. Since тг 1 (Vo) = Xfl{4 ф 0} = X, the restriction тг' of тг to a morphism from X to Vo is finite. Since Vo = Ar, this gives the normalization lemma, provided that тг (and hence тг') is surjective. But if ir(X) were a proper subset of Pr, it would be irreducible and it could not contain completely the hyperplane Pr — Ar either. Then choose a closed point x e¥r — it(X) - Ar: it follows that L U Trf г(х) is a linear subspace of Pn — An disjoint from X and bigger than L. This contradiction proves that тг' is surjective, so Noether's lemma comes out. §8. Specialization Let к be an algebraically closed field and let R С к be a valuation ring. Then the residue field L of R is also algebraically closed. Let тг : R —У L denote the canonical map, and let M = кег(тг). Let ¥n(k) and Pn(L) denote the set of closed points of P£ and P£ respectively (= the set of /г-valued and L-valued points). Then there is a remarkable map: ¥n(k) -A ¥n(L) defined as follows: let (ao, a\,..., an) be homogeneous coordinates of x G ¥n(k). Then for some Л G k, all the elements Xao,..., Xan will be in R, and not all of them M. Set p ((«o, • • •, an)) = (тг (Aa0),..., тг (Aan)) • Since Л is unique up to a unit in R, (it (Xao),... ,ir (Xan)) is unique up to multiplication by a non-zero element of L, so p makes sense. Note that p is surjective. Definition 1. For all closed points x €¥%, p(x) is the specialization of x with respect to R. For example, look at the case n = 1, and consider к = A1 (k) =¥1(k)-{oo}. Then p : к ->■ P:(L) = L U {oo} is just the place associated to R. Although p does not extend to a continuous map of the whole scheme P£ to P£, (Reader: why not?) it has the following remarkable property:
128 II. Preschemes Theorem 1. For all closed subsets Z С P£, there is a unique closed subset W С P£ such that p(Z(k)) = W(L). In fact, W can be constructed as follows. Consider the diagram of fibre prod- products: Spec к case of real valuation P Spec R Spec L , where a and b are the generic and closed points of Spec (R). Then j is a closed immersion, so we will identify P£ with the closed subscheme p F) in Рд. More- Moreover i is an isomorphism of P£ with the subset p~l{a) of Рд, plus the restriction of Opn to p~l{a). In other words, г is a homeomorphism, and for all x G P£, i* • Ои„\ mn —> o_ pn is an isomorphism. Theorem 1 bis. W = PnK П i(Z). To see why this is reasonable, let's check that for all closed points x {p(x)}=FnLn{i(x)}.
§8. Specialization 129 In fact, if ж = (a0,... , an) where we assume щ G R and a0 g M to simplify notations, then p(x) = (тгско,... ,iran). Cover Рд by open affines: Ui = Spec R —,...JlF [Aj Aj Consider the subset Clearly i(x) G 17, and ГПР£ = {p(x)}. Viap, 27 -—> Spec (Д), so 27 is the graph of a morphism / : Spec (R) —У Рд which is a section of p. Therefore £ is closed in Рд and the point г (ж), which corresponds to the generic point a G Spec (R), is dense in U. Therefore E = {i(x)}. (Compare Prop. 3, §7). It follows that p(Z(k)) cPJfl i(Z). The main point is the lifting problem: to show that every L-valued point of P£ П i(Z) is the intersection of P£ with an R-valued point of i(Z). We shall do this via the Going-Down Theorem of Cohen-Seidenberg: Going-Down Theorem. Let f : X —» Y be a finite morphism. Assume that Y is an irreducible normal scheme, i.e., all of its local rings oy are domains, integrally closed in its quotient field. Assume that for all x G X, no non-zero element of 0/(ж))У is a 0-divisor in ox7. Then for every pair of points x\ G X, yo G Y such that f{x\) G {yo}, there is a point xq G f~1(yo) such thatxi G {xo}. This is very nearly the same as saying that / is an open map. In fact, if / is open, the conclusion of the Theorem has to hold; and conversely, whenever you know that under / constructable sets go to constructable sets, then the conclusion of the theorem implies / is open. We also need a new form of Noether's Normalization lemma: Normalization lemma over R. Let Z С Р£ be an (irreducible closed subset of dimension r and let Z = i(Z). There exist (r + 1) linear forms £o(X),... ,£r(X) with coefficients in R such that the subset L С Рд defined by (,q = ... = lr = 0 is disjoint from Z. Let т be the projection Then giving Z any structure of closed subscheme, т : Z —> Рд is a finite surjec- tion morphism. Proof. Once we find £o,... ,£r such that L П Z = 0, the last part is easy: r is finite by Prop. 6, §7. Therefore, it is surjective if the image contains the generic point of Рд. But restricted to P£, r defines a finite morphism from the 7 In the language to be introduced in Ch. 3: all associated points of ox lie over the generic point of Y.
130 II. Preschemes r-dimensional variety Z to P£. If this were not dominating, its fibres would be positive dimensional by the results of Ch. I, §7. Therefore r is surjective. To construct the ^'s, let s be the smallest integer such that Z is disjoint from to = ... = ts = 0 for some set of s + 1 linear forms. Since Z is disjoint from the empty set defined by Xq = ... = Xn = 0, therefore s < n. Let L' be 4 = ... = 4 = 0 and let r' : P£ - V ->■ P^ be the projection. We want to show s < r (in fact, s < r is easily seen to be impossible by the first argument). Assume s > R, and let Z' = t'(Z). Z' is closed by Prop. 6, §7, and if Z' = t'(Z) in P|, then Z' must be just the closure of i(Z'). Since s > r, i(Z') is still a proper closed subset of P|. But then i(Z') is contained in some hypersurface F = 0 of P|. Normalize the coefficients of F so that they all lie in R, but not all in M. Then F = 0 defines a subset H С P^: i.e., let Я П Ui = V {{F/Xf)), d = degree F. Since H D i(Z'), it follows that H D Z' too. Furthermore H ^ TSL since the equation ir(F), reduced mod M, is not identically 0. Choose a closed point (ao,... ,as) in P^ — H: assume for simplicity that a0 ф 0. Let a; G R be elements such that тг(о;г) = ai and let 17 С Рд be the section of p defined by v al v" v as v Ai — Ao, • • • ,Л8 — Ao • a0 a0 Then U П Z' = 0, since {«o? • • • > Q«) is the only closed point of 17 and it is not in Z'. On the other hand, back in Рд, г i /-i/тл f locus of zeroes of 1 Call this L". Then L" is disjoint from .£ and is defined by only s — 1 equations. This shows that s was noi as small as possible, and the lemma is proven. □ The theorem now follows easily: suppose Z С Р£ is given. First of all, we may assume that Z is irreducible, since the general case reduces to this one right away. Let Z = i(R) and choose a finite surjective morphism: Spec (R) (r = dim.k Z). Let W = -EflPJ, let у be any closed point of W, and let yo = т(у). If y0 = (ao,... ,ar), then choose ai € R such that тг(а;) = Щ. The a's define a closed point (ao, ■ ■ ■, ar) 6 P£. Let xo = i ((a0,..., ar)). Then y0 б {жо}- By the Going-Down Theorem, choose a point x G t~1(xq) such that у G {ж}. Since жо lies over the generic point of Spec (R), so does x. Therefore x = i(x) for some point x G Z. т restricts to a finite morphism:
§8. Specialization 131 Spec (к). Then since Тк{х) is closed, x is a closed point too. In other words, x G Z(k) and у is the point {i(x)} П P£, i.e., p(x). This proves that у 6 p(Z(k)), hence П Definition 2. The subset VF of P£ is called the specialization of Z with respect to R. Corollary (of proof). If Z is irreducible and dimZ = r, then all components of its specialization W are r-dimensional. Proof. Assuming that Z is irreducible and r-dimensional, we constructed a finite surjective morphism r : i(Z) —t Рд. Intersecting with P£ this gives a finite surjective morphism rL : W —¥ P£ . We would like to know that every component of W dominates P£, which would show that it is r-dimensional. Suppose this did not happen: let wq G W be the generic point of a component of W such that tl{wq) ф у, у the generic point of Рд. The going-down theorem applies to r : i(Z) ->■ Рд. Since tl(w0) G {y}, there is a point w G т~1(у) such that wq G {w}. But then w must be a point of W over the generic point of P£ and wq G {w} implies that wq is not generic. □ To actually determine W, here is what you have to do: describe Z by its homogeneous ideal Аск[Х0,...,Хп] . Then take the intersection An R [Xo,..., Xn] and take the reductions mod M of all the polynomials in this intersection: B = ir{AnR[X0,...,Xn]} . Then W is the closed subset V(B). Proof. In fact, for all F G AnR[X0,..., Xn], F = 0 defines a closed subset H of Рд. Since FeA,HD i(Z). Therefore Я D 1{Z) and Я П P^ D W. But Я П P^ is the locus ttF = 0. Therefore W С Conversely, suppose x G P£ - W. Then ж G Рд - г"(^), and by the lemma in Th. 2, §7, there is a homogeneous polynomial F G R[X0,... ,Xn] such that F = 0 on i(Z), F ф 0 at x. Therefore F G An R[X0,... ,Xn] and ttF G Б. Since ttF(x) фО,х<£ V{B). U
132 II. Preschemes Example V. Let Z be a hypersurface F (Xo,... ,Xn) = 0. Normalize the coeffi- coefficients so they all lie in R, not all in M. Then W is the hypersurface nF = 0. (In fact, A = F-k[X]. Thus AnR[X] D F-R[X]. Conversely, if G G AnR[X], then G = F • H for some H G A;[X]. By Gauss' lemma, H G R[X], hence G G F • R[X]. Therefore тг{А П R[X]} = nF ■ L[X].) More specifically, take Z to be the hyperbola X1X2 = olXq in P^, where a G M. Then W is given by X1X2 = 0, i.e., it is the union of 2 lines. Thus W can be reducible even when Z is irreducible. As a matter of fact, the only other elementary property that W does have when Z is irreducible is that it is connected. More generally, the connectedness theorem of Zariski states that if Z С Р£ is any closed connected set, then its specialization over R is still connected. Example W. One of the reasons why specialization is so important is that char (L) may be finite when char (k) = 0. For example, take к to be the field of algebraic numbers, and let R be a valuation ring such that R П Q = Z(p). Then L is an algebraic closure of Ъ/рЪ. Take n = 2 and let Z be the Fermat curve: yk 1 yk yk Its reduction mod R is again the plane curve X± + X$ = Kq. This equation is irreducible, however, only when p / k. If pv \ k, pv+1 / k, then let к = pv • ко. We find: xf + xl - xl = (x}° + x%° - x£°y , and W is a plane curve of lower order &o, "taken with multiplicity pv". This example makes it clear that the process of specialization should be further refined to take this multiplicity into account: Definition 3. Let Z be a closed subscheme of P£ defined by a homogeneous ideal A. Let В = ж {А П R [Xo,..., Xn]}, and let В define the closed subscheme W of P£- Then W is the specialization of Z with respect to R. Proposition 2. Let Z be a closed subscheme o/P^. Then there is one and only one closed subscheme Z С Рд such that: 1) For all x G Z, the ring ox z is a torsion-free R-module, 2) Z = Z xPn P£ = Z xSpec '(л) Spec (k). Moreover, the specialization of Z is the closed fibre of Z over b. Proof. Suppose that a subscheme Z in Рд is defined by a collection of ideals Then condition B) on Z means that the ideal Ai • К с К fe,..., ^ч should be the ideal I (Z П Ui). And condition A) is equivalent to requiring that
§8. Specialization 133 be torsion-free for all i; or that for all a € R, а ф 0, and / G R [Xo/Xi,..., Xn/Xi], a • f G Ai implies f e Ai. But once the ideal A* = I (Z П Ui) is given, I claim the only ideal Ai with both these properties is precisely the ideal Ai = A*nR[X0/Xi,...,Xn/Xi] . In fact, say Bi has both properties. Then A* = Bi • К implies В; С А* П R[Xo/Xi,...,Xn/Xi] and that for all / G A*, a ■ f G Bi for some non-zero a e R; and if / G А* П R [Xo/Xi,..., Xn/X;] so that a- f e Bt, some a, then the torsion-freeness implies that / itself is in Bi. This shows that Z is unique. Now define Z by these ideals Af. we leave it to the reader to check that on Ui П Uj, these define the same subscheme. The fibre of Z over b is defined by the affine ideals А{ = ж {А* П R [Хп/Х{,..., Хп/Х{]} С L [Хо/Х{,..., Хп/Х{] . But W is given by the ideals j f-fjr f\TT.\ J f I vd J ^ ^-^ ' ' -^ 1/^-0) • • • j -Л-nJ I |J/ г / homogeneous, degree d J Г /f/xdx /€АПД[*о,...,Х„] ) \ W {> f homogeneous, degree d j = ж{I(ZпUi)пR[Xo/Xi,...,Xп/Xi}} D Example X. In P^, let Z be the union of the 3 points z = у = 0; x = 0, у = а; x = а, у = 0, where x = X\/Xq, у = X2/Xq are affine coordinates and a G M. Then the homogeneous ideal of Z is generated by: Xi (Xi - aX0) Reducing this ideal mod M, we get (X?,XiX2,X2): therefore W is the single point x = у = 0 in P£ with structure sheaf k[x, y]/ (x2,xy, y2) (cf. Ex. P, §5). It is interesting to work out other examples, where n points Xi are given in P^, such that p(x\) = ... = p(xn) = @,0), and see which ideal А С к[х, у] of codimension n comes out. Example Y (Hironaka). In P|, let Z be the twisted cubic which is the image of Xo = 1 Xi = t3 X2 = t2 X3 = at ,
134 II. Preschemes a G M. The ideal A of Z is generated by: - i — X2X3, a2X2Xo — aX2 — X\X3 . It also contains the element X2X0 - X|. Therefore ir {Ar)R[X0,Xi,X2,X3]} contains: X3, X2X3,X\X3, and XxXq — X2 . These elements define, as a set, the plane cubic curve C: X3 = 0, X^Xq = X2. Also, giving t a value in R, one sees that p(Z(k)) contains all the points (l, P3, /52,0), P G L. Since these are dense in C, it follows that C, as a set, is the specialization of Z over R. But the ideal В = (X3, X2X3, X\X3, X?Xo — X$) is itself the intersection: В = (X3,X*X0 - Xl) П (Хг,Х2,Х3J . It can be checked that В = тг{АП R [Xo,... ,X3]} so that the scheme spe- specialization of Z has nilpotent elements in its structure sheaf at the one point P = A,0,0,0). The reason for this can be seen in the affine coordinates x = Xi/XOi у = X2/X0, z = X3/XQ. Then Z, like C, contains A,0,0,0). After specialization, the whole curve Z is squashed into the horizontal plane z = 0. But at P, and only at P, Z has a vertical tangent. For this reason, it resists being squashed there and gets an embedded component in the end: More concretely, suppose / is a polynomial in x,y,z, over R, that vanishes on Z. Then not only is /@,0,0) = 0, but |f @,0,0) = 0. Therefore the same must hold for polynomials over L that vanish on the scheme-specialization of Z. In particular, z itself cannot vanish on the specialized scheme.
§8. Specialization 135 In practice, specializations usually arise like this: one is given a Dedekind domain D and a subscheme Z С Р^. For example, a set of homogeneous Dio- phantine equations define a subscheme Л С Р^- Or, in a geometric situation over an algebraically closed field L, one might have a subvariety Z С P£ х AL. Then suppose you want to compare the generic fibre of Z over D with some closed fibre of Z over D. Let ко be the quotient field of D and тг : D ->• Lq a map of D onto D/M, some maximal ideal M. Then by fibre product, we get П z П n Spec (A;o) D Spec (D) Spec (Lo) Then Z sets up a link between Zk0 and Zl0 which is very interesting. The point is that just as in Prop. 2, if oz is a sheaf of torsion-free D-modules, then Zk determines Z, and therefore determines Zl0. This is just the non-algebraically closed generalization of the concept of specialization. It can be reduced to the previous case by taking fibre products. In fact, let к be an algebraic closure of &o; using the place extension theorem, let R С к be a valuation ring, quotient field к, dominating Dm- The residue field L over R is an algebraic closure of Lo. We obtain Spec (A;o) Spec (D) Spec (Lo). Assuming for a minute that Z* is torsion-free over R, it follows from Prop. 2 that the subscheme Zl С Р£ is the specialization over R of the subscheme Zk С P£. Zl0 is just an Lo-structure on Zl and Zk0 is a A;o-subscheme on Z*.; so the added touch here is that having D around lets us specialize a &o-structure to an Lo-structure as well. That Z* is torsion-free over R follows from the following (by taking R2 = DM, Ri = R, M = oX:Z): Lemma. Let R2 be a valuation ring, and R\ an R2 -algebra. Let M be a torsion- free R2-module. Then M ®д2 Ri is a torsion-free R\-algebra.
136 II. Preschemes Proof. Recall that a finitely generated torsion-free module over a valuation ring is free. Therefore M, as a torsion-free module, is the direct limit of free R2- modules. Therefore M ®r2 R\ is a direct limit of free R\-modules. Therefore M ®r2 Ri is torsion-free. □
III. Local Properties of Schemes So far, besides mere definitions, the only properties of varieties that we have studied are i) their dimensions, and ii) whether or not they are complete. In this chapter, we will analyze 2 local concepts: i) when is a variety "manifold-like" at a given point, and ii) if it is not manifold-like, how many "branches" does it have at a given point. The main problem here is to carry over our intuitive differential- geometric and topological ideas into a purely algebraic setting. In some cases, as with the module of differentials, this involves setting up an algebraic theory in close analogy to the geometric one. In other cases, as with the concepts of normality and flatness, an idea is introduced into the geometry primarily because of the simplicity and naturality of the algebra. In these cases, the geometric meaning is hidden and often subtle, but the first aim of the theory is to elucidate this meaning: one should expect a true synthesis of algebraic and geometric ideas in cases like these. Many different techniques have been used to develop the local theory of schemes. For example, there are: i) projective methods. These involve reducing the question to graded rings - either by adding a new variable to make your polynomials homogeneous, or by passing from a filtered object to its graded object. This enables you to use global projective methods in a local situation. ii) power series methods. These involve taking completions of our rings in 7-adic topologies and a close comparison of the original ring with its completion. The Weierstrass preparation theorem is a key tool here. iii) homological methods. These involve the study of syzygies, i.e., projective res- resolutions of modules, the dual notion of injective resolutions, and are adapted to concepts like that of 0-divisor which have good exact-sequence transla- translations. These methods often overlap (e.g., (i) and (iii) in concept of Hilbert poly- polynomial). I will try to give a sampling of all of them, although we prefer (i) as being closer to the spirit of this book. To do this, however, I have had to assume many more standard results in Commutative Algebra than in previous chapters. My advice to the reader, if he has not seen these results before, is to skip back and forth between this book and books on Commutative Algebra, using the ge- geometric examples to enrich the algebra, and the clean-cut algebraic techniques to clarify the geometry. D. Mumford: LNM 1358, pp. 137-223, 1999. © Springer-Verlag Berlin Heidelberg 1999
138 III. Local Properties of Schemes §1. Quasi-coherent modules Almost all local theory involves modules as well as rings and ideals. The purpose of this section is to put modules in their convenient geometric setting. This is really just a technical digression before we get on to the more interesting geometry. Definition 1. Let (X,ox) be a scheme. An ox-module is a sheaf F of abelian groups on X plus, for all U С X open, а Г (U, ox)-module structure on F(U, F), such that if V С U, the diagram: F(U,ox) x r{U,F) - r(U,F) F(V,ox) x F(V,F) * F(V,F) commutes. A homomorphism of ox-modules is a sheaf homomorphism that pre- preserves the module structure on every open set. Basic Example. Let X =_ Spec R, and let M be an .R-module. We define an associated ox -module M in the same way in which we defined ox itself. To the distinguished open set Xj, we assign the module M/, checking that when Xf С Xg we get a natural map Mh ->• M/. We check that lim M/ = Mp, [P]£Xf and that our assignment gives a "sheaf on the distinguished open sets", i.e., satisfies Lemmas 1 and 2 of §1, Ch. 2. (The proofs are almost word-for-word the same as in the construction of ox.) We then can extend to a sheaf on X in the canonical way, letting F(U, M) be the set of all s G П Mp which are [P]€U given locally by sections over distinguished open sets. We put an ox-module structure on this set by restricting to F(U,ox) x F(U,M) the natural map П Rp x Y\ Mp ->■ П Mp and observing that it lands us in F(U, M)\ clearly, this commutes with restriction, so we do have an o^-module. Proposition 1. Let M,N be R-modules. The natural map hom£x (M,iV) ->• потд(М, N) gotten by taking global sections is a bisection. Proof. Given a homomorphism </? : M ->• N we get induced homomorphisms <Pf : Mf ->• Nf for every /. This gives us homomorphisms F(U, M) ->• F(U,N) on every distinguished open set U, compatible with restriction; that necessarily induces homomorphisms jor all open sets U. It is easily seen that this process gives an inverse to hom(M, N) -> hom(M, JV). □ Corollary. The category of R-modules is equivalent to the category of ox- modules of the form M.
§1. Quasi-coherent modules 139 Proposition 2. The sequence M -¥ N ->• P is exact if and only if the sequence M -t N ->■ P is exact. Proof The sequence of sheaves is exact if and only if it is exact at all stalks, i.e., for all primes P, Mp ->• Np ->• Pp is exact. This is equivalent to exactness of M -t N ->■ P by a standard theorem of commutative algebra (cf. Bourbaki). □ Corollary. If M —>• N is a homomorphism, then its kernel, cokernel, and image are of the form К for some R-module K. Theorem-Definition 3. Let X be a scheme, T an o_x-module. The following are equivalent: A) For all U С X affine open, T\v = M for some Г(U,оx)-module M. B) There is an open affine cover {Ui} of X such that for all i, T ц = Mi for some Г (Ui, оx)-module Mi. C) For all x £ X, 3 a neighbourhood U of x and an exact sequence of ox \u- modules Qx\u -^Q.x\u -^ (here the exponent in parentheses denotes direct sum). For allVcU open affines, the canonical map is an isomorphism. An Oj^-module with these properties is called quasi-coherent. Proof. Let's check D) =► C) =► B) =► A) =► D) . D) =$■ C): Before we prove this, we really ought to define a direct sum of ox-modules. In fact, if {Fa} is a collection of ox-modules, then @Fa is the sheaf associated to the presheaf: For all affine U, check that: Now assume D) and let x G X. Take any open affine neighbourhood U of x and let R = F(U,ox). The .R-module r(U,T) has a presentation: » r(J) _>. г(С/, T) —»• 0 . This induces the top sequence of homomorphisms:
140 III. Local Properties of Schemes a Дх It; о . There is a canonical homomorphism a : r(U,T) ->• .T7 ^ that then gives the bottom sequence. To prove C), it suffices to check that a is an isomorphism. But a is given, on distinguished open subsets Uf of U by: which is an isomorphism by D). C) ==>■ B): Cover X by open affines Ui = Spec (R{) on which there are exact sequences: 0. Since ох у = R\l\ ox v = R(Ji\ the cokernel of ф{ is a module of type K, by the Cor. of Prop. 2 and this is B). B) ==>■ A): Notice first that if U is an open affine set such that J7\JJ = M for some Г (С/, ox)-module M, then for all / G Г (U,ox), T\v = Mf. Therefore, starting with condition B), we deduce that there is a basis {Ui} for the topology of X consisting of open affines such that T\v_ = M{. Now if U is any open affine set and R = F(U,ox), we can cover each of these C/j's by smaller open affines of the type Ug, g G R. Since Ug = (Ui)g, T v is still of the form (Mi)g. In other words, we get a finite covering of U by affines U9i such that = Ni, Ni an R9i -module. For every open set V С U, the sequence 0 (V HUgi is exact. Define new sheaves Ti* and T*i by:
§1. Quasi-coherent modules 141 ,r{) = r{vnugt,?) Then the sequence of sheaves is exact, so to prove that T is of the form M, it suffices to prove this for T* and F?j. But if Mf is Mi viewed as an Я-module, then T* = Mf. In fact, for all distinguished open sets Ug, ,r*) = r(uanuat,p) The same argument works for the -T^/s. A) =$> D): Let U = Spec (R), V = Spec (S) be open affines such that resent th R(D ЛЦ, R(J) -^ м —». 0 . UDV. Let F\u = M. Present the Л-module M via: This gives us: {J) Qx\b -+Zx\i, и 0. By restriction, we get an exact sequence on V: But F\ v is an o^-module of type N, so we get an exact sequence of 5-modules: On the other hand, the first exact sequence gives, by tensor product, the exact sequence: M 0
142 III. Local Properties of Schemes But P and a' are the same homomorphisms. Hence N and M®r S are cokernels of the same map, hence they are isomorphic. □ Example A. Let X = Spec (R), where R is a principal valuation ring (= discrete, and rank 1). Spec (R) has one open and one closed point: Ж1,жо- Let U be the open set {x\}. Then a sheaf T on X just consists in 2 sets Mo = Г{Х,Т) Mi = r(U,F) and a restriction map res : Mo ->• M\. If К is the quotient field of R, and the ox -module T consists in an .R-module Mo, a if-module Mi, and an .R-linear restriction map res : Mo ->• M\. By condition D) of the theorem, it is quasi- coherent if and only if the induced map Mo ®r К —у Mi is an isomorphism. In this case, T = Mo. T could fail to be quasi-coherent in two different ways. There might not be enough global sections, e.g., Mi = К, Mo = @); on the other hand, all the sections might be supported only on the closed point, e.g., Mi = @), Mo = R. In a quasi-coherent sheaf, the closed point can support only the "torsion" part, i.e., the kernel of Mo ->• Mo ®r K. Note that an ox -ideal Q is quasi-coherent in this definition if and only if it is quasi-coherent in our former sense (Ch. 2, §5). Moreover, kernels, cokernels, and images of maps between quasi-coherent modules are quasi-coherent. For by the theorem quasi-coherence is a local condition, and we know the result for affine schemes. Yet another way of combining quasi-coherent ox-modules is this: Let Q be a quasi-coherent o^-ideal, T a quasi-coherent o^-module. Let Q • T be the least submodule of T containing all elements a • s (a G r(U, Q), s G Г(и,^)). Then Q • T is also quasi-coherent, and if U is any open affine, Q • T\v = I • M where / = r(U, Q), M = Г(и, Т). [The first statement follows from the second, whoose proof is left as an exercise.] As a less trivial example, we will define for every morphism X —У Y a very important quasi-coherent ox-module i?x/y» called the sheaf of relative differen- differentials. First, we recall the commutative algebra involved. Let A ->• В be a homomorphism of rings. We define a .B-module Bb/a by taking a free .B-module on the symbols {dC | /5 G B} and dividing out by the relations a) d(P1+p2)=dp1+dp2 b) For a,CeB, adfi + fida = d{aC) c) dp = 0 if P comes from A. Notice that for all .B-modules C, i / ^ ^i\ ~ Г module of Л-derivations homB (ПВ/А,С) = |
§1. Quasi-coherent modules 143 In fact, if т : Qb/a -^ С is a .B-homomorphism, then /3 ->• r(d/3) is an A- derivation from В to C. And if D : В ->• С is an Л-derivation, define r by Let В ®a В —V В be the map /3i ® /32 ->■ /З1/З2, and let / = kerS. / is a В ®aB module, and I/I2 is а В ®a ^//-module, that is, a Б-module (since multiplication by /3 ® 1 and by 1 ® /3 give the same result in ///2). Theorem 4. 7//2 is canonically isomorphic to Qb/a- Proof. To map Qb/a to i"//2, we let d/3 go to 1 ® /5 - /5 <£> 1 in /. Clearly a) and c) are satisfied. And d(afi) goes to: \ ® af3 - af3 ® \ = 1®аР- This is to be the image of @da + ad@, so the relations b) are satisfied too. So we get a map Qb/a -* I/I2- To get a map backwards, we define a ring R = В © Qb/a, where В acts on Qb/a through the module action and elements of Qb/a have square zero. Define a map В х В ->• E by (/Зь/Зг) '—> (/3i /?2, /3i rf/Зг) • This is clearly biadditive; it is Л-bilinear, for if a comes from A, and d (а/Зг) = ай/Зг- Hence this induces a map В ®а В -t E.It is easily checked that this is a ring homomorphism. Moreover, / goes into Qb/a (by definition, / is the set of those elements whose images have first coordinate 0; and all squares are 0 in Qb/a, so the map must factor via I/I2 ->• Qb/a)- It is easy to see that our two maps are inverses. □ In some books, the module of differentials is defined as the dual of the module of derivations, i.e., the double dual of Qb/a- Double dualizing will in general lose information, so the definition here, due to Kahler, is preferable. Example B. Let A = к, В = к [Х\,..., Xn]. Then Qb/a is a free .B-module with generators dX\,..., dXn, and More generally, if B = k[X1,..., then Qb/a is generated, as .B-module, by dX\,... ,dXn, but with relations:
144 III. Local Properties of Schemes Example C. Let A = k, and let В = К D к be an extension field. Then the dual of Ок/к (over K) is the vector space of ^-derivations from К to K. In particular, if К is finitely generated and separable over k, the dimension of both of these vector spaces is the transcendence degree of К over к (cf. Zariski-Samuel, vol. 1). If n = tr.d.(K/k) and /i, ...,/„€ A", then dfi,..., dfn are a basis of {all ^-derivations of К that 1 annihilate /i,..., /„ are 0 J {/1»• • •»/n are a separating 1 transcendence basis of К /к J On the other hand, if char к = p and К = k(b) where bp = a G к, then f2s/A is the free if-module generated by db. Example D. Let A = к, В = k[X,Y]/(X • Y). Then by Ex. B, dX and dY generate Bb/a with the one relation XdY + YdX = 0. Consider the element и = XdY = -YdX. Then Xu = Yu = 0, so the submodule M generated by u> is ku>, a one-dimensional fc-space. On the other hand, in ft/M, we have XdY - YdX = 0, so J2/M ^ В • dX © Б • dF. Note that B-dX * B/(Y) * QBx/k, where Bx = B/(Y) ^ ВД; likewise, Я-dF ^ /2By/fc. That is, the module of differentials on В (which looks like ~r) is the module of differentials on the horizontal and vertical lines separately extended by a torsion module. (One can even check that the extension is non-trivial, i.e., does not split.) Problem. Let R be a finitely generated A;-algebra. Show that flRjk = @) if and only if R is a direct sum of finite separable extension fields over k. Hint: Let P be a prime ideal in R and let L = quotient field of R/P. Show ^L(k) = @)? and hence L is finite separable over к. Thus show that R satisfies the d.c.c. Finally show that nilpotents can't occur either. All this is easy to globalize. Let a morphism / : X ->• Y be given. The closed immersion globalizes the homomorphism S : В ®а В ->• В. In fact, if U = Spec (В) С X and V = Spec (А) С Y are open affines such that f(U) С V, then U xv U is an open affine in X Xy X, A~l (U Xy U) = U, and we get a commutative diagram: F(U,ox) +-^Lr{UxvU В -* B®AB .
§1. Quasi-coherent modules 145 Let Q be the quasi-coherent oXxyX-ideal defining the closed subscheme Z = A(X). Then Q2 is also a quasi-coherent oXxyj^-ideal, and Q/Q2 is a quasi-coherent o^x^x-module. It is also a module over qXxyx/Qi which is oz extended by zero. As its stalks are all 0 off Z, Q/Q2 is actually a (Z, oz)-module. It is quasi-coherent by virtue of the nearly tautologous: Lemma. If S С T are a schema and a closed subscheme, and if T is an os- module, then T is a quasi-coherent os-module on S if and only if T, extended by О onT — S, is a quasi-coherent oT-module on T. Definition 2. f2x/Y is the quasi-coherent ox-module obtained by carrying Q/Q2 back to X by the isomorphism A : X -—t Z. Clearly, for all U = Spec (В) С X and V = Spec (А) С Y such that f(U) С V, the restriction of f2x/Y to U is just ^b/a- Therefore, we have globalized our affine construction. There are many unsolved problems concerning flx/y- For example, if Y = Spec (k) and X is a variety over the field k, when is Qx/y torsion-free? We can now fill in the gap in Chapter 2, §7: Proposition 5. To prove that a morphism f : X —t Y is affine or finite, it suffices to check the defining property for the open sets Ui С Y of one affine covering. Proof. Let Y = UUi, Ui = Spec (R{) and assume f~1(U{) is affine for all i. First of all, note that the sheaf f*(ox) is a quasi-coherent sheaf of oy-algebras such that, if f~l(Ui) = Spec (Si), then regarding Si as an i^-module, To see this, notice that we have natural maps: Г ((Ui)g,Si) = (Si)g —► Г (Г1 (Щд ,ox) = Г ((Ui)gJ*(ox)) for all g e R{. And since Z (Щд <* f-\Ui) xVi (Щд, Г (Г1 (Ui)g ,ох)=Г (ГЧЩ,ох) ®Ri (Ri)g - (Si)g therefore all these maps are isomorphisms. Now let U = Spec (R) с У be any open affine subset. Let S = Г (U, f*(ox))- I claim f~l{U) = Spec E), which will prove that / is affine. To begin with, the homomorphism defines a morphism of schemes over R:
146 III. Local Properties of Schemes * Spec (S) U = Spec(.R). Look at the restriction V>i of гр to the parts of f~l(JJ) and Spec E) over the open subset U П Ut of U. First of all, j~x{U) П f'^Ui), as a subset of Z^), is isomorphic to and since all of these are affine schemes, it is affine. If R\ = Г (U П Ui,oY), its coordinate ring is just <%®я. i^, which (by D), Theorem 3) is Г (U П [/;, f*(ox)). On the other hand, again by D), Theorem 3, which is the ring of Spec (S) Хц (U П Ui), i.e., the inverse image of U П Ui in Spec (S). Putting all this together means that tpi is a morphism on affine schemes such that -ф? is an isomorphism. Therefore if>i and hence ф are isomorphisms. Finally, assume Si is a finite i^-module. Then using the above notation, <% ®Ri Щ is a finite i^-module, hence so is S ®д R'{. Now U is covered by a finite number of these subsets UDUi: say by U DUi,... ,U r\Un- Then build up a finitely generated submodule S* of S by throwing in enough elements to generate S<S>rR'{ over R\ for 1 < i < n. Then the submodule S* of S is big enough so that for all prime ideals P С R, the induced map Sp ->• Sp is surjective (i.e., localize in 2 stages, from R to Щ and then Rp, for a suitable i). Therefore S* = S (cf. Bourbaki), so S was a finite .R-module anyway. □ Problem. Suppose / : X ->• Y is a morphism of schemes such that for all U С Y open and affine, f~1(U) admits a finite open affine covering (i.e., f~1(U) is quasi-coherent). Let T be any quasi-coherent o^-module. Prove that f*(T) is a quasi-coherent oy-module. §2. Coherent modules For one more section, we must continue to study modules for their own sakes, this time with finiteness assumptions. Definition 1. A scheme X is noetherian if for all open sets U С X, the partially ordered set of closed subschemes of U satisfies the descending chain condition. This implies first of all that for all open sets U in X the closed subsets of U satisfy the d.c.c, hence U is quasi-compact. In other words, X is a noetherian topological space. As in Ch. I, this implies that every closed subset Z С X can be written in exactly one way as a finite union of irreducible closed subsets
§2. Coherent modules 147 Zi,...,Zn such that Z{ ~fi Zj. These Zi are called, as before, the components of Z. Secondly, if U = Spec (R) is an affine open set, then in view of the order-reversing bijection of R, X being noetherian implies that the ring R is noetherian. Moreover, conversely, Proposition 1. Let X be a scheme. If X can be covered by a finite number of open affine sets Ui = Spec (Ri) such that each Ri is a noetherian ring, then X is a noetherian scheme. Proof. Let U be any open subset of X, and let Z\ D Z2 D Z3 D ... be a chain of closed subschemes. The open subset U П Ui of Ui can be covered by a finite number of distinguished open sets (Ui) , gij G Ri, since Ri is noetherian. Therefore U itself is covered by a finite number of open affines (Ui)g.. which are the Spec's of the noetherian rings (R%)9i.- By the noetherianness of (Ri)gi. the chain of subschemes of (Ui)g..: \zkn(Ui)g..} I 9t} ) k=l,2,3,... is stationary for each i, j. By the finiteness of the covering, this implies that the chain {Zk} itself is stationary. □ Definition 2. Let X be a noetherian scheme, T a quasi-coherent ox-module. Then T is coherent if for all open affines U С X, F(U,T) is a finite Г (U,ox)- module. As usual, to check that a quasi-coherent T is coherent, it suffices to check that r(Ui,T) is a finite Г (Ui,ox)-modu\e for the open sets Ui of one affine covering {Ui} of X. [In fact, if this is so, then by D), Theorem 3, §1, F(V, T) is a finite Г (V, ox)- module for all affine subsets V of any Ui. Then if U is any affine, M = F(U, T), and Mi = F(Ur\Ui,T). Mi is a finite Г (U П Ui, oA-)-module. Cover U by a finite set of U П C/i's: say U П U\ ,...,U П Un. Then to generate M over Г (U, ox), it suffices to take enough elements of M so that their images in each Mi A < i < n) generate Mi over Г (U DUi,ox).] Note that all quasi-coherent ox-ideals are coherent. More generally quasi- coherent sub and quotient modules of coherent ox-modules are coherent. If / : X ->• Y is a morphism of finite type, then flXjy is a coherent ox-module. If / is affine, then f*(ox) is coherent if and only if / is a finite morphism. Definition 3. Let T be a coherent ox-module on the noetherian scheme X, x G X is an associated point of T if 3 an open neighbourhood U of x and an element a G F(U,T) whose support is the closure of x, i.e., sy ф 0 «=» у G {x}, all у G U . In other words, for all open U С X and all s e F(U,T), look at the support of s : {y G U | sy ф 0}. Call this W. It is a closed subset of U since, by definition of
148 III. Local Properties of Schemes the stalk Ty, sy = 0 only if s is 0 already in an open neighbourhood of y. Then the generic points of the components of these W's are the associated points of Notice that if X = Spec (R), T = M, then for all prime ideals P С R, [P] is an associated point of T if and only if P is an associated prime ideal of M. (For the definition and theory of these, see Bourbaki, Ch. 4, §1; or Zariski-Samuel, vol. 1, pp. 252-3, where these are called the prime ideals "associated to the 0 submodule of M".) The only problem in proving this is to check that, when X is affine, to find the associated points of T it suffices to look at the supports of global sections s G Г(Х, Т); we leave this point to the reader to check. One of the main facts in the theory of noetherian decompositions is that a finite .R-module has only a finite number of associated prime ideals. Since any noetherian scheme can be covered by a finite set of open affines, this implies Proposition 2. Let T be a coherent ox-module on a noetherian scheme X. Then T has only a finite number of associated points. The associated points of ox itself are obviously very important: Proposition 3. Let X be noetherian. The generic points of the components of X - for short, the generic points of X - are associated points of ox • On the other hand, if U С X is open, s G Г(и,ох) and Z = {y G U \ sy ф 0} is the support of s, and if Z does contain any generic points of X, then s is nilpotent. In particular, if X is reduced, the generic points of X are the only associated points ofox. Proof. Since X itself is the support of 1 G Г (X, ox), the generic points of X are associated points of ox. Now suppose s G Г (U,ox). If s(x) ф 0 for some x EU, then Us is non-empty. But U8 is open, so it will contain some generic point of X. On the other hand, Us С Z since s(y) ф 0 =>■ sy ф 0. Therefore Z contains a generic point of X. Therefore conversely if we assume that Z does not contain any generic point of X, it follows that s(x) = 0, all x G U, and therefore s is nilpotent. □ When ox does have non-generic associated points, their closures are called the embedded components of X. The simplest case is in Example P, Ch. II: X = Spec (k[x,y]f (x2,xy)) , with the origin as an embedded component, since it is the support of the nilpotent x. Among coherent modules, the most important are the locally free ones: Definition 4. Let X be a scheme. An ox-module T is locally free of rank r if there is an open covering {[/;} of X such that
§2. Coherent modules 149 If T is locally free of rank 1, it is called invertible8. Locally free modules are the most convenient algebraic form of the concept of vector bundles familiar in topology and differential geometry. And invertible sheaves are the algebraic analogs of line bundles. To see this clearly, suppose we mimic literally the usual definition of vector bundle in scheme language9. We can then check the sort of object we get is the equivalent of a locally free module. Definition 5. Let X be a scheme. A vector bundle of rank r on X with atlas is 1) a scheme E and a morphism 2) a covering {Ui} of X, and 3) isomorphisms of schemes/U{: 7Г :E Ar x such that for all i,j, if we restrict 4>i and (f>j to Ui П Uy. so as to get a morphism (after composing p\): Ar x (Ui П Uj) ^4 Ar then the dual homomorphism ip*j takes the coordinates X\,..., Xr on Ar into linear forms in the X's: e=i where a^f e Г(UinUj,ox). This terminology stems from the fact that if Я is a ring, M an Я-module, then M is locally free of rank 1 if and only if there is an Я-module iV such that M <8>я N = R (cf. Bourbaki). The reader who is not familiar with vector bundles can skip the discussion that follows if he wants.
150 III. Local Properties of Schemes (Cf. Atiyah, К-theory, p. 1; Auslander and Mackenzie, Introduction to Dif- ferentiable Manifolds, Ch. 9). Thus a vector bundle is just a scheme over X locally isomorphic to Ar x X. The simplest way to define a vector bundle, without distinguishing one " atlas", is to simply add the extra condition that the atlas is maximal, i.e., every possible open V С X and isomorphism ф : тг (V) -^> ArxV compatible with the given {Ui, ф{] is already there. This makes the indexing set of i's unaesthetically large, but this is unimportant. The classical way of obtaining an o^-module from a vector bundle E/X is by taking the sheaf £ of sections of E: For all U С Х, . c\ _ \ set of morphisms s : U -¥ E 1 ^ ' ' ~ 1 such that 7Г • s = lr/ f To make £ into an ox-module, first look at sections of E over subsets У of a set Ui in the atlas: Then via фi sections of тг 1 (Ui) over V and sections of Ar x Ui over V correspond to one another. But sections s of Ar x Ui over V are given by r-tuples of functions /i = si (X\), ...,fr = sl(Xr) in Г (V,ox). In other words, we get isomorphisms: sections of the scheme 1 ^ Г sections of the r~l(Ui) over V } = \ sheaf orx of V hence This induces an ox-module structure in £\v . The reader can check that the compatibility demanded between фi and ф^ over UiHUj is exactly what is needed to insure that the 2 o^-module structures that we get on £\v.nU. are the same. The main theorem in this direction is that every locally free ox -module arises as the sheaf of sections of a unique vector bundle (up to isomorphism).
§2. Coherent modules 151 The point here is this: both E and £ are structures which are locally "trivial", i.e., isomorphic to Ar x U or ox\v. Therefore, to get either E or £ globally is only a matter of taking a collection of pieces тг~1(С/г) or £ v , each "trivial" as before, and patching them together. To prove that the two sets of global objects that we get in this way are isomorphic, it suffices to show that the patching data required to put together pieces of the type Ar x Ui or of the type orx „ are exactly the same. In the first case, what is needed is an isomorphism of the two open subsets: ArxUi с suitable isomorphism E с In the second case, since £ тт.пи. is isomorphic in two different ways with orx jj.nu. (i.e., by restricting the isomorphism over Ui and over C/j), we need to know the automorphism of Qrx\u nJJ by which these 2 isomorphisms differ. Now the first isomorphism is required to be linear and therefore it is given by an r x r matrix a^j! G Г (Ui П Uj,ox), 1 < k,£ < r. To be an isomorphism means exactly that the determinant of a,k,e is invertible in Г (Ui П Uj,ox). On the other hand, an ox-module homomorphism is also determined by the matrix of components of the sections x(@,.. .,0,1,0,.. .,0)). Again \ ls an isomorphism if and only if the deter- determinant of this matrix is invertible. The situation is even nicer: if E is the sheaf of sections of E/X, then the same rxr matrices give the induced automorphisms of Ar x (Ui П Uj) and of orx и.пи., whenever you have corresponding "trivial- izations" of E and S over Ui and over Uj. From these considerations, it follows easily that a vector bundle E is determined by its sheaf of sections 8, and that every locally free ox -module £ arises in this way. For most purposes, the sheaf £ turns out to be more convenient than the bun- bundle E. A word of warning though: Grothendieck has introduced a dual method of going back and forth between E and £. In his approach, the module £ associated to the bundle E is defined by:
152 III. Local Properties of Schemes s is a linear function r{u,€) = onE where a linear function is one such that, over UC\Ui, the induced element ф* (s) G Г (Ar x (U П Ui) ,oAr xU.) is of the form: ak-Xk, aker(UnUi,ox) . k=i His method has a good generalization to arbitrary coherent ox -modules (cf. EGA, Ch. II). A basic tool in dealing with coherent modules is Nakayama's lemma which we want to recall in several forms here: Nakayama's Lemma. Let X be a noetherian scheme, T a coherent ox -module and x e X. If then 3 a neighbourhood U of x such that Jr\u = @). Proof. Apply the Nakayama lemma of Ch. I, §8, to the ox-module Tx, and the maximal ideal mx. It follows that Tx — @). If U\ is an affine open neighbourhood of x, then Jr\u = M. Let a±,...,an be generators of M as Г (Ui,ox)-modn\e. Since ai i—> 0 in Tx, it follows that a; i—> 0 in F(U, F) for some neighbourhood U of x. Since the a^'s generate F, this implies that T „ = @). D Souped-up version I. Let X,F,x be as above. If U is a neighbourhood of x and ai,..., an G Г(и, Т) have the property: (*) the images Щ,... ,a^ generate Tx ®o к (ж) then 3 a neighbourhood Uq С U of x such that a\,... ,an generate Jr\u . Proof. Define a module homomorphism: by Ф {{bi,..., bn)) = Y^aibi- Let К be the cokernel of ф, a coherent module on U. Then Kx ®ox к (ж) = @), so by Nakayama's lemma K, v = @), some Uo С U containing x. Therefore ф is surjective on Uo. □ Souped-up version II. Let X, T be as above. Define e(x) = dim]k(a.) [Tx ®2ж к (ж)] all x £ X. Then e is upper semi-continuous, i.e., {x \ e(x) < r} is open, for all r. Assume further that X is reduced. Then for all x € X, T is a free ox-module in some neighbourhood of x if and only if e is constant near x.
§3. Tangent cones 153 Proof. Let xi £ X and r\ = e{x\). Then there are r\ elements a\,..., ari G FXl whose images generate FXl (g> k(xi). Lift the щ to sections of T in some open U\ containing x\. By version I, the щ generate T in some open f/2 С U\ still containing x\. But then of,..., a^7 generate J"y B)k(y), all у G C/2, i-e., e(y) < ri if у G £/2- To prove the second statement, note that if T is a free ox-module of rank r in a neighbourhood U\ of x, then e(y) = r, all у G U\. Conversely, assume e(y) = r, all у G f/i. As in the first part, construct a surjective homomorphism С/2 'С/г in some (possibly smaller) affine neighbourhood U2 of ж. Let К be the kernel of ■?/>. If К Ф @), К has a non-0 section s. Since, К С orx\V , s is an r-tuple of elements of r(U2,ox). Now since X is reduced, s will be non-zero at some generic point у of [/2. Moreover, since у is generic, the stalk oy of ox at у is a field. Looking at stalks at y, we get an exact sequence of vector spaces over oy: But e(y) = dim£ Ty = r, by assumption, so this is a contradiction. This proves that ф is an isomorphism, hence Т\„ is free. D Problem. Let X be an irreducible noetherian scheme all of whose stalks ox at closed points are principal valuation rings. If T is a coherent ox-module, show that T £ Tx Ф Ti , where T\ is a locally free Ojf-module, and Ti has support at a finite number of closed points xi,..., xn, hence Ъ= Ф ох./тгх\ i=l (here ox/mx should be considered an a,.-module by being extended by 0 outside §3. Tangent cones We are ready to go back to geometry. Let к be an algebraically closed field, let X be a scheme of finite type over k, and let ж be a closed point of X. The scheme X has a tangent cone at x defined as follows: 1. Let U С X be an affine open neighbourhood of x.
154 III. Local Properties of Schemes 2. Let i : U —> An be a closed immersion, making U isomorphic with the sub- scheme Spec (k [Xi,..., Xn] /A) of An. By adding suitable constants to the Xj's, we can assume that X\,...,Xn G I({x}), or equivalently that i(x) = ©, the origin in An. 3. For all polynomials f £ к [Xi,..., Xn], let /* be their "leading form", i.e., if N f = ^Л? /г homogeneous of degree г /г ф О then /* = /r. Let A* be the ideal of all polynomials /*, for all / G A. Provisional Definition 1. Spec (k [X\,..., Xn] /A*) = the tangent cone to X at x. A priori, it might look as though this definition depended on the particular embedding of a neighbourhood of x in affine space. We can easily make it intrinsic through: Definition 2. Let О be a local ring, with maximal ideal m. Then n=0 If к = О/т, then gr((9) is seen to be a graded fc-algebra, generated by m/m2, the elements of degree 1. In fact, if xi,... ,xn generate m/m2 over k, then some homogenenous Final Definition. If x G X is a closed point, Spec [gKoJ] is the tangent cone to X at ж. Why are these the same? With notation as above, let R = k[X1,...,Xn]/A = M = (X1,..., so Rm — ox • Then
§3. Tangent cones 155 if A*k = homogeneous piece of A* of degree k. But (Xi,..., Xn)k I (Xi,..., Xn)k+1 +A*k = fcth-graded piece of k[X1,..., Therefore Example E. Let f(x, y) be an irreducible polynomial such that /@,0) = 0. Then X = V((f)) is an affine plane curve through the origin. In this case, A* is generated by the leading form /* of /, and since /* is a homogeneous form in 2 variables, /* will factor into a product of linear forms U\ Therefore, the tangent cone is a union of lines through the origin taken with multiplicities. Case i). f has a non-zero linear term: / = ax + Cy + /2 + /3 + ... where a or /5 ф О, Д is homogeneous of degree k. Then the tangent cone is the one line ax + Cy = 0 , and the origin is called a non-singular point of X. Case ii). f begins with a quadratic term / = /2 + /3 + • • • where /2 Ф 0 and /2 is the product of distinct linear factors £(x,y), m(x,y). Then the tangent cone consists of 2 distinct lines with multiplicity 1, and X is said to have a node at the origin. Case in), f begins with a square quadratic term: where a or C Ф 0. The tangent cone is a double line. There are several subcases depending on the power of t dividing f(fit, —at). If /з(/3, —а) Ф 0, this is divisible only by t3, and t is said to have a cusp at the origin.
156 III. Local Properties of Schemes 0= lx*y)(x-y)*x3 (compare Example P ch.I) Figure to Case ii): X is said to have a node at the origin tangent cone Figure to Case in): f is said to have a cusp at the origin Example F. Let f(Xi,...,Xn) be an irreducible polynomial such that /@, ...,0) = 0, and let X = V((f)). Again A* is generated by the leading form /* of /. If /* is linear, then the tangent cone to X at О is a hyperplane and we say that X has a non-singular point at the origin. If / is a non-degenerate quadratic form, then the tangent cone is the simplest type of quadratic cone and we say that X has an ordinary double point at the origin.
§3. Tangent cones 157 Х2-У2-22 Warning: If X is an affine variety of higher codimension, defined by /i = ... = fr = 0, then its tangent cone may not be the locus Д* = ... = /* = 0. One may need the leading forms of some more of the polynomials ^ 9%f%- Since the tangent cone is defined by a homogeneous ideal A*, it is natural to projectivize it: i.e., look at the locus of roots (a\,..., an) of the polynomials in A* as the set of homogeneous coordinates of a subset T С ¥n~1. More precisely, A* defines a subscheme T in P71, which we call the projectivized tangent cone. An amazing fact is that there is a natural way to put X — {x} and T together into a new scheme Bx (X) in such a way that locally on Bx (X), T is the subscheme defined by the vanishing of a single function / (which is not a 0-divisor). This is known as blowing-up x because {x} is replaced in the process by T which is a picture of the infinitesimal behaviour of X at x. We first define the variety Bn obtained by blowing up the origin О in An. Let p : An - {©} in—1 be the projection morphism taking a closed point with affine coordinates (ai,..., an) into the closed point with homogeneous coordinates (a\,..., an). Look at the graph Г of p: Г С (An - {О}) х Define Bn to be the closure of Г in An x P' subset). By restricting the projection p\ : AT diagram: DTI —1 1 (as a subvariety as well as a x P71-1 -)■ An to Bn, we get a
158 III. Local Properties of Schemes С open Bn An - {©} С An Since Г is closed in (An - {©}) x P71, all the points of Bn not in Г lie over the origin in An. Note that Bn is an n-dimensional variety and q is a birational morphism. q is a proper morphism, too, since pi : An x P71 -» An is proper. To visualize Bn effectively, consider the totality of lines £ in An through O. £ is the union of О and p-1(t) for some closed point t G P71. Therefore Г contains the curve and Bn contains its closure, which is just £ x {t}. In particular, this shows that Bn contains all the points of An x p™-1 over the origin in An: so, set-theoretically, Вп=Ги({О} xP"-1) . More than that, this shows that whereas An is the union of all these lines £, with their common origins identified, Bn is just the disjoint union of the lines £. In Bn, we have replaced the one origin by a whole variety of origins, a different one for each line £ containing the original origin. This explains why Bn is called the result of blowing up О in An. The locus of origins, {©} x Pn-1, is called the exceptional divisor of Bn, and will be denoted by E. Identify Identify the case n = 2
§3. Tangent cones 159 To give a purely algebraic description of Bn, we must cover it by affine open pieces. Now Bn is embedded in An x Pn-1, and P71 is covered by the usual n pieces Ui,...,Un- Take X\,..., Xn as affine coordinates in An and Y\,..., Yn as homogeneous coordinates in P71. Then An x P71-1 is the union of the open affines: An x Ui = Spec к \Xi,...,Xn, Yi Yn у' • * *' у 1 г i« The projection p is defined by setting the ratios X\ : ... : Xn and Y\ : ... :Yn equal, i.e., we let X{ (Yi/Yi) =Xi,...,Xi (Yn/Yi) = Xn. Therefore Bn П (An x Ui) = and Ii This means that Yj/Yi, as an element of the function field к (Bn) = к (An) = k(Xi,... ,Xn), equals Xj/Xi, and that if we identify the affine ring Г (Вп П (An xUi) ,oBn) with its isomorphic image to к (Xi,...,Xn), we ob- obtain: Call this piece of Bn, Bn . We see from this description that each Bn , as a scheme in its own right, is isomorphic to An. On the other hand, inside Bn, Впг' and Bn are patched together along the common open subset: Xi Xi r X\ Xn Aj Aj = Spec A: = Spec к The birational morphism q corresponds, on the ring level, to the inclusions: Iy v- "I ■y -^-1 -Л-П Aj,—,...,— . A; Ai J Moreover, E П Б^} is the subset V ((Хг,..., Xn)) in Bnl). But all the X's are multiples of Xi in Г (£„ ,oBn J, therefore
160 III. Local Properties of Schemes In this way, E itself is just P71. The process of blowing up can be generalized to an arbitrary scheme X of finite type over k, and an arbitrary closed point x G X. The quickest way to define this new scheme BX(X) is to choose an affine open neighbourhood U of x, and a closed immersion i : U —► An such that i(x) = ©. Let i(U) = Spec (k [Xi,.. .,Xn] /A). Set-theoretically, we want to define Bx (X) as the union of X - {x}, and the closure of q~x [i{U) - {©}] in Bn, suitably identified. More precisely, for all г = 1,... , n, define an ideal A* mk[Xi,Xi/Xi,...,Xn/Xi]by At = {f ek[xiix1/xit...fxn/xi] \f-x?eA, ifiv»o} . It is clear that A* and A*- induce the same ideal in к [X{, Xi/Xi,..., Хп/Х{, which is the affine ring of Bn Г\Вп. Therefore, the ideals {A*} define a coherent oBn -ideal Q* С oBn. Let U* С Вп be the corresponding closed subscheme. Note that via q, we get a morphism q' С Bn и corresponding to the ring homomorphisms k[X!,...,Xn)/A i,...Xn/Xi] k[Xu...,Xn] . Note finally that just as q restricts to an isomorphism from Bn — E to An — {0}, so q' restricts to an isomorphism of U* — U* П E to U — {x}. Definition 3. BX(X) is the union of U* and X — {x}, patched along their isomorphic open subsets U* — U* ПЕ and U — {x}. Then q' extends to a morphism Q : BX(X) —>• X. Let £ = Q-\{x}) = U*C\E: as a set this is called the exceptional subscheme of BX(X). This set-up has the following properties:
§3. Tangent cones 161 (I.) Q induces an isomorphism of BX(X) — 8 with X — {x}. (II.) BX(X) is a scheme (not just a prescheme) and Q is proper. Proof. Let f,g : К \ BX(X) be a pair of test morphisms. Let Z\ = {s G К | f(s) == g(s)} and let Z2 = {s G К \ Q(f(s)) = Q($(s))}. Z2 is closed since X is a scheme, and Z2 is covered by its intersection with the 2 open pieces f/i = {s G К | /(s) and #(s) £ £}, and U2 = {s G К | /(s) and #(s) € [/*}. It suffices to show that Z\ is closed in Z2 Df/i and in Z2 П[/2. But Z\V\U\ — Zif\U\ since Q is an isomorphism over X — {ж}. And on f/2, instead of / and g, we can consider the compositions / f',g' : К Z^ £/* -4BncAnx P71 . </ П Then Z1r\U2 = {s e К \ f'(s) = g'(s)}, and this is closed since An x P71 is a scheme. This shows that BX(X) is a scheme. To show that Q is proper, it suffices to show that the restrictions of Q to Q~X{X - {x}) and to Q~X{U) are proper. But the first is an isomorphism and the second is the restriction of the proper morphism q to a closed subscheme. D (III.) For all у G £, there is an affine open neighbourhood V С ВХ(Х) of у such that the ideal of 8 П V in Г f V, oBx(X)) is equal to (/), for some non-zero divisor /. Proof. Suppose that the image of у in Bn is in E П Bn • Then let V equal U* П Бп . Let X{ G r(U,ox) be the restriction of the function X* to U. Since the ideal of {x} in r(U,ox) is (xi,...,xn), the ideal of £ in F(V,ox) is also (xi,..., xn). But all the X's are multiples of Xj in i f Bn ^, oBn J, so this ideal is just (xi). Referring to the definition of A*, it is obvious that Xi is not a 0-divisor mr{y,oBx{x)y a (IV.) If X is a variety of dimension r, and r > 1, then Дв(Х) is a variety of dimension r, Q is birational, and 8 is non-empty and pure (r — 1)- dimensional. In general, if r = sup{dimZ | Z a component of X through x} > 1, then a) r = sup{dimZ | Z a component of BX(X) meeting 8} b) r — 1 = sup{dim Z \ Z & component of 8}. Proof. First of all, by (III.), no component of Bx (X) is contained in 8. Therefore, Q induces an isomorphism between non-empty open subsets of all components of BX(X) and of X. To prove assertion a), we must check that if Z is a component of X containing x (with dimension > 1), then the component
162 III. Local Properties of Schemes of BX(X) meets 8. But Q{Z') is closed since Q is proper, and it contains Z—{x}, therefore Q(Z') = Z, so Z1 meets Q~1(x). This proves (a) and then (b) follows from (III.) and the results of Ch. I, §7. Now assume X is a variety. Then BX(X) is irreducible. Also A is a prime ideal, and it follows immediately from the definition of A* that it is a prime ideal too. Therefore BX(X) is reduced too. The fact that 8 is pure (r — l)-dimensional follows again using Ch. I, §7. □ (V.) (the whole point): 8 = the projectivized tangent cone to X at x. Proof. By construction, 8 is a closed subscheme of the projective space E. Its ideal in Bn is A* + (X{), and its ideal in the ith affine piece E П Bn of E is: ~Ai = A*- But A* is just the ideal of quotients f/X%, where / G A and the leading term /* of / has degree > r. Therefore ,* /* = leading term of ^ j- some / G A, \ + (X{) , * degree f*=r J so if В is the ideal in к [Xi,..., Xn] generated by the leading terms of elements Ai = < —: g G B, homogeneous of degree r > . The ideal on the right is nothing but the ideal defining the ith affine piece of the projectivized tangent cone. D Corollary. // X is an r-dimensional variety, then the tangent cone to X at x is pure r-dimensional. In general sup < dim Z _ , n ( Z a component of the Z a component II г ^ ., . > = sup < < of X at x = sup < dim Z [ tangent cone to X at x Example E bis. If we blow up A2 = Spec k[x,y], we get a Bi covered by: U = Spec k[x,y/x] V = Spec k[y,x/y] . E is a projective line, and E C\U and E П V are defined by setting x = 0 and у = 0 respectively. Now if / = x2 - у2 + ж3, and X = V((/)), then BX(X) will be a curve in Bi entirely contained in U and defined by the equation
§3. Tangent cones 163 = 1 - (x/yf + x in U The picture is, roughly: B,(X) y/x-axis Now if / = Y3 - x3, then BX(X) is entirely contained in U and is the curve: (II-- B,(X) y/x-axis Note that the subscheme of BX(X) defined by x = 0 is the point P doubled since BX(X) is tangent to the exceptional curve E at P. And, indeed, the tangent cone in this case is a double line, as (V) requires.
164 III. Local Properties of Schemes Example F bis. Let X = Spec k[x,y,z]/ (x2 - y2 - z2) С A3. Then B3 contains a copy of the projective plane as its exceptional divisor E, and Bx (X) is a surface in B3 that meets E in the circle: {y/xJ + {z/xJ = 1. Roughly, it looks like this: B,(X) Problem. Check that Bx (X) depends only on x and X and not on the neigh- neighbourhood U of x and the closed immersion i : U —>■ An. §4. Non-singularity and differentials From a technical point of view, the big drawback about the tangent cone is that it is non-linear. It is always easier to handle essentially linear objects. The most natural way around this is to study the "linear hull" of the tangent cone, which we will call the tangent space. Let X be a scheme of finite type over k, and let x G X be a closed point. 1. Let U С X be an affine open neighbourhood of x. 2. Let г : U —>■ An be a closed immersion making U isomorphic with the closed subscheme (k [X\,..., Xn] /A) of An. By adding suitable constants to the Xj's, we can assume that X\,...,Xn G I({x}), i.e., i(x) = O. 3. For all polynomials f e A, notice that /@,... ,0) =0 since О G V(A). Let fe be the linear term of /. Let Aq be the ideal generated by the linear forms f, all / G A. Provisional Definition 1. Spec (k[X\,... ,Хп]/А0) = the tangent space to X at x. Compare this with the non-intrinsic definition of the tangent cone in §3. It is clear that Aq С A*, the ideal of all leading forms, hence the tangent space contains the tangent cone as a subscheme; and in fact it is just the smallest linear subscheme of An containing the tangent cone as a subscheme. For this reason, we can say that the tangent space is just the "linear hull" of the tangent cone. When dealing with linear subspaces of An, and, more generally, with any "affine spaces", i.e., schemes isomorphic to An, it is possible to get confused
§4. Non-singularity and differentials 165 between the underlying vector space, and the whole scheme. It is important to realize that one can canonically attach to every &-vector space V a scheme whose set of closed points is just V, and which is isomorphic to An; and that under this canonical correspondence, the set of sub-vector spaces of kn is "equal" to the set of linear subschemes of An. This is simply a matter of making the inclusion of the vector space kn in the scheme An coordinate-invariant It is also the special case when the base scheme is Spec (k) of the functorial bundle, locally free sheaf correspondence outlined in §2. In particular, in dealing with the tangent space to a scheme X one sometimes wants to deal with it as a scheme, and sometimes as a vector space. This correspondence goes as follows: suppose У is a fc-vector space. Let Ry be the ring of polynomial functions from V tok (equivalently, the symmetric algebra on the dual space homfc(V, k)). Then let V8ch = Spec (Rv) . Notice that V "equals" the set of closed points of Vsch. In fact, by the Nullstel- lensatz, every maximal ideal of Ry is the kernel of a homomorphism / i—> /(ai,...,an) G к for some (ai,..., an) G kn. If we call this ideal Ma, the correspondence a —у [Ма] maps kn isomorphically onto the set of closed points of ysch. Bearing in mind, then, that linear subspaces of An are essentially the same as subvector spaces of kn, I want to show how easy it is to compute the tangent space to an affine scheme Spec (k [X\,..., Xn] /A) in An at any closed point. First look at the origin. Note the lemma: (*)// A = (Л,...,/»), then A, = (/f,...,/£). Proof. In fact, if / is any element of A, then / = $Z 9ifi, some gi G к [X\,..., Xn]. Then the linear term fl of / is just £ #@,..., 0)//, so f G (j[,..., /£). D г=1 (Whereas it is not always true that the ideal A* of all leading forms is generated by the leading forms of generators of A.) Proposition 1. Let X = Spec (k [Xi,..., Xn] / (Д,..., /m)) be a closed sub- scheme of An. Let x G X be a closed point with coordinates (ai,... ,an)> Then the tangent space to X at x is naturally isomorphic to the linear subspace of An defined by: г=1 (Note that the linear equations here define both a linear subscheme of An, and the corresponding subvector space of kn.)
166 III. Local Properties of Schemes Proof. If we translate X so that (a\,... ,a^) is shifted to the origin, it is then defined by the equations /i (Xi + ai,..., Xn + an) = ... = fm (Xi + ai,..., Xn + an) = 0 . The tangent space of the original X at (a) is isomorphic to the tangent space of the shifted X at O, and this, by (*), is the locus of zeroes of the linear terms of fi (Xi + a\,..., Xn + an). But the linear term of this polynomial is: 3=1 j so the Prop, follows. D The intrinsic definition of the tangent space is this: Final Definition 1 (due to Zariski). Let ж be a closed point of the scheme X of finite type over k. Let (mx/ml,k) . Then the vector space TXix and the corresponding scheme T*cx will both be called the tangent space to X at x. The dual vector space mx/mx, and its scheme (nix/ml) ° are both the cotangent space to X at x. Here is the canonical isomorphism of this tangent space with the previous one: As before, let U С X be isomorphic to Spec (k [X\,..., Xn] /A). Let R = k[X1,...,Xn]/A = Assume that so Rm = ox . Then mx/ml ^ M/M2 =i (X1,...,Xn)/(Xu...,XnJ Therefore, since Ao is generated by linear forms: / Symmetric algebra \ ^ , r „ v л i л < , о > =K Л1,....ЛП /Ло \ on mx/mx J L nJ/
§4. Non-singulaxity and differentials 167 Taking Spec of both sides, Tsch = g f Symmetric algebra 1 x*x def. \ on mx/ml J Г Non-intrinsic tangent We can even embed the tangent cone inside the tangent space in a completely intrinsic way. There is a canonical surjection: { Symmetric algebra onm,/m| This defines a closed immersion: Tangent cone to } = Spec (grfe)) c т.с* . Once again, we see that Fx^x is exactly a "linear hull" of the tangent cone. Definition 2. The (closed) point ж is a non-singular point of X, or X is non- singular at ж, if the tangent space to X at x equals the tangent cone to X at x (i.e., the tangent cone is itself linear). Look at this condition algebraically: it means that gr(ox) is isomorphic to the symmetric algebra on mx/mx. In other words: A. or в. к I fc+i ~ / ^th symmetric power 1 х/ х \ ofmx/m2x j if / (Xi,..., Xn) G ox [Xi,..., Xn] is homogeneous of degree к and if Ж1,..., xn G mx are independent mod mx, then / (ж1,..., xn) G mx+1 only if all coefficients of / are in mx. Note that this implies that ox is an integral domain: for if a, b G ox, а ф О, 6^0, then for some integers к and £ we would get a a G mx — M — xk+1, b G mx — mx+1. Then a and b would have non-zero images a, b in mx/mx+1 and mx/mx+1. Therefore a b G mx+i/mx+i+1 would be non-zero, i.e., a b $. mx+e+1, hence a fortiori a ■ b ф О.10 On the scheme X, this means that some neighbourhood U of x is a variety [to be precise: x is in the closure of only one associated point z of ox, namely the one corresponding to the ideal @) Co,.; and if W is the union of the closures of the other associated points of ox, then X — W is an open neighbourhood of ж which is a variety.] Therefore, in discussing non-singularity, we may usually restrict our attention to varieties. But conversely, ox may be a domain even if gr (o_x) is not: see Ex. E, §3.
168 III. Local Properties of Schemes Proposition 2. Let X be an n-dimensional variety. Then for all closed points xe X, dimTXtx > n , equality holding if and only if X is non-singular at x. Proof. In fact, we saw in §3 that the tangent cone to X at x is pure n- dimensional. Therefore the dimension of the tangent space is at least n, and if it equals n, the tangent space must equal the tangent cone. Corollary 1. Let X = Spec (k [Xi,..., Xn] / (/i, ■ ■ •, fmj)- Assume that X is an n-dimensional variety. Then for all closed points x = (ai,..., ам) С X, Г Я f 1 rank \-^-(ai,...,aN)\< N-n , equality holding if and only if X is non-singular at x. Proof. This is just Prop. 1 + Prop. 2. Note that this criterion was exactly what we used to define non-singular points on hypersurfaces in Ex. E,F of §3. Corollary 2. If к = С, X is a variety over C, and x G X is a closed point, then X is non-singular at x if and only if the analytic space X corresponding to X (as in Ch. I, §0j is an analytic manifold at x. Proof. Use Cor. 1 and the fact that an n-dimensional analytic subspace of Cn defined by Д = ... = /m = 0 is a manifold at (a) if and only if rank [dfi/дХк (ai,..., un)] = N -n . (Cf. Gunning-Rossi, Ch. V, A 13 and A 14 for example.) D Theorem 3. Let x G X be a closed point of a scheme X of finite type over k. The following k-vector spaces are canonically isomorphic: 1) the tangent space T = homfc {тх/т2х,к) to X at x, 2) the space of point derivations D : ox —t к over k, i.e., k-linear maps such that D{fg) = f(x) ■ Dg + g{x) ■ Df. 3) hom^ ((Пх/к)х,к{х))- As a set, these are also isomorphic to: 4) the set of morphisms f : I —t X with image x, where I = Spec (k[e]/(e2)). Proof. A) and B) are isomorphic, since every point derivation D kills mx, and therefore induces a linear functional £ : mx/mx —t k. Conversely, given any such £, define D by D(f) = £[f-f(x)}
§4. Non-singulaxity and differentials 169 and we get a point derivation. To compute C), let U = Spec (R) be an affine open neighbourhood of x, and assume x = [M]. Then (flR/k,R/M) . On the other hand, we know from §1 that for all -R-modules A module of ^-derivations Applying this with A = R/M, we find that B) and C) are isomorphic. Finally, according to Prop. 3. §6, Ch. II, morphisms f : I -¥ X with image X correspond to local homomorphisms Such /* always kill mx, hence define and are defined by linear functionals £ : mx/m2x -¥ (б) = к. Therefore, the sets A) and D) are isomorphic. D Corollary. For all closed points x G X, mx/mx is canonically isomorphic to (f2x/k)x®ox^-{x)- In this map, df (modulo mx-(Qx/k)x) corresponds to f — f(x) (modulo mx). Proof. Dualize the isomorphism of vector spaces B) and C) in the theorem. D This corollary shows a very significant and far-reaching thing: that the vector spaces mx/mx, which a priori are a collection of unrelated vector spaces, one for each closed point ж, can all be derived from one coherent sheaf Ох/к on X. This enables us a) to use the machinery of Nakayama's lemma (§2), and b) to handle non-closed points. Definition 3. Let X be a scheme of finite type over к. Then for all (not neces- necessarily closed) points x G X, the cotangent space to X at x is: We abbreviate this to OX/k{x), от Q{x). Moreover, let d(x) = dimk(a.)[i?(x)] , sometimes called the "embedding dimension" of X at x. Note that d is upper-semi-continuous by version II of Nakayama's lemma. Now assume that X is a variety and n = dimX. Then / generic point \ r -, \ of X ) = dmifcW I/2*W/*J vector space of ^-derivations = n .
170 III. Local Properties of Schemes (Cf. Ex. С, §1). It follows that {x £ X \ d(x) = n} is an open dense subset U of X. Again by version II, U is the maximal open subset of X on which Ox/k is a locally free o^-module. Since for closed points x e X, d(x) = n (mx/ml) = n X is non-singular at x, it is natural to extend the definition of non-singularity to all points x of X to mean that d(x) = n, or that Ox/k is a free ox-modu\e near x. The conclusion is: Proposition 3. Let X be a variety over к of dimension n. Then the set of non- singular points x G X is an open dense subset of X on which Ox/k г5 а locally free оx-module of rank n. In particular, if X is a non-singular variety (i.e., non-singular everywhere), then Ox/k is a locally free o^-module. In the correspondence between locally free modules and vector bundles discussed in §2, the bundle version of Ox/k is then exactly the usual cotangent bundle of X over k, and conversely the sheaf of differentials Ox/k is then the sheaf of sections of the cotangent bundle. The operation of taking the tangent space is a covariant functor, while the cotangent space is a contravariant functor: Given / : X —У Y, X and Y schemes of finite type/fc. For all closed points x G X, let у = f(x). We get this (A) ./* and this: and this: dfx ' TX)x —> Ty<Y ■ (B) Given / : X —у Y, X and Y schemes of finite type over k. For allxe X, let у = f(x). We get this hence this (OX/k)x $- (OY/k)y and this
§4. Non-singularity and differentials 171 Of course, at closed points, Ox{x) and Оу{у) are the dual spaces of Tx>x and TVtY respectively; and df* is the transpose of dfx. For example, suppose Y is a closed subscheme of a scheme X. Then for all x G Y, we get a surjection f* '• Q.x,x ~^ Q.x,Yi hence Qx{x) is a quotient of Qy{x). More precisely, one checks that: fei where / = ker (oxX —y o^y) . On the other hand, if ж is a closed point, we get a dual injection: TX,Y ^ Tx,x such that the image of TXiy is the subspace of Tx>x perpendicular to the differ- differentials df, (f e I). Example G. Let's compute the function d for a hypersurface H с Fn. Since Fn itself is non-singular, for all x G Fn, dimpn(x) = n. Since Он{х) is always a quotient of i?pn(x), its dimension is at most n. But a point x G H is non-singular if this dimension is n — 1, so we get: ,, > f n-1, if ж is a non-singular point ^ ~~ \ n, if ж is a singular point . Now let / = Spec k[e]/(e2). I itself has a one-dimensional tangent space with a canonical generator C, given by the linear functional a • б i—У а from (e) to к. We can give a suggestive interpretation to definition 4 of the tangent space, in Theorem 3: it means that for all closed points x G X and all tangent vectors t G Tx>x, there is one and only one morphism f : I -¥ X with image x such that df(C) = t. [Proof: Such /'s correspond to possible local homomorphisms /* : ox —» &[б]/(б2), hence to possible fc-linear maps mx/mx -¥ (e). But df(C) is the com- composite linear functional mx/mx —у (б) —у к , and since C is an isomorphism, the set of /*'s correspond 1 — 1 with the set of linear functional mx/mx —У к] In other words, / is a sort of disembodied tangent vector which can be em- embedded in any scheme X so as to lie along any given tangent vector to X. The set of all morphisms from / to X is a sort of set-theoretic tangent bundle to X,
172 III. Local Properties of Schemes being isomorphic to U TXtx- Using this approach to the tangent space, we closed points can give a more geometric explanation of the realization of TXtx as an actual linear subspace of Am, when X С Am. Suppose X = Spec (k [Xi,..., Xm] / (gi,..., </*)) and x = © is a zero of the <7i's. Then Tq,a"» is isomorphic to the set of morphisms and these are determined by the n-tuple («i,..., am) such that /* (Xi) = щ • e. Tq,x will be the subvector space of Tq,a™ corresponding to the morphisms / that factor through X. But for / to factor through X, it is necessary and sufficient that f*(gx) = ... = f*(gk) = 0. In general, though, f*(h) = /i(/*(Xi),...,/* (Xm)) = h(aie,...,ame) i \ since e2 = 0. Therefore and the vector space of («i,... ,am)'s making f*(gi) = 0, 1 < г < к, is exactly the space of solutions of the linear equations: If we identify the m-tuples («i,..., am) with the closed points of Am itself, we have again the tangent space to X at x as described in Proposition 1. The property defining non-singularity requires that the given scheme X be locally the set of zeroes of functions in A^ with enough independent differentials. It is even true that if X is non-singular of dimension n and /i,..., Jn-п are functions of A^ which vanish on X with independent differentials, then the locus of zeroes of the /'s is exactly X, plus perhaps some other components disjoint from X. The general result is this: Theorem 4. Let X be a non-singular variety, Y С X a closed subscheme, and x G Y any point. Then Y is non-singular at x if and only if there exists an affine open neighbourhood U С X of x and elements /i,..., Д G R = Г {U,ox) such that i,...,/*)] 2) df\,..., dfk define independent elements of fix/k ix) ■
§4. Non-singularity and differentials 173 In this case, dimF = dimX — k. Proof. We begin with a special case: Lemma. Let X = Spec (R) be a non-singular n-dimensional affine variety. Let f G R be an element such that the image of df in fix (#) is not zero, for all x G X. Then the subscheme Y = Spec (R/(f)) is a disjoint union of non-singular subvarieties of dimension n — 1. Proof. Let x G Y be a closed point. We saw above that TXjy is isomor- phic to the subspace of TXtx perpendicular to df. Since df ф 0 in fixix), dimTXtX = dimT^jj- = n - 1. But since Y = V((/)), Y is pure n - 1 dimensional, hence its tangent cone at x has a component of dimension n — 1. But the only subscheme of an (n — l)-dimensional affine space with an (n — l)-dimensional component is the whole space. Therefore the tangent space and tangent cones are equal and Y is non-singular at x. D Now assume Y П U = Spec [R/ (Д,..., Д)] and that df\,..., dfk are inde- independent in fix/к {у) for all у G U (replace U by a smaller neighbourhood of x if necessary). Let Z{ = Spec [R/ (Д,..., /,-)]. We can use induction to show that each Zi is a disjoint union of non-singular varieties. In fact, suppose Zi is such a union. Note that for all z G Zi, г fiZi/k{z) = Therefore the image of d/i+i in fizi/k(z) 1S not zero. Therefore by the lemma, Zi+i is also a disjoint union of non-singular varieties. Also, since the dimension goes down by 1 each time, Y П U = Zk is pure (dimX — к)-dimensional. Conversely, assume Y is non-singular at x. Let the ideal of Y as a subscheme of X be generated at x by /i,... /w. As before N fiY/k(x) S If /г = codimx(y)> then these dfi must span a subspace of fix/kix) °f dimension exactly k. Choose /i,...,/& such that df1,...,dfk in i?x/fc(ж) are independent. Extend these /j's to functions in some neighbourhood U = Spec (R) of ж in X. Let Y* = Spec (Д/ (/i,..., /*)). If f7 is small enough, these /i's will still vanish on Y DU, so Y Г) U will be a closed subvariety of Y*. But by the first half of the proof, if U is small enough, Y* will be a non-singular subvariety of U of codimension k. In particular, it is irreducible and reduced. Then since dimY* =dimYnU,YnU = Y*. D
174 III. Local Properties of Schemes Problems 1. If X is non-singular at x and Y is non-singular at y, then show that X xY is non-singular at x x y. 2. Describe f2x/k(x) fc>r a non-closed point x € X as follows: let i?o be the subspace generated by the elements df,f£ mx. Show a) i?o — rnx/ml, with d/ corresponding to / (mod ml), all / G m^. b) O\/Oq = &ъ.(х)/к, w^h d/ (mod i?o) corresponding to d/, all f € Qx, (/ being the image of / in к (ж)). Hence show X is non-singular at x if and only if dim (mx/m1) = codim {x}. 3. Let x be any point of a variety X (not necessarily closed), show that X is non-singular at x if and only if a,, is a regular local ring. [This result, although pretty, has historically been rather a red herring. The concepts of non-singularity and regularity diverge over imperfect ground fields.! §5. Etale morphisms In the last section, we have seen that many familiar concepts involving differ- differentials can be transferred from differentials and analytic geometry to algebraic geometry. But one very important theorem in the differential and analytic situ- situations is false in the algebraic case - the implicit function theorem. This asserts that if we are given к differentiable (resp. analytic) functions /i, ...,/& near a point x in Rn+k (resp. Cn+fc) such that del; (dfi/dXj)(x)^01 l<i,j<k then the restriction of the coordinate projection {Locus A = ... = fk = 0) —> W1 (resp. Cn) is locally an isomorphism near x. However, to take a typical algebraic situation, look at the projection: V(x\-x2) —► A1 (xi,x2) i—У х2 . At x = A,1), g|- (x\ — x2) is not zero, but the projection is not even 1 — 1 in any Zariski-open subset U of V (x\ — x2) since for all but a finite set of values X2 = a, U will contain both points (+</a, a) and (—^/a, a)- This indicates that there exists in algebraic geometry a non-trivial class of morphisms that are nonetheless "local isomorphisms" in both a differential-geometric and analytic
§5. Etale morphisms 175 sense. These are known as etale11 morphisms and are defined by mimicking the implicit function theorem as follows (we are now dealing with arbitrary schemes): Definition 1. 1st the particular morphisms: X = Specfl[Xi,...,Xn]/(/i,...,/n) Y = Spec (R) are etale at a point x € X if (*) det (dfi/dXj) (x) ф 0 . 2nd an arbitrary morphism / : X —> Y of finite type is etale, if for all x € X, there are open neighbourhoods U С X of x and У С F of /(ж) such that F{U) С V and such that /, restricted to U, looks like a morphism of the above type: U С res (/) open immersion V C. Spec Д[ХЬ...,ХП]/(A,...,/n) Spec(R) where det (dfi/dXj) (x) ф 0. This intuitively reasonable definition, like the provisional ones we made for the tangent cone and tangent space, is not really intrinsic. One has a right to ask for an equivalent form involving only the local rings of X and Y and not dragging in affine space. There is such a reformulation, but it involves the concept of flatness so we have to put it off until §10. This clumsy form is adequate for the present. The condition on the partials of the /;'s means exactly that Qx/y — @)- In fact, if 5 = R [Xu ..., Xn] I (A,..., /n), then Q X/Y = Q a S/R = S/R S-module generated by dX\,..., dXn modulo the relations ELi (dfi/dXj) ■ dXj =0, 1 < i < n and this module is @) exactly when det (dfi/dXj) is a unit in 5. Notice that if / : X —> Y is etale, and we take any fibre product: 11 The word apparently refers to the appearance of the sea at high tide under a full moon in certain types of weather.
176 III. Local Properties of Schemes X' = XxYY' Г Y' X Y then /' is still etale. This follows immediately from the definition. Thus, for example, the fibre of an etale morphism / : X —)■ Y over a point у £ Y must be a scheme etale over Spec k(y). We can easily work out the definition in this case: Proposition 1. Let X be a scheme of finite type over a field k. Then X is etale over к if and only if X is the union of a finite set of points Xi = Spec (ki), where each k{ is a separable finite algebraic extension of k. Proof. First of all, assume X is etale over k. Then if U = Spec (R) is an affine piece of X, flRjk = @). By the Problem in §1, this shows that R = ®ki, ki being finite separable//?. Conversely, if K/k is a finite separable extension, then by the theorem of the primitive element, K~k[X\/{f{X)) where df/дХ ф 0. But then df/дХ (£ (f{X)) either, so if Spec К = {ж}, df/dX(x) ф 0. This shows that Spec (K) is etale over Spec (/г), (taking n = 1 in the definition). D Corollary. If f :X ->Y is etale, then 1) for all у € Y, the fibre f~1{y), (as a scheme over Spec (k(y))) is the union of a finite set of points Spec (ki), ki finite separable over k(y). 2) for all x G X, the maximal ideal mx С о,, is generated by f* (rrif(x)) and the residue field k(x) is finite separable over k(f(x)). Proof The Prop, implies A) since f~l(y) is etale over Spec (k(y)). If x G X, and у = f{x), then B) is seen to be a restatement of A) since the point x defines a point x' in the fibre f~l{y) with local ring: = Q Qx,x /f* * °x,x D This condition on / looks nicer if stated in terms of geometric fibres instead of ordinary fibres - i.e., fibre products F >■ X Spec (J?) Y
§5. Etale morphisms 177 where Q is an algebraically closed field. In fact, if у = Image of i, then F = f-\y)xk{y)fl, so A) is the same as 1') all geometric fibres of / are finite sets of reduced points. Corollary 2. Let X and Y be varieties over an algebraically closed field k. Let f : X —»■ Y be an etale morphism. Then f is dominating, and k(X) is a finite separable extension ofk(Y). Proof. Locally, X is the locus of roots of n-equations in Y x An, for some n. Therefore dimX > dimF. Now if f(X) were a proper subvariety of F, then the fibres of / would have positive dimension and would be infinite. Therefore / must be dominating. But then Spec (k(X)) is the fibre of / over the generic point of Y, so Cor. 1 says that k(X) is finite separable over k(Y). D One of the key facts about etale morphisms is Hensel's lemma, of which here is a variant: Theorem 2. Let О be a complete (noetherian) local ring, with maximal ideal m, residue field k. Let f be a morphism of finite type: X -A Y = Spec (O) x i—у у = [т] . Assume к = k(y) —I k(x). Then f etale near x =$>■ f a local isomorphism near x. Proof. We may assume that X = Spec @[Xi,...,Xn]/(/i,...,/„)) x = [m + {Xi - oi,..., Xn - an)] det(dfi/dXj)(x)^O, where oi,... ,an G O. In fact, if x = [P], then modulo P, each Xi is equal to an element of k, hence for some a; G O, Xi = щ (mod P). Therefore if Pq = m + (Xi - oi,..., Xn - an), P D Po. But since O[Xu...,Xn] /Po = О/т = к, Pq is maximal so P = Po. The main part of the proof consists in constructing a section s : Y —> X of the morphism / such that s(y) = x. This is equivalent to constructing a subscheme Z С X isomorphic to Y under the restriction of /. We do this by a classical form of Hensel's lemma: Hensel's Lemma. Let О be a complete local ring, and let Д,..., fn G О [Xi,..., Xn]. Assume a\,..., an satisfy /i (oi,...,on) = ... = /n(oi,-.-,On) =0 (mod m) (det dfi/dXj) (oi,..., an) $ m .
178 III. Local Properties of Schemes Then there exist a\,..., an € О such that щ = щ (mod m) /i (ab...,an) = ... = /n(ai,...,an) =0 . Proof. We "refine" the approximation root (oi,...,on) by induction. Suppose at the rth stage that we have got an n-tuple (a^ ,..., о„ J such that of' = a; (mod m) /i (aSr),..., oW) =... = /„ (aSr),..., of)) = 0 (mod mT) . Vary the a\r' a little to a^ + e,, where 6^ G mr. Then 3 = 1 J But since det (dfi/dXj) @1,..., an) is a unit in O, we can find a matrix В {Bij}, with B{j G О such that B-[dfi/dXj(au...1an)]=In {In = the identity n x n matrix). If we set e, = - it follows that fa (a^ + ei,... ,о„ + enj = 0 (mod mr+1) for all г. Then ai = ai + 6г is closer approximate to a solution, and if we set ai = lim of', r—>oo the lemma follows. D Applying this to the theorem, we find elements ai,...,anG0, such that Z = Spec O[Xu...,Xn]/(X1-au...,Xn-an) is a subscheme of X through the point x. Since O[X1,...,Xn]/(Xl-au...,Xn-an)^O , Z is isomorphic to Y under the restriction of /. What remains is to prove that near x, X = Z. Let Q be the ox -ideal defining Z, and notice that the stalks at x fit into a diagram:
§5. Etale morphisms 179 Thus oxX = f*(Q) Ф Qx- In particular, its maximal ideal mXtx is /*(my) 0 Qx. On the other hand, since / is etale, mXtx С /*(wy) • ож x (Cor. 1 just above). Therefore Qx Cf*(my)-Qx Cmx-Qx . Therefore, by Nakayama's lemma, Qa; = @), hence Q = @) near ж, hence Z = X near ж. D Using this, we prove the following connecting the concept of etale with power series: Theorem 3. Let f : X —> Y be a morphism of fnite type of noetherian schemes. Let x G X, у = f(x), and assume that the induced map fl : k(y) —> k(x) is an isomorphism. Then f is etale in some neighbourhood of x if and only if the induced map: Jx : Qy —► ox of complete local rings is an isomorphism. Proof First we assume that / is etale near ж, and prove that /* is an isomor- isomorphism. This part can be reduced to Theorem 2 by making a fibre product: X Y where, by definition, i takes the closed point y' = [my] of y' to y, and г* is the canonical inclusion oy <-> oy. Let x' € X' lie over x £ X and y' G У. Then /' is etale near x', so by Theorem 2, we get a diagram:
180 III. Local Properties of Schemes local homo. °x,X f* (local homo.) —y % • Also, from Cor. 1 of Prop. 1, we know that mXtx = fx (my) 'Q.x,x- Therefore the theorem follows from the elementary fact: Given 2 local rings Oi, Oi and local homomorphisms if m2 = /(mi) • Oi-, then is an isomorphism . Now conversely, assume oy -^» ox. Since for any ideal А С ox, (A • g^) Г)ох = A (Zariski-Samuel, vol. 2, p. ), we find that r(jny)-ox = [/"(my)-kins* = mxC\ox = m In other words, the fibre of / over y, near ж, is just a copy of Spec {k(y)). Therefore /?/-i(y)/sPec k(y) 1S @) at x, and since (j7/-1(y)/SPec k(y))x - it follows from Nakayama's lemma that (OX/y)x — @)- Now describe X = Spec R[XU. ..,Xn]/Ac A% Y = Spec (R) . Then relations all'
§5. Etale morphisms 181 Therefore, there must be elements Д,..., /„ G A such that [dfi/dXj] forms an invertible matrix over o^, i.e., det (dfi/dXj) (x) ф 0. Define a subscheme X: by Spec R [Xi,..., Xn] I (/i,..., /„). By definition, X is etale over Y near x. Now compare all the local rings at x. Applying the 1st half of the theorem to X/Y, we get the diagram: П П .v у . by assumption \ / since / etale °y Therefore, ox ~ is a subring of ox x. But since X is a closed subscheme of X, oxX is isomorphic to a quotient °^ -x x' Thus Qx x' *n °^^er words, A and (Д,..., /„) must generate the same ideal in the local ring of x on Y x An. Therefore X and X are locally, near x, the same subscheme of Y x An, and / must also be etale near x. О For the rest of this paragraph, we return to the geometric case: we fix an algebraically closed ground field k. Corollary 1. Let X and Y be schemes of finite type over k. Let f : X -> Y be a morphism. Then f is etale if and only if for all closed points x G X, the induced map: fx '• °f(x) —> Qx is an isomorphism. Moreover, if f is etale, then for all closed points x G X, there is a natural isomorphism of the tangent spaces to X and Y at x, f(x) taking the tangent cone into each other: Tangent cone r Tangent space to X at x to X at x Tangent cone Tangent space to Y at f(x) u toY at x. In particular, X is non-singular at x if and only ifY is поп-singular at f(x).
182 III. Local Properties of Schemes Proof. The first statement follows from the theorem since k(x) = k(f(x)) = k. The second statement follows from the first since gr(oa.) = gr 1о^хЛ — gr [Qf(x)) as graded rings, and the tangent space and cone set-up is deduced formally from the graded rings gr(o). D Corollary 2. Let X and Y be varieties ofC, and let f : X —» F be a morphism. Let X and У be the corresponding analytic spaces and let T :X У be the corresponding holomorphic map (cf. Ch. I, §10). Then f is etale if and only if T is locally (in the complex topology) an isomorphism. Proof. First assume / is etale. We may as well assume then that Y is affine - say (Spec С [Yi,..., У*] / (#1,..., g£)) - and that / is one of the standard maps: SpecC[Y1,...,Yk,Xu...,Xn]/(g1,...,g£,fl,...,fn) SpecC[Yu...,Yk]/(gu...,g£) , where det (dfi/dXj) is nowhere 0 on X. Then T has the form: = V(fu...Jn)nV(9l,...,g£)cV(fu...,fn)cCr+k T У= V(gu...,9i) Ck . For all x G X, since det (dfi/dXj) (x) ф 0, T' is a local isomorphism near x by the implicit function theorem. Since T is the restriction of T' to the inverse image of V (gi,..., g£), it is a local isomorphism too. Conversely, assume T is a local isomorphism. Let fix and fly denote the sheaves of holomorphic functions on X and Y. f induces, for all closed points x G X, all the following maps: Q V,Y = a V,Y = Q x,X
§6. Uniformizing parameters 183 where у = f{x). But F* is an isomorphism. Therefore /* is an isomorphism, and / is etale by Cor. 1. D In case X and Y are non-singular varieties, the concept of etale simplifies remarkably: Theorem 4. Let f : X -> Y be a morphism of non-singular n-dimensional varieties over k. If for all closed points у &Y, the fibre f~l(y) is a finite set of reduced points (or, equivalently, if for all closed points x G X, mx = f* (m^^)) • ox), then f is etale. Proof. Everything being local, we may assume that X and Y are affine. Choose a closed immersion г : X <-> AN. This defines a closed immersion (i,f):X<-+ANxY. Then here we have an n-dimensional non-singular variety embedded in an N+n- dimensional non-singular variety. By Theorem 4, §4, x is locally the subscheme of zeroes of exactly N functions on A^ x Y. Therefore, if R = Г (F, oY) for all closed points x € X, there is an open neighbourhood U С X of x and elements .,..., Xn] such that U С »SvecR[Xli...1XN]/(fli...,fN) open restriction restriction of/ у ====== Spec R . Let у = /(ж), and let Д,..., /n e к [Xi,..., Xn] denote the images of the /i's when their coefficients are evaluated at y. Then the fibre f~l(y) looks locally near x like Spec к [Xi,..., Xn] / (Д,..., fN). By assumption, this fibre has no tangent space at all at x. Therefore the differentials df{ must be independent at X 16 det (dfi/dXj) (x) = det (dJJdXj) (x) ф 0 . Therefore / is etale. D §6. Uniformizing parameters In the differential and analytic geometry, n-dimensional manifolds M are distin- distinguished from singular spaces by the existence of coverings {Ui] such that each Ui is isomorphic, in the appropriate sense, to an open ball in En or in Cn. In our case, we cannot hope in general to find any Zariski open sets U in a variety V
184 III. Local Properties of Schemes which are isomorphic to Zariski open sets in An because this would imply that the function fields are isomorphic: What we can do, if V is non-singular, is to find sets of n functions /1,..., /n defined in Zariski open sets U С V, such that the morphism F = /i x ... x fn : U —> An induced by the /i's is etale. Because the implicit function theorem is false, this is the closest we can come to local coordinates, and such /i's are known as "uniformizing parameters". Theorem 1. Let X be an n-dimensional non-singular variety over к (alge- (algebraically closed). Let /i,...,/n £ ГA1,ох) for some open set U С X. The following are equivalent: A) Let F = /i x ... x fn : U —)■ An be the induced map. Then f is etale. B) For all closed points x € U, let t\ = f\ — fi(x),..., tn = /„ — fn{x). Then t\,- • • ,tn generate mx/m^.. C) For all closed points x € U, the natural map taking T{ -Л t{ is an isomorphism. D) Mx/k\u - © QX -dfi. Proof. A) <$=> C) by Theorem 2, §5. Since for all closed points о = (oi,..., an) G An, {Xi — oi,... ,Xn — an} generate the maximal ideal ma^n , the elements U = F* (X{ - at) generate F* (ma)A«). So A) <^=> C) by Cor.' 1 of Prop. 1 and Theorem 3, §5. Finally, for all closed points x € U, {ti,..., tn} generate mx/ml <^=> {dfi,..., dfn} generate OX/k (x) <^=> {dfi,...,dfn} generate QX/k in some neighbourhood of x . n Therefore B) is equivalent to QX/k being a quotient of Ф ox • dfi. But since г=1 г=1 it is locally free of rank n, it can only be a quotient of Ф ox • dfi if it actually 1 г=1 n equals 0 ox • dfi. □ г=1 Definition 1. Elements /i,..., /n G Г (U,ox) with these properties are called uniformizing parameters in U.
§6. Uniformizing parameters 185 Since Qx/k is locally free of rank n and is spanned by the d/i's, any non- singular variety of X can be covered by open sets for which a set of uniformizing parameters exists. Also, n elements /1,..., /n £ k(X) are uniformizing parame- parameters in some non-empty open set U if and only if d/i,..., dfn generate Qk(X)/ki i.e., k(X) is separable and algebraic over к (Д,..., /n). Example H. Let X С A2 be the curve y2{l — x2) = 1 and assume char(fc) ф 2. Then Qx/k is generated by drc and dy with the relation: 2y A - a;2) dy - 2y2xdx = 0 . Multiplying by у/2, this becomes dy = xy3dx, so drc alone generates Qx/k- Therefore ж is a uniformizing parameter everywhere on X and the projection p\ onto the ж-axis is etale: y-axts On the other hand, dy only generates Q when x ф 0 (у is nowhere 0). So if {7 = X — {@,1), @, — 1)}, the projection p^ : X —> {y-axis} is etale and у is a uniformizing parameter only on U. All the standard machinery of Calculus can be carried over to our algebraic setting. For example, one can generalize the Jacobian criterion for a map to be a local isomorphism as follows: First of all, if X is a non-singular variety, /i,..., /„
186 III. Local Properties of Schemes are uniformizing parameters in U С X, and gi,...,gn are any functions in Г (U,ox), define the Jacobian as follows: » if # = oy er{u,ox) . Proposition 2. Let F : X -> Y be a morphism of non-singular n-dimensional varieties. Let x € X and у — F(x). Choose uniformizing parameters /1,..., /n and gi,... ,gn in some neighbourhoods of x and у respectively. Then f is etale in some neighbourhood of x if and only if d(F-gi,...,F-9n) *№,-,/») l)* Proof Suppose the /j's and ^'s are parameters in U and V respectively, and that F(U) С V. Then . . tl_ , тт Г for all closed points z € U , ] / is etale on U <(=> < „„, / ч , > { F* yrn^z)) generates mz J {for all cl F* (ffi- closed points z € U, if w = F{z) , ),..., ^* (fl'n - 9n(w)) generate m2 for all closed points z G U , n) generate Replacing U by smaller open neighbourhoods of ж, the Prop., as stated, follows. D This complex of ideas has a very important application to intersection theory on a non-singular variety. This is based on the observation: Proposition 3. Let X be a non-singular n-dimensional variety, and let /].>••• »/n be uniformizing parameters in an open set U С X. Then there is an open set U* С U x U containing the open piece A(U) of the diagonal such that the elements fc <g> 1 - 1 <g> fi G Г (U x U,oXxX), for 1 < i < n, generate the ideal Q С oXxx °f ^e diagonal in U*. Proof. By definition, the o^-module fix/k is Just the oXxX-module Q/Q2, re- regarded as a module over QXxX/Q and carried over to X. In the process, the differential df comes from / (g> 1 — 1 <8> /. Since we assumed that d/i,... ,dfn generate Ox/к in U, it follows that Д ® 1 - 1 ® Д,..., /n ® 1 - 1 ® /n gen- generate Q/Q2 in Л(С/"), i.e., for all x e Л(С/"), they generate the stalk {Q/Q2)x over QxXxX. But (Q/Q2)^ = Qx/Q2, so a fortiori they generate Qx/mx • Qx.
§7. Non-singularity and the UFD property 187 Therefore, by Nakayama's lemma, the elements fi <g> 1 — 1 <g> fi generate Q itself in some neighbourhood of each of these z's. D The weaker fact that A(X) is locally on X x X the scheme of zeroes of some n functions follows also from Th. 4, §4. Proposition 4. Let X be a non-singular variety of dimension n. Let Y and Z be irreducible closed subsets of X and let W be a component ofYDZ. Then dim W > dim Y + dim Z - n (or, codim W < codim Y + codim Z). Proof Y П Z is isomorphic to the intersection Y x Z П A(X) taken in X x X. Therefore A(W) is a component of Y x ZnA(X). But for all x G A(W), A(X) is defined in some neighbourhood U* of x by the vanishing of n functions g\,..., gn (Prop. 3). Therefore dimW = dim A(W) > dim(Y x Z)-n = dim Y + dim Z — n by Cor. 3ofTh. 2, Ch. I, §6. D dimW can, of course, be bigger than dimF + dimZ — n: e.g., i£Y = Z. When equality holds, Y and Z are said to intersect properly on X. Example I. The Proposition really requires the non-singularity of X. To see this, let X С A4 be the cone by xy = zw. X is a 3-dimensional variety. Let Y be the plane x = z = 0 and let Z be the plane у = w = 0. Then Y П Z consists of the origin alone, so §7. Non-singularity and the UFD property We want to prove in this section the important: Theorem 1. Let X be a non-singular variety over k. Then for all x 6 X, o^ x is a UFD. Varieties X such that all o^^'s are UFD's are sometimes called factorial. The geometric meaning of this property is this: Theorem 1'. Let X be a non-singular variety over k. For all irreducible closed subsets Z с X of codimension 1, there is an open affine covering {U{} of X and elements f{ G r(Ui,ox) such that (fi) = I(ZП Щ).
188 III. Local Properties of Schemes In fact, ox is a UFD if and only if all minimal prime ideals P С ох are principal; and these minimal prime ideals are the ideals I{Z), for irreducible closed Z С X of codimension 1, containing x. Notice that to prove the geometric property, it suffices to prove that ox is a UFD for all closed points x. Therefore Theorem 1 for all x follows if it is proven for closed points. One case of the Theorem is very elementary: when Z is non-singular too, it follows from Th. 4, §4. To prove Theorem 1 all 3 of the methods mentioned in the introduction can be used. There is a projective method, based on Seven's idea of projecting cones. Or using complete local rings, we can reduce the UFD property for ox (x closed) to the UFD property for ox which is isomorphic by Th. 10, §6, to к [[ti,..., tn]]. The most far-reaching method is the cohomological one, due to Auslander and Buchsbaum, by which it can be proven that all regular local rings are UFD's. We shall present the first two methods. For the last, cf. Zariski-Samuel, appendix, vol. 2. Proof No. 1, via completions. As we saw at the beginning of this section, it suffices to prove that ox is a UFD, for closed points x. We shall assume then that к [[t\,..., tn]] and hence a,, is known to be a UFD (cf. Zariski-Samuel, vol. 2, p. ). Lemma 1. Let R be a noetherian integral domain. Then R is a UFD if and only if, for all f,g G R, f фО, д ф 0, the ideal U)--(9)={heR\hge(f)} def. is principal. Proof. Assume R is a UFD. Let where e, rj are units, /i,..., /n are prime, and r,, Si > 0. Then hg G (/) if and only if h is divisible by г=1 so (/) : (g) = (e). Conversely, assaume this property of principal ideals. We need to show: / I gh ==► / = g'h' where g' \g,ti \h all f,g,h G R, non-zero . Assume that bf = gh. We know that (/) : (g) = (h1) for some h' G R. But / and h are in the ideal (/) : (g). Therefore / = g'h' and h = aih', for some g', ai G R. Also h' G (/) : (g) means h'g = a-if-, some a-i G R. Therefore
§7. Non-singularity and the UFD property 189 tig = a2f = a2g'ti 9 = a2g' ■ Thus Ы | h and g' \ g. D Lemma 2. If О is a noetherian local ring such that О is a UFD, then О is a UFD. Proof. We use the basic fact that б is flat over O, i.e., if M —>N —>P is an exact sequence of 0-modules, then is an exact sequence of (^-modules (cf. §9 below, for a summary of the basic facts about flatness). Let /, g € О be non-zero elements. Let A = f • О : g • O. (We write / • О instead of (/) to distinguish between the ideal generated by / in О and in 6). First, I claim A-6 = f-6:g-6 . But by definition A is the kernel of multiplication by g as a map from О to Off ■ O, i.e., 0 —► A —► О -^ O/f • О is exact. Therefore, 0 —► A ®o б —► б -^ б/f • б is exact. Therefore, A (g>о б is isomorphic to its image A • б in 6, and this is the kernel of multiplication by g as a map from б to Off • 6, i.e., it equals f6 : дб. Now since б is a UFD, / • б : g ■ б is generalized by some element Неб. We must deduce that A is generated by an element of O. But A ■ б principal implies that A • 6/A • m ■ б is one-dimensional over the residue field к of О (т = maximal ideal of O). Also, A/mA ^ A ■ 6/A -m-6 . If h G A generates A/m • A over k, then by Nakayama's lemma, h generates A. Therefore Л is a principal ideal. By Lemma 1, this shows that О is a UFD. D Proof No. 2, via projections. We base this proof on the useful fact: Theorem 2. LetX be a variety of dimension n and letx € X be a non-singular point. Then there exists an open neighbourhood U С X of x, an irreducible hypersurface H С Pn+1, and an isomorphism ofU with an open subset U' С Н: open ,open X D U ~ U' С Н .
190 III. Local Properties of Schemes Proof. First we can replace X by an affine neighbourhood of ж. Then embedding X in A^ we can replace this affine X by its closure in P^. Therefore assume that X is a closed subvariety of P^. Also we can assume that ж is a closed point. Now suppose L С P^ is a linear space of dimension N — n — 2 disjoint from X. Projecting from L gives a morphism p0 : fN - L ->■ Pn+1. We saw in §7, Ch. 2, that po restricts to a finite morphism from X to Pn+1. Let H = po(X): Я is a closed n-dimensional subvariety of Pn+1, i.e., an irreducible hypersurface. Let p denote the restriction of pq: The real point of the proof is to show that if L is chosen suitably, there will be an open neighbourhood U С Н of p(x) such that p is an isomorphism from p~l{U) to U. We use the following criterion for this: Lemma 1. Let f : X —)■ Y be a finite morphism of noetherian schemes. For some у € Y', assume that the fibre f~l{y) consists of one reduced point, with sheaf k(y) on it. Then there is an open neighbourhood U С Y of у such that ves(p) :p-l(U) —>U is a closed immersion. Proof of lemma. Let U = Spec (R) be an affine open neighbourhood of y. Then f~l(U) is affine and if 5 = Г (f~l{U),ox), S is an Д-algebra which is a finite .R-module. Let у = [P]. Let L = k(y), the quotient field of R(P). Then, since f~l(y) = Spec E ®д L), our assumption is that the natural homomorphism ф : R^ S induces an isomorphism L = R®RL —> S®RL . Let M = 5/0(Д). Then M is a finite Д-module such that M ®RL = @). By Nakayama's lemma, this means that f • M = @) for some / G R — P. Then M <8>r Rf = @), so the natural homomorphism Rf —► 5 ®я Rf = Sf is surjective. This means that if p is a closed immersion over £7/, QED for lemma. Therefore, the theorem will follow if we can construct L such that p has the 2 properties: 1) {x} = р-'Ш), 2) mx>x = P* (mp(x),H) ox . The next step is to translate these into synthetic-geometric conditions on L. 1st of all, for any closed point у efN - L, p^1 {po(y)) is the join of L and {y}, i.e., the smallest linear space containing L and y. Write this J(L, {y}). Therefore, A) is equivalent to
§7. Non-singularity and the UFD property 191 1*) J(L,{x})f]X = {x} . As for B), it is equivalent to: mx,x/m2x>x ■£- mp{x)tH/m2p{x)yH being surjective. Dualizing, this is the same as J-x,x —> 2р(х),н being injective (T"s being the tangent spaces). Since Тр(х),н С Tp(x^pn+i, this is the same as restriction of dpo lx,x —> ^(i),p«+i being injective. But the set of linear subspaces of PN has the following property: for all closed points у € P^, and all subvector spaces V С Ty>FN, there is a unique linear subspace M CfN containing у whose tangent space at у is V . For this reason, if Z С P^ is any variety and у G Z is any non-singular closed point of Z, one can talk of the linear subspace M С P^ tangent to Z at y: i.e., that linear space through у with the same tangent space at у as Z. This gives us a global formulation of the last statement. Let M be the subspace of P^ tangent to X at x. Then either a) M П L = 0, in which case po restricts to a linear isomorphism of M with a subspace of Pn+1, hence dpo : Тх>м —> Гр(ж),рп+1 is injective. or b) M П L ф 0, in which case po restricts to a projection of M onto a lower dimensional subspace of Pn+1, hence it maps the whole join J = J(MC\L, {x}) to one point. Then dpo is 0 on TXyj, and is not injective on ТХум- Therefore B) is equivalent to: 2*) if M С Р-^ is the linear subspace tangent to X at ж, then It remains to show that there is a linear space L satisfying 1*), 2*), and disjoint from X too. Let q : WN - {x} —► P^ be the projection with center x. Then q{X — {x}) is a constructable set of p-^-1 whose closure is an n-dimensional variety W. And q~1(W) is an (n + 1)- dimensional subvariety of Pn — {x}. (In fact, if Wo is W minus any hyperplane, q-l(Wo) = Wo x A1). Let S = XUMUq~1(W) . This is a closed subset of ¥N, all of whose components have dimension < n + 1. Now use:
192 III. Local Properties of Schemes Lemma 2. Let S С Р^ be a closed subset; whose components have dimension < k. Then there exists a linear subspace L С P^ of dimension N — к — 1 disjoint from S. (Proof left to the reader.) Now choose L С P^ disjoint from 5, of dimension TV — n — 2. B*) is obvious. If A*) were false, then there would be a closed point у in X П J(L, {x}). Then line £ joining x and у would be in 5. But the line I is in J(L, {ж}), so it also meets L which has codimension 1 in J(L, {x}). Therefore L meets 5, a contradiction. So A*) is true and we are finished. □ Now to prove the UFD property for the rings ox, we can assume that x is a non-singular closed point of a hypersurface Я С Pn+1. Let Z С Я be an irreducible closed subset of dimension n — 1, containing x. The first part of our proof is an extension of the argument just given, involving choosing a projection: To be precise, choose у subject to 3 conditions: a) yiH b) if t = line joining ж, у, l П Z = {x} c) if Но С Pn+1 is the hyperplane tangent to H at x, then у £ Щ. [It is obvious that we can realize conditions a) and c). As for b), if q : Pn+1 — {x} -> Pn is the projection, it amounts to у £ q~l[q(Z - {x})], and since dim {closure of q~1[q(Z — {x})]} < n , this is possible too.] Then by (a) po restricts to a finite morphism p : Я —> Pn . By (c), interpreted exactly as above, p* (mp^) generates mx, so by Th. 3, §5, p is etale at x. And by (b), p~l (p(x)) f)Z = {x}. Now p(Z) is an irreducible hy- hypersurface in Pn: let F = 0 be the irreducible homogeneous polynomial defining it. Upstairs in Pn+1, F = 0 defines the hypersurface {y} \JpQl{p(Z)), which is the cone over p(Z). Restricted to Я, F = 0 defines a subschema Z* С Я, whose underlying set is the locus F = 0, i.e., p~1(p(Z)), and whose ideal is generated everywhere by one of the functions F/Xf, d = degree (F). Then Z is a closed subscheme of Z*. Our claim is that: (*) Z = Z* near x . Since I{Z*)X is principal, then I(Z)x will have to be principal too, and the Theorem follows. Intuitively, Z* should equal Z because of the fact that p is etale at x and Z stays away from all points of p~l{p{x)) except x:
§7. Non-singulaxity and the UFD property 193 H H p not etale Z meets yep" (p(x)) We can give a rigorous proof using norms. Introduce coordinates: У = @,0,...,1) p : (Xo,..., Xn+i) —> (Xo,..., Xn), Xi = Xi/X x = A,0,...,0) . r л к w \ °Pen affine of U = Spec (Alan,..., xn+1] /{a)) = | where Xq ф Q . „ . , , f open affine of Pn Ao = Spec*[*i,..•>*«] = | whereXo^0 p is dual to the inclusion of rings: R = k[xi,...,xn+i]/(g) U S = k[xi,...,xn] . Since у £ H, the equation g of H has the form = a£+i + ai (xi,..., Жп)<+\ + • • • + ае (жь ... ,xn) In particular, R is a free ^-module, with basis l,zn+i,... ,#n+i- ^ow ^ / = F/X$ e She the affine equation of p(Z). Then in U, I(Z*) = /• R by definition. Our contention is that I {Z)x = f • o^. Point 1. Let р^СрСж)) = (ж,жг,... ,Хк)- Let P = I(Z) be the ideal of Z in Л. Then P is generated by elements а е R such that а(жг)^0,2<г</г. Proof. Let Mj = / (ж»). Since Xi £ Z,P <f. Mi. Let Po С Р be the ideal generated by elements а 6 P — |J Mj. Then every element of P — Po must be in some Mi, i=2 so P С Po U M2 U ... U Мк . But then P с Po (cf. Zariski-Samuel, vol. 1, p. ), hence P = Pq. □
194 III. Local Properties of Schemes Point 2. Let a € R satisfy a (x{) Ф 0, 2 < i < k, and a(x) = 0, and let Nm(a) be its norm in S. Then Nm(a) = a • a' , where a' 6 R and a'{x) ф 0. Proof. Let Ta : R -+ R denote the 5-linear map given by multiplication by a. Let Хк° +Ь\Хк~1 + .. .+bk be its characteristic polynomial. Then Nm(a) = (—l)fc6fc. By the Hamilton-Cay ley Theorem ak - b\ak~l + ... bk = 0, so (-1)*-1 • Nm(a) = a ■ (o* + M* + ... + 6fc_i) . Therefore a' = (—1) • [ah~1 + ... + bk-i], and since a{x) = 0, a'(x) = (—1) • bk-i(p(x)). Let M be the ideal of p(x) in 5 and take tensor products: R и 5 ++■ R/M-R U ++■ SIM . To compute fe^-i mod M, we may as well use the characteristic polynomial Ta of multiplication by a in R/M • R. But Spec (R/M • R) = p^ipix)), and this is {x} with reduced structure and some other stuff. Thus R/M-R = k(x)®A where Spec (A) = {x2, ■ ■ ■, Xk}- Thus the component of a in к (a) is 0, and in A is a unit. Thus Ta breaks up into a 0 map and an invertible map Ta\A. Thus Char.polyn. where ck-i ф 0. Thus bk-i(p(x)) Ф 0. □ Let's put these together. We know that Z С Z*, hence / • o^ С I{Z)X. To show the converse, it suffices to take an element a 6 / (Z)x such that a (xi) ф 0, 2 < i < к, and show that a e f • a,, (by Pt. 1). But Nm(a) is zero on p(Z), so Nm(a) ef-S. Thus by Pt. 2, in ож, a = ^^ e / • ox. QED and end of proof no. 2 What happens to the UFD property at singular points? In one case, there is a converse to our Theorem:
§7. Non-singularity and the UFD property 195 Proposition 2. Let X be a variety, and let x £ X. Then if there is one closed subvariety Z of codimension 1 through x which is non-singular at x and whose ideal I(Z)X in oxX is principal, X is non-singular at x. Proof. Let I(Z)X = (/). Then ftz/k(?) = f2x/k(x)/k(x) ■ df . If Z is non-singular at x, then dim ftz/k 0*0 = dim Z, so dim ftx/k(z) < dimf2z/k(x) + 1 = dimX so X is non-singular at ж. □ Example J. Assume that char (к) ф 2 and let X be the subscheme of An defined by: Since char ф 2, ^ Xf is a non-degenerate quadratic form, and therefore, if n > 3, it is irreducible and X is a variety. Since the origin О is the only singular point. Let О be the local ring of the origin on X. Then О is not a UFD if and only if there are subvarieties W С X of codimension 1 through the origin, whose ideal in О is not principal. Note first of all, that every linear form ^ aiXi is an irreducible element in О (in fact, if m = maximal ideal of O, then ^ aiXi ^ m2, so if ^ aiXi = f • g, then either / or g is ^ m and is a unit). Case 1: n = 3. Then X is an ordinary cone in A3, with apex at the origin. Let W be one of the lines on X, e.g., X\ = г'Хг, Х3 = 0. Then X is singular, but W is non-singular at O. Thus О is not a UFD by Prop. 2. Put another way, we get 2 factorizations into irreducible elements Xl + X\ = (Xx + iX2) (Xx - iX2) = -X3 ■ X3 I(W) not principal here
196 III. Local Properties of Schemes Case 2: n = 4. Again we have problems because if a = X\ + X|, then a = (X\ + iX2) (X\ — iX2) = — (X3 + 1X4) (X3 — 1X4) . Similarly, if W is the plane X\ = iX2, X3 = iX±, then this plane has a non- principal ideal at О on our 3-dimensional X, since it is non-singular at O, while X itself is singular at O. Case 3: n> 5. Now we're OK. We can use: Lemma (Nagata). Let R be a noetherian domain, and let x € R be an element such that x • R is prime. If R[l/x] is a UFD, then R is a UFD also. Proof. Let P С R be a minimal prime ideal. If P = x • R, it is principal. If P ф x-R, then P = P-R [i] f)R. Moreover PR [^] is principal, since R [i] is a UFD, so for some у e P,P-R[±] = y-R [±]. Write у = yQ-xr where yo ^ x • R. Then one checks easily that (y • R [^]) П R = yo • R, hence P = yo • R. Thus all minimal prime ideals are principal and R is a UFD. □ Apply this to R = k[X1,...,Xn]/J2Xi x = Xi + iX2 . ^2 Let x' = iX2 - Xi, and note that £i=1 X? = £Г=з Xf - x • x'. Thus R/(x) = к [x1, X3,..., X±] I Х]Цз Xf, and this is an integral domain when n > 5, so (x) is a prime ideal. But since the equation asserts that x' = ^IXf/x. Therefore by Nagata's lemma, R is a UFD and a fortiori О is a UFD. §8. Normal varieties and normalization In this entire section, we assume that к is an algebraically closed field. Definition 1. Let X be a variety over к. Then a point x € X is called a normal point of X, or X is said to be normal at z if the ring ox is integrally closed in its quotient field k(X). X is normal if it is normal at every point.
§8. Normal varieties and normalization 197 Note, for example, that a factorial variety X (i.e., ox always a UFD) is normal, since all UFD's are integrally closed in their quotient fields. In particular, non- singular varieties are normal. If a variety X is normal, its affine coordinate rings are integrally closed in k(X) too. In fact, an intersection of integrally closed rings is integrally closed and for any domain R, R = П R 12 lip all prime ideals p Recall the important: Structure theorem of integrally closed noetherian rings. If R is a noethe- noetherian integral domain, then R is integrally closed r= minimal prime ideals P (ii) for all minimal prime ideals p, Rp is a principal valuation ring This shows that every integrally closed noetherian ring is an intersection of valuation rings, and reduces the study of such rings to the theory of fields with a distinguished family of valuation rings in them. This result is the essential step in the classical ideal theory of Dedekind rings. (For a proof cf. Zariski-Samuel, vol. I, Ch. V, §6; or Bourbaki, Ch. 7). In our context, it shows us that if X is a normal variety, and Z is a subvariety of codimension 1, with generic point ж, then ox is a principal valuation ring. If ordz is the associated valuation, then this means that for all functions / € k(X), we can define ordz(/) , known as the order of vanishing of the function f along Z. It is negative if / ^ ox, in which case we say that / has a pole along Z; it is 0 if / € ox — mx so that / is defined on an open set U meeting Z but does not vanish on U Г) Z; and it is positive if / € mx, i.e., if / vanishes on Z. We also find: Proposition 1. Let X be a normal variety and let S С X be its singular locus (i.e., the set of singular points). Then codimx(S) > 2 . 12 In fact, assume that x,y € R, у £ 0, and x/y G Rp for all prime ideals P. Let A = (y) '■ (x) = {z € R \ x/y = w/z, some w G R}. Since x/y G Rp, x/y can be written w/z, with w G R, Z G R — P. Therefore A <jL P. Therefore A is not contained in any prime ideal, so A = R. This means that 1 G A, i.e., x/y G R.
198 III. Local Properties of Schemes Proof. If x is the generic point of a subvariety ZcXof codimension 1, then we must show that X is non-singular at x. But the ideal of Z at x is the maximal ideal of a,., which is principal since a,, is a principal valuation ring. And Z itself is non-singular at its generic point. So by Prop. 2, §7, X is non-singular at x. □ Corollary. Let X be a 1-dimensional variety. Then X is non-singular <=$■ X is factorial <=$■ X is normal. In this proposition, we have used only the property that ox is a principal valuation ring, if {x} has codimension 1. Conversely, if codimxE) > 2, then X is non-singular at such points x, hence normal at such ж, hence ox is a principal valuation ring. Let's give this property a name: Definition 2. A variety X is non-singular in codimension 1 if it is non-singular at all points x such that {x} has codimension 1. In some cases, such a variety is normal: Proposition 2. Let IcAn be an irreducible affine hypersurface. If X is non- singular in codimension 1, then X is normal. Proof. Let R = Г(Х,ох). We must show that R is integrally closed. By the structure theorem, we need only show that R= f( RP . minimal prime ideals P Let f,g e R and assume f/geRp for all minimal prime ideals P. Let Ы = Qi П ... П Qi be a decomposition of (g) into primary ideals. Let Pi = \Щ1. What we must show is that all these Pi are minimal. Because if Pi is minimal, then f /g 6 Rp{, hence / € g-Rpi, hence / e (Qi • Rp^DR. But since Qi is primary, Qi = (Qi i Therefore, if all Pi are minimal, /6 f] Qi, hence / 6 (g), hence f/g 6 R. i=l Now suppose R = к [Xi,..., Xn] /(h) and let тг : к [Xi,..., Xn] -> R be the canonical map. Let g' be a polynomial such that тг(д') = д- Then in к [X\,..., Xn], the ideal (h,g') decomposes as follows: ~l(Qi) primary, with л/т^ЧО?) = k~1 (Pi) . But now since h and g' are relatively prime (in fact, h is prime and g' £ (h) since g ф 0), Macauley's Unmixedness Theorem asserts that all the associated prime ideals of (h,g') have codimension 2. (Cf. Zariski-Samuel, vol. 2, p. 203.) Therefore Pi has codimension 1 in R, i.e., is minimal. □
§8. Normal varieties and normalization 199 On the other hand, it is easy to construct varieties of any dimension which are normal at all but a finite set of points as follows: Example K. Start with any normal affine variety Y = Spec E) - for example An. Let R С 5 be a finitely generated subring such that S/R, as a vector space over k, is finite dimensional. Then 5 will be integrally dependent on R and with the same quotient field. Therefore X = Spec (R) is definitely not normal. Let the inclusion of R in 5 define the finite morphism f:Y—>X . Then since 5 and R are so nearly equal, it turns out that there is a finite set of closed points x\,..., xn 6 X such that res f :Y - f~l ({жь ... ,xn}) —у X - {xi,...,xn} is an isomorphism. In particular, X is normal at all points except x\,...,xn. Proof. Consider S/R as .R-module. Since it is finite-dimensional over k, it is annihilated by an ideal А С R of finite codimension. Then V(A) is a finite set of closed points. And if / e A, then Rf = Sf, so the restriction of / means Y/ isomorphically onto Xf. □ To be more specific, take (A) 5 = k[x], Y = AX R = k[x2,x3} = k + x2-S . Then if we let у = x2, z = x3, so X = Spec (R) is the affine cubic curve with a cusp at the origin - cf. Ex. E in this Chapter and Ex. 0, Ch. I. X has one singular point, which is also its one non-normal point. (B) 5 = k[x,y], Y = A2 R = k[x,xy,y2,y3] . Then X = Spec (R) is an affine surface in A4, with one non-normal point - the origin. We can summarize the discussion by drawing a picture illustrating the hier- hierarchy of good and bad varieties:
200 III. Local Properties of Schemes Non-singular in codim 1 varieties Let's list examples showing how each successive class does include more va- varieties than the one before: 1. A factorial variety which is singular - take £ X? = 0 in A5 (Ex. J). i-\ 2. A normal variety which is not factorial - take the cone xy = z2 in A3. It is not factorial (cf. Ex. J), but it is normal by Prop. 2. 3. A variety non-singular in codimension 1, but not normal. - Cf. Ex. K. 4. A variety not non-singular in codimension 1 - take y2 = x3 in A2. Just as one can associate to every integral domain its integral closure in its quotient field, so to every variety, there is another variety called its "normaliza- "normalization" . This normalization is at the same time normal, yet birationally equivalent to the original variety. The existence of such a simple way of "making" every variety normal is one of the reasons why normal varieties are an important class. Life would be much simpler if there were an analogous way of canonically con- constructing a non-singular variety birationally equivalent to any given variety. Definition 3. Let X be a variety and let L be a finite algebraic extension of k(X). A normalization of X in L is a normal variety Y with function field k(Y) = L, plus a finite surjective morphism:
§8. Normal varieties and normalization 201 Y X such that the induced map тг* : k(X) -> k(Y) = L is the given inclusion of k(X) in L. If L = k(X), so that тг is birational, Y and тг are simply called a normalization of X. Theorem 3. For every variety X and every finite algebraic extension L D k(X), there is one and (essentially) only one normalization of X in L: i.e., if ttj : Yi -> X were 2 normalizations, there is a unique isomorphism t: TTi \ / ТГ2 such that tti = ттг • t and such that t* is the identity map from L to L. Proof. To show uniqueness of (У, тг), cover X by affines Щ = Spec (Ri). If 5i,i = Г (Trf: (Ui) ,QYl), 5i,2 = Г (тг^1 (С/г) ,Qy2), then both Si,i and Si,2 are integrally dependent on R, are subrings of L = к (Yi) = к (Уг)? and are integrally closed. Therefore 5г>1 = 5j,2 = integral closure of R in L. Let ti : ttj (Ui) -^ тт^ (С/г) be defined by the identity map from 5i,i to 5i,2- Since ti is the only isomorphism inducing the identity map L -> L of function fields, the £г'в patch together into the required t. To show existence, first assume X is affine. If X = Spec (R), let 5 be the integral closure of R in the field L. By a classical (but not easy) theorem, 5 is a finite .R-module (cf. Zariski-Samuel, vol. 1, p. 267). Let Y = Spec E) and let тг : Y -> X be dual to the inclusion of R in 5. Then тг is finite, Y is normal, and k(Y) = quotient field of 5 = L. To show existence in general, cover X by affines Ui. Let TTi : Vi -> [/г- be a normalization of Ui in L. By the uniqueness part of the result, we can patch the Vi's together into a Y and the тгг'в to а тг so that (У, тг) is a normalization of X in L. □ Corollary. Lei X be a variety. Then {x £ X \ X is normal at x} is open. Proof. Let тг : Y -> X be a normalization of X. Then тг* (оу) is a coherent ox-module. The map /* from sections of ox to sections of oY defines a homo- morphism f* :ox —► тг* г* (o Let /C = cokernel (/*), and let 5 = Support (/C) = {x e X \ Kx ф @)}. Then 5 is a closed subset of X and
202 III. Local Properties of Schemes Г /* ] x £ S <=> < oxX —^» 7г* {оу)х is surjective > . But 7г* (оу)х is just the integral closure of ox x in k(X) so /* is surjective if and only if ox is integral closed. □ Example L. Let's look back at various non-normal varieties we have seen and work out their normalizations. In Ex. K, just above, we started with a normal Y = Spec E) and constructed a morphism / : Y —> X = Spec (R) via a ring R of finite codimension. (Y, f) is just the normalization of X. In particular, we have repeatedly looked at the special case A1 JL>v(X3-Y2)=C where f(t) = (t2,t3) (cf. Ex. 0, Ch. I). Also in Ch. I, Ex. P, we looked at A1 -A- V (Y2 - X2(X + 1)) = D . Again (A1, /) is the normalization of D. This is a case of Ex. K, corresponding to the ring k[t] and the subring к [t2 — l,t {t2 — l)] of finite codimension. We looked at these same 2 plane curves in Ex. E in §3 of this chapter, and if you look back at Ex. E bis you will see that we actually constructed the morphism / by blowing them up, i.e., canonical map canonical map B0(C) Bo(D) Another example. Take X to be the normal surface xy = z2, but let L = к (yfx,z). Take coordinates u,v in A2 and define / : A2 —> X by f(u,v) = (u2,v2,uv). Via /*, Г(Х,ох) is identified with the subring к [u2,v2,uv] in A;[u, v]; since и and v depend integrally on this subring / is finite. A2 is non-singular, hence normal. And, via /*, и = y/x, and v = z/y/x so к (A2) = к (\/ж, z). Thus (A2, /) is the normalization of X in L. Incidentally, this gives a direct proof that X is normal, since
§8. Normal varieties and normalization 203 k[x,y,z\/ (xy - z2) = k[u,v}Dk(u2,u/v) subring of elements / € k[u, v] invariant under f(u,v) —у f(-u,-v) and the intersection of an integrally closed ring with a subfield of its quotient field is always integrally closed. Theorem 4. If X is a projective variety, then its normalization in any finite algebraic extension L D k(X) is a projective variety. Before proving this theorem, we must introduce the Segre transformations of a projective embedding. Start with Pn and let d be a positive integer. Let Xq, ... ,Xn be homogeneous coordinates in Pn. Then take all the monomials in Xo,...,Xn of degree d and order them: ith monomial = TT ХТУ , \\ тц = d j-0 j-0 for 0 < i < N. Here n\d\ is fact. Let Yq, ..., Yn be homogeneous coordinates in ¥N. Define a morphism: фа ■ pn —у p^v so that if ж is a closed point with homogeneous coordinates (oo,..., on), then Ф{х) is a closed point with homogeneous coordinates In fact, suppose we assume that the 1st n + 1 monomials are the powers Xg,Xf,...,X*. Let Щ С Pn and V- С Р^ be the open sets Xt ф 0 and Yt ф 0 respectively. Then we define Ф by requiring that Ф (Ui) С Vi, for 0 < i < n, and that Ф*г :r(Vi,OpN) -> Г (Ui,Qpn) be: o j=o - - In fact, Ф~1 (Vi) then turns out to be exactly Ui, and Ф* is surjective since that Yk/Yi for which the &th monomial is XjXf~l goes over to Xj/Xi. Therefore Ф is even a closed immersion. Now if X С Pn is any projective variety and d is a positive integer, the image Фа{Х) С Р^ is called its $h Segre transformation. This operation is very useful for simplifying a projective embedding. If the original X has a homogeneous coordinate ring:
204 III. Local Properties of Schemes R = k[X0,...,Xn]/P and if Rk denotes the kth graded piece of R, then the new embedded variety <&d{X) has the homogeneous coordinate ring R(d) = ф Rdk k=0 = к We are now ready to begin: n IP-. Proof of Th. 4-' Let R = к [Xo,..., Xn] /P be the homogeneous coordinate ring of a projective variety X с Pn. Let a be a non-zero element of R\, and let £ С R be the multiplicative system of non-zero homogeneous elements. Then the localization Re is still graded and k(X) is isomorphic to the subring (ReH of elements of degree 0. Moreover, Re itself is just a simple polynomial ring /j(X)[q;]. Let 5 be the integral closure of R in the ring L[a}: S С L[a] U U R С к(Х)[а] . Then 5 is a graded subring of L[a]. Moreover, since R is finitely generated over k, so is 5. We would like to construct a projective variety with the ring 5, but there is one obstacle: 5 may not be generated by the vector space Si of elements of degree 1! Here is where the Segre transformation comes in. I claim that if we replace X by a suitable Segre transformation Фа{Х), the resulting 5 will be generated by S\. First we need: Lemma. Let S be any finitely generated graded k-algebra, where к = So. Then there exists a positive integer d such that oo S(d) = ф Skd is generated by the elements S(d)i = Sd- Proof of lemma. Let x\,...,Xk be homogeneous generators of 5 and let di = degree (xi). Let d' be the least common multiple of the d/'s, let d' = c^e*, and к let d = к • d'. Now Sn is just the linear span of the monomials П х\г such that = n. Note that if fc < e* for all i, then n < d. Therefore, if n > d, the monomial decomposes:
§8. Normal varieties and normalization 205 (where j[ = fi if г ф г'о, Д, = Д, - бг0). Pursuing this inductively, it follows that if we start with a monomial of degree n = t • d!, we can write it as the product of terms x^ each of degree d', and a "remainder" monomial of degree d. In particular, if the monomial had degree n = £ • d, we could, by grouping these х\{ 's, write it as a product of monomials each of degree d. Therefore Su is spanned by the Ith symmetric power of 5^. □ Now suppose that S{d) is generated by S{d)\. We replace X by the Segre transforms Фа{Х). This means that its new homogeneous coordinate ring is just R{d). Notice that Rz(d) is the subring k(X)[ad] of k(X)[a]. Therefore R(d) = R П k(X)[ad]. I claim that S(d) is the integral closure of R(d) in L[ad). Study the diagram: L[<x] S(d) С R L[od] k(X)[o] R(d) С k(X)[od] If ж € L[ad] is integral over R(d), then as an element of L[a], it is integral over R. Therefore x € Sf) L[ad], therefore x € S(d). Moreover, if ж € S(d), then x is integral over R and writing out its equation of dependence, it follows that the coefficients have to lie in R(d), so it is integral over R(d). The result of all of this is that we can assume 5 is generated by Si. Therefore, choosing a basis of S\, we can write S = k[Y0,...,Ym]/P* for some homogeneous prime ideal P*, with the Yi's all of degree 1. Let Y с Pm be the variety V(P*). We now have the following rings: k[Y0,...,Ym} U k[X0,... ,Xn] S U R Y is normal: In fact, 5 is integrally closed in L[a], hence in its quotient field. Therefore 5^ is also integrally closed. But if Ui С Y is the open set Yi ф 0, then r(Ui,QY) equals
206 III. Local Properties of Schemes \fesm} i.e., the subring of Sy{ of elements of degree 0. Therefore it is the intersection Sy{ П L taken inside L[a], and this intersection is integrally closed in L. Construction of 7Г :Y -> X: Each Xi equals a linear form in the l^'s. Let M С Pm be defined byXo = ... = Xn = 0. Then (a0,..., am) ►—> (X0(a),..., Xn(a)) defines a projection p : Pm - M —> Pn . Note that Y П M = 0: in fact, suppose (ao,..., am) 6 Pm is a closed point on Y Г) M. Then Xo(a) = ... = Xn(a) = 0 and f(a) = 0 for all homogeneous / € P*. Since the image of Yi in 5 is integrally dependent on R, it follows that for some polynomials aij € к [Xo,..., Xn], Y/'+ail(Xo,...,Xn)Y^+... + aiN(Xo,...,XN) = 0(mod Р*). Substituting oo,..., om, it follows that Yi(a) = сц = 0 too, so (ao,..., am) is not a real point. Now restrict p to Y and it must define a finite morphism from Y to Pn. The image will be the projective variety with homogeneous ideal Р = Р*Пк[Х0,...,Хп] i.e., X. It is clear that the function field of Y is exactly L, so Y is the normal- normalization of X in L. □ Theorem 5. Let К be a finitely generated field extension of к such that tr.d.K/k = 1. Then there is a unique complete, non-singular "model" С of K, i.e., a variety С such that k(C) is isomorphic/k to K. Moreover, С is a projective variety. Proof. Existence: Let x € К be a separating transcendence base. Then К is a finite separable extension of k(x), so К = k{x,y), for some у G K. Therefore, К contains a subring k[X, Y]/(f) where / is an irreducible polynomial. Homogeniz- Homogenizing /, we obtain a plane curve Co С P2 defined by / = 0, such that k(Co) — K. Let С be the normalization of Co. Then С is normal and 1-dimensional, hence non-singular by the Cor. to Prop. 1. Uniqueness: I claim that any complete non-singular model, regarded as a topo- logical space plus a sheaf of rings, is nothing but the following: i) take all principal valuation rings R С К such that к С R: call this set 71. ii) let С be the union of 7Z and a generic point. iii) topologize С by taking as open sets aHhe empty set, and h^C minus a finite subset of R.
§9. Zariski's Main Theorem 207 iv) put a sheaf Qq on С via r(C-{Ru...,Rn},oc)= f| R R€R-{Ri Rn} for all finite subsets {R\,..., Rn} С 71. If С is any complete non-singular model, then for all closed points x € C", ox is a principal valuation ring in K, so we can map С into С (taking the generic point of С to the generic point of С to the generic point of C). It is 1 — 1 since the local rings of a variety are all distinct. It is onto since each ring R 6 7Z must certainly dominate some ox by the valuative criterion for completeness (Cor. of Prop. 3, §7, Ch. II.) And if one valuation ring dominates another, and they have the same quotient field, they are equal. It is clear that this is a homeomorphism and that Qjq^Ojq, can be identified. Therefore (C,oc) = (C'^Qq,). □ This is completely false if tr.d.K/k > 1, essentially because a local ring ox at a closed point of a variety X can be a valuation ring only when dim X = 1. Example M. Let x e Pn be a closed point, and let В = Bx (Pn). Let p : В -> Pn be the canonical map. Then (a)both В and Pn are non-singular (in fact, В and Pn are both covered by open pieces isomorphic to An), ^p is proper and Pn is complete, so В is complete too, and ^p is birational. In other words, we have 2 non-singular and complete varieties, and a birational map between them, which is not an isomorphism if n > 2. Among higher dimensional varieties, however, we can put together the ideas of Th. 2, §7, and of this section in: Proposition 6. Let X be a normal protective n-dimensional variety. Then X is the normalization of a hypersurface H С Pn+1. Proof. Assume X is a closed subvariety of P^. In the proof of Theorem 2 we constructed a projection Po such that X П L = 0, and such that under po an open set in X and an open set in H = po(X) are isomorphic. Let p : X -> H denote the restriction of po- Then p is finite since it is a projection, and p is birational since it is an isomorphism in an open set. Since X is normal, (X,p) is the normalization of H. □ §9. Zariski's Main Theorem In §8 our discussion of normality has been largely a matter of carrying over into geometry the algebraic ideals and algebraic constructions involving integral closure. However, normality turns out to have a hidden geometric content as well, which is not so easy to discover. This involves the concept of the "branches" of a variety at a point. To understand this idea, let's first look at the case к = С and try to describe our naive topological notion of a branch: i>N _ г . pn+1
208 III. Local Properties of Schemes Let X be a variety over С Let x € X be a closed point. Let 5cXbe the singular locus. Let U С X be some sufficiently regular and sufficiently small neigh- neighbourhood of x in the complex (or strong) topology (consisting of closed points only). Look at U — U П 5, and decompose it into components in the complex topology: u-uns = v1u...uvn . Then the closures Vi,...,Vn are the branches of X at x. Example N. Let X be the plane curve 0 = x2 - y2 + x3 in C2, and let U be the neighbourhood of the origin (= the one singular point) {(x,y) | N <e, \y\ <e} for some small e. Then unx-{(o,o)} always breaks up into 2 pieces. In fact, in U П X: \x-y\-\x + y\ = \x\3 < e-\x\2 . Therefore, either |x — y| < л/ё • И or 1Ж~У| < лА'М- Obviously both cannot occur (if 6 < 1). Each piece separately turns out to be connected so we get 2 branches at @,0), described by \x — y\ <C |ж| and \x + y\ <C |z|. On the other hand, take X to be the cone xy = z2. Let U = {(x,t,z) \x\ < 6, |y| < 6, \z\ < б}. Then define a continuous surjective map: —> XHU- {@,0,0)} (s,t) i—> (s2,t2,st) . Therefore X f)U — {@,0,0)} is the continuous image of a connected set, hence connected. Thus X has only one branch. Is there any purely algebraic way to get ahold of these branches? One way to detect the existence of several branches at a point x € X is to look for covering spaces of the following general type: / : X' —> X such that 1) f~1(y) is a finite set, for all у € X 2) / birational.
§9. Zariski's Main Theorem 209 Then if / x(x) = {x\,.. .,xn}, and U С X is a small complex neighbourhood of x, we know that f~l{U) will have to break up: f-1(U) = U1U...UUn, where Ui is a small neighbourhood of Xi. Then in fact / (Ui) will be a union of some subset of the branches through x. In other words, in the set of branches of J at z, we will get n disjoint subsets, each one coming from the branches of X' at one of the X{. There is one canonical way to do this: let (X',f) be a normalization of X. In that case, if 5 С X is the singular locus and 5' = /-1E), then / defines an isomorphism of X' — S' and X — S. Therefore U — U П 5 is homeomorphic to the disjoint union of the sets Ui — Uif] S' and we get a canonical decomposition of the set of branches at x depending on the various points X{ € X' over x. The essential content of Zariski's Main Theorem, in all its forms, is once a variety is normal, there is only one branch at each of its points. Therefore, for any variety X and closed point x € X, the set of points x\ in its normalization X' over x is in 1 — 1 correspondence with the set of branches of X at x. Now let к denote an algebraically closed field. I. Original form: Let X be a normal variety over к and let / : X' -> X be a birational morphism with finite fibres from a variety X' to X. Then / is an isomorphism of X' with an open subset U С X. II. Topological form: Let X be a normal variety over C, and let x € X be a closed point. Let 5 be the singular locus of X. Then there is a basis of complex neighbourhoods of x such that is connected, for all i. III. Power series form: Let X be a normal variety over к and let x € X be a normal point (not necessarily closed). Then the completion ox is an integral domain, integrally closed in its quotient field. IV. Grothendieck's form: Let / : X' -> X be a morphism of varieties over к with finite fibres. Then there exists a diagram: where У is a variety, X' is an open set in Y and g is a finite morphism. V. Connectedness Theorem: Let X be a variety over к normal at a closed point x. Let / : X' -> X be a birational proper morphism. Then /-1(ж) is a connected set (in the Zariski topology).
210 III. Local Properties of Schemes Let's consider first the original form (I). As a simple special case, it contains the important assertion that a bijective birational morphism between normal varieties is an isomorphism (compare Ch. I, §4, Ex. N,O, and P). Form (I) can be proven by a very direct attack involving factoring / through morphisms Spec R[X]/A —> Spec R with only one new variable at a time. This is Zariski's original proof: cf. Lang, Introduction to Algebraic Geometry, p. 124. But these direct methods have never been generalized to other problems, so we will not present this proof. In the case where X is a factorial variety, however, a direct proof is so elementary that we do want to give this: Proposition 1. LetX be an n-dimensional factorial variety and let f : X' -> X be a birational morphism. Then there is a non-empty open set U С X such that 1) res(/) : f~l{U) -> U is an isomorphism, 2) if Ei,...,Ek are the components of X' — f~1(U), then dim Ei = n — 1, for all i, while dim / (Ei) <n — 2. In particular, if x is a closed point in X — U, all components of f~1(x) have dimension at least 1. Proof. Let U be the set of points x € X such that x has a neighbourhood V С X for which res(/) : r1 is an isomorphism. U is clearly open and A) holds. Now let x' € X' be any closed point and let x = f(x'). (A) If Ox x = o^, x,, then x eU. Let V С X' and V С X be affine neigh- neighbourhoods of ж' and x such that /(V) С V. Let R = Г (V, ox), R' = Г (V, ox,), x = [P], and x' = [P1]. Then, via /*, R is a subring of R', while the localizations Rp and R'P, are equal. Therefore, we can find a common denominator g € R for the generators of R' as .R-algebra such that g £ P. In particular, Rg = R'g. Replacing V and V by Vg and Vg respectively, / becomes an isomorphism from V to V. Suppose V were not /-100- Then let z e f~l{V) - V\ let x = f{z), and let у € X' be the point in V such that f(y) = x. Then, as subrings of the common function field of X and X', we have: °z,X' > °x,X = °y,X' • If z фу, this contradicts the "Local Criterion" of Hausdorffness (cf. Ch. II, §6). Thus f~l{V) = V and xeU. (B) Suppose ox x <£. Q_x,x,. Let s € o^',*' ~~ Q-x,x- Write s = t\/t2, where Zi,t2 € Qxx are relatively prime (using the assumption: oxX a UFD). Let ti extend to sections of ox in a neighbourhood V of x, and let s extend to a section
§9. Zariski's Main Theorem 211 of ox, in a neighbourhood V of x', where v' С f~1(V). Then the subsets V\ and V2 of V defined by t\ = 0 and £2 = 0 are pure n — 1-dimensional; and since £1 and £2 are relatively prime, they have no common component through x. Replacing V by a smaller neighbourhood of ж, we can assume that V\ and V2 have no common components at all. Therefore V\ f\V2 is pure (n—2)-dimensional. Upstairs in V, let V2 be the locus £2 = 0. Since in V, t\ = s • £2, £1 also vanishes on V2. Therefore / (V2') С V\ П V2. But V2' is pure (n - l)-dimensional, so every component of V2 dominates under / a lower dimensional variety of X. Therefore all components f~1(f(x')) through x' have dimension > 1, and This shows that every point of X' — f~l (U) is on an (n — l)-dimensional compo- component of X' — f~1(U)', hence X' — /~г(и) is pure (n — l)-dimensional. And that all its components are mapped onto subsets of X of codimension > 2. □ The set-up of this Proposition should be familiar from the blowing-up mor- phisms of §3: f:Be(X)—>X. In this case, let U = X — {x}. Then res(/) : f~1(U) -> U is an isomorphism, and we proved in §3 that all components E of BX(X) — f~l{U) had dimension n - 1. Since {x) = f(E), dim f(X) < n - 2 also, (if n > 2). The Proposition shows that these features are fairly typical of birational morphisms. Of course, if / has finite fibres, then no components Ei can occur, so X' = f~1(U) and Form (I) of Zariski's Main Theorem follows. If X is not factorial, the Proposition is false to the extent that it is stronger than the Main Theorem: Example 0. Let X = V ((X1X2 - X3X4)) С A4. As we saw in §8, the origin is a normal, but not factorial point on X. Define /:A3 —> X by (УьУ2,Уз) '—> (УъУ2Уз,У2,У1 Уз) • (What we are doing here is taking the closure of the graph of the rational function y3 = XA/Xi = X2/X3.) If U is the locus (XUX3) ф @,0), then Z^) is the open set B/1,2/2) Ф @,0) and is isomorphic to U. But if P is a closed point of A3 such that 2/1=2/2 = 0, then f(P) = @,0,0,0). Thus / shrinks a line to a point, and is 1 — 1 elsewhere. But a line has codimension 2 on X, so this is a case where the conclusion of Prop. 1 is false. The most powerful approach to the proof of (I.) is through the apparently innocuous form (III.). This method is also due to Zariski, who proved both (III.) and (III.) =» (I.). (III.) itself is actually quite a job to prove; however, it is a matter of pure local algebra, so we refer the reader to Zariski-Samuel, vol. 2, pp. 313-320. Here is how we get back to geometry:
212 III. Local Properties of Schemes Proof of (III.) =>■ (I.): Let / : X' -»• X be a birational morphism with finite fibres, and assume X is normal. First, by argument A in the proof of Prop. 1, it will be enough to show that for all x' £ X', if x = f(x), then fx> '• Q-x,x —* йх',х> is an isomorphism. If this is proven, we can proceed exactly as in Prop. 1 to construct a neighbourhood U of x such that f~1(U) and U are isomorphic, and (I.) follows. For simplicity we can even assume x and x' are closed, although this is not essential. The proof uses 2 lemmas: Lemma 1. Let О be a complete local ring, with maximal ideal m. Let M be an O-module such that 1) f]mn-M = @) n 2) M/m • M is finite-dimensional over O/m. Then M is a finitely generated O-module. (Cf. Zariski-Samuel, vol. 2, p. 259.) Lemma 2. Let О be a noetherian local ring with completion O. Then dim((9) = dim(d). (For the definition of dim((9), cf. Ch. I, §7; for the lemma, cf. Zariski-Samuel, vol. 2, p. 288.) Now look at the local homomorphism /*,. It induces a homomorphism ф of the completed rings as follows: x,X **" °x>, П П У-х,Х ■— У-х',Х> ■ Ф The basic fact we need is that o^. x is an integrally closed domain, by (III.). First of all, ox, x, is a finite module over bxtx- This follows from Lemma 1 with О = ox x, M = ox, x,. In fact, С f)(mx>)n-Qx,tX, n = @) by Krull's Theorem (Zariski-Samuel, vol. 1, p. 216). And
§9. Zariski's Main Theorem 213 M/mx -M = ox,x, /ф (mx) • ox, x, But Qx,x,/f* (mx) -ох,Х1 is the local ring of x' in the fibre f~1(x). Since x' is an isolated point of f-1(x), this local ring is finite dimensional. Thus all hypotheses of Lemma 1 are satisfied. Secondly, ф is injective. In fact, let A = о,. х/кет(ф). If ф is not injective, then dim A < dim ox x: in fact, for any chain of prime ideals Pq <£. P\ <£. ... <£. Pt С А, we get a longer chain (using the fact that o^ x is a domain). Moreover, since ox,x, is integrally de- dependent on A, dim (o^x') < dim A. Then, by Lemma 2: = dimoxX = dim a,. x > dim A > dhnox,x, and this is a contradiction. We can now consider all our local rings as subrings of ox,x,. Taking inter- intersections in the total quotient ring of this big ring, we can deduce: ox,x, С (quotient field of ЬхХ) П^д/ . Since a,. x is integrally closed in its quotient field, and a^x' is integrally de- dependent on Од. >x, this means ox,x, С oxX. But then °x>,x> С (quotient field of ox>x) ПохХ = oxX . U As for form (II.), using results in Gunning and Rossi, that comes out of (III.) too: Proof of (III) => (II.): The basic fact is С 16, Ch. Ill, p. 115 in Gunning-Rossi: they show that if X is the analytic space corresponding to the variety X, and Q is the sheaf of holomorphic functions on X, then (II.) is correct at a point x £ X if Qx is an integral domain. But all the rings: ox С Qx С ox = Qx are included in each other. If X is normal, (III.) tells us that ox is a domain; so Qx must be a domain too. □ Finally, forms (IV.) and (V.) of the Main Theorem are even deeper. (V.) is a much more global statement since the properness of / is involved. There is a cohomological proof, due to Grothendieck (cf. EGA, Ch. Ill, §4.3) and a proof using a combination of projective techniques and completions, due to Zariski. As
214 III. Local Properties of Schemes for (IV.), the interesting point here is that it asserts the existence of plenty of finite morphisms, rather than asserting that normal varieties have some strong property. (The connection loosely speaking is that the more finite morphisms there are, the stronger restriction it is to be normal.) In fact, to see that (IV.) =^ (I.) just apply (IV.) as it stands to a morphism / : X' -> X in (I.). We may as well assume that the Y in (IV.) is a variety and that X' is dense in Y. Then g is birational and finite, hence its rings are integrally dependent on those of X. If X is normal, g must be an isomorphism, so X' is just an open subset of X as required. For a good proof of (IV.), cf. EGA, Ch. 4, (a proof relying very heavily on (III.)). §10. Flat and smooth morphisms The concept of flatness is a riddle that comes out of algebra, but which techni- technically is the answer to many prayers. Let's recall the basic algebra first: Let R be a ring, M an Я-module. Then M is flat over R if, for all elements mi,..., mn G M and ai,..., an G R such that n 52 a>i • ^г — 05 there are equations г=1 for some m'j G M, bij G R such that n 52 bijui = 0, all j . г=1 It is clear that a free Я-module has this property (i.e., take the m^'s to be part of a basis of M); and that any direct limit of flat Я-modules again has this property, hence is flat. Conversely, it was recently proven that any flat Я-module is a direct limit of free Я-modules. Intuitively, one should consider flatness to be an abstraction embodying exactly that part of the concept of freeness which can be expressed in terms of linear equations. Or, one may say that flatness means that the linear structure of M preserves accurately that of R itself. On the other hand, to see an example of flat but not free modules, suppose R is a domain and let M be its quotient field. Since M= M -R , w a aeR M is a direct limit of free Я-modules, hence is flat. But M is certainly not a free Я-module itself. The defining property can be souped up without much difficulty to give the following:
§10. Flat and smooth morphisms 215 Given an Я-module M and an exact sequence of -R-modules 0 —► Ni —► N2 —► N3 —► 0 the sequence 0 —> Ni®RM —)■ N2®rM —> N3®RM —> 0 is exact, if either M or N3 is flat over R. This is the form in which one usually uses the definition. In the special case where M is flat over R, N2 = R and N\ is an ideal I C R, this implies that the natural map / <8>я M -> I • M is injective. Conversely, this special case implies that M is flat over R. Here are some of the basic facts: A. If M is a flat Я-module and S is an Я-algebra, then M ®rS is a flat S- module. B. If M is an .R-module, then M is flat over R if and only if for all prime ideals P С R, the localization Mp is flat over RP. B'. (Stronger) If M is an 5-module and 5 is an Я-algebra, via the homomor- phism i : R -»• 5, then M is flat over R if and only if for all prime ideals P С 5, Mp is flat over Ri-цру C. If .R is a local ring and M is an Я-module, and either R is artinian or M is a finitely presented Я-module (i.e., of the type Rn/finitely generated submodule), then M flat over R implies M is a free Я-module. D. If R is a domain, and M is a flat Я-module, then M is torsion-free. (More generally, every non-0-divisor / £ R annihilates no non-O-elements in a flat .R-module.) Conversely, if R is a valuation ring, then torsion-free Я-modules are flat. E. Let M be a Б-module, and В an algebra over A. Let / G В have the property that for all maximal ideals m с A, multiplication by / is injective in M/m • M. Then M flat over A implies M/f • M flat over A. F. If О is a noetherian local ring, then its completion О is a flat (9-module. For a full discussion, cf. Bourbaki, Alg. Comm., Ch. I. Putting A. and D. to- together reveals another important point about flatness: if M is flat over a domain R, then not only is M torsion-free, but for all homomorphisms R -> S, where 5 is a domain, M <8># S is still torsion-free as 5-module, i.e., M is "universally torsion-free". Definition 1. Let / : X -»• Y be a morphism of schemes and let T be a quasi-coherent o^-module. Then T is flat over o_Y for all x G X, Tx is a flat Oy^-module. The morphism / itself is flat if ox is flat over oy. Notice that whether or not a morphism / : X —)• Y is flat or not involves only the Oy-module structure of ox and not the ring structure of ox. Properties A. - D. above can be translated into geometric terms:
216 III. Local Properties of Schemes A*. Let X' Г Y' P X Y be a fibre product, and let T be a quasi-coherent ox-modu\e. If T is flat over Oy, the quasi-coherent ox, -module p* {T) is flat over oY,. [Here, p*{F) is an ox, -module defined locally on an affine Spec (R1) С X' \ Spec (R) С X by M ®R R', if M = T(Spec (R In particular, if / is flat, then /' is flat. B*. Let / : Spec (R) -> Spec E) be a morphism, and let T = M be a quasi- coherent module on Spec (R). Then T is flat over Ogpec E) if and only if M is flat over S. C*. Let X be a noetherian scheme, and let T be a coherent ox-modu\e. Then T is flat over ox if and only if T is a locally free ox-module. D*. Let / : X -> Y be a morphism. Assume Y is irreducible and reduced with generic point y. Let T be a quasi-coherent o^-module flat over oY- Then for all x G X, Tx is a torsion-free 0/^) y-module. If X is noetherian and T is a coherent o^-module, this means that all associated points of T He over y. Conversely, this property implies that T is flat over oy if all stalks oyY are valuation rings (e.g., Y a non-singular curve, or Spec (Z)). The best intuitive description of when a morphism / : X —)• Y of finite type is flat is that this is the case when the fibres of /, looked at locally near any point x G X, form a continuously varying family of schemes. Suppose, for example, X and Y are varieties. Then, by D, if / is flat, X dominates Y. We saw in Ch. I, §8, that there is an open set U С Y such that all components of all fibres over points of U are n-dimensional, where n = dimX — dimF. On the other hand, fibres over other points of Y may have dimension > n. This increase in dimension is clearly a big discontinuity of fibre type, and it can be shown that if / is flat, all components of all fibres of / have dimension n. In the other direction, for any morphism / : X -> Y of varieties one would expect that almost all the fibres do form a continuous family, and indeed it can be shown that there always is some non-empty open U С Y such that res(/) : f~1(U) -> U is flat.
§10. Flat and smooth morphisms 217 Another way of testing this intuitive description of flatness is via the fact that a continuous function has at most one extension from an open dense set to the whole space. The analogous fact about flatness is: Proposition 1. Suppose g : Z -»• Spec (R) is a morphism and X\,X2 are 2 closed subschemes of Z. Assume A) that for some поп-0-divisor f G R, X\ and X2 are equal over Spec (R)*, and B) that the restrictions of g to X\ and X2 are fiat morphisms from X\ and X2 to Spec (R). Then X\ = X2 • Proof. Let U = Spec E) be an open affine in Z, and let Xi = Spec (A/Ai). By assumption A), A\ ■ Sf = A2 ■ Sf, and by assumption B), S/Ai and S/A2 are flat over R. I claim Ai = S П (Ai ■ Sf), hence A1 = Sn(A-l-Sf)=Sn(A2-Sf) = A2 . But since / is not a 0-divisor in R, the sequences: mult, by 0—>■ R -Д R is exact. Therefore mult.by 0 —► S/Ai -^ S/Ai is exact, i.e., if /n • a € Ai, then a G Ai. This means exactly that Ai = Sn(Ai-Sf). D Look back at Prop. 2 of Ch. II, §8. Here we had exactly the situation of the Prop, above with R a valuation ring, and we asserted that given a closed subscheme ICf1 (Spec (Rf)), there was one and only one closed subscheme X С Z extending X and flat over R. In other words, when R is a valuation ring, we get existence as well as uniqueness. Example P. Any "family" of affine hypersurfaces should be considered a "con- "continuous" family, since it defines a flat morphism. To be precise, let be a hypersurface, with coefficients щг,...,*„ in an arbitrary ring R. Let П = Spec R[XU . ..,Xn]/ Then H is a scheme over Spec (R) whose fibres over Spec (R) are the hyper- hypersurfaces obtained by mapping the coefficients from R into a field by the various homomorphisms R —> Rp/P ■ RP
218 III. Local Properties of Schemes (P С R a prime ideal). Let's assume that none of these fibres is all of An, i.e., that the equation doesn't vanish identically after applying any of these homomorphisms to its coefficients. Then H is flat over Spec (R) by Property E. above: in fact, R[Xi,... ,Xn] is a free Я-module, so it is flat over R, and for all maximal ideals m С R, f = £)ан,•••,*„-Xl1 '"' '-^nn is a non-O-divisor in R/m [Xi,...,Xn]. Therefore, in E., take В = M = R [Xx,..., Xn] and A = R. Now let's look at the case of a finite morphism / : X -»• Y. Proposition 2. Let f : X -> Y be finite, and assume Y noetherian. Then f is flat if and only if /* (ox) is a locally free oY-module. Proof. To prove this, we may as well assume Y = Spec (R). Then X = Spec E), where S is an Я-algebra, finitely generated as Я-module. Then S = /* (ox) so / is flat 4=> S is flat over R (by B*) <==> 5 is a flat oY-module (by B*) /* (их) is a locally free oy-module (by C*) . □ Corollary. Assume also that Y is reduced and irreducible. Then f is flat if and only if the integer dinik(y) [/* (йх) <g>2y k(y)J is independent of y. Proof. Add version II of Nakayama's lemma to the Proposition. Note that if Ay = /* (ох)у ®о &(y), then Spec (Ay) is the fibre of / over y. So the Corollary asserts that flatness is equivalent to the fibres of / being Spec's of Artin rings of constant length - again a natural continuity restriction. Example Q. Let к be an algebraically closed field and define / : A1 -»• A1 by f (x) = x2. f is & finite morphism since it is dual to the inclusion of rings k[X] D k[X] X2^-X . f* Let a G k, and look at the fibre of / over the point X = a. It is Spec (k[X]/ {X2 — a)) So if а ф 0, it is the disjoint union of 2 points; if а = 0, it is the disembodied tangent vector / = Spec (k[e]/e2) of §4. In all cases, dimkk[X]/(X2-a) =2 so / is flat.
§10. Flat and smooth morphisms 219 Example R. Look at the morphism / : A2 -»• V ((XiX3 — X^)) c -^3 defined by f(x, y) = (x2,xy, y2) as in Ex. L. This is finite, and if a, b, с G к satisfy ас = Ъ2, the fibre of / over (a, b, c) is Spec(k[X,Y]/(X2-a,XY-b,Y2-c)) . If a or с is not 0, this is the disjoint union of 2 points; but if a = с = 0, hence b = 0, it is Speck[X,Y]/ (x2,XY,Y2) and this ring is 3-dimensional. Therefore / is not flat. Given a finite morphism / : X -»• Y, where У is an irreducible and reduced noetherian scheme, the preceding discussion and examples suggest introducing several subsets of Y: a) Let Yq С Y be the open set over which /* (ox) is locally free, hence res(/) : /-1 (Уо) -> Yo is flat.
220 III. Local Properties of Schemes b) For any point у £ Yq, let U С Yq be an affine neighbourhood of у such that f-1^) = Spec (R),U = Spec E) and R is a free 5-module. Taking a basis ai,..., an of R over 5, we can form the discriminant in the usual way: d = det (Tr (at ■ а3)) . Then for all points y\ £ Yo, the value d{y\) is a discriminant of the finite dimensional к (yi)-algebra whose Spec is the fibre f~1(yi). Therefore, d{y\) ф 0 if and only if the fibre /-1 (yi) is a union of the Spec's of separable extensions of к (yi). Therefore, we have found an open subset of points уi £ Yo whose fibres are like this; equivalently whose geometric fibres are reduced. Definition 2. Y — Y\ is the branch locus of /, and points у £ Y — Y\ are ramification points for /. Moreover, I claim that Y\ is the maximal open set such that res(/) : /-1(yi) -»• Y\ is etale. To prove this, we need the main result of this section, which is the intrinsic characterization of etale morphisms referred to in §5: Theorem 3. Let f : X -> Y be a morphism of finite type. Then f is etale if and only if f is flat and its geometric fibres are finite sets of reduced points. Actually it is no harder to prove a stronger theorem stating an equivalent fact for morphisms with fibres of positive dimension. This result involves the natural generalization of etale: Definition 3. 1st, the particular morphisms: X = Spec i2[X1>...,Xn+fc]/(/i>...,/„) Y = Spec (R) are said to be smooth at a point x £ X (of relative dimension k) if (*) vmk(dfi/dXk(x)) = n. 2nd, an arbitrary morphism / : X -> Y of finite type is smooth (of relative dimension k), if for all x £ X, there are open neighbourhoods U С X of x and V С Y of f(x) such that f(U) С V and such that / restricted to U, looks like a morphism of the above type which is smooth at x:
§10. Flat and smooth morphisms 221 SpecR[X1,...,Xn+k]/(f1,...,fn) open immersion res (/) V С ^ Spec (R) . open immersion Theorem 3'. Let f : X —*Y be a morphism of finite type. Then f is smooth of relative dimension к if and only if f is flat and its geometric fibres are disjoint unions of k-dimensional non-singular varieties. (N.b. This makes sense because the geometric fibres of / are schemes of finite type over algebraically closed fields, and it is for such schemes that we have de- defined non-singularity.) Proof. First assume / is smooth. It is clear that the definition of smoothness is such that whenever / : X -> Y is smooth, all morphisms /' obtained by a fibre product: X' are still smooth. In particular, if F is a geometric fibre of / over an algebraically closed field i?, F is smooth over Q. But by Theorem 4, §4, when Y = Spec i?, Q algebraically closed, a morphism of finite type / : X -»• Y is smooth if and only if X is a disjoint union of non-singular varieties. To see that / is flat, it suffices to check that is a flat Я-module when rank (dfi/dXj(x)) = n, all x e Spec E). Set Ri = R [X\,..., Xn+k] / (/i,..., fi) . Then Ri+i = Ri/fi+i • Ri, and Ro = R [X\,.. .,Xn+k] is flat over R, so we are in a situation to use Property E. of flatness inductively. We need only check that for all maximal ideals m С R, fi+\ is a non-0-divisor in Let Q D R/m be an algebraically closed field. Then we just saw that Spec i? [X\,... ,Xn+k] I (/i,..., fi) is a union of n + к — г-dimensional vari- varieties Vij. In particular, i? [Xi,...,Xn+k] / (/i, ••-,/») is a direct sum of integral
222 III. Local Properties of Schemes domains. Therefore, /;+i is a non-O-divisor here if its component in each factor is non-zero. Since all components of Spec Q [Xi,..., Xn+k] I {h■>••••,fi+i) have lower dimension, /;+i does not vanish on any of the Vij. Thus Property E. is applicable and S is flat over R. Conversely, assume / is flat with non-singular geometric fibres. Let x € X, and express / locally near x by rings: X D U = SpecR[Xu...,Xn+k]/A res(/) Y D V = Spec (R) . Let f(x) = [P], and embed R/P in an algebraically closed field Q. Look at the geometric fibre over Q: R[X1,...,Xn+k]/A R * Q[Xu...,Xn+k]/A R/P с Q Choose a point x of the fibre F = Spec Q [X\,..., Xn] /A over the point x. Then using the fact that F is non-singular at x, of dimension k, we know by Th. 4, §4, that there are elements Д,..., fk G A such that df1,..., dfk are indepen- independent in i?An/fi{x). In fact, we can even take the f{ to be images of elements fi G A. Then since Spec Q [X\,...,Xn+k] / (Д,..., fk) is itself a non-singular A;-dimensional variety near x, and since it contains F as a closed subscheme, F = Spec Q [Xi,..., Xn+k] / (/j,..., fk) near x. Now the fact that the df{ are independent in &ап/п(%) means that rank =k . Hence rank (dfi/dXjix)) = к , since x lies over x. Note that U is a closed subscheme of Spec (R [X\,...,Xn+k] / (/i,..., fk)). It will suffice to show that these schemes are equal near x and then / will have been expressed in the standard form and hence is smooth. This last step follows from: Lemma. Given a diagram: Xi
§10. Flat and smooth morphisms 223 where X\ is a closed subscheme of a noetherian scheme X2 and /1 is flat, then for all x G X\, if the geometric fibres of /1 and /2 over f(x) are equal near some point x over x, then X\ = X2 near x. (Note the analogy with Prop. 1). Proof of lemma: Algebraically, we have the dual picture R/I -« R Let x = [P], where P С R is a prime ideal containing /. We want to prove that / • Rp = @). By assumption, we can embed S/I~1(P) in an algebraically closed field i?, and find a prime ideal P С R (8M f2 such that 1) I-(R (8M O)p = @) 2) if j : R -»• R (8M f2 is the canonical map, Г1(Р)=Р. Now tensor the exact sequence 0—>■/—>■#—>Д//—>-0 over S with i?. Since Я/7 is flat over i?, it follows that /(8M i? -^> / • (R (8M /2). Now let R\ denote the local ring Rp and let R2 denote the local ring (R (8M i?)p. Let Pi and P2 denote their maximal ideals, j induces a local homomorphism j' : R\ -»• R2, hence an injection j : R\/P\ С Дг/Рг- Since I ■ R2 = @), therefore Using the fact that the 1st module is just / (8M /2, this means that: /<8>дД2/Р2 = @) . But R2/P2 is just an extension field of R1/P1, so also. But since / <8>д (Ri/Pi) is the same as / • Rp/P • (/ • Rp), Nakayama's lemma shows that / • Rp = @) too. QED for lemma and Th. 3'. Problem. Here is a Main Theorem-type result for flat morphisms: Let / : X -»• У be a birational flat morphism between varieties. Show that / is an open immersion.
Appendix Curves and Their Jacobians D. Mumford: LNM 1358, pp. 225-291, 1999. © Springer-Verlag Berlin Heidelberg 1999
Preface13 This appendix is a slightly expanded version of the series of four Ziwet Lectures which I gave in November 1974 at The University of Michigan, Ann Arbor. The aim of the lectures and of this volume is to introduce people in the mathematical community at large - professors in other fields and graduate students beyond the basic courses - to what I find one of the most beautiful and what objec- objectively speaking is at least one of the oldest topics in algebraic geometry: curves and their Jacobians. Because of time constraints, I had to avoid digressions on any foundational topics and to rely on the standard definitions and intuitions of mathematicians in general. This is not always simple in algebraic geometry since its foundational systems have tended to be more abstract and apparently more idiosyncratic than in other fields such as differential or analytic geome- geometry, and have therefore not become widely known to non-specialists. My idea was to get around this problem by imitating history: i.e., by introducing all the characters simultaneously in their complex analytic and algebraic forms. This did mean that I had to omit discussion of the characteristic p and arithmetic sides. However it also meant that I could immediately compare the strictly ana- analytic constructions (such as Teichmuller Space) with the varieties which we were principally discussing. When I first started doing research in algebraic geometry, I thought the sub- subject attractive for two reasons: firstly, because it dealt with such down-to-earth and really concrete objects as projective curves and surfaces; secondly, because it was a small, quiet field where a dozen people did not leap on each new idea the minute it became current. As it turned out, the field seems to have acquired the reputation of being esoteric, exclusive and very abstract with adherents who are secretly plotting to take over all the rest of mathematics! In one respect this last point is accurate: algebraic geometry is a subject which relates frequently with a very large number of other fields - analytic and differential geometry, topology, if-theory, commutative algebra, algebraic groups and number theory, for instance - and both gives and receives theorems, techniques and examples with all of them. And certainly Grothendieck's work contributed to the field some very abstract and very powerful ideas which are quite hard to digest. But this subject, like all subjects, has a dual aspect in that all these abstract ideas would collapse of their own weight were it not for the underpinning supplied by concrete classical geometry. For me it has been a real adventure to perceive 13 Written in 1975.
228 III. Local Properties of Schemes the interactions of all these aspects, and to learn as much as I could about the theorems both old and new of algebraic geometry. Dafydd ap Gwilym's The Lark seems to me like the muse of mathematics: High you soar, Wind's own power, And on high you sing each song, Bright spell near the wall of stars, A far high-turning journey. If this book entices a few to go on to learn these "spells", I'll be very pleased. I'd like to thank Fred Gehring, Peter Duren and the many people I met at Ann Arbor for their warm hospitality and their willingness to listen. A final point: in order not to interrupt the text, we have omitted almost all references and attributions in the lectures themselves, and instead written a separate section at the end giving references as well as suggestions for good places to learn various topics.
Lecture I: What is a Curve and How Explicitly Can We Describe Them? In these lectures we shall deal entirely with algebraic geometry over the complex numbers C, leaving aside the fascinating arithmetic and characteristic p side of the subject. In this first lecture, I would like to recapitulate some classical algebraic geometry, giving a leisurely tour of the zoo of curves of low genus, pointing out various features and their generalizations, and leading up to my first main point: the "general" curve of genus g, for g large, is very hard to describe explicitly. The beginning of the subject is the AMAZING SYNTHESIS, which surely overwhelmed each of us as graduate students and should really not be taken for granted. Starting in 3 distinct fields of mathematics, we can consider 3 types of objects: a) Algebra: consider field extensions К D C, where К is finitely generated and of transcendence degree 1 over С b) Geometry: First fix some notations: we denote by Pn the projective space of complex (n + l)-tuples (Xo,..., Xn), not all zero, mod scalars. XQ,...,Xn are called homogeneous coordinates. Pn is covered by (n + l)-affine pieces. Ui = (pts where Xi ф 0) and xq = Xo/Xi,..., xn = Xn/Xi (xi omitted) are the affine coordinates on Ui. Consider algebraic curves CcPn: loci defined by a finite set of homogeneous equations fa(Xo,... ,Xn) = 0, and such that for every x £ X, С is "locally defined by n — 1 equations with independent differentials", i.e., 3 /ai,... /an_x plus g, with g(x) ф 0, such that for all a, n-l gfa = ^2 ha,ifai, some polynomials ha^ i=l and rk(dfaJdXj(x)) =n-l. c) Analysis: consider compact Riemann surfaces14. The result is that there are canonical bijections between the set of isomorphism classes of objects of either type. [A word about isomorphism in case (b): the simplest and oldest way to describe isomorphism in the algebraic category is that C\ С Pni and Ci С РП2 are isomorphic if there is a bijective algebraic 14 Perversely, algebraic geometers persist in talking about curves and analysts about surfaces when they mean essentially the same object!
230 A. Curves and Their Jacobians correspondence between C\ and C2, i.e., there is a curve D С С\ х С2 defined by bihomogeneous equations ga (Xo,... Xni; lo, • • • УП2) — 0 which projects bi- jectively onto C\ and onto C2] To go back and forth between objects of type a), b), c), for instance, we 1) associate to a curve С the field К of functions / : С -»• С U @0) given by restricting to С rational functions p ((Xo,...,Xn) /q (Xo,...,Xn))), degp = deg q; and the Riemann surface just given by С with the induced complex structure from Pn. 2) associate to a Riemann surface X its field of meromorphic functions; and any curve С which is the image of holomorphic embedding of X in Pn. 3) from the field K, we recover С or X as point set just as the set of valuation rings R,CCRCK. To X or С or К we can associate a genus g as usual: g = no. of handles of X or [space of holomorphic differentials uj on X] ^ [space of rational differentials w = adx, (a,x G K, x ^ C) on С with no poles] or 2# — 2 = (no. of zeroes) — (no. of poles), of any differential u. For each g, we shall let 9Rg denote the set of isomorphism classes of X or С or К of genus g: we shall discuss the structure of 9Rg in the second lecture. So much for generalities. Most of what I shall say later is best understood by considering the computable explicit cases of low genus. Let's take these up and see what we have: g = 0: there is only one object here: X = Riemann sphere С U @0) С = P1 itself K = C(X). g = 1: Here we have the famous theory of elliptic curves: X = C/L, L a lattice which may be taken to be Z + Z • w, Im и > 0. С = any non-singular plane cubic curve, i.e., С С Р2 defined by f(x,y,z) = 0, / homogeneous of degree 3, with some partial non-zero at each root; in affine coordinates, x, у, С is given as the zeroes of a cubic polynomial f(x,y) = 0. К = С ( X, y/f(X)j, where / is a polynomial of degree 3 with distinct roots.
A. Lecture I 231 The connections between these are given as follows: given X, form the Weier- strass p-function: and map C/L into P2 by A, р(г), p'(*))» * £ L @, o, l), zeL, (i.e., (C — L)/L is mapped to the affine piece Xo ф 0 by p and p\ and the one point L/L is mapped to the "line at infinity" for this affine piece.) Then p and p' generate the field К of X and since p'2 is a cubic polynomial in p, К is as above. Or starting with any plane cubic C, take affine coordinates x, у so that the line at infinity is a line of inflexion. Then С is readily normalized to the form: y2 = /(*), deg/ = 3. Therefore the field of rational functions on С is С f x, y/f(x)j. To go back to X, look at the abelian line integral fK 'y) dx w = — J(xo,yo) У taken on C; then (x,y) i—»■ w is well-defined up to a period which lies in a lattice L, hence defines: С A C/L. A few comments on this set-up: C/L is clearly a group, and hence so is С - here the group law is characterized geometrically by the beautiful: x + у + z = 0 <=$■ x, y, z collinear. For instance, Sx = 0 <£► x a point of inflexion. Since Зх = О Ф> х £ \L/L, there will be 9 of these. Now via X « C/L, we get flat metrics on X with curvature = 0. But if we instead look at a metric on X induced from the standard metric on P2 via X = С С Р2, we get a metric whose curvature at the 9 points of inflexion equals that of P2, which is positive; and by the Gauss-Bonnet theorem, it must be negative at other points. The wobbly curvature points up the fact that X does not fit symmetrically in P2 - we will discuss this further in Lecture III. Another indication of the antagonism between C/Z +Z -u and С is the Gelfand-Schneider result: with a few exceptions for very special w's, (i.e., w G Q(\/—n)), w and the coefficients of any isomorphic cubic С are never simultaneously algebraic.
232 A. Curves and Their Jacobians Start with the fields K: these are all of the form 9 = 2: К = С (X, \/f(X)\ , where degree / = 5. What this means is that the corresponding curve С admits a 2 — 1 mapping onto P1 ramified at 6 points: the 5 roots of / and the point at infinity. This does not quite give us С embedded in Pn though. We can do 2 things: let 7г : С —>• P1 be the above map. Fix xi,X2 with ttxi = тгжг- Then one can prove that С can be mapped to a plane quartic curve Co bijectively except that x\ and xi are identified to a double point of Co- This means that at the double point Co is given by an equation 0 = xy + f3(x,y) + f4(x,y) where the double point equals the origin. In this form, тг(х,у) = x/y; or geo- geometrically, 7г : С -»• P1 is defined by "projecting from @,0)." This still doesn't represent С embedded in Pn! In fact, to do this, you need n = 3, and at least 3 equations too. You start with a line fcP3, then take quadric and cubic surfaces F,GcP3 containing £. Then FПG will fall into 2 components - £ plus a quintic curve C, and it can be proven that every curve of genus 2 occurs as such a C. Given such a C, there is only one 2 — 1 map тг : С —> P1 and the most important points on С are the 6 points Xi where it ramifies. They have 2 signif- significances a) they are the Weierstrass points of C, i.e., the points x G С such that there is a rational function f on С with a double pole at x and no other poles, (if t is the coordinate on P1, let / = (t — t(x))) b) they represent the "odd theta-characteristics," i.e., look for differentials uj with no poles and zeroes only with even multiplicities: one writes this И = 221 if {uj) = divisor of zeroes and poles of u. In this case, there are w» with one double zero at Xi, i.e., (ui) = 2xi and no others (in fact if ai = t(xi), oj{ = /ттр—£i— dt). У ll*-ai Analytically, С can be represented by a Fuchsian group: С^Н/Г where: Я = {z | Im z > 0} Г = discrete subgroup of 5LB, R)/(±l)
A. Lecture I 233 or by various Kleinian groups: where D = open subset of С U (oo) Г = discrete subgroup of SLB, Cj/(±I) which acts discontinuously on D. I want to make only one remark on these representations in connection with my main question of how explicitly one can describe C. Start with a Fuchsian Г. Choosing a standard basis of tti(C), Г is generated by hyperbolic transforma- transformations A\, B\, Ai, £?2 satisfying AlBlA{lB^lA2B2A^lB^1 = e. It is quite clear from the work of Fricke-Klein and of Purzitsky and Keen that there is a small number of inequalities on the traces of small words in A\,..., B<i which are always satisfied for Fuchsian .T's, such that conversely if Ai,..., B^ £ SLB, Ж) satisfy these inequalities, they generate a Fuchsian Г. (It would be nice to know these inequalities precisely.) This means that one can actually find all Fuchsian /"s quite explicitly. For Kleinian .T's no such simple inequalities are known and presumably do not exist. In the simplest case, the problem arises - describe explicitly the set of pairs (А, В) б 5LB, Q2 which generate a free group of only hyperbolic elements acting discontinuously at some zq G C, i.e., the Schottky groups. This looks very hard.15 g = 3: Here we encounter first the phenomenon of not having one easy description of all C's at once: "almost all" C's can be described one way, but some are a special case and must be described a different way. The general type are the C's which are non-singular plane quartic curves. The embedding of С in P2 is canonical and is given in the following simple way: let tp\ ,(^2,^зЬеа basis of the differentials of first kind (= with no poles) on C. For all x G C, let dt be a differential near x with no zero at x so that ipi(x) = a,i(x)dt, п{ a function. Define: C—^P2 by x i—>• (ai(x),a2(x),a3(x)). This is independent of the choice of dt because changing dt multiplies the triple by a scalar. This procedure works in any genus and defines the so-called canonical map Ф:С—►P5, given, loosely speaking, by: 15 Cf. forthcoming book, "Indra's Pearls" by D. Wright, C. Series and myself, Camb. Univ. Press.
234 A. Curves and Their Jacobians where {ipi} is a basis of differentials of 1st kind. [Note that there is a natural correspondence between linear functions in the homogeneous coordinates on P5 and arbitrary differentials ]T Xnpi of 1st kind on C] As is well known, there are 2 types of C's: those for which Ф is an embedding (i.e., Ф injective and Ф(С) non-singular), and those for which Ф is 2—1, and the image is isomorphic to P1. All C's which admit 2 — 1 maps to P1 fall into the 2nd category and are called hyperelliptic. Thus for g = 3, either С = Ф(С) - then because each tpi has 2g — 2 = 4 zeroes, each line in P2 meets Ф{С) in 4 points and Ф{С) is a quartic - or Ф{С) is a non-singular conic "with multiplicity 2," i.e., Ф is 2 — 1. As all non-singular conies are isomorphic to P1, С is then hyperelliptic. In general, in the non-hyperelliptic case, Ф(С) will have degree 2g — 2, because the hyperplanes H С Р5 correspond to differentials <p of 1st kind in such a way that: Ф (zeroes of ip) = H П Ф{С). Plane quartic curves С are intricate objects. They have lots of special points on them: a) their 24 points of inflexion are the Weierstrass points of C: the points x such that there is a function f on С with a triple pole at x and no other16, b) their 28 bitangents - lines tangent to С at 2 points - correspond to the odd theta-characteristics. Because if £ is tangent to С at ж and y, then the differential ip corresponding to t has a double zero at x and y: (<p) =2x + 2y. In fact, projective geometry yields a vast constellation of "higher Weierstrass points" too, such as the 108 points x for which there is a conic D touching at x with contact of order 6. More generally, for any degree d, look at the points x for which there is a curve D of degree d touching С at X with contact "one more than is expected," i.e., one more than is possible at most points of D. One can think of this as some kind of analog on С of the finite set of points of order d, \LjL С C/L in the genus 1 case. This analogy goes quite far. For 16 You can find the function as follows: let x be a point of inflexion, let I be the tangent line to С at x. Then I meets С at one further point y. Let u, v be affine coordinates such that у is the origin и = v = 0, and I is the coordinate axis и = 0. Consider the function / = v/u. Since и and v are zero at y, f is regular at y\ but at x, / has a triple pole.
A. Lecture I 235 instance, as d —>■ oo, one can show that these points are dense in С and even fairly evenly distributed in the "Bergman metric," i.e., for any curve C, choose a basis <pi,...,<pg of differentials of 1st kind for which / Jc / c Then using such a basis, we can normalize our canonical embedding ф . с —у F^-1 up to unitary transformations, in which case the standard metric ds2 on P5 has a restriction dsB to С independent of the choice of the ipi's: this is the Bergman metric. An interesting question that arises in this connection is the relationship be- between the Bergman metric dsB and the Poincare metric ds2p of constant negative curvature induced from the standard metric on H: ds2 = dx2 + dy2/y2, z = x + iy via the Fuchsian uniformization С = Н/Г. Kazdan suggested that if Гп С Г are subgroups of finite index and cofinal among such subgroups, if Cn = Н/Гп, and if ds2B is the Bergman metric on Cn pulled back to H, then with suitable scalars An, lim An ds2B = ds2 n—too n P' We won't say much about the hyperelliptic case: in genus g, if С —>• P1 is 2 — 1, then there are exactly 2g + 2 branch points, and the corresponding fields are just С (x, \/f(X)\, where deg/ = 2g+1 or 2g+2. [If / has degree 2g + 2, by a linear fractional transformation in X, taking some root to oo, С [X, y/J) = С [X1, y/J7) where deg /' = 2g +1.] These curves are special however in the following precise sense: one can build a big algebraic family of curves of genus g: such that all curves of genus g occur as fibres Xs = f~1(s). Then the set of s such that Xs is not hyperelliptic will form a dense Zariski-open subset of 5. g = 4: Let Ф : С —У P3 be the canonical map. If С is not hyperelliptic, we saw that Ф(С) was a space curve of degree 6. In fact, Ф(С) is the complete intersection F П G of a quadric and cubic surface meeting transversely. One could also ask, however, is С a plane curve or is there a map тг : С —> P1 of low degree? The answer to the first question is that С must be given singularities before it can be put in P2: the simplest way is to identify 2 pairs of points making С into a plane quintic Co with 2 double points; as for тг, one can always find а тг of degree 3. As the genus g grows, it gets harder and harder to represent the general curve С of genus g either as a plane curve with relatively few singular points, or as a
236 A. Curves and Their Jacobians covering of fairly low degree of P1. For instance, it can be shown that the lowest degree curve representing such а С has degree In general, its singularities will only be double points but the number of these will be (d • 6=— 2 which is asymptotic to 2/9(g2). If g < 10, one can work backwards and write down all equations /(Xo, -Xi, X2) defining curves of this degree d and this number S of double points, hence having genus g. This is because the vector space of such /'s has dimension (d + l)(d + 2)/2 (count the coefficients), and for any point (uo, 01,02), if we require the coefficients of / to satisfy the 3 linear equations: ■—-@0,01,02) = 0, then / = 0 has a singularity at @0,01,02). Now if g < 10, then 35 < ' 2 ' (see table below), hence we can pick an arbitrary set of 6 points P{ = (ag , Oi , O2 ) in P2 and always find at least one curve С of degree d with singular points Pi,..., P$; in general these will be double points and С will have genus g. However if g > 11, if we choose the 5 singular points generically, there will be no such /, i.e., the coordinates of the 5 double points will always satisfy some obscure identities. The upshot is that there is no reasonably explicit way to write down the equations of these plane curves: one is in a realm of unexplicitness almost as bad as with Kleinian groups. Next, it can be shown that the lowest degree map тг : С —У Р1 has degree (This is equivalently the smallest number of poles of any non-constant function on the general curve C.) This also, to my knowledge, does not lead to any explicit polynomial presentation of C, but it does lead to a very explicit topological presentation of C. Namely, assuming the branch points of тг are all simple, then one can reconstruct С in 5 steps: a) Choose the branch points {xi} arbitrarily: there are 2(g + d — 1) of them. b) Choose a set of "cuts" joining xi to a base point z:
A. Lecture I 237 c) Choose 2(g + d — 1) transpositions ai acting on {l,...,d} such that (<Ti ЧТ2 ••••) = e. d) Make a topological covering space Co of P1 by glueing together d copies of P1 via the transposition Ui on the iih cut. e) By Riemann's existence theorem, Co has a unique algebraic structure, i.e., there is a unique curve С and map тг : С —> P1 such that С is homeomorphic to Co as covering of P1. Unfortunately, step b) is essentially topological and seems very deep from an algebraic point of view. For instance, if you want to algebraize this construction, you are led to ask: given prescribed branch points, cuts and transpositions, find an explicit multi-valued algebraic function with these branch points and transpositions. Thus if d = 2, \/П(ж ~~ xi) is a function; if d = 3 or 4, the solvability of 5з,54 (the permutation groups) allows one to find such explicit functions too. But I don't know of any general method for larger d. We summarize these discussions in the table on the next page. For general g, the simplest explicit polynomial presentation of С seems to be one due to K. Petri in a paper that was until recently almost forgotten. He was M. Noether's last student and collaborated with E. Noether and appears to have written only 2 papers. I want to conclude this lecture by describing his results in one of these published in 1922. This is unavoidably a bit messy, but just to be able to brag, I think it is a good idea to be able to say "I have seen every curve once." Let С be a non-hyperelliptic curve of genus g. Petri starts by choosing g points xi,...,xg on С in a reasonably general position (we won't worry about this). Let (pi,. ,.,(pg be a dual basis of differential forms, i.e., (Pi(xj) = 0 if i^j ф 0 if i=j. Let X\,..., Xg be the corresponding homogeneous coordinates in fg~1 for the canonical map Ф : С —> P5. Also, if 3 < i < g, write <pi = dti, U a local coordinate at X{ and then expand 4>\ = XiUdU + ... (We may assume X{ ф 0 if 3 < i < g.) Then Petri's first step is to write down a basis for the vector space of k-fold holomorphic differential forms on С for every k: these are differential forms a{x){dx)k with no poles. For к > 2, they form a vector space of dimension Bk—l)(g — 1). The table below summarizes his results. Look at it carefully - each column displays a basis for fc-fold differentials, 1 < к < 5. Within each column however, we group the differentials in rows according to the multiplicity of their zeroes on 31 = ж3 + ... xg. Thus the first def row is always <р%,..., tpkg as each of these has no zero at one of the xi, whereas all other monomials in the ip^s will be zero at least to 1st order at each point of 31. The second column arises like this:
238 A. Curves and Their Jacobians a) one checks that every quadratic differential which is 0 on 31 is on the form b) hence if 3 < i < j < g, ipi<fj can be rewritten as ip\ • ( ) + ip% • ( ). c) Omitting these <fi<fj, the remaining 3g — 3 monomials form a basis as indi- indicated.
Table of representations of the general curve С of genus g degree of map 7Г : С -> P1 no. of branch points degree d of plane curve C0CP2 no. double points 8 of Co canonical curve vs. 0 1 2 3 4 5 6 10 11 0 4 6 10 12 16 18 30 34 1 3 4 4 5 6 6 9 10 0 0 1 0 2 5 4 18 25 c4 c6 c8 Сю Ci8 C20 - - - CP2 CP3 CP4 С P5 CP9 CP10 0 0 3 0 6 15 12 54 75 vs vs vs vs vs vs vs vs vs 3 10 15 15 21 28 28 55 66 100 51 300 69 2178 C198 С
240 A. Curves and Their Jacobians The third column arises like this: a) one checks the triple differentials <p\( ) + у\уъ{. ) + V'K ) are °f codimension 1 in the vector space of triple differentials ш, with double zeroes on 31! This is a reflection of the "fundamental class on C": the condition for such an ш to be formed out of ipf, ipnf>2,4>\ alone is that y(/viv2) = 0. all zeroes У of ipi except хз,..., xg b) Writing rji as indicated, this has a double zero on 31 and every difference rji — rjj satisfies (♦). c) Hence rji — rjj can be rewritten as indicated and this leaves exactly bg — 5 remaining triple differentials as a basis. The remaining columns are quite mechanical: the 2 ways of rewriting differentials reduce us to the attached list, and, by counting, leave us with exactly the right number to be a basis! Let us write out the 2 sets of identities by which these reductions are made. They will be: k=3 uijk VP jfc=3 (here the a is linear, the a' is quadratic, the г/s are scalars, and 3 < г, j < g, i Ф j). But what this means in terms of equations in P5 is precisely that 2 sets of homogeneous equations: 9 k=3 jfc=3 of degrees 2 and 3 generate the ideal of C! Petri now goes on to prove 3 beautiful results I) These equations are related by syzygies: a) fij = fji* 9ij + 9jk = 9ik
A. Lecture I 241 9 9 b) Xkfij - Xjfik + J2 <*ijifk£ - J2 <*ikefje = Pijk9jk 1 = 2, £ = 3 Чфк 1фз where 3 < i,j,к < д, i,j,к distinct, and the /fife's are scalars symmetric in г, j, and к. II) There are 2 possibilities: either pijk = скул = 0 whenever г, j, к are distinct, and then С is very special - it is a triple covering of P1 or if g = 6 it may also be a non-singular plane quintic; or else most of the p's and en's are non-zero (precisely, one can write {3,..., g} = I\ U h so that for all j 6 Д, к 6 Iii there exists an i with p^k ф 0, ацк Ф 0), and then the /if's alone generate the ideal of C. III) Given any set of /if's, g^s as above related by the syzygies in (I), where all Ai Ф 0, and at least one p^k ф 0, there exists a curve С of genus g whose canonical image in P5 is defined by these equations. In my mind, (III) is the most remarkable: this means that we have a complete set of identities on the coefficients a, a', v, v', v", A, //, p characterizing those that give canonical curves. It would be marvellous to use this formidable and precise machine for applications.
Petri's basis for the canonical ring ¥>3,.--,¥>я - - - д simple differentiali Here, use ¥>§,••■,¥>* 2 2 - - 3# — 3 quadruple differentials Here, use W - *7i = (-)Vi з < » < i < # ¥>з, ■■■,¥>? 2 2 - 5^ — 5 triple differentials , = (, 4 4 <Рз,---,<Рд 3 2 2 (p2<pi, 3 <C i <C g lg — 1 A-tuple differentials tpitpi,..., Vi^g simple zero on 21 ViV3> - • ■ > ViVg double zero on 21 ViV3) ■ ■ •; ViVg triple zero on 21 2 2 iPiiPi,iPiiP2<Pi,<PiiP2(Pi 4~f°ld zero on 21 3 4 q ^ ■ ^ Vi; V1V2, Vivi; Vivlj ViV2)V2 5-fold zero on 21 9# - 9 Ъ-tuple differentials c-i 8 о cr
Lecture II: The Moduli Space of Curves: Definition, Coordinatization, and Some Properties In the previous lecture, we studied each curve separately. We now want to discuss in its own right the space of all curves of genus g, which we denote by 9Rg. Also very important is the allied space: {isomorphism classes of objects (C, x\,..., xn) Л where С is a curve of genus g, and x\,..., xn are > distinct ordered points of C. J Let us begin as before by looking first at the simplest cases: I) mo,n = [F1 - @,l,oo)]n~3 - (all diagonals). In fact, if we have n distinct points xi,..., xn € P1, a unique automorphism of P1 takes x\ to О, Х2 to 1, and X3 to oo. The remaining n — 3 are arbitrary except for being distinct and not equal to 0 or 1 or oo. II) S0ti)O = ЯЯ1Д = A]- (the affine line17 with coordinate j). Because curves of genus 1 are groups, their automorphisms act transitively on them, hence 3Jt5)o = 3Jt5)i- To determine this space, recall that all such curves are isomorphic to one of the plane cubics C\, defined by y2=x(x-l)(x-X). Equivalently, C\ is the double cover of P1 ramified at 0, l,oo,A. One proves easily that C\x и С\2 if and only if there is an automorphism of P1 carrying {0,1, oo, Ai} (unordered sets) to {0,1, oo, A2}. This happens if and only if A2 = Ai, 1 - Ab 1/Ai, (Ai - 1)/Ab Ai/(Ai - 1), or 1/A - Ax) [e.g., note that the map (x,y) carries C\ to Ci_a; and the map (x,y) 17 Out of habit, I find it more comfortable to call affine n-space A" instead of C71: because A" also denotes affine n-space over other ground fields.
244 A. Curves and Their Jacobians carries C\ to Ci/a]- One must cook up an expression in Л invariant under these substitutions and no more. It is customary to use: (It is readily checked that this j is invariant under these 6 substitutions and since 6 = max(deg of numerator and denominator), no other A's give the same J.) We then get a bijection between the isomorphism classes of genus 1 curves С and the complex numbers С by taking С to j(X) if С « С\. Analytically, if С = C/L, the ./-invariant of С can be calculated from L in the following way: define18 92 = 60-^1/A4 93 = 140 - Then it can be shown that: = 1728-g32/{g32-27gl). In particular, if L = Z + Z • a;, then j(u) = j(C/Z + Z • u) is the famous elliptic modular function. Its most important property is its invariance under 5LB,Z), which can be explained from a moduli point of view as follows: ( ЗаеС such that \ f 3 (c d) e SLB,Z) such that \ a(Z + Z-LJ1) = Z + Z-u2 j ^^ \ u2= auJl+b (This is trivial to check.) But {3 isomorphism 1 Г 3 a G С such that I I J \ \ / j(C/Z + Z • o>i) = j(C/Z + Z • cu2). Hence: 3 (c!) €SLB,Z) such that Ш2 = CLO\ + d 18 The stream of funny constants can best be explained as making a certain Fourier expansion have integral, not just rational, coefficients. This makes the theory work well under "reduction modulo p".
A. Lecture II 245 III) 9Jt2,o: this space was studied classically by Bolza among others and in recent years was analyzed completely by Igusa, and was attacked as follows: describe a curve С of genus 2 as a double cover of P1 ramified in 6 points Ai,..., Ae. This sets up a bijection: {T . r >. ( unordered distinct 6-tuples Isom. classes of I ^ I > > m1 , , x , . „ . _ > = < Ai,..., A6 € P modulo automorphisms С of genus 2 / ^fV/u., pGLB,C) Describe an unordered 6-tuple {Aj} by its homogeneous equation f(Xo,X\) of degree 6, a so-called binary sextic, and we arrive at the problem: find polynomial functions of the coefficients of a binary sextic f(Xo,Xi) invariant under linear substitutions in Xo, X\ of determinant one. This is a problem worked out by the classical invariant theorists. These invariant functions are then coordinates on 90*2,0 • Without going into any more detail, suffice it to say that the simplest way to describe the answer you get is: /modulo Z/5Z acting by (z,*/,z)h->(C4CVC3z) where £5 = 1 this, in turn, may be embedded in A8 by the 8 functions x5, x3y, xy2, y5,x2z, xz3, z5,yz For all g > 3, 90t5)o has never been explicitly described}. This rather discour- discouraging fact does not mean that the other 90t5)Tl's have not been studied however. The lack of an explicit description is rather a challenge i) to find one and ii) to find the properties of ЯЯ5)П even without such a description! The first point to be made about ЯЯ5)П in general is why you call it a "space" and expect it to be a variety in the first place. Recall that a protective variety X С Pn is defined to be the complete set of zeroes of a set of homoge- homogeneous polynomials fi which generate a prime ideal p С С [Xo,..., Xn], and that a quasi-protective variety X С Pn is defined to be the difference X — (Yi U ... U Yn) where X, Yi are projective varieties. We then say that a normal19 quasi-projective variety M5)Tl is the moduli space if i) we are given a bijection between ЗЯ5)П and the set of points of M5)Tl, ii) for every algebraic family of curves of genus g with n distinct points, i.e., every "proper smooth morphism 7Г : X -> S of varieties whose fibres are curves of genus <720, plus n disjoint sections <jj : S -> X," the induced set-theoretic map ф : S -> M5)Tl defined by 19 This means that the affine coordinate rings of M9,n axe integrally closed in their quotient field. This is a mild condition needed only for technical reasons. 20 Again it is not essential to know in detail what these terms mean: the idea is to generalize, for instance, the family of curves y2 = x(x — l)(x — \i)(x — Аг)(ж — Аз), which would represent an algebraic family of curves of genus 2 parametrized by A .
246 A. Curves and Their Jacobians jli \ _ :. of M5)Tl corresponding via (i) to the curve 7Г 1(s), and points ai( is a morphism of varieties. It is not hard to show that any 2 such М5)П, М'д п are canonically isomorphic as varieties: hence we may speak of the variety Шд>п. It is a non-trivial theorem however that such a variety M5)Tl exists at all. The second point is to explain the relationship between 3Jt5)Tl and the Teichmuller space 35)Tl- Define free group on 2g + n generators A\,..., Ag, B\,..., Bg, П = { C\,..., Cn mod one relation kj\ ... un — e Define set-theoretically: set of objects (C, a, x\,..., xn), where С is а curve of genus g, xi,...,xn are distinct points of С and is an isomorphism such that a(Ci) is freely homotopic to a small loop around xi in positive sense [and if n = 0, a is "orientation preserving," e.g., via the intersection pairing (.), (a(Ai) -a(Bi)) = +1], modulo (C,a,x) ~ (C",a',a/) if there is an isomorphism ф : С -^ С such that ф(х{) = x\ and such that @*) о a differs from a' by an inner automorphism Via the deformation theory of compact complex manifolds, it is easy to put a complex structure on 35)Tl: this is the Teichmuller space. It is a deep theorem that 35>n is, in fact, a bounded, holomorphically convex domain in С3з-3+П. Let automorphisms a of П such _ I that (т(С{) = conjugate of Ci 9'n ~ ' (and if n = 0, a is orientation Inner automorphisms preserving in a suitable sense) Then it follows easily that Гд^п acts discontinuously on 35)Tl via (C, a,Xi)\—>• (С, а о a, xt) and that JJVg,n — ^g,n/ *- g,n- In the case g — n = 1, we just have again the situation mentioned above: viz.
A. Lecture II 247 3i,i = {u € С | Im и > 0} Г1Д - SLB,Z)/(±J) 9*1,1 = {j€C}=A}. In fact, given weC, define (C,a,xi) as follows - С = C/Z + Z • и x\ = image of 0 and if we let the image у of 1/2 6 С be the base point C, define a : П -^ щ(С -xi,y) by o;(Ai) = loop in С obtained by projecting: 0 V2 = loop in С obtained by projecting: @ (O+V2 @+1 7. 1/2 1 The third point we want to discuss is how one proves that ЗЯ5)П is, in fact, a quasi-projective variety, i.e., how one finds global homogeneous coordinates for £Ш5)П. То tie this in, for instance, with Petri's approach in Lecture I, one can view his ideas as leading to coordinates on some Zariski-open subset U С ЗЯ5M: (i.e., not on all of ЗЯ5M because the curve С had to be non-hyperelliptic and the g auxiliary points xi,...,xg had to be carefully chosen not in too special a position). In general, the hard part of this problem is to make the coordinates work everywhere on 3Jt5)Tl and not just on a Zariski-open U however. These coordinates can be viewed as automorphic forms on the Teichmtiller space 35)Tl with respect to the Teichmtiller modular group Гд^п; however this approach to their construction has not been pursued. I know of 3 methods to obtain coordinates: I. via "theta-null werte," II. via the cross-ratios of the higher Weierstrass points, III. via invariants of the Chow form.
248 A. Curves and Their Jacobians The first method will be discussed in Lecture IV, and we will pass over it for now. Method II is like this: let С by any curve of genus g. For any n > 3, let Rn(C) be the vector space of n-fold differential forms with no poles - it has dimension dn = Bn — l)(g — 1) - and let lj\ , 1 < г < dn, be a basis. Define Фп : С —> P^ by \U>\ J{x),...,LJKdnJ{x just like the usual canonical embedding. Regardless of whether С is hyperelliptic or not, these are all projective embeddings of C. On ФП(С), there is a finite set of points x of hyperosculation, i.e., points where for some hyperplane H, H touches Фп{С) at x with order > dn. Allowing these x to be counted with suitable multiplicity, there are en = gd^ of them: call them x\n\ 1 < i < en. These are the n-fold Weierstrass points (our definition here is slightly different from that of Lecture I, but is equivalent). Consider the en x dn-matrix giving the coordinates of the Weierstrass points: (n) ( (n)\\ For every /c{l,...,en}, #/ = dn, consider the minor: Mi = det Un) (Уп)I . l<i<d L V 3 П Note that the M/'s are not numbers, but rather products of differential forms at the various points Xj, j G /. Now for large N look at monomials in these minors: where r/ > 0 and ^2ieI ri = N for all i. Then these monomials are products over all Xj of nN-fold differentials at Xj. It follows that although the Mr's are not complex numbers, their ratios are! Or if there are fi possible choices of exponents r/ satisfying 77 > 0 and ^2i€lri = N, the set of values Mr, as r varies, is a well-defined point in P^. Finally we must symmetrize under permutations of the Xj which are not naturally ordered: perm. <r / Of {1 en} Then the ratios М'Г1/М'Г2 depend only on C, and not on the bases lj^ or on the ordering of the Xj 's. Thus we get
A. Lecture II 249 depending only on C. One proves a) that not all M'r(C) are zero, and b) that if C\ ф С2, M'r{C\) is not proportional to M£(C2), all r. Thus we have coordinates on SDt5jo- %ftg,n is very similar. Method ///is not so explicit. In general, for any curve С С Pm, we can de- describe С by its "Chow form": let Xq, ..., Xm be coordinates on Pm and consider 2 hyperplanes: Hu defined by ^ U{Xi = 0 and Hv defined by ^ ViXi = 0. Then it turns out that there is one equation Fc(u; v) such that Fc(u;v)=0^=^CnHunHv^ 0. Fc is called the Chow form of С and it determines C. (For curves in P3, this idea goes back to Cayley.) Consider the Chow form F<£n(c)- This depends on С and on the choice of basis и± of Rn(C). However, changing the basis \<^\ \ changes the Chow form F$n(^)(u; v) by the contragrediant linear substitution in и and v. Writing out this means that there is a natural representation of SL(dn,<C) in the space of forms F or of the space of coefficients Fap. One proves that there are "enough" invariant polynomials Ci(Fap) so that a) for each curve C, at least one Ci (/^„(c),»/?) is not zero, and b) if C\ tfi C2, then the set of numbers Ci (/^„(c),»/?) is not proportional to сг {Рфп(С2),а/з)- Thus again the map С \ >• (...,Ci fan(C),ap) ,-••) embeds 9Rg into projective space. The fourth point I want to make about ЗЯ5)П is that although it is not compact, because a sequence of curves may "degenerate", 3Jt5)Tl has a natural compactification 3Jt5)Tl obtained by casting out your net further and attempting to make into a moduli space not only the non-singular curves, but also some singular ones too. In fact one looks at curves С С Pn which may have "ordinary double points" and may even have several components. To be precise, we mean either a) that as an analytic set, С is connected and everywhere is isomorphic locally either to the unit disc Л, or to 2 copies of the unit disc A\ U Ai crossing transversely or equivalently
250 A. Curves and Their Jacobians b) that in the Zariski topology, С is connected and everywhere is defined locally either by n — 1 equations /1,..., /n-i with independent differentials dfc or by n — 1 equations g, /2,..., /n-i where g vanishes to second order with leading term (x,y) and the /j's vanish only to 1st order, with dx,dy,df2,... ,dfn-\ all independent. For instance, we could take 2 non-singular curves and let them cross transversely at one or more points; or we could take 1 non-singular curve and map it to Pn so that it crosses itself transversely at one or more points. Or we combine both operations! Then 3Jt5)Tl is to be the space of objects (C,x\,... ,xn), up to isomorphism, where С is a projective curve with only ordinary double points as defined above and the X{ are distinct non-singular points of С and g is the sum of genuses of the components of С treated as non-singular curves, plus the number of double points, minus the number of components, plus one: g = ^2(gi — l)+S+l; and finally if any component Co of С is isomorphic to P1, then there are at least 3 points of Co which are x^s or where Co meets other components of C. It is a theorem that 3Jt5)Tl is, in a natural way, a projective variety, esp. it is compact. The last topic I would like to discuss at some length is the curious ambivalence in the variety 3Jt5)Tl to be in various senses somehow hyperbolic on the one hand, yet in other senses it wants to be elliptic. To explain this, it's best to go back first to ЗЯ1Д. We can factor the map: H A) £ H/SLB, Z) by considering subgroups Г С SLB,Z) of finite index: H —-> Н/Г —-> H/SLB, Z). The curves Н/Г are finite coverings of ЯЛхд and are called "higher level" moduli spaces: I'll denote Н/Г by SDlfд. It too can be naturally compactified by adding a finite set of points; so we get finally the diagram 4 с ЯЯхд С Now of course all curves lie in 3 classes:
A. Lecture II 251 Elliptic Class: <? = 0; admits positively curved no holo. &-forms metric; Parabolic Class: 9=1', admits flat metric; one holo. &-form for each к Hyperbolic Class: g > 2; admits negatively curved lots of holo. metric; &-forms giving proj. embedding The point is that ЗЯхд is P1, hence is elliptic, while if Г is moderately small, SQtj j is hyperbolic. The reason this flip is possible is that /3 is ramified: in fact there are 2 finite points j = 0 and j = 123 at which Эхд —> ЯЯхд is respectively triply and doubly ramified, and 1 infinite point j = oo over which the /3's are arbitrarily highly ramified. Prom another point of view, Эхд admits a canonical metric with negative curvature, i.e., ds2 = dx2 +dy2/y2, (if z = x+iy 6 H ~ Эхд is the coordinate). This induces a negatively curved metric on each SQtf 1. In this metric, SQtf г has finite volume, but the metric has singularities, a) at points where a is ramified and b) at points of SQtjд — ЯЯ{д. (If Г is small enough, a will be unramified and only (b) occurs.) It is this constellation of facts that to some extent generalizes to 3Jt5)Tl. In our present state of knowledge, the generalization is very partial. To begin with, we get the same diagram: 9in a I V for each Г С -Г5)П of finite index. Let me begin with the known elliptic-type properties which are unfortunately weak: we assume n = 0 for simplicity. a) Assume also g > 4 for simplicity21. Then the singular set S С fflg is the set of points of 9Rg where 3g -> 9Rg ramifies and is the set of points corresponding to curves С with automorphisms. Then В\{Шд — 5), the first betti number, is zero, hence so is B\ of Шд, Шд and any non-singular blow-up ЗЯ* of %Rg. This means, e.g., that the so-called Albanese variety of ЗЯ* is trivial. b) 3Jt5 has lots of rational curves in it. In fact for any algebraic surface X and rational function / on X, let Ct С X be the curve f(x) = t, and let [Ct] € 9Rg denote the corresponding point. Then 21 If g = 2 or 3, Sing(9E)T9) Ф (Ram.Pts. of 3g -> Шд) С {С with automorphisms}. Always Bi(mg - Sing Жд) = 0, hence Bi(pKg) = В\Щд) = В^{Щ) = 0.
252 A. Curves and Their Jacobians is a morphism mg. c) If 9 < 10, fflg has the much stronger property of being unirational. This means equivalently that the field CCJt5) of rational functions is a subfield of C(ti,t2, ■. • ,tn) for some n or that there is a Zariski-open set U С An and a morphism /:[/—»■ 9Rg with dense image22. In terms of moduli, fflg being unirational means that one can write down a family of curves of genus g depending on parameters t\,..., tn which can be arbitrary complex numbers satisfying some inequalities fi(t) ф 0, such that "almost all" curves of genus g appear in the family: e.g., if g = 2, take the family y2 = x5 + txx4 + t2x3 + t3x2 + Ux + h and if g = 3, take the family y4 + y3 (tix + t2) + y2 (t3x2 + Ux + t5) +y {kx3 + t7x2 + t4x + ts) + (t9x4 + t10x3 + tnx2 + tux + t13) = 0. In fact, if g < 10 we may use the remarks in Lecture I about realizing curves as plane curves with double points to write down a family of plane curves of degree d = [^p^] with free parameters almost all of which represent curves of genus g and which include almost all curves of genus g. Whether more SDt5's, g > 11, are unirational or not is a very interesting problem, but one which looks very hard too, especially if g is quite large. Now consider the hyperbolic tendencies of 3Jt5)Tl. First of all, we can put 2 types of metric on 3gy. one of these is the famous Teichmuller metric рт- This is a Finsler metric, so it's a bit messy. However, it equals the Kobayashi metric of 35,n, so all holomorphic maps / : Л -> Э5)П are distance decreasing for рт and the Poincare metric on Л : a hyperbolic property. Its unit balls have been determined and are quite amaz- amazingly wrinkled and creased: this led Roy den to prove the rigidity theorem that if dim35)Tl > 1, Гд^п = Aut C5)Tl); esp. 35)Tl is not at all a homogeneous domain in C3s~3+n. On the other hand, in this funny metric, 35)Tl is a straight space in the sense of Busemann, i.e., has unique indefinitely prolongable geodesies, but con- contrary to a conjecture does not have negative curvature in Busemann's sense (this fly in the ointment shows that my general picture is not entirely accurate!). 35)Tl carries another metric pp-w, the Peterson-Weil metric23, which is a Kahler metric, hence locally much nicer. Moreover, it has strictly negative Ricci cur- curvature and holomorphic sectional curvatures. In particular, holomorphic maps / : Л -> 35)Tl will also be distance decreasing for pp-w (suitably normalized) by the Ahlfors-Pick lemma. All the spaces ЗЯ5)П inherit both metrics (with pos- possible singularities where a : 35)Tl -> 90t£n is ramified), and, esp. with pp-w, this makes them rather hyperbolic. A closely related hyperbolic property of 3Jt5)Tl is: 22 If this holds, one can assume n = Zg — 3 by restricting /„ to U П L, L a sufficiently general Ъд — 3-dimensional subspace of A"; hence С(Шд) С C(*i,... ,£зэ-з) with finite index too. 23 If n > 0, more precisely, there is a family of P — W metrics depending on assigning branch numbers 04, 2 < 04 < 00, to the base points Xi.
A. Lecture II 253 The Rigidity Theorem of Arakelov-Parsin-Manin-Grauert. (also called the "Safarevitch-Mordell conjecture in the function field case"). Fix g > 2 and let С be any curve, S a finite set of points of C. Then there are only finitely many families of curves of genus g over С — S, i.e., 7Г-.Х —>C-S which are "non-constant" (i.e., the fibres tt~1(s) not all isomorphic), and if 2(genus C) - 2 + #5 < 0 there are none at all; moreover, for each such family there are only finitely many sections, and even for "constant" families, there are only finitely many non- constant sections. Corollary. Fix g,n,C,S as above. Then there are only finitely many non- constant morphisms Ф: с - s —> mg,n which are locally liftable to ЭД)П: i.e., if x G С — S, and ф{х) is a ramification point for 3g>n —>• Шд,п, one asks that in a small neighborhood of x, ф factor through 3g,n- A sketch of the proof is given in an appendix below. Finally I want to conclude by giving a conjecture which I am hopeful will very soon be a theorem! Conjecture. For each g,n, there is а ГЬ С Л,,п of finite index such that for all Г С Го, ЯТ^П is a variety of general type in Kodaira's sense. Here "general type" for a variety X of dimension n means that you compact- ify X to X, then blow-up X to X* which is non-singular, and then you look for differential forms of type uj = a(x) {dx\ Л ... Л dxn) on X*, with no poles. It means that if к is large enough, you can find n + 2 such forms whose ratios generate the field of rational functions C(X) on X. Since on unirational varieties, there are no non-zero differential forms of any type, the conjecture means that for Г small, 9Jt£n is more or less the opposite of being unirational. [Added in 1997 edition] After these notes were written, a remarkable fact was discovered: 9Яд itself, for g > 24, is of general type. In other words, one finds in the sequence of spaces 9Яд this transition from elliptic to hyperbolic. For g < 13, Шд is unirational, *DT15 has Kodaira dimension — oo (no such w's, any к), ШТ23 has positive Kodaira dimension B such a/s for some k) and general type thereafter.
254 A. Curves and Their Jacobians Appendix: The idea of the proof of rigidity The proof has 2 steps. The first consists in showing that the set of all families тг : X —>• С — 5, and the set of all sections s : С — S —>• X of families тг, itself consists in a finite number of families. The second consists in showing that given one тг : X —>• С — S or one section s : С — S —> X of such а тг, then one cannot deform тг or s, i.e., that the only families тг or s lie in are O-dimensional. Since a finite number of O-dimensional families is just a finite set, we are done. To carry out the first step, one can use an explicit projective embedding of 9ЭТД)П, and for all ф : С - S ->• Шд>п with ф(С — S) ф point, extend ф to ф : С —>• Шд>п and seek a bound on degree ф{С). Then by general results, the set of morphisms ф : С — S —>• Шд>п with degree ф{С) bounded can be grouped into a finite number of nice families, the parameter space of each of which is some auxiliary variety. Equivalently, this means take a particular ample line bundle L on 9Яд>п and seek a bound on с\{ф L). (In fact, the nicest line bundle to pick is not quite ample, but near enough to make the proof go through: we will ignore details like this here.) Choosing a nice L, the next step is to identify ф L from the geometry of the family тг : X —>• С — S and the section s. One extends the family тг of non-singular curves over С — S to a larger family W:~X—>C over С of curves, some of which have double points (as in the definition of Шд>п). Then it turns out that for the most natural L on 9ЭТ„.п, ф*Ь^ where fi-xjC denotes the line bundle whose sectinons are differential forms on the curves tt-1(s), i.e., the cotangent bundle to the fibres of тг, except that where tt-1(s) has a double point, the forms may have simple poles with opposite residues at the 2 branches of tt-1(s) at this double point. If one is dealing with n sections si too, hence a morphism ф : С — S —>• Шд>п with n > 0, then ф L is a tensor product of powers of this bundle and the line bundles where ^ : С —>• X is the extension of s^. Now, in fact, by using the theory of algebraic surfaces, one gets a very good bound: 1-9o) where q = genus С s = #5 go = dimension of biggest abelian variety which appears in the Jacobian of every curve tt-1(s) of the family.
A. Lecture II 255 C\ [s*^jc/c) seems harder to bound: I don't know a nice small explicit bound. However, following Grauert one can show that one exists by showing first that the cotangent bundle i?-^ (of rank 2) is ample on almost all fibres tt~1(s) of X over С and then applying general results on ample vector bundles. A good explicit bound here would be very interesting. To carry out the second step, one applies Kodaira-Spencer-Grothendieck de- deformation theory to calculate the vector space of infinitesimal deformations of 7Г : X —>• С — S and of s : С — S —»• X. More precisely, one looks at deformations of X such that the map тг : X —>• С extends to this deformation and all singular fibres remain concentrated in n~1(S). It turns out that: Space of 1 infinitesimal I ~ rxi (~y q-\ deformations of [ ~~ \ ' x/s v 7Г-.Х—vC-S ) and Space of infinitesimal deformations of s-.C-S—>X To show these vector spaces are @), one shows - and this is Arakelov's deepest contribution - that M-x/C is an ample line bundle on X. Then the first space is @) by Kodaira's Vanishing Theorem, and the second space is @) because the line bundle has negative degree. Amazingly, Arakelov's proof here involves studying the curve D С X such that D П тг (s) = the Weierstrass points of тг (s) and identifying via differentials the line bundle on X of which D is a section.
Lecture III: How Jacobians and Theta Functions Arise I would like to begin by introducing Jacobians in the way that they actually were discovered historically. Unfortunately, my knowledge of 19th-century literature is very scant so this should not be taken too literally. You know the story began with Abel and Jacobi investigating general algebraic integrals I = f f(x)dx where / was a multi-valued algebraic function of X, i.e., the solution to g(x,f(x)) =0, g polynomial in 2 variables. So we can write / as / = у dx where 7 is a path in plane curve g(x, y) = 0; or we may reformulate this as the study of integrals Ha) - Г ^(Ж' У) jx p> Q polynomials У) X ' a,aoe plane curve С : д(х, у) = 0 of rational differentials ш on plane curves C. The main result is that such integrals always admit an addition theorem: i.e., there is an integer g such that if uq is a base point, and a\,..., ag+\ are any points of C, then one can determine up to permutation b\,...,bg G С rationally in terms of the a's24 such that I и + ...+ I 00= и + ...+ I u, mod periods of / u. J ao J a,o «во «/во «/ For instance, if С = F1, uj = dx/x, then g = 1 and: fai dx Г2 dx _ faia2 dx J1 x Jx x Jx x ' Iterating, this implies that for all a,\,..., ag, 6i,..., bg EC, there are a,..., cg € С depending up to permutation rationally on the a's and b's such that 24 E.g., one can find polynomials gi(x,y;a) in x,y and the coordinates of the a's such that the bi's are the set of all b G с such that gi(b; a) = 0.
258 A. Curves and Their Jacobians 9 pa / (mod periods). 9 ра{ 9 i-bi 9 pa il / ш ~*~ Z_/ / ш ^ Z_/ / =l 1'ao i=i •'oo j=i Jao Now this looks like a group law! Only a very slight strengthening will lead us to a reformulation in which this most classical of all theorems will suddenly sound very modern. We introduce the concept of an algebraic group G: succinctly, this is a "group object in the category of varieties," i.e., it is simultaneously a variety and a group where the group law m : G x G ->• G and the inverse i : G ->• G are morphisms of varieties. Such a G is, of course, automatically a complex analytic Lie group too, hence it has a Lie algebra Lie(G), and an exponential map exp : Lie(G) —>• G. Now I wish to rephrase Abel's theorem as asserting that if С is a curve, and uj is any rational differential on C, then the multi-valued function J an UJ 'a0 can be factored into a composition of 3 functions: С - (poles of uj) -A J 4^ Lie J А С where: i) J is a commutative algebraic group, ii) t is a linear map from Lie J to С iii) ф is a morphism of varieties; and, in fact, if g = dim J, then if we use addition on J to extend ф to ф^ : [(C-poles uj) x ... x (C-poles uj)/ permutations] —> J sg then ф(д) is birational, i.e., is bijective on a Zariski-open set. In our example then J = F1 — @, oo) which is an algebraic group where the group law is multi- multiplication, and ф is the identity. The point is that J is the object that realizes the rule by which 2 p-tuples (a\,... ,ag), Fi,... ,bg) are "added" to form a third (ci,..., cg), and so that the integral $^i=i Hx*) becomes a homomorphism from J to С A slightly less fancy way to put it is that there is а ф : С-(poles и) —> J and a translation-invariant differential 7] on J such that hence гф(а) ran j г)= I uj (mod periods). Jф(ап) Jao Among the a/s, the most important are those of 1st kind, i.e., without poles, and if we integrate all of them at once, we are led to the most important J of
A. Lecture III 259 all: the Jacobian, which we call Jac. From property (iii), we find that Jac must be a compact commutative algebraic group, i.e., a complex torus, and we want that ф : С —> Jac , should set up a bijection: translation— iv) ф* : Thus . , , . , rational differentials invariant 1-forms , | wonC w/o poles г] on Jac L ' *■{ dim Jac = dimRi(C) = genus g of C. To construct Jac explicitly, there are 2 simple ways: v) Analytically: write Jac = V/L, V complex vector space, L a lattice. Define: V = dualoffli(C) set of £ G V obtained as periods, i.e., £{uj) = J a; for some 1-cycle 7 on С Fixing a base point uq G C, define for all a G С {image in V/L of any £eV defined by £(uj) = С w, where we fix a path from ao to a Note that since Jac is a group, т,„^ , translation — invariant \ _ / cotangent sp. to Jac at a 1-forms on Jac J \ any a G Jac vi) Algebraically: following Weil's original idea, introduce S9C = С x ... x C/Sg and construct by the Riemann-Roch theorem, a "group-chunk" structure on S9C, i.e., a partial group law: m : U\ x C/2 —> U3 UiC S9C Zariski-open. He then showed that any such algebraic group-chunk prolonged automat- automatically into an algebraic group J with S9C D U4 С J (some Zariski-open £/4). An important point is that ф is an integrated form of the canonical map Ф : С —>• F9~1 discussed at length above -
260 A. Curves and Their Jacobians vii) Ф is the Gauss map of ф, i.e., for all x G С, <1ф(Тх,с) is a 1-dimensional subspace of Тф(х)^&с, and by translation this is isomorphic to Lie(Jac). If Рд-1 z= [space of 1-dimensional subsp. of Lie(Jac)], then в,ф : С ->• Ffl-1 is just Ф. (Proof: this is really just a rephrasing of (iv).) The Jacobian has always been the corner-stone in the analysis of algebraic curves and compact Riemann surfaces. Its power lies in the fact that it abelianizes the curve and is a reification of Яь e.g., viii) Via ф : С —>• Jac, every abelian covering тг : C\ —> С is the "pull-back" of a unique covering p : G\ —> Jac (i.e., C\ = С х G\). Jac Weil's construction in vi) above was the basis of his epoch-making proof of the Riemann Hypothesis for curves over finite fields, which really put characteristic p algebraic geometry on its feet. There are very close connections between the geometry of the curve С (e.g., whether or not С is hyperelliptic) and Jac. We want to describe these next in order to tie in Jac with the special cases studied in Lecture I, and in order to "see" Jac very concretely in low genus. The main tool we want to use is: Abel's Theorem. Given x\,..., Xk, y\,..., у к € С, then 3 rational function / "j on С with I JU \h / (/) = (zeroes of /) - (poles of /) ( «=* 2^ ^Xit = 2^ ™)' def I i=\ i=\ When this holds, we say Exi = Uyi, or Uxi,Uyi are linearly equivalent. For instance, when С = F1, any 2 points a, b are linearly equivalent via the function x~a For every /г, we consider the map: к times С x ... x С —> Jac к г=1 If SkC denotes Ck divided by permutations, i.e., the fcth symmetric power of C, then this map factors via ф(к) . skC _^ Jac Define
A. Lecture III 261 surjective if к > д). The fibres of this map25 are called the linear systems on С of degree k, and by Abel's theorem they are the equivalence classes under linear equivalence and can be constructed as follows: a) Pick one point 21 = £?=1 x{ e SkC. b) Let {v. sp. of fens, f on С with (/) + 21 > 0, i.e., poles only at £i, order bounded by mult, of Xi in 21 c) Let |2l| = {set of divisors EVi = (/) + SXi, f € LBl), / ф 0} С SkC. d) Then |2l| = ф^~1 {ф(кЦЩ. Note that it follows |2l| = projective space of 1-dimensional subspaces of LBl). e) We also want to use the Riemann-Roch theorem that tells us that dim |2l| = к — g + i where {dim of v.sp. Ri(—21) of differentials 1 w 6 i?i(C), with zeroes on 21 J " Now let's look at low genus cases: g = 0 : Jac = @) g = 1 : (а) ф : С —>• Jac is an isomorphism, i.e., С = Jac. In fact, for any genus : С -> Jac is an embedding, hence an isomorphism of С with W\. {Proof, the fibres of ф(к) being Fn's, ф^ would be either an embedding or С itself would be P1.) (b) If A; > 2, ф(к) . Skc _^ Jac makes SkC into a Pfc-1-bundle over Jac, whose fibres are the linear systems of degree k. In general, if к > 1g — 2, ф(к) .SkC —> Jac makes SkC into a Pfc~fl-bundle over Jac. {Proof. This is a conse- consequence of the Riemann-Roch theorem since no differential can have more than 2g — 2 zeroes.) 25 A technical aside: the complete ideal of functions on SkC vanishing on ф^ 1(а) is generated by the functions on Jac vanishing at a - this is needed to make rigorous some of the points made below.
262 A. Curves and Their Jacobians g = 2 : The interesting case is 1 < к < 2g — 2, i.e., к = 2: the map Recall that there is a degree 2 map тг: С —>• F1. Since the points of F1 are all linearly equivalent to each other, the degree 2 cycles 7г-1(ж) are also linearly equivalent. This gives us a copy E of F1 inside S2C. The result is that Jac is isomorphic to the quotient of S2C after identifying all points of E; i.e., that Jac is obtained by "blowing down" E С S2C. Here is a picture: S2C 0B) where, as is customary in the theory of algebraic surfaces, we draw real 2-dimensional manifolds in place of manifolds of 2 complex dimensions, which are 4 real-dimensional, hence undrawable! Going backwards, we may say that S2C is obtained from Jac by "blowing up" e = ф^(Е): this is a process applicable to any variety X that replaces one of its points x by the set of tangent lines to X at x, giving you a new variety BX(X) birational to the first. We see here clearly that if we take the group law m : Jac x Jac —>• Jac and try to transfer it to S2C, we get merely a group chunk as in Weil's treatment because of E. Consider first к = 3: Jac> For any x G C, consider the differentials oo on С zero at x: they form a 2-dimensional vector space and have 3 zeroes besides x. These zeroes form a degree 3 cycle, and as uj varies all these are linearly equivalent (use the functions uj\/uj2)'. this gives us a copy Ex of F1 in S3C. It turns out: Jac = (S3C modulo collapsing each Ex to a point ), or putting it backwards if 7 = locus of points ф№\Ех), then S3C = ( Jac, with a curve 7 С Jac, isom. to C, blown up). Most interesting is the case к = 2: . S2C —H W2 С Jac.
A. Lecture III 263 Then if С is not hyperelliptic, there are no non-trivial degree 2 linear systems, so S2C ^ W2. But if С is hyperelliptic, you get one degree 2 linear system as in the g = 2 case, so W2 = (S2C with a copy E of F1 blown down). The image e G W2 of E is a now double point and it looks like this: 0B) g = 4 : In this case, I want to consider because of its importance in Lecture IV only the case к = 3: ) Ssc __» щ c jac. We mentioned briefly in Lecture I that either а) С is hyperelliptic, or b) С was an intersection in F3 of a quadric F and a cubic G. Now we also distinguish bi) F singular, hence a quadric cone, and Ъ2) F non-singular. Ъ2) is the most common case. Using the 2 rulings on a non-singular quadric, it is a standard fact that such a quadric is isomorphic to F1 x F1. Thus С = (F1 x F1) П G, and since G is a cubic, С meets the curves (F1 x pt.) or (pt. x F1) in 3 points. Thus the 2 projections of F1 x F1 to F1 induce 2 maps 7ri,7r2 from С to F1 of degree 3. The 2 families of degree 3 cycles and {тг2~1(ж)} 1 {^} {2 form 2 linear systems E1,E2C S3C, with E1 £ E2 = F1. Then: case b2: S3C with Ei,E2 blown down to 2 points ei, e2 and ei,e2 = ordinary double points of W3. In case Ъ\, the 2 rulings "come together"; in fact, S3C contains only one non-trivial linear system E, and
264 A. Curves and Their Jacobians case 6i: W3 = (S3C with E blown down to e) and e = higher double point of W%. In the hyperelliptic case a, it turns out that there is a whole curve of linear systems Ex С S3C depending on a point x G C: in fact, take the degree 2 linear system, and just add x to each of its members. Thus тхг ~ ( S3C with the surface UEX blown case a: W3 = , . ,. , „ \ down to a curve 7 isomorphic to С and 7 = double curve of W3. Enough examples: the moral is that HVs and their singularities display like an illustrated book the vagaries of the curve С from which they arise. The general result is the following: Theorem (Riemann-Kempf). Let a eWk С Jac, let L = ф^~1(а) С SkC and suppose L = Ff. Then Wk has a singularity at a of multiplicity (д~£+ ), and the tangent cone to Wk inside Ta> jac (— tangent sp. to Jac at a) is equal to: Here £)</>(fc) is the differential of ф^ and it gives rise to an exact sequence: (*) 0 > T^L > Т^3кС У Ta, Jac- In fact, this sequence actually "displays" the Riemann-Roch formula in a beau- beautiful way: using the fact that Rx (C) = translation-invariant differentials on Jac — T*t Jac (= cotangent sp. to Jac at a), for all a, it is not hard to check that if и G R\{C) corresponds to [и] е T*t Jac, then: [uj]=0 in Т^>3кС<=^ш is zero on 21. Therefore Coker Бф{к) ^ dualof-Ri(-Sl), the space of differentials zero on 21. Therefore counting the dimensions of the vector spaces in (*): dim L = dim SkC - dim Jac + dim Coker
A. Lecture III 265 which is the Riemann-Roch theorem! What comes next is going to be harder to follow, but we can go much further: let {uji} be a basis of R\(—21) and let {fj} be a basis of LBl). Then a general member of L is given by and a basis of Ri(—2lt) is given by (Z^=o *?/j ) ' u*- Therefore span the dual of the cokernel of Ja,Jac, or ^2j=otj[fjUJi] = 0 are linear equations on Ta,jac which define the subspace D<}>{k) {Twit,skc)-lt follows that if we put together a big (£+1) x (g - к+£) -matrix of linear functions on Ta,jac out of [fj-ил], then all its (£ + 1) x (£ + 1) minors vanish on each Бф^ {T^uskc)i hence vanish on the whole tangent cone to Wk- Kempf proved that these equations suffice, and that Wk itself has equations of this type: Theorem (Kempf). There is a (£ + 1) x (g — к + £)-matrix of holomorphic functions (fij) on Jac near a such that Wk is the set of zeroes of all its (£+1) x (^4- 1) minors: i.e., Wk is a determinantal variety. Moreover, [fjOJi] = linear term of fij and the tangent cone to Wk is the set of zeroes of the (£+1) x (£+1)- minors of the matrix \fjOJi] of linear functions. The feature of the Jacobian, however, which really gives it its punch is the theta function. There are 3 very good reasons to look next at the function theory of Jac - a) to define projective embeddings of Jac, hence understand better the algebraic structure, moduli, etc. b) because Jac is a group, one hopes that its function theory will reflect this in interesting ways, c) by pull-back, functions on Jac will define functions on S9C, hence on C, and may give a good way to expand functions on C, prove the Riemann-Roch theorem, etc. So write Jac =£9/L. Instead of constructing L-periodic meromorphic functions / on О, one seeks L-automorphic entire functions /, i.e.,
266 A. Curves and Their Jacobians f(x + a) = ea(x) • /(ж), a e L, x € {ea} = "automorphy factor6 Equivalently, such / are holomorphic sections of a line bundle L{Cay on Jac and clearly the quotient of 2 such / is always L-periodic. The simplest choice of {ea} is something in the general form: (ea = (constant) is too simple, because no /'s will exist.) Now if g > 2, most complex tori C9 /L have no non-constant meromorphic functions on them at all, and are not algebraic varieties, and do not carry any but "trivial" {ea}'s27. In the case of a curve C, however, special things happen; let's look for bilinear forms as candidates for B. We saw above that on R\(C) one has a positive definite Hermitian form: , , f _ Jc hence its dual, which is the universal covering space О of Jac, gets a Hermitian form that we will write H. But also Hi (C, Z) carries an integral skew-symmetric form given by intersection pairing. As we saw in (v) above, there is an isomorphism Hi(C,Z) = L, hence L carries such an E. It is not hard to show that H and E are connected by: 2) = Im Я(ж1,ж2), all si,Ж2 G L, and that when (*) holds there is a (nearly canonical28) choice of {ea}, viz. ea(z) = ±e^(*.«)+^K«)]. Moreover, one has the beautiful theorem: 26 i.e., entire functions on C9, nowhere zero, such that ea+p(x) = ea(x + P) ■ ep(x). 27 {ea} is trivial if ea(x) = e(x + a)/e(x) for some e. 28 The sign ± is not canonical; it satisfies some funny identities that I don't want to discuss; any 2 choices, however, are related by a transformation e'a(x) = £(a)ea(x), £ e Horn (L/2L, ±1).
A. Lecture III 267 Theorem. The existence of a positive definite Hermitian H on <C9 and an inte- integral skew-symmetric E on L satisfying E = Im H is necessary and sufficient for a complex torus <C9 /L to carry g algebraically independent meromorphic func- functions and if it has such functions, it admits an embedding into Fn, some n, hence is a projective variety29. Here we see the principle emerging that a complex torus does not fit easily in Fn: non-trivial identities (*) are required before it will fit at all. Now define a theta-function30 of order n to be an entire function / on О such that f(X+a) = and let Sn be the space of such /. Then 5 = ^ Sn is a graded ring. Elementary Fourier analysis combined with the fact that E is a unimodular pairing leads to (**) dim Sn=n9, (n>l). In particular, there is exactly one first order theta-function, up to scalars. This important function, written $(ж), is called Riemann's theta function31. If, in- instead, we take any n > 3, and let ф\,..., фпд be a basis of 5n, we get: Lefschetz's embedding theorem. C9 /L is embedded in P71" by (ф1(х),...,фп9(х))=Фп(х). This makes sense because ф%1ф^ are single-valued functions on C9 /L. This solves problem (a) raised above, (b) however is even more remarkable. In fact, to introduce the group structure into the picture, for all f3 G C9, define (!>/)(*) = /(* + /?). For all nowhere zero holomorphic functions eonC9, define 29 By Chow's theorem, if you embed it in projective space at all, the image is projective a variety; and if you embed it in 2 ways, the 2 projective varieties are isomorphic algebraically as well as analytically. 30 Since this is not exactly the classical definition, let me indicate the connection. Classically, one splits L = L\ + L2, when Li = Z9 and E(x\,X2) = 0 if x\,X2 are both in Li or both in L2. For all a E L2, define a complex linear £a : C9 —> С by £a(x) = E(x,a) if x G L\. Require instead f(x + a) = f(x), qGLi f(x + a) = е2«»<М»)+$Ма))у(я^ Vq € L2. Then these /'s differ from the other /'s by an elementary factor. 31 In the classical normalization, it is
268 A. Curves and Their Jacobians (Uef)(x)=e(x)f(x). Then refining the analysis leading to (**), one finds Lemma, i) V/? G C9, there exists e such that UeTpSn = Sn if and only if ii) Choosing such an e(/3) for each 0 G \L, /3 »-> t/e(/3) -Tp defines a projective representation32 of ^L/L on Sn: this representation is irreducible. It seems to me remarkable that although C9/L is an abelian group, its function-theory is full of irreducible representations of dimensions bigger than one: in fact, these are ordinary representations of a finite 2-step nilpotent group 1 —> Z/nZ —> <5n —> -L/L —>- 1 n analogous to the nilpotent Lie group: 1 —>R —>•£> —>V ®V —>-l (V = real vector space) whose Lie algebra is the Heisenberg commutation relations33. This rather easy lemma has lots of consequences. Corollary, i) In the embedding \Pn, translation by f3 on C9 /L extends to a linear transformation P71" -)■ F71" if and only if C G ^L/L. ii) Modulo the choice of distinguished generators for the associated finite group <6n, we get, up to scalars, a distinguished basis of Sn, hence a normaliza- normalization of 91 under projective transformations. In this normalization, translations by 0 G ^ acts on \Pn(C9 /L) by a simple set of explicit n9 x n9-matrices. To be more explicit, start with •& G S\. Choose ф : Ъ9 х Ъ9 —ь L such that Е(ф(п,т),ф(п',т')) = (n,m'} — (m,n'), and extend ф to Q9 xQ9 -^ L <g> Q. Then if n = m2, a typical distinguished basis of Sm2 is of the form 32 I.e., g •-> Ug is a projective representation of G if Ugig2 = COnSt. Ugi ■ Ug2, all gi,g2 eG. 33 5) = set of triples (a, x,£), a € R, x e V, £ € V, with group law: (a,x, 0 .(a1, *', £') = (а + а'
A. Lecture III 269 a (x) = exponential of suitable ' linear fen. д(тх + ф(а, where a,P range over coset representatives of ~Ъ9 modulo Ъ9. Thus is the normalized projective embedding of C9 /L. The most important point here is that while translations by P G ^L/L are normalized, Фп@) is not normalized. Hence #n@) = (...,$ о @),...J is an invariant of the torus C9/L and the distinguished generators of <5n. These # я @) are classically called the theta- null werte. We will discuss their role as moduli at more length in Lecture IV. Summarizing this discussion, you can say that you can take а) C9 /L, and b) Fn: both innocent homogeneous complex manifolds. You marry them via \Pn and the children they produce are these highly unsymmetric and intricate functions $ о : С9 /L Fn We pass on now to problem (c): when C9 /L is the Jacobian of C, pull back functions on C9 /L to С and see what you get. We follow Riemann and consider the basic functions: where e G C9, and a; = (uj\,... ,uj9), uJi the basis of R\{C) (recall that $ is naturally a function on the dual space to Ri(C), and we have identified this space with £9; so {ш{\ is the dual basis of R\{C)). For fixed у and e, this is a multi-valued function on С that changes by a multiplicative factor ( I u>+ const. I \\Jy ) when analytically continued around a cycle. Riemann showed that when not identically zero it had exactly g zeroes z\,..., zg and that there was a point Л G Jac (depending on the choice of sign ± in the definition of the automorphy factor {ea} for ■#) such that in Jac:
270 A. Curves and Their Jacobians In fact, we saw that ф{я) , s9C —> Jac was birational, and this shows that if e0 = ф(у) + Л we get an inverse to almost everywhere by e i—> (/-tuple of zeroes of ft I / a; — e -I- e0 \Jy Moreover, we find: some Zi 3zi,..., zg-\ G С equals у such that YX=l Ф&) = ^ + e This means that if we define the codimension 1 subset & С Jac by в = {x G Jac | ti(x) = 0}, then, up to a translation, & is just Wg-\\ This is the basic link between theta- functions and the geometry discussed earlier. Moreover, if we fix e such that $(e) = 0, and fix z\,..., zg-\ G С such that ^ Ф^г) = Л + е, then consider the function Ee(x, y) for variables x and y. It follows that so long as x »-> Ee(x, y) is not identically 0, its zeroes are x = у and x = z\,..., zg-i, i.e., ignoring certain bad points Z{ independent of у, Ее(х, у) is a Prime Form as a function of x: has a unique variable zero at y. Using this, we can show that every rational function f on С has a unique factorization: If ai = zeroes of / bi = poles of / then г-, „ , (for some a; G i?i(C)). This beautiful decomposition is the higher genus analog of the factorization: of rational functions on F1. Nor do these factorizations depend much on e, be- because if $(ei) = i?(e2) = 0, then Eei (ж, у) = (fen of x alone)(fen of у alone)Ee2 (ж, у). Using the £?e's, we also get beautiful expressions for differentials on С with various poles too, e.g., ■d £?.(i,a) \ is a rational 1-form on C, with simple poles at a, 6 only, residues ±1 respectively; log Ee(x,y) dx y=a is a rational 1-form on C, with a double pole at x = a and no others.
Lecture IV: The Torelli Theorem and the Schottky Problem The purpose of this lecture is to consider the map carrying С to its Jacobian Jac from a moduli point of view. Jac is a particular kind of complex torus and the Schottky problem is simply the problem of characterizing the complex tori that arise as Jacobians. The Torelli theorem says that Jac, plus the form H on its universal covering space, determine the curve С up to isomorphism. First of all, we saw that if g > 2, not all complex tori X = C9 /L are even projective varieties: in fact, necessary and sufficient for X to be a projective variety is that there exists a positive definite Hermitian form H on O, such that E = Im H is integral on L x L. The varieties that arise this way are called def abelian varieties. The forms H are called polarizations of X. Since rk L = 2g, and E is skew-symmetric and integral on L, det E = (—l)9d?, for some d G Z, d > 1: d is called the degree of the polarization. A polarization of degree 1 is called a principal polarization. Jacobians come with a natural polarization in which E is just the intersection form on L = H\(C,Zj): this form is unimodular, so this is in fact a principal polarization. In general, if (O /L, H) is any polarized abelian variety, one can find L\ С L of finite index and n > 1 such that (O /L\, £ H) is a principally polarized abelian variety - so in studying all abelian varieties, the principally polarized ones play a central role. Secondly, we saw in Lecture III that starting with any principal polarized abelian variety (C9 /L, Я), we get Riemann's theta function ■# : C9 -> C, hence & = (zeroes of $) С С9 /L. A more succinct way to describe how C9 /L and H canonically determine the codimension 1 subvariety & up to translation34 is the following: & = any codim. 1 subvariety D of C5 /L whose fundamental class [D] e H2(C9 /L, Z) is represented by E, under the canonical identification: H2(C9 /L,Z) = (skew-symmetric, integral forms on L). Up to a translation, the only such & is the set of zeroes of Id. This shows that H, or & (up to translation) are equivalent data. Moreover it is also possible to describe which codimension 1 subvarieties D С С9 /L arise from H and a ■#: for 34 From Lecture III, © looks like it is unique even without a possible translation: however, remember the annoying ambiguity of sign in {ea } - this means we actually only found © up to translation by a point x G \ L/L.
272 A. Curves and Their Jacobians any a G C9 /L, let Da = translate of D by a. For any D, choose ai,...,ag so that Dai,..., Da meet transversely and consider the number of intersections: This is denoted (D9) and is always divisible by g\ Then D = some 0 iff (D9)/g\ = 1. We say such D's are of degree one. Therefore, instead of describing principal polarizations on C9 /L as forms H with Im E integral and unimodular on L, we can describe them as codimension 1 subvarieties & С С9 /L with @9) = g\ given up to translation. This gives a completely algebraic way to describe such polarizations. There are also quite simple ways to describe algebraically polarizations of higher degree, but we do not need to know these35. We can now introduce the moduli space of principally polarized abelian varieties: set of pairs (X, (9), where X is an abelian variety and & С X is a codimension 1 subvariety such that (в9)=д\ isomorphisms such that but / need not take the identity 0 € Хг to 0 € X2 As in Lecture II, it turns out that Ag has a natural structure of normal quasi- projective variety. Moreover, we obtain a morphism: by defining t(C) = (Jac, Wg-\). The Torelli theorem simply says that t is injec- tive and the Schottky problem can be rephrased as asking for a characterization of the image t(9Rg). Before studying these in more detail, I would like, in parallel with the treatment in Lecture II, to i) indicate the analytic description of Ag via an infinite covering, ii) indicate how to explicitly coordinatize Лд, and iii) describe the closure of t(9Rg) in Ag. In regards to (i), we consider set-theoretically: set of 4-tuples (V, L, Я, a), where V = a complex vector space L = a lattice in V H = a positive definite Hermitian form on V a = an isomorphism Z9 xZ9 -> L with Im H(a(n,m),a(n',m')) = (n,m') — (n',m) isomorphism 35 The 2 methods are i) by a suitable line bundle £ on C9 /L given up to translation, or ii) by a suitable homomorphism ф:С9/Ь-+ (€F/L)A where (C9/L)A is the "dual" abelian variety.
A. Lecture IV 273 set of 3-tuples (X, 0,a), where X = an abelian variety 0 = codim 1 subvariety with @fl) = g\ a = an isomorphism Z9 x Z9 -> ЯХ(Х, Z) where if then [0](o;(u,m),Q;(u, m')) = = fundamental class of 0, isomorphism (The connection being given by X = V/L, O~^_H as above). Clearly, forgetting a defines a map У)д У А-д and for all a € SpBg,Z) = [group of 2g x 2g integral symplectic matrices], defines an action of SpBg, Z) on ¥)g such that On the other hand, given (V, L, H, a), there is a unique isomorphism Я/ : V = C9 such that Ф(а(п,0)) = n, i.e., such that the first g generators of L are just the unit vectors in C9. Define the g x g complex matrix п by Ф(а, (О, m)) = п • m, i.e., the second g generators of L are just the g columns of Q. Write H via a g x g Hermitian matrix h via Then the condition on Im H written out is: Im *n • h • m = 0 Im tntf2 • h-m = ьп-п which works out as simply: Vn, meZ9 О = 1О, h=(ImO)-1. On the other hand, if Q is any symmetric g x g complex matrix with Im i? positive definite, then V = C9 17? Q | f\ 17? Q '=■ OJ + 11 * OJ H(x,y)=tx-(lmf))-1-y a(n,m) = n + Q -m is an element of $)g. This proves that л ~ / °Pen subset of Cfl(fl+1^/2 of g x g complex symmetric 1 fl ~~ 1 matrices i?, with Im ft positive definite /
274 A. Curves and Their Jacobians which is called the "Siegel upper half-space". Bringing in Teichmuller space again, it is not hard to see that we get the big diagram: £3g~3 £g(g+l)/2 U U Зд _ ¥)g Bounded Domains on this line Ag Quasi-Projective Varieties on this line where i is even an equivariant holomorphic map for a homomorphism r : Гд —> SpBg,Z) Гд = Teichmuller modular group. In regards to (ii), we want to mention how to use theta-null werte to explicitly embed Ag in a big projective space P^. To be precise, the ideas of Lecture III lead to the following: there is a subgroup Гт С SpBg, Z) of finite index such that for all m > 2, the functions $ * @), a,/? G ^Z9 running over cosets mod Z9, which are called the theta-nulls of the abelian variety X, are global homogeneous coordinates not on Ag but on the covering space i.e., define an embedding Ад"" «->• F Suitable polynomials in the theta-nulls $ я @), invariant under the finite group SpBg,Z)/rm, will then be coordinates on Ag itself. Assuming the injectivity of t, this gives by composition coordinates once more on fflg: this is method I alluded to in Lecture II. Other ways to get coordinates on Ag are to use other modular functions on Ag, i.e., holomorphic functions automorphic with respect to SpBg, Z), such as Poincare series or Eisenstein series. The coordinates gotten in this way seem harder to interpret algebraically: in particular, finding an algebraic interpretation of the Eisenstein series in terms of the definition of Ag via moduli seems to be a very interesting problem. In regards to (iii), although unfortunately t(fflg) is not closed in Ag, it is very nearly so. We can look at the compactification fflg of fflg mentioned in Lecture II and study the "limit" of the Jacobian of a non-singular curve С as С approaches a singular curve Co representing a point of Шд — Шд. It turns out these Jacobians have limits which are still abelian varieties if and only if Co is made up of a set of non-singular components {Di} connected together like a
A. Lecture IV 275 tree, and that in this case the limit of the Jacobian of С is the product of the Jacobians of the Di. From this one proves: set of pairs (X, 0) of the following type: X = Jac(.Di) x ... x Jac(-Dfc), t(mg) = Di non-singular curves of genus = Ui=i Jac (Di) x ... x 0^ x ... x Note that inside t(9Rg), t(9Rg) is readily characterized as the set of (X, 0) with irreducible 0. We now come to the final and most fascinating point (for me): exploration of the special properties that Jacobians have and that general principally polarized abelian varieties do not have. I would like to thank Harry Rauch, as well as Alan Mayer and John Fay, who introduced me to these questions and helped me see what a subtle thing was going on. The first step is to reconstruct С from Jac, i.e., prove t is injective (Torelli's theorem). Once this is proven, it follows that dim t(mg) = hence: The second point is to try to characterize £(9ЯД) by some special properties (Schottky's problem). I know of 4 essentially different approaches to these closely related questions. At the outset, however, let me say that none of them seems to me to be a definitive solution to the second question and that I strongly suspect that although many special things about Jac are known, there is much more to be discovered in this direction. Approach I: Reducibility of 0 П 0a Recall that 0a denotes the translate of 0 by a. Almost all classical work on Torelli's theorem is closely related in some way to the lemma: Lemma. Let Jac be a Jacobian, 0 its theta-divisor. Then given a G Jac, a^O {0 П вa С 06 U 0c "I (t г П \ с * \ т I j-^ l for some rr, у G C, 1 for some о, с G Jac > <=$> < i/\j./\t- ,. ,. , - ' л I \ a = ф(х) - ф(у) J distinct from 0, a J k ^v ' ^vy/ } In fact, what this means is that if a = ф(х) — ф(у), then 0П0а breaks up into 2 components W\,W2 of dimension g — 2, and W\ С 0&, W? С 0С- This lemma is fairly elementary: let's check the easiest implication "<*=". Using lecture 3, we recall that 0 = Wg-\, esp. 0 is the image of:
276 A. Curves and Their Jacobians ф(я-1) . з9~гС —у Jac . Then 0 П 0a is the image under ф^9'1^ of: ^C 3*eS9~lc with с hence if a = ф(х) — ф(у), by Abel's theorem: 1 = 21 — rr + у Clearly one way for 21' to exist is if x is one of the points in the divisor 21: i.e., 21 = 2l0 + ж, 21' = 2l0 + y: thus W D Wy = { set of divisors 2l0 + rr, 2l0 G 55C} . The only other way is if 21 + y, 21' + x are distinct linearly equivalent divisors of degree g; but by Riemann-Roch, dim |2l + y\ > 1 if and only if there is a 1-form ш, zero on 21 + y. Such an и must have # — 2 more zeroes: call these 2lo- Then w -> w' _ / se^ °^ divisors 21, where 21 + 2lo + у = zeroes 1 D ^ ~ \ of some w G i2i(C7), 2l0 G 55C J and W = Wx\JW'y. Therefore 0 П 0a = @E^ж) U and it is not hard to see that: (where —Wg-2 is the set of points — x, x G Wg-2i and к = £]г=7 Ф{хг), {%i} the zeroes of some и e Ri (C)). Finally, if b = ф(х) — ф{г), с = ф(и)) — ф(у), then the same argument shows: 0П0С = (\Уд-2)ф{у})и(-\Уд-2)к_ф{у) hence 0D0aC0bU0c. Weil investigated the deeper problem of classifying all cases where 0П(9а was reducible: it appears that for most curves, this only happens if а = ф(х) — ф{у) again. But for some curves of genus 3 or 4 or for curves С which are double coverings of elliptic curves, there are other a's for which 0 f) 0a is reducible.
A. Lecture IV 277 However, for general principally polarized abelian varieties X, it seems very likely that 0 П &a is irreducible for all a £ X. There are various ways to use variants of the lemma to prove Torelli's the- theorem: one can stick to the implication "<*=", and generalize it substantially, playing an elaborate Boolean algebra game with all the translates of all the Wr С Jac. This leads eventually to the conclusion that there are only two possi- possible ways to set up this whole Boolean configuration inside Jac, given the divisor 0: one being obtained from the other by reflection in the origin. Or using the full strength of the lemma, one sees that 0 determines the surface У = {ф(х)-ф(у)\х,уеС}с Jac. If С is not hyperelliptic, it turns out that the tangent cone Ty,o to V at 0 G Jac is just a cone over the curve С itself: more precisely, if P is the projective space of 1-dimensional subspaces of Tjac,o5 then the canonical curve Ф(Х) sits in P, and union of lines £, I corresponding to Tv'° ~ * points of Ф(С) If С is hyperelliptic, other arguments are needed. Or one may use variants of the lemma where a is infinitesimal. The geometric meaning of 0П0а, a infinitesimal, is the following: let P be the projective space of (g — l)-dimensional subspaces of TjaCH. Then we get the so-called Gauss map: 7Г : 0-(singular pts. of 0) —У Р defined by ( , ( tangent space T@ x, translated \ 7Г ( CIC ) — I J I J \ to a subspace of TjaC)o J ' The divisors ж~1(Н), Н С Р a hyperplane, are the limits of the intersections 0 П &a as а -> 0. Thus the lemma says that at least a 1-dimensional family of divisors тг (H) is reducible: in fact, note that hyperplanes in P are points in P, and if we let Hx С P denote the hyperplane corresponding to x G P, the lemma says that тг^Ях) is reducible for x G Ф{С). Andreotti showed that one could say more: let В С Р be the branch locus of тг, then В = "envelope" of the family of hyperplanes {Ях}, i.e., В = M (Нх П Hx+sx) (Sx = infinitesimal change of x) хеФ(С) U( linear (g — 2)-space of P dual to the line in P tangent to Ф(С) at Ф(х)). Again for non-hyperelliptic C's, this enables one to reconstruct Ф{С) immedi- immediately from (Jac, 0). Fay has given an analytic form of the lemma: if Jac = €? /L and д : С9 —У С is the theta function whose zeroes are 0, then he shows:
278 A. Curves and Their Jacobians / fx \ / fy E(x,v)-E(u,y)"dlz+ w)-i9(z+/ V Ju / \ J v ( fx \ f fy + E(x,u)-E(v,y)-'dlz+ u> )•# z+/ \ Jv / \ Ju / /-sb+V \ = E(x,y)-E(u,v)-0[z+ w)-i?(z) where E(x,y) is a certain "Prime form" on С х С In particular, it follows that if rr 7^ u: i?(z) = i? (z + / w)=0=>^lz+/ wUOor tfz+/ w =0 \ Ju / \ J v / \ J и / which is the "<^=" of the lemma. Another pretty way to interpret this half of the lemma is via the Kummer variety: one uses the set of theta-functions of order 2 to map Jac to a projective space. All these functions are even, so the map factors through К = Jac/(±1) (here — 1 = inverse of group law on Jac), and defines: Ф : К м- P*\ A; = 2* — 1. Then we find that *P(K) has trisecants; more precisely, for any x,y,u,v G C, fix a point a £ Jac such that: 2a = ф{х) + ф{у) - ф{и) - ф(ь). Write а = \{х + у- u — v) for clarity. Define \(x — y + u — v) and \{x — у — u + v) as а — ф(у) + ф(и) and as а — ф(у) + ф(у). Then: -(х -у + u-v) 1 , «^ ( -(x-y-u + v) are collinear. Contrast this with the situation for generic principally polarized abelian varieties: because dim К <?С dimF* for g large, it seems very likely that &{K) has no trisecants. In connection with the Schottky problem, I would like to raise the following questions: given a principally polarized abelian variety (X, 0), suppose there is a 2-dimensional set of points a & X such that 0 П 0a is contained in ©ь U 0c{{b,с} П {0,а} = ф). Then is X a Jacobian? Or if not, are there some small extra conditions that suffice to characterize Jacobians? Approach II: 0 of translation type Since 0 = Wg-i, О is just the sum, using the group law of Jac, of the curve ф(С) = W\ with itself (g - l)-times. One can localize this property and come up with the following: - Let Я be a germ of a hypersurface at 0 G Cn.
A. Lecture IV 279 - Then H is of translation type if there are (n — 1) germs of analytic curves ji at 0 G Cn such that H = ji + ... + jn-i- ( + represents vector addition pointwise of these subsets of Cn.) In fact, since up to translation 0 is symmetric, i.e., — 0+k = 0 for some к G Jac, whereas ф(С) is symmetric only when С is hyperelliptic, we find that for non- hyperelliptic C, 0 is doubly of translation type: i.e., for all x G 0, represent 0 near x as a sum: (germ of 0 at x) = j[x' + ... + 7^_1 + x 7г- = a 1-dimensional germ at 0. Then if -0 + A; = 0, so that к-хеё (germ of 0 at x) = -j[k~x) - ... - 7^~x) + x gives a 2nd representation of 0 as a hypersurface of translation type. The beau- beautiful fact, which was conjectured by Sophus Lie and proven by W. Wirtinger, is that the only hypersurfaces doubly of translation type are the theta divisors in Jacobians and certain degenerate limits. Moreover, the theta divisor is never of translation type in a third way, which then proves Torelli's theorem! The fol- following sort of answer to the Schottky problem is presumably a consequence - although the details have not been written down: given a principally polarized abelian variety (X, 0), 0 is doubly of translation \ / (X, 0) is the Jacobian of a type at some point x G 0 J \ non-hyperelliptic curve The only thing lacking here is a nice differential-geometric criterion for a hyper- hypersurface to be singly or doubly of translation type. Also, since 0 is symmetric, one would hope that "usually" being simply of translation type by analytic pro- prolongation would force it to be doubly so: but this is not clear. Recently, Saint-Donat discovered a very beautiful proof of the Lie-Wirtinger results that I want to sketch. It is based on the following beautiful criterion: Theorem. Let Я С Tie a hyperplane, let xi,... ,Xd G H and let 71,..., 7^ be germs of analytic curves at xi,...,Xd which cross H transversely. Suppose t\,...,td are coordinate functions on 71,..., jd such that for all hyperplanes H' near H: Then by analytic continuation, the 71,..., jd are part of an algebraic curve Г С Pn of degree d (possibly reducible), so that in some neighbourhood U of H, Г П U= б г=1
280 A. Curves and Their Jacobians This can be proven, e.g., by reducing to n = 2 and in this case showing that in some neighbourhood U of Я, there is a meromorphic 2-form w on U, with simple poles on \Jji Г) U, such that ti(z) = I Res7iu>, J Xi and finally using the pseudoconcavity of U to extend w to a rational 2-form on P2, whose poles will be Г. To apply the theorem, let ЯСС1 be a germ of a hypersurface such that n—1 n—1 Я = 2^ 7г = 2^ ^*> T*> ^* Serms of analytic curves. г=1 г=1 Let P denote the projective space of lines in Cn through 0. Associating to each point x of 7i or Si the tangent line to ji or Si at rr and translating to the origin, we get germs of analytic curves 7», <Ji С P. For each z e H, write n-l n-l Then Tff,z defines a hyperplane H(z) С Р, and since T#,z D TluXi UTsijVi, it follows that 7г(жг),^г(гЕг) € Я(^). Now parametrize the branches 7» and ^ by any linear function L on Cn, i.e., )) = х, L(Si(y))=y. Then it follows from (*) that: ) () г=1 Assuming that all hyperplanes H' near Я@) are of the form Я (г), this proves that the 2n — 2 branches ji,Si with coordinate functions Xi,—yi satisfy the condition of the theorem! Analytically prolonging, the ji,Si therefore are part of a curve С С P and one goes on to prove that С is a canonically embedded curve of genus n (or a singular limit of such) and Я is its theta divisor.
A. Lecture IV 281 Approach III: Singularities of 0 We use the fact that 0 = Wg-\ and apply the results of Lecture III: it follows that every a G 0 is equal to ф^9~^(Щ for some divisor 21 = Yli=i xi m C- If ■c ^— dim 21 i g (ф^^Л (ск)^Р^, then Kempf's results show in this case: 1) 0 is defined near a by an equation det (Д,) = 0, where fa is an matrix of holomorphic functions at a, 2) the multiplicity of 0 at a is I +1 and in fact the tangent cone to 0 is defined by the polynomial equation det (dfij) = 0 on Tajjac. Let Sing 0 denote the set of singular points on 0. Then it is not hard to see that Sing2<9, the set of double points, is dense in Sing 0; also, by the results quoted in Lecture I, g > 4 =Ф- 3 at least one map тг: С —У P1 of degree d < g — 1 3 at least one 21 G S9~1C with dim |2l| > 1 Sing 0 ф 0. But if a is a double point of 0, 0 is defined near a by an equation /11/22 — /12/21 = 0. Therefore 0 is also singular at any point x where fn{x) = /12B;) = hi{x) = /22B;) = 0. But 4 equations define a set of points of codimension at most 4, hence 3) Sing 0 ф 0 and all components have dim > g - 436. In fact, a closer analysis shows that: 3') (C hyperelliptic ) ==> (Sing 0 irreducible of dim. exactly g — 3) all сотр. of Sing 0 have dim. (C not hyperelliptic ) —T , ,, A v ' l exactly g - 4 Notice, for instance, that this was exactly what we found in Lecture III if g = 3 or g = 4. This immediately distinguishes Jacobians from generic abelian varieties when g > 4, because for almost all (X, 0) e ag, 0 will be non-singular! In fact, Andreotti and Mayer prove: 36 A heuristic argument for "proving" this is to count the dimension of the space S of coverings С of P1, of degree g — 1, simple branching, and genus g. Simple topology shows that there must be 4g — 4 branch points, so we get dim S = 4g — 4. Looking at the curve C, we get a morphism p : S —> fflg, hence almost all fibres of p have dim «7 — 1, i.e., almost all curves С admit of g — 1-dimensional family of maps тг : С —> P1 . Allowing for the 3-dimensional automorphism group of P1, this gives a g — 4-dimensional family of 1-dimensional linear systems L С S9~1C, hence dim Sing © > g — 4.
282 A. Curves and Their Jacobians Theorem. Let Agn) = {{X, в) G Ад | dim Sing 0>n}. Then t(fflg) is a component of Ag9~ . The proof of the former fact and one ingredient of the proof of the theorem is the heat equation that i9 satisfies: if we describe a principally polarized abelian variety X as above X = then in the classical normalization: and it is immediate that considering д as a function of z and Q: 1 d2id _ dti 2i7ri oZiOZj dfiij If a G 0 is a double point, and Ha is the hyperplane ]Ci j=i э"Г§1~(а) '^*i — 0 in the tangent space to Лд at (X, 0), then this shows that the singularity a "disappears" if you move {X,0) in a direction transversal to Ha. The idea of Andreotti and Mayer's proof is to show that for almost all curves C, correspond- corresponding to points 7 G 3g, t{3g) is non-singular at £G) and its tangent space is the intersection of these Ha's, hence upstairs in Sjg, tCg) and the inverse image of Ag9~ are both non-singular with the same tangent space at £G). This will prove their theorem. Now if X = C9/Z9 + ft ■ Ъ9 ** Jac (C), let Ui be the differential on С gotten by restricting dz{ to C; then one shows that if С is not hyperelliptic, t{3g) is non-singular at £G) and its tangent space is defined by equations ]T XijdOij = 0 for all {Xij} such that the quadratic differential ]T XijWiWj vanishes identically on С More canonically, the point here is that the cotangent spaces to 3g and ¥)g can be identified as follows: Ту O = Rz(C), quadratic differentials on С m* ~ о 2 ( space of transl.-inv. \ T* n = Symm \ л c v "я'М J у 1-forms on Л J and when X = Jac, t* is multiplication taking quadratic expressions in the и G R\(C) to the corresponding quadratic differentials; the kernel is thus the quadratic forms in R\(C) which vanish identically on C: call this Ker(Symm2i?iC -> R2C) or /2- So what Adreotti and Mayer need is that for almost all curves C, /2 is spanned by the forms:
A. Lecture IV 283 (a) - [u>i] - [ljj], a = double pt. of 0. Looking back at Lecture III, we can see what these special quadratic forms are: we take 21 of degree g — 1 such that dim |2l| = 1, i.e., £B1) has a basis {1, /}. By Riemann-Roch, i?i(-2l) is 2-dimensional: let u>i,u>2 be a basis. It follows that щ = fui, i = 1,2, have no poles, hence are in R\(C). Then the 2 quadratic differentials 771 u>2 and 772^1 are equal, i.e., Я% — [Vi] * [^2] - [щ] • [wi] is a quadratic form in R\{C) which vanishes on С (equivalently represents a quadric in P^-1 vanishing on Ф{С)). According to the results of Lecture III, if a = 0E—!)B1), then qa = constant • q^. It appears to be an open question whether or not for every non-hyperelliptic C, these q^s span /2 = Ker (Symm2i?iC -> R^C). However, Andreotti and Mayer were able to check this for С which were triple covers of P1, hence it does hold for almost all С and their theorem is proven. This approach does not establish Torelli's theorem for all curves C, but it does show that for almost all C's, t~l (t(C)) = {C}. In fact, for almost all C, 2 good things happen - i) in the canonical embedding Ф : С -> Pfl-1, ф(С) is the intersection of the quadrics containing it, and ii) the space /2 of quadrics through Ф(С) is generated by the tangent cones qa to & at its double points. Thus we have a simple prescription for recovering С from (Jac, 0) when С is "good": i) take the tangent cone to 0 at all its double points, ii) translate to the origin of Jac and projectivize to get a quadric in P5, iii) intersect all these quadrics: this is generally the С you started with! These ideas, though very elegant, do not seem to work without exception - e.g., Ag9 has other components besides t(9Rg). Approach IV: Prym varieties The final approach to the Schottky problem is due to Schottky himself, in collab- collaboration with Jung. One may start like this: since the curve С has a non-abelian 7Ti, can one use the non-abelian coverings of С to derive additional invariants of С which will be related by certain identities to the natural invariants of the "abelian part of C", i.e., to the theta-nulls of the Jacobian? And then, per- perhaps, use this whole set of identities to show that the theta-nulls of the Jacobian alone satisfy non-trivial identities? Now the simplest non-abelian groups are the dihedral groups, and this leads us to consider unramified covering spaces:
284 A. Curves and Their Jacobians C2 I degree n, abelian covering, group A Ci | degree 2 С where the involution i : C\ -> C\ of Ci over С lifts to an involution on Ci and tcu = — a, all o; G A These, in turn, may be constructed by starting with the degree 2 covering C\, taking its Jacobian Jaci, and taking the "odd" part of Jac, when it is decomposed into a product of even and odd pieces under i. More precisely, we define: т., (^ i^\ ( subabelian variety of Jaci of all \ Prym (Ci/C) = . , /ггч t J v ' ' \ points x - i(X), xe Jaci J (connected component of the set of x G Jaci such that t(x) = —x. There is a natural map ф- : C\ —У Prym given by ф- (х) = ф\ (х) — i @i (x)), with 0i : C\ —Y Jaci the canonical map. Then the coverings Ci in question are pull-backs of abelian coverings of Prym via ф-. Now Jaci is very nearly the product of Jac and Prym: in fact there are homomorphisms a Jac x Prym i^ Jaci such that ao /3, /3 oa are multiplication by 2, and kero;,ker /3 are finite abelian groups isomorphic to (Z/2ZJfl-1 (g = genus of C). The genus of C\ will be 2# — 1, hence dim Jaci = 2g - 1 dim Jac = g dim Prym = g — 1. The beautiful and surprising fact is that the new abelian variety Prym carries a canonical principal polarization too. In fact, if 0 С Jac, &\ С Jaci and E С Prym are the 3 theta-divisors, S is characterized by either of the properties:
A. Lecture IV 285 '1 (в) ~20 + 2S а (where ~ means the fundamental classes of the divisors are cohomologous; or equivalently that suitable translates of the divisors are linearly equivalent). So far, these facts tie Jac, Jaci and Prym into a tight but quite elementary config- configuration of abelian varieties, but one that does not impose any restriction on Jac itself. Thus if i9, d\ and £ are the theta functions of these three abelian varieties, one can calculate $1 from д and £ and vice-versa, but i9 and £ can be arbitrary theta-functions of g and g — 1 variables respectively. But now the underlying configuration of curves comes in and tells us: (*) (Jac x @)) Па~1в1=в + 0V where 77 € Jac is the one non-trivial point of order 2 such that 01G7) = 0 (i.e., the original double cover C\/C "corresponds" to 77). This follows in fact directly from the interpretations О = Wg-\ С Jac and &\ = W^g С Jaci. Now д and £ cannot be arbitrary any longer: (*) turns out to be equivalent to asserting that the squares of the theta-nulls of Prym: 0), «,/Jez are proportional to certain monomials in the theta-nulls of Jac: (after one makes the correct simultaneous coordinatization of Prym and Jac). A third way to interpret (*) is via the Kummer variety: embed Jac/ ± 1 and Prym/ ± 1 in projective spaces. Ф : Jac / ± 1 <->• P29 = Pj Tp. Via suitable normalized coordinate systems as in Lecture III, there is a canonical way to identify Pp with a linear subspace of Pj. Then (*) says: Since Im Ф, Im Ф have such large codimension, one certainly expects that for most g and (g — l)-dimensional principally polarized abelian varieties X, Y, and Ф(Х) would be disjoint. с The case when g = 4 is the first one where t(Wlg) Ф Ag and in this case: dim A\ = 10 dim£(9K4) = 9.
286 A. Curves and Their Jacobians Schottky was able to show that the above identities on д and £ implied one iden- tity on $ alone, of degree 8, and Igusa has asserted that this identity holds only on £(9DT4)! Moreover, when g > 4, no efficient method of eliminating £ from the above identities is known and the ultimate problem of characterizing t(9Rg) by simple identities in the theta-nulls remains open. I am confident that Schottky's approach has not been exhausted, however, and a full theta-function theoretic analysis of the dihedral (or even higher non-abelian) coverings of С remains to be carried out. I hope these lectures have perhaps convinced the patient reader that nature's secrets in this corner of existence are fascinating and subtle and worthy of his time!
Survey of Work on the Schottky Problem up to 1996 Enrico Arbarello Approach I. The reducibility condition for the theta divisor on a p.p.a.v. X can be written as 0a П 06 С 0C U 0_c (here ±c ф a, b). If а ф —b, this condition expresses the fact that ip(a), ip(b),ip(c) belong to a trisecant of the Kummer variety К = K(X). Gunning [Gu] defines the locus Va,b,c = {2ГЕ | 0ж+а П 0Ж+Ь С 0X+C U 0-ж-с} С Х {Уа,ь,с has a natural scheme structure) and shows that if Va,b,c is positive dimen- dimensional, and if some mild condition is satisfied, then Va^,c is a smooth curve and X is its Jacobian. So, in this sense, "infinitely many trisecants" =$■ "X is a Jacobian". In [W3], Welters defines a subscheme Vy С X, for any length 3 artinian sub- scheme Y с X (except for the case Y = Spec С [61,62] / (бьбгJ) by letting Vy = {2x I 3 a line 1 С P2*, such that ^(l) Э Y + 2x} , and proves, without extra conditions, that " Vy is positive dimensional" =£► " Vy is a smooth curve and X is its Jacobian". It is easy to see that Vy always contains Y — s, where SuppY = {a, b,c}, and s = a + b + с (with a,b,c possibly coincident). Welters asks the question: is the condition Vy ~£.Y — s sufficient to insure that Vy is a smooth curve and X is its Jacobian? This is the so-called trisecant conjecture. Particular cases of this question are: is the existence of a trisecant or of an inflectionary tangent to the Kummer variety sufficient to conclude that X is a Jacobian? These questions are analyzed and partially answered in [D2] and [M2], but they remain open. Another particular case of this question is the one in which Y = {a + tD\ + 62£>2}, 2x eY -s ,2x + eD1 + e2D2 + e3D3 G Vy. This translates into the K.P. equation
288 A. Curves and Their Jacobians D\9-e- AD\9 ■ Dxe J where с? is a suitable constant. This case is Novikov's conjecture solved by T. Sh- iota [S]. The relationship between the K.P. hierarchy of differential equations and the scheme Vy is established in [A-D2]. Following the point of view of [A], [A-D2], a geometrical proof of Shiota's Theorem is given in [Ml]. There, to overcome the difficulties coming from the singularities of the theta divisor, an important role is played by the following fundamental result of Ein and Lazarsfeld [E-L]: If X is a g-dimensional, indecomposable p.p.a.v. and & its theta divisor, then dim@sing) < g - 2. Approach II. One can read a complete survey of the classical point of view in the papers by J. Little [Lil], [Li2]. There, one can find the developments of the ideas in Saint- Donat's paper [S-D]. A discussion of Darboux's criterion [ D] for the algebraicity of local analytic arcs can be found in [Gr]. In [Li3] it is proved that the existence of a curve of flexes of the Kummer variety is equivalent to the double translation property for the ©-divisor, thus relating Approaches I and II. Approach III. The loci Ag ' (often denoted by Afk) have been studied especially in the case k<g-A. In [B 1], Beauville proves that A\ is the union of t(M.4) and the locus 0nuii of 4-dimensional p.p.a.v. with a vanishing theta-null. A5 has five components that are described in [Del] and [Do4]. In [Del] a list of components for Ag9 is described. In [B-Dl], a bridge between Approaches I and III is established by showing that the locus of ^-dimensional p.p.a.v. whose associated Kummer has a trise- cant line is contained in Ag . As it was mentioned above, the case к = g — 2 has been settled by Ein and Lazarsfeld: дО?-2) _ decomposable p.p.a.v. The study of tangent cones at singular points of the theta-divisor has been carried out by M. Green in [G] where he proves the following theorem: For
Survey of Work on the Schottky Problem up to 1996 289 any non-hyperelliptic curve С of genusg > A, the space of quadrics through the canonical curve is spanned by tangent cones at double points of its theta-divisor. When the curve in question is non-trigonal, and not a smooth plane quintic, this gives a new proof of Torelli's theorem. Green's proof relies on a subtle theorem by Kempf [K] which asserts that, when С is not hyperrelliptic, the Kodaira-Spencer map: is an isomorphism. A detailed analysis of the local Torelli theorem including the case of hyper- elliptic curves is contained in [O-S]. The heat equation satisfied by the theta function of an abelian variety, which plays a central role in the paper of Andreotti and Mayer, has been studied by Welters in [Wl]. This paper was instrumental in Hitchin's laying the foundation of a parallel theory in the case of higher rank vector bundles on curves [H]. Approach IV. Using Kummer varieties and suitable normalized coordinate systems one can define the Schottky loci Sg and Sg' to be = {x G A9 | 3 n G X\{0}of order2,andP G Ag-U s.t. S = {X eAg\VrieX\{0}of order 2,3 P е Ag-i, аЛ. ф Zi By virtue of Prym varieties one has t(Mg) CSgC Sg*1** . In [vGl], van Geemen proves that t(Mg) is an irreducible component of Sg and Donagi [Dol] proves that t{Mg) is an irreducible component of Sg . An important tool in the proofs is Welter's characterization (for g > 5) of the difference С - С С J{C) as the support of the base locus of the linear system Too = \{s G OxB0) | multo(s) > 4}|. This result was conjectured in [vG-vG]. There it is also conjectured that this equality is yet another way to characterize Jacobians, to the point that what should happen is: if X is not a Jacobian, the support of /oo is {0} G X. In the same paper an infinitesimal version of this conjecture is also discussed. In [B-D3] it is proved that for a generic p.p.a.v. of genus g > 4 the base locus of /oo is finite, and that the infinitesimal version of the conjecture holds. Moreover, the conjecture itself is related with the trisecant conjecture. In [Iz] the conjecture is proved for g — 4. Analogous to the Torelli map t is the Prym map P : ftp+i -> A
290 A. Curves and Their Jacobians where Hg+i is the moduli space of unramified 2-sheeted coverings of genus g + 1 curves, or equivalently, of pairs (C, 77) where С is a smooth curve and 77 G Я1 (C, Z2) a non-zero half period. The Prym map is generically injective for g > 6 [F-S] but, unlike the Torelli map, it is never injective because of Donagi's tetragonal construction [Do 4]. Turning the attention to low genera, Igusa [I] and Freitag [Fr] prove that t{Mt) = S, = <S|big). Also, the Prym map is generically surjective for g < 5 and its structure has been extensively analyzed in [D-S] and [Do5]. Donagi, in an unpublished work, establishes the equality t(Ms) = «S5. On the other hand, «S5 Ф S\ , as the latter contains intermediate Jacobians of cubic threefolds [Do2]. The conjecture t{Mg) = Sg is still open. The Schottky problem for Prym varieties is studied in [B-D2] where it is proved that the moduli space of Prym varieties of dimension > 7 is an irre- irreducible components of the locus of p.p.a.v. whose associated Kummer varieties admit a quadrisecant plane. Finally, two words about new ideas. Let (X = С9/Л, 0) be p.p.a.v.; Buser and Sarnak defined a metric invariant m(X) = m(X,0) by setting it equal to the minimum square length, with respect to the Hermitian form, of a non-zero lattice vector. Then, on the one hand, they show that for a general p.p.a.v. X , m(X) > -(WI/9 « —, 7Г 7Гб while on the other hand, they show that for a Jacobian J(C), m(J(C))<-\ogDg + 3). 7Г As a consequence, for g > 0, the Jacobian locus t(Mg) lies in a small neigh- neighbourhood of the boundary of Ag. Lazarsfeld [L] takes a different point of view. He considers the Seshadri con- constant e{X) = e{X, 0) = sup {e > 0 I /*ci@) - e[E] is nef} , where / is the blow-up of X at one of its points x, and E is the exceptional divisor of / (by homogeneity the definition does not depend on x). He proves that e(X) > - In particular, for a general p.p.a.v. X , (Je!) я . He then gets upper bounds for Jacobians of curves e(J(C)) < g1^,
Survey of Work on the Schottky Problem up to 1996 291 and, even more precisely, for Jacobians of d-gonal curves e(J(Q) < ^. In conclusion, the minimal length of lattice vectors and the Seshadri constant behave in a similar way and both tend to distinguish Jacobians of curves among all principally polarized abelian varieties.
References The Red Book of Varieties and Schemes 1. Auslander, L., Mackenzie, R.E.: Introduction to Differentiable Manifolds. McGraw-Hill: New York 1963 2. Atiyah, M.: K-Theory. W.A. Benjamin: New York 1967 3. Bourbaki, N.: Commutative Algebra 1-7. Springer: Heidelberg 1988. Orig. publ. by Addison-Wesley, Reading, MA, 1972 4. Bourbaki, N.: Algebre Ch. 8: Modules et Anneaux Semi-Simples. Hermann: Paris 1958 5. Bourbaki, N.: Algebre Commutative 1-4. Reprint Masson: Paris 1985 6. Grothendieck, A., Dieudonne, J.A.: Elements de Geometrie Algebrique I. Springer: Berlin-Heidelberg 1971 7. Grothendieck, A., Dieudonne, J.A.: Elements de Geometrie Algebrique III/l. Publ. Math. IHES 11: Paris 1961 8. Gunning, R.C., Rossi, H.: Analytic Functions of Several Complex Variables. Prentice Hall 1965 9. Klein, F.: Vorlesungen uber die Entwicklung der Mathematik im 19. Jahrhun- dert. Springer: Berlin 1926 10. Lang, S.: Introduction to Algebraic Geometry. Wiley-Interscience: New York 1958 11. Semple, J.G., Roth, L.: Introduction to Algebraic Geometry. Clarendon Press: Oxford 1985 12. Serre, J.-P.: Algebre Locale. Multiplicites. Lect. Notes in Math. vol. 11. Springer: Berlin-Heidelberg 1975 13. Zariski, 0., Samuel, P.: Commutative Algebra 1. Springer New York 1975 14. Zariski, 0., Samuel, P.: Commutative Algebra 2. Springer New York 1976 D. Mumford: LNM 1358, p. 293, 1999. © Springer-Verlag Berlin Heidelberg 1999
Guide to the Literature and References Curves and Their Jacobians Lecture I: The best beginning book on the theory of curves is: R. Walker, Algebraic Curves, (Dover reprint 1962). With a little more background, the best modern book is: J.-P. Serre, Groupes algebriques et corps de classes, (Hermann, 1959). A more analytical treatment can be found in: R.C. Gunning, Lectures on Riemann Surfaces, (Math. Notes, Numbers 2, 6, 12, Princeton Univ. Press). C.L. Siegel, Topics in Complex Function Theory C volumes, Wiley- Interscience, 1969-1973). There is a huge classical literature, of which the following are best known to me: K. Hensel and G. Landsberg, Theorie der Algebraischen Funktionen einer Variablen, (Chelsea reprint, 1965). F. Severi, Vorlesungen u'ber Algebraische Geometrie (Johnson reprint, 1968). J.L. Coolidge, A treatise on algebraic plane curves (Oxford, 1931). The most famous book which drew together clearly the topological, analytical and algebraic strands is: H. Weyl, Die Idee der Riemannschen Flache (Teubner, 3rd ed., 1955). For elliptic curves specifically, one can look for instance in: Ch. 7 of L. Ahlfors, Complex Analysis Bnd ed., McGraw-Hill, 1966): a good quick introduction to the analysis. A. Hurwitz, R. Courant, Allgemeine Funktionentheorie und Elliptische Funktionen (Springer, 1964). S. Lang, Elliptic Functions, (Addison-Wesley 1973). J. Tate, Notes from Phillips Lectures at Haverford, to be published J.W.S. Cassels, Diophantine equations with special reference to elliptic curves (Survey article: J. London, Math. Soc, 41 A966), pp. 193-291). The Gelfand-Schneider theorem quoted in Lecture I can be found in: S. Lang, Introduction to transcendental numbers (Addison-Wesley, 1966), p. 22. D. Mumford: LNM 1358, pp. 294-300, 1999. © Springer-Verlag Berlin Heidelberg 1999
Curves and Their Jacobians 295 For higher Weierstrass points, see: B. Olsen, On higher Weierstrass points (Annals of Math., 95 A972), pp. 357-364). J. Hubbard, Sur les sections analytique de la courbe universelle de Teichmuller, (to appear in Memoirs of AMS). For the theory of theta-characteristics, some references are: D. Mumford, Theta characteristics of an algebraic curve (Annales Ecole Norm. Sup., 4 A971), p. 181). Ch. 13, Vol. 2 of H. Weber, Lehrbuch der Algebra, (Chelsea reprint) - for the case of g = 3. The details of the hyper elliptic vs. non-hyperelliptic story can be found in most of the books cited above, e.g., Walker, Hensel-Landsberg or Severi. Concerning the explicit description of the inequalities on generation of Fuchsian groups, see: R. Fricke, F. Klein, Vorlesungen iiber die Theorie der Automorphen Funktionen (Johnson reprint 1965), esp. Vol. 1, Part 2, Ch. 2. N. Purzitsky, 2 generator discrete free products, (Math. Zeits., 126, p. 209 and other papers in Math. Zeit., 111. J.) L. Keen, On Fricke Moduli, in Annals of Math. Studies 66, 1971. Kazhdan's ideas on the metrics of varieties D/Г are contained in: D. Kazhdan, Arithmetic varieties and their fields of quasi-definition, (Actes du Cong. Int., Nice, Vol. 2). Proofs of the facts concerning representation of a general curve С of genus g as a plane curve of lowest possible degree or as a covering of P1 of lowest possible degree can be found in: S. Kleiman, D. Laksov, Another proof of the existence of special divisors, (Acta Math., 132 A974)). Petri's work can be found in: K. Petri, Uber die invariante Darstellung algebraischer Funktionen einer Verdnderlichen, (Math. Ann. 88 A922)). B. Saint-Donat, On Petri's Analysis of the Linear System of Quadrics through a Canonical Curve, (Math. Ann., 206 A973)). K. Petri, Uber Specialkurven I, (Math. Ann., 93 A924)). Lecture II: I gave some introductory talks on moduli problems in general and on ЗЯ^о, i.e., the elliptic curve case, in particular, in:
296 References D. Mumford, K. Suominen, Introduction to the Theory of Moduli, in "Algebraic Geometry, Oslo, 1970", (Wolters-Noordhoff, 1971). For g = 1, cf. also references given above for elliptic curves. The genus 2 case is in: J.-I. Igusa, Arithmetic variety of moduli for genus two, (Annals of Math. 72 (I960)). The precise definition of fflg and proof that it is a quasi-projective variety are in: W. Baily, On the theory of 0-functions, the moduli of abelian varieties and the moduli of curves, (Annals of Math., 75 A962)). D. Mumford, Geometric Invariant Theory, (Springer 1965). In particular, the first reference coordinatizes 9Rg essentially by theta-nulls; the second coordinatizes fflg both by a variant of theta-nulls, using cross-ratios of points of finite order on the Jacobian, and also by invariants of the Chow form as in the text of the Lecture. The final method of using cross-ratios of higher Weierstrass points has been suggested by Lipman Bers. The method as described in the text has never been published, but details can be filled in as follows: A) one must first assign multiplicities to the higher Weierstrass points so that the divisor 2Ц of all of them varies algebraically with С - cf. Hubbard (op. cit.), B) one proves that noieC occurs with multiplicity > g2 in Щ; C) using this one deduces that $k(^k) is stable in this sense of "Geometric Invariant Theory", Ch. 3; D) check that if a quadric contains Фк(Щ), then it contains Фк{С), and since Фк(С) is an intersection of quadrics, Фк(^к) determines С up to isomorphism; E) apply the results of "Geometric Invariant Theory", Ch. 3. Concerning Teichmuller space, the best reference is the survey article of Bers: L. Bers, Uniformization, Moduli and Kleinian groups, (Bull. Lond. Math. Soc. 4 A972)). For the compactification 9Rg of 9Rg, see: P. Deligne, D. Mumford, The irreducibility of the space of curves of given genus, (Publ. IHES, 36 A969)). L. Bers, Spaces of degenerating Riemann Surfaces (in Annals of Math. Studies 79 A974)), as well as forthcoming articles by F. Knudsen and myself. The references for the "positive curvature" assertions on 9Rg are as follows: E. Arbarello, Weierstrass points and moduli of curves, (thesis; to appear in Cpmp. Math.). D. Mumford, Abelian quotients of the Teichmuller modular group, (J. d'Anal. Math., 18 A967)).
Curves and Their Jacobians 297 H. Rauch, The singularities of the modulus space, (Bull. AMS, 68 A962)). B. Segre, Sui moduli delle curve algebriche, (Annali di Mat. 7 A930)). Concerning the Teichmuller metric and Petersson-Weil metrics on 3p,n, see: L. Ahlfors, Curvature properties of Teichmuller's space, (J. d'Analyze, 9 A961)). L. Ahlfors, Some remarks on Teichmuller's space of Riemann surfaces (Annals of Math., 74 A961)). H. Masur, On a class of geodesies in Teichmuller space (to appear) H. Royden, Automorphisms and isometries in Teichmuller space (in An- Annals of Math. Studies 66 A971)). H. Royden, Metrics on Teichmuller space (in Springer Lecture Notes, 400). For the Ahlfors-Pick lemma, cf. M. Cornalba, P. Griffiths, Some transcendental aspects of algebraic ge- geometry, §6 (in Proc. Summer Institute on Alg. Geom. 1974, AMS). The rigidity theorem of A—P—M—G has usually been considered separately in 2 parts: one on the finiteness of the set of sections of a fixed family тг : X -> С — S; the other on the finiteness of the set of families тт. Both have much deeper, still unsolved number-theoretic analogs - viz. given a number field K, then one conjectures that 1) given a curve D defined over К of genus g > 2, D has only a finite set of X-rational points and 2) given a finite set S of primes of К and g > 2, there are only finitely many curves D defined over К with "good reduction outside 5"' and of genus g. The first is called Mordell's conjecture and the second is called Safarevitch's conjecture (cf. his talk at the Stockholm International Congress of 1962). If one replaces К by the field of rational functions on a curve С over C, these conjectures are equivalent to the Rigidity Theorem of the lecture. The Мог dell part was proven first, independently by Manin and Grauert: H. Grauert, Mordells Vermutung uber Punkte auf algebraischen Kurven und Funktionenkorper (Publ. IHES 25 A965)). Y. Manin, Rational points on algebraic curves over function fields, (Izvestija Akad. Nauk, 27 A963)). P. Samuel, La conjecture de Mordell pour les corps de fonctions, (Sem. Bourbaki, exp. 287, 1964-65). The Safarevitch part was proven by Arakelov using earlier partial results of Parsin: S. Ju. Arakelov, Families of algebraic curves with fixed degeneracies, (Izvest, Akad. Nauk 35 A971)).
298 References A.N. Parsin, Algebraic curves over function fields, (Izvest. Akad. Nauk, 32 A968)). References for recent results on Шд added in proof are: E. Arbarello, E. Sernesi, The equation of a plane curve, Duke Math. J. 46 A979). E. Sernesi, Unirationality of moduli of curves of genus 12, Ann. Sc. Norm. Pisa 8 A981). M. Chang, Z. Ran, Unirationality of moduli of curves of genus 11,13, Inv. Math. 76 A984). M. Chang, Z. Ran, The Kodaira dimension of moduli of curves of genus 15, J. Diff. Geom. 24 A986). J. Harris, D. Mumford, On the Kodaira dimension of moduli of curves, A982). D. Eisenbud, J. Harris, The Kodaira dimension of moduli of curves, g > 23, Inv. Math. 90 A987) Lecture III: The Jacobian is introduced in the standard books referred to in our notes to Lecture I: esp. Serre (op. cit.) shows that all rational differentials ui on curves С are pull-backs of translation-invariant differentials rj on algebraic groups J via some rational map ф : С -> J. Moreover Gunning (op. cit., esp. Math. Notes 12, subtitled "Jacobi Varieties") treats in detail many of the topics of this and the next Lecture. For Weil's original algebraic construction of the Jacobian and its application to the Riemann hypothesis, see: A. Weil, Varietes abeliennes et courbes algebriques, (Hermann, 1948). The result that I called the theorem of Riemann and Kempf on the singu- singularities of Wk was proved for к = g — 1 by Riemann. Kempf's results appear in: G. Kempf, On the geometry of a theorem of Riemann (Annals of Math., 98 A973)). G. Kempf, Schubert methods with an application to algebraic curves (Stichting Math. Centrom, Amsterdam, 1971). The elegant proof of Riemann-Roch using the differential of ф^ is worked out in detail in: A. Mattuck, A. Mayer, The Riemann-Roch theorem of algebraic curves, (Annali Sc. Norm. Pisa, 17 A963)).
Curves and Their Jacobians 299 Here are references for the theory of theta functions; and more generally, function theory on abelian varieties: W. Baily, Classical theory of theta functions (In AMS Proc. of Symp. in pure Math., vol. 9). F. Conforto, Abelsche Funktionen und algebraische Geometrie (Springer- Verlag, 1956). J. Fay, Theta functions on Riemann Surfaces (Springer Lecture Notes 352). J.-I. Igusa, Theta functions (Springer-Verlag, 1972). A. Krazer, Lehrbuch der Thetafunktionen (Teubner, 1903). D. Mumford, On the equations defining abelian varieties (Inv. Math., 1 and 3 A966-67)). D. Mumford, Abelian varieties (Tata studies in Math. 5, Oxford, 2nd ed. 1974). H. Rauch, H. Farkas, Theta functions with applications to Riemann Sur- Surfaces, (Wiliams and Wilkins, 1974). My treatment in the lecture follows more or less my book Abelian Varieties; e.g., the 2 embedding theorems above are proven in Ch. 1 of this book. The group-theoretic aspects discussed in the lecture are in my book, §23, as well as in my paper and Igusa's book. The prime form Ee and its applications are due to Riemann and are discussed at length with many more applications in Fay. Lecture IV: Besides the references given above on abelian varieties and moduli problems, one can find further material in: J.-I. Igusa, On the graded ring of theta-constants (Am. J. Math., 86 A964) and 88 A966)). D. Mumford, The structure of the moduli spaces of curves and abelian varieties (in Actes du Cong. Int., Nice, 1971). G. Shimura, Moduli of abelian varieties and number theory, (in AMS Proc. of Symp. in Pure Math., vol. 9). Because of their arithmetic and representation-theoretic importance, there is a huge literature on various types of modular forms on S)g and on other bounded symmetric domains. I am not competent even to begin to list references on these topics, but I would like to emphasize here as in the Lecture one big gap: the lack of a moduli-theoretic interpretation of the Eisenstein series.
300 References Concerning Torelli and Schottky, good general references are the books of Gunning (op. cit., notes to Lecture I, part III), Fay (op. cit.) and Rauch-Farkas (op. cit.). In more detail, for Approach I, see: A. Andreotti, On a theorem of Torelli (Am. J. Math., 80 A958)). H. Martens, A new proof of Torelli's theorem (Annals of Math., 78 A963)). T. Matsusaka, On a theorem of Torelli, (Am. J. Math., 80 A958)). A. Weil, Zum Beweis des Torellischen Satzes, (Nachr. Akad. Wiss. Gottingen A957)). Fay's formula is in his Springer Lecture Notes (op. cit.), p. 34 (formula D5); the interpretation via trisecants follows from the Proposition on p. 335 of my paper: D. Mumford, Prym Varieties /(in: Contributions to Analysis, Academic Press, 1974). Approach II can be found in: W. Wirtinger, Lie's Translationsmannigfaltigkeiten und Abelsche Inte- Integrate, (Monatshefte fur Math, und Physik, 46 A938)). S. Lie, Werke, Band 2, Abt. 2, Teil 2, p. 481. Aproach III is in: A. Andreotti, A. Mayer, On period relations for Abelian integrals on algebraic curves (Ann. Scu. Norm. Sup. Pisa A967)). My discussion of Approach IV follows my paper "Prym Varieties I" mentioned above. A more analytic approach is in Fay (op. cit.) or Rauch-Farkas (op. cit.) as well as in the many papers of Rauch and Farkas cited in their book. The original works of Schottky and Jung are: F. Schottky, Uber die Moduln der Thetafunktionen (Acta Math. 27 A903)). F. Schottky, H. Jung, Neue Sdtze uber Symmetralfunktionen und die Abel'schen Funktionen B parts, Sitzungsber. Berlin Akad. Wiss., 1 A909)).
Supplementary Bibliography on the Schottky Problem Enrico Arbarello Note: - The papers [A-D 2], [B3] and [Do3] are survey papers and contain more bibliographical references. - The paper [B3] is in the Proceedings of the Summer Research Institute on Theta Functions held at Bowdoin College in 1987 which is altogether another source of references. - Also the Proceedings of an AMS-IMS-SIAM Joint Summer Research Con- Conference on the Schottky Problem 1992, is a good source of references. This is where [Do5] is. - [B2] is a Seminaire Bourbaki and [De3] provides a continuation of it. [A] E. Arbarello, Fay's trisecant Formula and a Characterisation of Jaco- bian Varieties, Proceedings of Symposia in Pure Mathematics Vol. 46, A987) 49-61. [A-D 1] E. Arbarello and C. De Concini, On a set of equations characterizing Riemann matrices, Annals of Math. 120, A984), 119-140. [A-D 2] E. Arbarello and C. De Concini, Geometrical aspects of the Kadomtsev- Petviashvili equation, L.N.M. 1451, 95-137, Springer 1990. [Bl] A. Beauville, Prym varieties and the Schottky problem, Inv. Math. 41, A977) 149-196. [B2] A. Beauville, Le probleme de Schottky et la conjecture de Novikov, Seminaire Bourbaki no. 675, Asterisque 1988. [B3] A. Beauville, Prym varieties: a survey, Proceedings of Symposia in Pure Mathematics Vol. 49 A989), 607-620. [B-D 1] A. Beauville and 0. Debarre, Une relation entre deux approches du probleme de Schottky, Inv. Math. 86, 195-207, A986). [B-D 2] A. Beauville and 0. Debarre, Sur le probleme de Schottky pour les varietes de Prym, Annali Scuola Norm. Sup. Pisa, Cl. Sci. D) 14 , A987), 613-623. [B-D 3] A. Beauville and 0. Debarre, Sur les function theta du second ordre, Arithmetic of Complex manifolds, Proceedings, Erlangen 1988, Lecture Notes in Mathematics 1399 Springer 1989. [B-S] P. Buser and P. Sarnak, On the period matrix of a Riemann surface of large genus, Invent. Math. 117, A994), 27-56. D. Mumford: LNM 1358, pp. 301-304, 1999. © Springer-Verlag Berlin Heidelberg 1999
302 References [C] A. Collino, A new proof of the Ran-Matsusaka criterion for Jacobians, Proc. Amer. Math. Soc. 92, A984), 329-331. [D] G. Darboux, Legons sur la Theorie Generate des Surfaces, 2nd Ed., Paris Gauthier-Villars, 1914. [Del] 0. Debarre, Sur les varietes abeliennes dont le diviseur theta est sin- gulier en codimension 3, Duke Math. Journ. 56 A988), 221-273. [De2] 0. Debarre, Trisecant lines and Jacobians, J. Alg. Geom. 1, A992), 5-14. [De3] 0. Debarre, The Schottky Problem: an Update, Complex Algebraic Ge- Geometry, MSRI Publications, Volume 28, 1995. [De4] 0. Debarre, The trisecant conjecture for Pryms, Proceedings of Sym- Symposia in Pure Mathematics Vol. 49, A989). [Dol] R. Donagi, Big Schottky, Inv. Math. 89 A987), 569-599. [Do2] R. Donagi, Non-Jacobians in the Schottky locus, Annals of Math. 126 A987), 193-217. [Do3] R. Donagi, The Schottky Problem, in "Theory of Moduli" (E. Sernesi ed.), L.N.M. 1337, 82-137, Springer 1988. [Do4] R. Donagi, The tetragonal connstruction, Bull. Amer. Math. Soc. 4 A981), 181-185. [Do5] R. Donagi, The fibers of the Prym map, Proceedings of an AMS-IMS- SIAM Joint Summer Research Conference on the Schottky Problem 1992. [D-S] R. Donagi and R. Smith, The structure of the Prym map, Acta Math 146, A981), 25-102. [Du] B.A. Dubrovin, Theta functions and non-linear equations, Russian Math. Surveys 36, 2, A981), 11-92. [E-L] L. Ein and R. Lazarsfeld, Singularities of Theta Divisors and the Bira- tional Geometry of Irregular Varieties, Journal of the AMS 10, A997) pp. 243-258. [F] H.M. Farkas, On Fay's trisecant formula, Journal d'Analyse Mathema- tique D4), A984-85), 205-217. [Fr] E. Freitag, Die Irreduzibilitdt der Schottkyrelation, Arch. Math 40, A983), 255-259. [F-S] R. Friedman and R. Smith, The generic Torelli Theorem for the Prym map, Inv. Math. A982), 473-490. [vGl] B. van Geemen, Siegel modular forms vanishing on the moduli space of curves, Inv. Math. 78 1984, 329-349. [vG2] B. van Geemen, Schotty-Jung relations and vector bundles on hyperel- liptic curves, Math Ann. 281, A988), 431-449. [vG-vG] B. van Geemen and G. van der Geer, Kummer Varieties and the Moduli Spaces of Abelian Varieties, American Journal of Mathematics 108 A986), 615-642. [vG-P] B. van Geemen and E. Previato, Prym varieties and the Verlinde for- formula, Math. Annalen 294, 1992, 741-754.
Supplementary Bibliography on the by Schottky Problem 303 [G] M. Green, Quadrics of rank four in the ideal of the canonical curve, Invent. Math. 75, A984), 85-104. [Gr] P.A. Griffiths, Variations on a Theorem of Abel, Inventiones Math. 35, A976), 321-390. [Gu] R.C. Gunning, Some Curves in Abelian Varieties, Invent. Math. 66 A982) 285-296. [H] N. Hitchin, Flat connections and geometric quantization, Comm. Math. Physics 131, A990), 347-380. [I] J. Igusa, On the irreducibility of Schottky's divisor, Journal of the Fac- Faculty of Science Tokyo 28, A981), 531-545. [Iz] E. Izadi, The geometric structure of A±, the structure of the Prym map, double solids and Гоо-divisors, Journal fur die Reine und Angewandte Mathematik, A995). [K] V. Kanev, The global Torelli theorem for Prym varieties at a generic point, Math. U.S.S.R. Izvestija 20 A983), 235-258. [Ke] G. Kempf, Deformations of symmetric products, Riemann Surfaces and related topics: Proceedings of the 1978 Stony Brook Conference. Prince- Princeton A981), 319-341. [L] R. Lazarsfeld, Length of periods and Seshadri constants of abelian va- varieties, Math Research Letters 3, A996), 439-447. [Lil] J. Little, Translation manifolds and the converse of Abel's theorem, Compositio Math. 49 A983), 147-171. [Li2] J. Little, Translation manifolds and the Schottky problem, Proceedings of Symposia in Pure Mathematics Vol. 49 A989), 517-529. [Li3] J. Little, Another Relation between approaches to the Schottky Problem. [Ml] G. Marini, A geometrical proof of Shiota's theorem on a conjecture of S.P. Novikov, Compositio Mathematica 111: A998), 305-322. [M2] G. Marini, An inflectionary tangent to the Kummer variety and the Jacobian condition, Math. Ann. 309, A997), 483-490. [Mu] M. Mulase, Cohomological structure in soliton equations and Jacobian varieties, J. Diff. Geo. 19, A984), 403-430. [O-S] F. Oort and J. Steenbrink, The local Torelli problem for algebraic curves, Journees de Geometrie Algebrique d'Angers (A. Beauville ed.) Stithoff к Noordhoff, 1980, 157-204. [R] Z. Ran, On subvarieties of abelian varieties, Invent. Math. 62, A981), 459-479. [Re] S. Recillas, Jacobians of curves with g\ 's are Pryms of trigonal curves, Bol. Soc. Mat. Mexicana 19 A974), 9-13. [S] T. Shiota, Characterization of Jacobian varieties in terms of soliton equations, Inv. Math. 83, A986), 333-382. [S-V] R. Smith and R. Varley, Components of the locus of singular theta divisors in genus 5, LNM 1124, Springer-Verlag, A983) 338-416. [Wl] G.E. Welters, Polarized Abelian Varieties and the Heat Equations, Compositio Mathematica 49 A983) 173-194.
304 References [W2] G.E. Welters, On flexes of the Kummer variety, Indagationes Math. 45, A983), 501-520. [W3] G.E. Welters, A Criterion for Jacobian varieties, Annals of Math. 120, A984), 497-504. [W4] G.E. Welters, Recovering the curve data from a general Prym variety, Amer. J. of Math. 109 A987), 165-182. [W5] G.E. Welters, The surface С — С on Jacobi varieties and 2nd order theta functions, Acta math. 109 A987), 165-182.