/
Текст
Gauge Theory and
Variational Principles
GLOBAL ANALYSIS
Pure and Applied
Modem Methods for the Study of Nonlinear Phenomena in
Engineering, the Sciences, and Mathematics
In Two Series
— Advanced, graduate-level texts; monographs, and reference works
— Basic, advanced undergraduate - level texts
Ralph Abraham, Jerrold E. Marsden, Philip J. Holmes, Editors
NUMBER SERIES
1 David Bleecker, Gauge Theory and Advanced
Variational Principles. 1981 Graduate-level
Text
Other Numbers in preparation
Gauge Theory and
Variational Principles
DAVID BLEECKER
Department of Mathematics
University of Hawaii
▼▼
1981
ADDISON-WESLEY PUBLISHING COMPANY, INC.
Advanced Book Program/World Science Division
Reading, Massachusetts
London • Amsterdam • Don Mills, Ontario • Sydney • Tokyo
Library of Congress Cataloging in Publication Data
Bleecker, David.
Gauge theory and variational principles.
(Global analysis, pure and applied; no. 1)
Bibliography: p.
Includes index.
1. Gauge fields (Physics) 2. Variational principles.
I. Title. II. Series.
QC793.3.F5B55 530.1'43 81-17570
ISBN 0-201 -10096-7 AACR2
American Mathematical Society (MOS) Subject Classification Scheme A980)
53A55, 53C05, 55F10, 70G05, 81A30, 81A33, 81A36, 81A39, 81A54, 81A60
83E15
Copyright © 1981 by Addison-Wesley Publishing Company, Inc.
Published simultaneously in Canada.
All rights reserved. No part of this publication may be reproduced, stored in
a retrieval system, or transmitted, in any form or by any means, electronic,
mechanical, photocopying, recording, or otherwise, without the prior written
permission of the publisher, Addison-Wesley Publishing Company, Inc.,
Advanced Book Prog ram/World Science Division Reading, Massachusetts
01867, U.S.A.
Manufactured in the United States of America
ABCDEFGHIJ-HA-8987654321
Contents
Series Editors' Foreword ix
Preface xiii
Chapter 0. Preliminaries 1
0.1. Multilinear Algebra and Forms 2
0.2. Manifolds and Tensor Analysis 6
0.3. Lie Groups and Lie Algebras 18
Chapter 1. Principal Fiber Bundles and Connections ... .23
1.1. Principal Fiber Bundles 26
1.2. Connections 29
Chapter 2. Curvature and g-Valued Differential Forms . . .34
2.1. Graded Lie Algebra of §-Valued Forms 35
2.2. Curvature 37
vi CONTENTS
Chapter 3. Particle Fields, Lagrangians, Gauge
Invariance 42
3.1. Particle Fields 43
3.2. Gauge Transformations 46
3.3. Lagrangians and Gauge Invariance 50
Chapter 4. Lagrange's Equation for Particle Fields 54
4.1. The Principle of Least Action 55
4.2. Some Machinery 56
4.3. Lagrange's Equation 60
Chapter 5. The Inhomogeneous Field Equation 64
5.1. The Current 65
5.2. Inhomogeneous Field Equation 68
Chapter 6. Free Dirac Electron Fields 71
6.1. Covering the Lorentz Group 73
6.2. The Levi-Cevita Connection 77
6.3. Spin Structures and the Lagrangian 81
6.4. Dirac's Equation 84
Chapter 7. Interactions 89
7.1. Bundle Splicing 90
7.2. The (Nonfree) Dirac Electron Field 95
7.3. The Nucleon in a Yang-Mills Potential 99
Chapter 8. Calculus on Frame Bundle 106
8.1. Tensor Fields on L{M) 107
8.2. Pseudo-Riemannian Geometry 109
8.3. Metric Variations 116
CONTENTS vii
Chapter 9. Unification of Gauge Fields and
Gravitation 120
9.1. Gradients of Metric-Dependent Functionals .... 122
9.2. Conservation Laws from External Symmetry .... 127
9.3. The Einstein-Yang-Mills Action Principle 133
Chapter 10. Additional Topics 141
10.1. Geodesies and Forces on Classical Particles 141
10.2. Utiyama's Theorem 147
10.3. Spontaneous Symmetry Breaking 154
10.4. Characteristic Classes, Monopoles, and
Instantons 161
References 171
Selected Bibliography 173
Index of Notation 175
Index 177
Foreword
AIM AND SCOPE OF THE SERIES
What is Global Analysis?
From ancient times till Newton, mathematics meant geometry
and algebra. Then analysis (now called classical) was born, along
with the foundations of physics, engineering, and modern science.
Among the outstanding events of modern mathematics are the
syntheses of these fields, along common frontiers. The synthesis of
classical analysis and geometry is now called global analysis.
The History of Global Analysis and its Applications:
Important pioneers in the synthesis of global analysis were Henri
Poincare A880s), George Birkhoff A920s), Marston Morse A930s),
and Hassler Whitney A940s). The technical tools of differential
topology A950s) made the final synthesis possible A960s). Through
the efforts of Solomon Lefshetz A950s), the work of the Russian
school (Liapounov, Andronov, Pontriagin) on dynamics became
widely known in the west, and included in this synthesis. A veritable
explosion of new results and applications followed in the 1970s.
/x
X FOREWORD
From the earliest work of Poincare and Liapounov onward, the
applications of geometry and analysis to astronomy, physics, and
engineering provided the explicit motivation for much of this work.
The current form of the theory reflects this pervasive influence in its
direct applicability to these fields. It has already created new and
powerful methods of applied mathematics, which complement exist-
existing tools such as pertubation methods, asymptotics, and numerical
techniques.
Far from being the exclusive preserve of pure mathematicians,
global analysis has its roots in physical problems, and can be
redirected to these problems once again, often with startling results.
Target of the Series: The Accessibility of Global Analysis
There is a great contrast between the potential importance of
global analysis and the great difficulty of learning about it. A
growing number of scientists of all disciplines have discovered that
the techniques of global analysis have important applications in their
own fields, and are looking seriously for keys to these techniques.
These Series will attempt to provide these keys.
Needed are books that introduce the basic concepts and their
applications, texts that develop the prerequisites for more serious
study accessibly and compactly, and advanced monographs which
make the research frontier available to a wide audience of scientists
and engineers who have acquired these prerequisites. To these ends,
these Series will deal with such subjects as:
Theory
Linear algebra and representation theory
Calculus on manifolds and bundles
Differential geometery and Lie theory
Manifolds of mappings and sections
Transversal approximations
Calculus of variations in the large
Dynamical systems theory and nonlinear oscillations
Nonlinear actions of Lie groups
Applications
Classical mechanics and field theory
Geometric quantization
Hydrodynamics
Elastomechanics
FOREWORD xi
Econometrics
Social theory
Morphogenesis
Network theory
and other topics of pure and applied global analysis.
AUDIENCES
Series A. The advanced texts will provide reports on theory or
applications from the research frontier in expository style for special-
specialists, or for nonspecialists who have the prerequisite mathematical
background. For example, graduate students of science or engineer-
engineering, as well as mathematics, will find them manageable.
Series B. The basic texts will provide a complete curriculum of
essential prerequisites, starting with advanced linear algebra and
calculus, for the advanced texts of Series A. These texts will be
suitable for advanced undergraduate courses in pure and applied
mathematics, or as reference works for research in engineering, the
sciences, or mathematics.
UNIQUE FEATURES
Through the basic texts (Series B) covering all the prerequisites in
a uniform style, and the advanced texts (Series A) building on this
foundation it will be possible for anyone to study the detailed
applications of global analysis to their own fields (as they appear in
the series), to form their independent evaluation of the new methods,
and to master the techniques for their own use if justified.
Series В starts from the post-calculus level, in textbook format
with worked examples, exercises with answers, and adequate illustra-
illustrations. The two Series will give a complete library of prerequisites,
together with new contributions to global analysis, and some out-
outstanding examples of its applications, illustrating the new methods in
applied mathematics. All the texts will be in English, and conform as
far as possible to a common notational scheme.
Ralph Abraham
Jerrold E. Marsden
Philip J. Holmes
Preface
By a very fortunate combination of events, I. M. Singer visited
Hawaii in the spring of 1978 and delivered two sterling lectures on
differential geometry and gauge theory. This was an inestimable
boon to my enthusiasm for this subject area. At his suggestion, I
conducted an interdepartmental seminar that was attended by a
handful of diverse mathematicians and a physicist. This book grew
out of the seminar.
Owing to the preliminary chapter, the text can be understood by
advanced undergraduate mathematics and/or physics majors. The
proofs are detailed enough so that nobody will sweat blood between
the lines. Because of this, I must ask for the patience of those
enthusiasts who are already familiar with the jargon and find the
details distracting. On a first reading, you should skip those details
that you find uninteresting.
To the uninitiated, it would seem that the use of fiber bundles
and connections to describe the basic forces of nature is a half-baked
scheme devised by some clique of mathematicians bent on producing
an application for their work. However, physicists themselves found
these notions forced upon them by their own perception of nature. In
xiii
XIV PREFACE
the balance of this Preface, I hope to suggest how this could happen,
and to provide a philosophical matrix into which you can pour the
olla podrida of theorems found in the main text.
Let M be the space-time continuum. In semiclassical (first-
quantized) physics, a particle is described in terms of a particle field
(or wave function) \p: M -»V where V is some vector space (typically
over the complex numbers). Some fixed basis of V is chosen with the
basis corresponding to certain states of the particle. Implicit in the
determination of ip(x) is the choice of a reference frame at x. By
"reference frame" we do not necessarily mean a choice of space-time
axes, but we could mean a choice of zero-phase angle or a choice of
axes in isospin space. Let Px denote the space of all possible reference
frames at x. Any two reference frames are uniquely related by an
element of some group G of transformations (e.g., rotations). If
p G Px and gtEG, thenpg denotes the transformed frame. Now g also
produces a transformation of V (say w\-+g-w, for we V). If \p(p) is
the value of \p relative topE. Px, then \p(pg) — g~l">P(P) *s tne value
relative to pg. A smooth concatenation P of the various Px as x
ranges over M is called a principal fiber bundle with group G; Px is
the fiber above x. If pE Px is chosen, then g^pg gives a topological
equivalence of G with Px, but in general P is not equivalent to
M X G, since P may be twisted.
If U is a subregion of M, then a function au: £/-» P, such that
au(y) E Pv for all >>e U, is called a gauge (i.e., a continuous choice of
reference frame). A wave function ip should be regarded as a function
on P, \p: P-^V such that \p(pg) = g~]-\p(p). However, given the
gauge au: U -»P, we can pull \p down to U С М to obtain a local
wave function \pu: U-^V given by $u(y)= 4/(au(y)) f°r У^ U. If ow:
W-^V is another gauge, then we can write ow(y)= ou(y)gUH.(y)
where guw: UDW-^G. Consequently, we have >pw(y)= ^{ow{y)) =
'P(°u(y)gUK(y))=gu*(y)~l-'Pu(y)> which shows how the local wave
functions change under a change of gauge.
Presumably, physically meaningful quantities should be indepen-
independent of the choice of gauge (unlike the local wave functions). Hence
the physicist attempts to construct real-valued functions on M that
depend on the wave function (and its differential), so that the result
is independent of the choice of gauge. Such a function is called an
Action density. (We use a capital "A" to distinguish this "Action"
from the mathematical group "action.") In Theorem 3.3.5 we show
PREFACE XV
that it is not possible to construct an Action density (solely out of
the wave function) that depends nontrivially on the differential of
the wave function. Indeed, what is needed is an additional object
(called a gauge potential or connection) that lives on P and trans-
transforms in a way so that when it is incorporated into a hypothetical
Action density, it leads to a truly gauge-invariant Action density.
Theorem 3.3.6 shows precisely how connections solve the gauge-
invariance problems of defective Action densities.
There is a standard way to compute the field strength (or curva-
curvature) of a gauge potential (or connection). In the case where G = U(l)
(or equivalently, the group of rotations in the plane), the gauge
potential is essentially the 4-vector potential of electromagnetism and
the field strength is the electromagnetic field. Although these terms
go back (at least a century) to the days of Maxwell, it appears to
have been H. Weyl [1918] who introduced the concepts of gauge
transformation and gauge invariance. With the possible exception of
general relativity, the gauge concept in physics was mostly limited to
the study of electromagnetic interactions, until 1954 with the paper
of C. N. Yang and R. L. Mills [1954]. (Actually, O. Klein [1939] had
considered a non-Abelian gauge theory 15 years before.) Yang and
Mills introduced gauges prescribing a point-dependent choice of
isotopic spin axes. In this case, the group is 5GB) (or equivalently,
the unit quaternions). There was no immediate application for the
associated gauge potentials and field strengths, because these fields
were found to be massless, whereas nuclear forces were known to be
mediated by massive particles (e.g., 77 mesons). Nevertheless, the
Yang-Mills model was a precursor to the apparently successful
model of Weinberg and Salam [1967] for weak interactions. Indeed,
we shall see in Section 10.3 that the mechanism of spontaneous sym-
symmetry breaking (developed by Higgs [1966]) allows gauge fields to
acquire mass (consider, e.g., the massive "intermediate vector bosons"
in the Weinberg-Salam model). In spite of these refinements, the
basic fact remains that the existence of gauge fields is a consequence
of the existence of gauge-invariant Action densities for particle fields.
In the beginning there was a gauge-invariant Action density, and
then there was radiation (photons, pions, intermediate vector bosons,
gluons, etc.).
The Action density is a measure of the superfluous manifesta-
manifestations of the fields involved. Nature obeys the principle of least
XVi PREFACE
Action. This means that the only fields allowed by nature are those
that leave the integral of the Action density fixed (to first order) with
respect to all suitable variations of the field. This condition is
equivalent to requiring that the field satisfy a certain differential
equation, known as Lagrange's equation. In Chapter 4, we derive this
equation and find that it is most natural to regard it as an equation
on the bundle space P. Physicists are used to pulling the equation
down to M (by a choice of gauge), typically at the expense of
elegance. We derive in great detail the following special cases of
Lagrange's equation: (A) the Klein-Gordon equation for a charged
spin-0 particle (e.g., a 77" meson); (B) the Dirac equation for the
relativistic spin-5 electron; (C) the equation for the spin-5 nucleon
(proton-neutron doublet) subject to a Yang-Mills gauge potential.
Physicists are as familiar with these equations as differential geome-
geometers are with connections and principal fiber bundles. However, some
residual novelty is gained by considering these equations as based on
a principal fiber bundle over a curved space-time. Also, it is clearly
demonstrated that Dirac's equation follows from Lagrange's equa-
equation for the usual Action density provided that the Levi-Cevita
connection for space-time is used.
The gauge field strengths obey two differential equations. The
first equation, known as the homogeneous field equation or Bianchi
identity, is a consequence of the way in which the field strength is
defined. The second field equation is generally inhomogeneous,
because of a term that is the current of the particle field. The
inhomogeneous equation results from insisting that the gauge poten-
potential obey the principle of least Action for the sum of Action density
of the particle field and the so-called self-Action of the gauge
potential itself. The current is then the first variation (of the first
term in this sum) with respect to a change of potential. In the case
where G = (/(l), the two field equations together are equivalent to
Maxwell's four equations. In general, one can prove that the current
is conserved either by using the inhomogeneous equation or by using
the gauge invariance of the Action density. We do it both ways.
Although no explicit reference to Noether's theorem is made, the
idea that conservation laws follow from symmetries (in this case,
gauge transformations) is largely her own (Noether [1918]).
So far, we have considered only transformations that change the
reference frames at the various points xE.M, but leave the points
PREFACE xvii
themselves fixed. For this reason, the gauge theories and transforma-
transformations discussed above are called internal. The Einstein field equation
of general relativity is based on a principle of least Action that is
invariant under transformations of space-time itself. In this sense,
general relativity is an external gauge theory of sorts. Here the gauge
transformations can be regarded as coordinate changes, which leave
the geometry of space-time unchanged. Thus the integral of the
geometrically expressed Action density (scalar curvature) of general
relativity is invariant under these transformations. The conservation
law arising from this external symmetry is interpreted (via the
Einstein field equation) as conservation of the nongravitational en-
energy and momentum that shape space-time.
There is an interesting and unexpected harmony that arises from
a certain blending of the internal and external gauge theories through
the natural imposition of a geometry on the bundle space P over
space-time. Given a geometry (or metric) on M and a gauge potential
on P, there is a natural way to define a bundle metric on P. The
Action density is defined to be the scalar curvature of this bundle
metric. By demanding that the integral of the Action density be
stationary under variations of the metric on M, we obtain the
Einstein field equation, with the energy-momentum source arising
from the field strength of the gauge potential. Similarly, variation of
the gauge potential leads to the second field equation (or Yang-Mills
equation) for the field strength of the potential. Actually, the idea
here (which we study in Chapter 9) has its roots in the classical
Kaluza-Klein unified field theory (Klein [1926]). Although they
consider only the case in which G = U(l), the situation for an
arbitrary Lie group is only technically more difficult. There are many
accounts of this generalized Kaluza-Klein theory (e.g., Cho [1975]),
Herman [1978], Tabensky [1976]), but I am not aware of any that
bother to go through the complete calculation of Chapter 9. In
Section 10.1 it is proved that the geodesies (i.e., straightest possible
paths) relative to the bundle metric on P project to the (generally
nongeodesic) paths of "charged" particles under the force of the
gauge field strength; the charge is essentially the fiber component of
the geodesic in P. Kaluza and/or Klein proved this in the case where
A number of interesting physical consequences arise when it
happens that P is twisted. The mathematical consequences (e.g.,
XViii PREFACE
nonvanishing characteristic classes) were well known by 1950, but
the physical interpretations (e.g., magnetic monopoles, solitons, in-
stantons) have been explored only recently in this context (Belavin,
Polyakov, Schwarz, and Tyupkin [1976], Jackiw and Rebbi [1976],
't Hooft [1974], etc.)- A brief encounter with such notions is found in
Section 10.4. It is perhaps regrettable that the relatively recent work
of Atiyah, Hitchin, Singer, Ward, and many others in this area could
not be given adequate coverage. Actually, these works speak well for
themselves, and are best received by those with some background in
algebraic geometry or with the analytical tools needed to understand
the Atiyah-Singer index theorem (see Booss [1977], Mayer [1981],
Palais [1965]).
It must be conceded that this book does not by any means bring
you up to the level of many of the current research areas in gauge
theory. For example, there is nothing in the text about the second
quantization of gauge potentials or particle fields. Without this, very
little of real physical significance (scattering cross sections, particle
lifetimes, etc.) can be properly discussed or computed. However, the
mathematical foundations of interacting second-quantized field the-
theories are plagued by multifarious divergent expressions. You may
wish to consult the book of Faddeev and Slavnov [1980] and the
references therein. Should you wish to broaden a general background
in mathematical physics, you will find the books by Abraham and
Marsden [1978], Arnol'd [1978], Thirring [1978], and Sachs and Wu
[1977] particularly helpful.
There are many people who have helped bring this book to its
present form. For helpful comments, encouragement, and willing
ears, I thank Chris Allday, Joe Gerver, Dave Johnson, Bob Little,
Ernest Ma, Jerry Marsden, M. E. Mayer, Dick Palais, I. M. Singer,
Joel Weiner, and Les Wilson. I owe much to the typists, Kathleen
Kikuta and Joanne Nakamura, for their patience, accuracy, and
punctuality in the face of unabated notational monstrosities. I apolo-
apologize and give thanks to those who have indirectly influenced this
volume or who figure prominently in the field and yet do not appear
among the references or in the selected bibliography (which scarcely
represents the vast and growing literature).
David Bleecker
Gauge Theory and
Variational Principles
CHAPTER
О
Preliminaries
Roughly speaking, you should be familiar with the basic concepts
and notation of elementary set theory (e.g., unions, intersections,
Cartesian products, and functions); you must know some linear
algebra (abstract vector spaces, bases, linear transformations, dual
spaces, diagonalization of symmetric matrices, etc.); a little group
theory (homomorphisms, kernels, direct products, etc.); and the
basics of analysis on R" (open sets, continuous functions, the funda-
fundamental theorem of calculus, partial derivatives, integration of smooth
functions of several variables, etc.). You should be familiar with the
use of indices, and realize that sums are taken over repeated indices.
If you lack motivation, you should read the introduction to Chapter
1, as well as the Preface.
David D. Bleecker, Gauge Theory and Variational Principles ISBN 0-201-10096-7
Copyright © 1981 by Addison-Wesley Publishing Company, Inc., Advanced Book
Program/World Science Division. All rights reserved. No part of this publication
maybe reproduced, stored in a retrieval system, or transmitted, in any form or by
any means, electronic, mechanical, photocopying, recording, or otherwise, without
the prior permission of the publisher.
1
2 GAUGE THEORY AND VARIATIONAL PRINCIPLES
Nearly everything in this chapter plays some role in mathematical
physics and will be used later in the book. In Section 0.1, we
introduce concepts of multilinear algebra such as tensors, metrics,
orientations, and star operators. In Section 0.2, we define manifolds
and extend the notions of 0.1 to objects on manifolds. Exterior
differentiation and codifferentiation of forms on manifolds are de-
defined, as well as the integration of «-forms on «-manifolds. Stokes'
theorem is proved for compactly supported forms on manifolds
without boundary. Maxwell's equations are explicitly written in
terms of differential forms in 0.2.22. In Section 0.3, we define Lie
groups, Lie algebras, and related notions (e.g., exponential maps and
structure constants). The case of the special unitary group SU(n),
which is important in physical gauge theories, is discussed in 0.3.12.
0.1 MULTILINEAR ALGEBRA AND FORMS
0.1.1 Notation. Let E and F be vector spaces over U. Let E be the
dual space of E, and let T°'°(E,F)=F. For p,q>0,Tp'q(E,F) is
the space of multilinear functions (F-valued tensors) f: Ё X • — X
EXEX---XE^F. We denote Tp-q(E,U) by Tp-\E). If и,,...,
up(EE and v\...,vq(EE, then ux® ■ ■ ■ ®up®tf® • • • ®vq<ETp-q{E)
is defined by (м, <8> • • • ®vq){x\ ..., xp, yx,..., yq)= х\щ) ■ ■ ■
xp(up)v\yx) ■ ■ ■ vq(yc/). Let ex,...,en be a basis for E, with e\...,e"
the dual basis. Any/E Tp~ q{E) can be uniquely written as
f=
for components fj*.:'? (EM. Let Ak(E,F) be the subspace of T°-k(E,F)
consisting off such that f(uu...,uk) is antisymmetric in uu...,uk^E;
A°(E,F) = F and Ak(E) = Ak(E,R). For k
where w, ..., E.U is antisymmetric in the indices il,...,ik.
0 PRELIMINARIES 3
0.1.2 Definition. For абЛ'(£) and j8eAJ(£), we define аЛ/?е
= 7777 2 (-1)а«(МаA)."-."а(/))^(«а(,Ч-1).•••.«»(,•+;))
/Ле ямт м сюег /Ле set of all permutations о of {1,...,/ + j), and
(— 1 )a = ± 1 is the sign of a. The components of (а Л /?) are given by
where
For a£A°(£), we je/
0.1.3 Definition. Л /mtfnc o« E is a g^T°'2(E) such that g is
symmetric and nondegenerate (i.e., if g(u,v) — 0 for all vE.E, then
м = 0). An orthonormal (o.n.) basis of E relative to g is a basis
ex,...,en such that g(ej,ej)= ±8tj where S:J is the Kronecker delta
(Sjj = 1 if i=j; 5,7 = 0 if i4=j). A volume element of E relative to g is a
juGA"(£') given by ё] Л • • • Аё" for an o.n. basis ex,...,en; note that
ё' ЕЛ'(£) and 1х(е^,...,еп)=\. Actually ju depends on the choice of
o.n. basis, but only by a factor of ±1. A choice of the two possible
volume elements is.called an orientation of E. If ц is an orientation for
E, then a basis vx,...,vn of E is positively oriented if [x(vi,...,vn)>0.
0.1.4 Definition. If g is a metric on E, then there is an induced
metric g^T°-2(Ak(E)) defined as follows. Let vx,...,vnbe any basis of
E, and let gtj —g(vf, vs). Take g'J to be the (i, j)th entry of the inverse
of the matrix (g,7). For a, fi^Ak(E), we define g(a, /3) in terms of
components (relative to vx,...,vn) by
p(a B)= У ?'l>lg'272 . . . o'kJ'kn R
8\aP) ZS 5 5 (xiiP-
4 GAUGE THEORY AND VARIATIONAL PRINCIPLES
We can verify that g(a, ft) is independent of the choice of basis. For
a,P<EA°(E), we let g(a,P) = aP.
0.1.5 Definition. Suppose that we are given a metric g on E and h
on F. Then there is a metric (gh) on Ak(E, F) defined as follows. Let
/i,-..,/„, be a basis for F and write a^Ak(E, F) as a = 1a"fa,a" E
Ak(E,R). Then, for a,/3(EAk(E, F), let (gh)(a,/3) = lhahg(a«,ph)
where hah=h(fa, fh). Again, (gh)(a, ft) is independent of the basis.
0.1.6 Theorem. Let g be a metric on E, and suppose that \x is a
volume element of E relative to g. For n = dim E, there is a unique
linear isomorphism *: Ak(E)-^>A"~k(E), such that aA*fi=g(a,fi)[x
for alia,
Proof. For уеЛ""^), define (py: Ak(E)^R by <py(a)ix=aAy.
We can prove that if <py(a) = 0 for all обЛк(£), then y = 0. Thus,
yi-^<p defines a one-to-one linear map А"~к(Е)-^Ак(Е)л. Since
this map is an isomorphism. Thus, for each fiE:Ak(E), there is a
unique y<EA"~k(E) such that q>y(a)-g(a,/S) for all a<EAk(E). We
take */? to be this y. Then аА*@=аАу = (ру(а)ц=£(а,>8)/х. From
this equation, it is clear that * is one-to-one (and hence an isomor-
isomorphism). ■
0.1.7 Notation. Let g be a metric on E, and let g'1 be as in 0.1.4
{relative to the arbitrary basis vx,...,vn of E). For io^Ak(E) with
components w, ...,■, we define
,,J\ ■■•/*= V ohj\ . . . aidk,.,_
We write \g\ = \det(gIJ)\. If eu...,en is an o.n. basis for E, let tj,7 =
g(ei,ej)=±8ij, and set (-l)g^=det(r]IJ). If «y-2al7<?,-, then gtj =
g(vi,vJ) = '2akiamj7}km or G—ATr\A in terms of matrices. Thus, \det A\
= |g|1/2, and it follows that ц=ё^ A ■■■ Ae" = |g|1/2t31 Л • • • Av" if
0 PRELIMINARIES 5
v),..., vn is positively oriented relative to jjl. Finally, we let e, =
8j2h::"j, and note that n = leit... teh® ■ ■ ■ <8>e'". '' '"
0.1.8 Theorem. Let g be a metric on E with orientation iueA"(£'),
and let vx,...,vnbe a positively oriented basis of E. For w = 2 w, . t5'>
Л • • • Ли'* еЛ'(£), we have (in the notation of 0.1.6 and 0.1.7)
Proof. Let *w denote the right-hand side of A). By 0.1.6, we need
only verify that aA*/S = g(a,/S)^ for all a,/S^Ak(E). Now
k\{n-k)\ k\
a Bjl '"Jke
k\
where we have used the antisymmetry of a and /? in the upper and
lower indices. ■
0.1.9 Theorem. For weA*(£), we have
Proof. In terms of components relative to some positively oriented
6 GAUGE THEORY AND VARIATIONAL PRINCIPLES
basis, we have
1 \\g\1<»mi""Hkemi...m?r"J-kt
(n-k)l k\
1 1
(n-k)\ k\
g g
| ■■■mkji---j^kej, ■■■л-t'l ■■■'*
' em\ ■ ■ ■ m'kj, ■ ■ /„-A, ■ ■ ■/„_*/, ■■■!'*
(n-k)l k!
■ ■ ■ Jn-km'\ ■ ■ ■ m'k£jl ■ ■ -Jn-kh ■■'*
0.1.10 Remark. There is a star operator *: Ak(E,F)-^A" k(E,F)
defined componentwise relative to a basis of F using * on Ak(E).
0.2 MANIFOLDS AND TENSOR ANALYSIS
0.2.1 Definition (C°° л-Manifold). Let M be a set, and suppose M
is the union of a number of subsets Ut where i ranges over some
(possibly infinite) index set I (i.e., M= U;e/(/;). In Figure 1, R" —
{(x\...,x")\x' ED?}, and let yt: U^U" be a one-to-one function such
that <Pj(Ut) is open. Assume that for all i,jE.I, we have q^o^r1;
(py((/, П Ц-)-»ф,-Щ П Uj) is C°° (i.e., has continuous partial derivatives
of all orders). A subset VdM is open г/(р;((/; П V) is open for all /E/.
The collection of open sets is called the topology of M relative to
0 PRELIMINARIES 7
{ф(.|/Е/}. We assume that the topology of M is Hausdorff (i.e., for
x, y^M with хфу, there are disjoint open sets Vx and Vy with xEF
and y^Vy). We add the technical assumption that M=DkUi where
ik E/, A: = 1,2,3,... . Under these assumptions, {ф(.|/Е/} is called an
atlas of M. Two atlases are equivalent if their union is an atlas. An
equivalence class of atlases is called a differentiable structure on M,
and M together with a differentiable structure is called a (Cx) n-
manifold, n being the dimension of M. Any tpf. Ц^>М" in some
representative atlas is called a chart or coordinate system.
0.2.2 Definition. // M and N are manifolds, then a function f:
M->N is called a (C°°) map, provided that (for all charts (p;: Ul■ -»IR"
onMandxPy. V^R1" onN)\Pj°f°<p7l: «p.((/. n/"'(^.))-»R" is C00.
If f has an inverse that is a map, then f is called a diffeomorphism. IfR
is given its usual differentiable structure, then the collection of maps f:
M->R is denoted by CX(M).
M
Figure 1
8 GAUGE THEORY AND VARIATIONAL PRINCIPLES
0.2.3 Definition. A curve through a point x EM is a map y: (a, £>)-»
M (a<0<b) such that y@) = x. Curves у, and y2 through x are called
equivalent if (y°Yi)'@) = (<p°Уг)'(О) for some (and hence any) chart
(p: £/-»R" with xEU. An equivalence class of curves through x is
called a tangent vector at x; the set of all tangent vectors at x is denoted
by TXM. We write y\Q) or
(=0
for the vector in TXM determined by y. Note that TXM has a natural
vector space structure. If YXETXM (say Yx=y'@)) and /EC°°(M),
then (/o y)'(O) EIR is called the derivative off along Yx, and is denoted
by Yx[f].
0.2.4 Definition. Let TM=UxeMTxM. A vector field on M is a
function Y:M-^TM such that YXETXM and (for all /E CX(M)) the
function x\->Yx[f] is in CX(M); we denote this function by Y[f]. The
set of all vector fields on M is denoted by T(TM). If Y, ZET(TM),
then [Y,Z] is that vector field such that [Y,Z]x[f]=Yx[Z[f]]-
Zx[Y[f]\. We omit the proof of the existence and uniqueness of[Y, Z]
(see Kobayashi and Nomizu [1963]). Observe that [Y,Z]=-[Z,Y],
and [Y,[Z,W]] + [W,[Y,Z]] + [Z,[W,Y]] = 0. The latter relation is
called the Jacobi identity.
0.2.5 Definition. ///: M-^ N is a map andxEM, then Дх: TXM->
Tf(x)N is the linear function (differential of f at x) f*x(y'(O)) =
(/° Y)'@) where у is a curve through x. If YE T(TM) and_ YE T(TN)
are such that f^x(Yx)= Yf(x), then we write Y1.Y. If YUY and ZL,Z,
then [Y, Z]1,[Y, Z] can be proved. In the event that f is onto, we write
this conclusion as fJ,Y, Z] — [fifY, f^Z].
0.2.6 Definition. Let <p: U-^U" be a chart. The coordinate vector
fields 3,,..., Зи on UCM are defined by
dt
(=0
0 PRELIMINARIES 9
where et is the ith standard unit vector in U". Any YET(TM), when
restricted to U, can be expressed as_Y=1a' 3, where a' E CX(U). If y:
U-+ R" is another chart with UD U¥=4>, then (for xE Un U) you may
verify that
<p(x)
where У—2а7Эу and у — (у',..., q>") on U. By an abuse of notation,
we can write
, dxj
Эх'
Vector fields can be regarded systems of functions (a1,..., a"), defined
on coordinate domains, that transform according to the rule above
under a change of coordinates.
0.2.7 Definition. Let YET(TM) be such that (for each xEM)
there is a curve yx: R->M through x with y^(t)= У (() for all t£R.
Such a Y is called complete. For tGU, define yt: M-^M by yt(x) =
yx{t). We can prove that q>, is a diffeomorphism, and that <ps°<p, = <ps+,
for all s,tGU. The set {<pt\tEM} is called the one-parameter group
generated by Y. If ZET(TM), then the Lie derivative of Z along Y is
the vector field LYZ defined by
(=0
{I.e.,
TXM).
( = 0
We can prove that LYZ = [Y, Z]. For a chart (p: £/->R" with Y=la'di
and Z=2bjdj (see 0.2.6) we have [Y, Z] = 2(aldi[bJ]-b'dl[aJ])dJ =
LYZ on U.
10 GAUGE THEORY AND VARIATIONAL PRINCIPLES
0.2.8 Definition. Let TP'C>(M) = UxeMTp-c>(TxM). A 1-form on M
is a function a: M->T°-\M) such that axET0A(TxM) and (for any
YET(TM)) the function a(Y) given by a(Y)(x) = ax(Yx) is in CX(M).
A tensor field of type (p, q) on M is a function S: A/-» Tp-q(M) such
that Sx ETp-q(TxM) and (for any l-forms a,,..., ap and vector fields
YX,...,Y on M) the function S(ax,...,ap,Yx,...,Yq) given by
S(ax,...,ap,Yx,...,Y4)(x) = S(aXx,...,apx,YXx,...,Yqx) is in C"(M).
The space of all tensor fields of type (p,q) on M is denoted by
0.2.9 Definition. A k-form on M is a tensor field w E 5" од( М) such
that wx E.Ak(TxM). The space of k-forms on M is denoted by Ak(M).
For aeA'(M) and /3eAJ(M), we define aA/3eAi+J(M) by (aA/3)x
= ax/\fix. Ify: U-^R" is a chart <p = (x\...,x") (x' ECX(U)), then
dx\...,dx" are defined to be those l-forms on V with dx'(dJ) = 8j. Any
w&Ak(M) can be written on U as
"=
'л '■• Adx'k
where uir..ik = u(dli,...,dik)eC'x'(U).
0.2.10 Definition. If feCx(M), then dfEA\M) is defined by
df(Y) = Y[f]for arbitrary' YET(TM). For wEAk(M), we define dec
to be the (k+\)-form that when restricted to U (in the notation of
0.2.9) is given by
We can prove that dw, as defined, is independent of the choice of
coordinates. In fact, die can be defined (without reference to coordi-
coordinates) as that (k+X)-form such that for any Xx,..., Xk+X ET(TM) we
0 PRELIMINARIES 11
have
A +1
die(X^,..., Xk + ]) = 2j v 1
1 = 1
К i < у < и
where the circumflex means that the symbol beneath it is to be omitted.
The operation d: Ak(M)-^Ak+\M) is called exterior differentiation.
If aEA'(M) and /3&AJ(M), then (from the coordinate definition) we
easily obtain d(aAfi) = daAfi + (— \)'af\df5; and d2 =d° d=0.
0.2.11 Definition. ///: M-^N is a map and ic&Ak(N), then
the pull-back /*uEA'(M) is defined by (/*ш)А.(У,,..., Yk) =
A)* f
C°°(M). It can be proved that df*w=f*dw, /*(аЛ/?)=/*аЛ/*(/3),
and (/og)*w = g*/*w.
0.2.12 Definition. In order to integrate forms, we introduce some
topological notions. A subset WCM is closed if its complement Wc =
{xE.M\x£ W) is open. The closure A of an arbitrary subset ACM is
the intersection of all closed subsets of M that contain A. Note that A is
the smallest closed set containing A. An open covering of A CM is a
collection % of open subsets whose union contains A. A subcover %' of
% is a subcollection (i.e., %' C%) such that %' is an open covering. A
subset К of M is compact if every open cover of К has a finite subcover.
The Hausdorff property of M ensures that a compact subset is closed. If
S is a tensor field on M, then the support of S, denoted supp S, is the
closure of {x<EM\S(x)J=0}.
0.2.13 Definition (Integration). A nowhere zero n-form v on an
n-manifold M is called an orientation for M. The pair (M, v) is called
an oriented manifold. Let a be an n-form on the oriented manifold
(M, v) such that K=supp a is compact. The compactness of К ensures
that there is a finite number of charts (p,: Ut->M,i=l,...,N, such that
\ U ■ ■ • U UN, and (p,((/,)ClR" is bounded, and (see Kobayashi
12 GAUGE THEORY AND VARIATIONAL PRINCIPLES
and Nomizu [1963]) there exist functions p, EC°°(M) such that supp p,
С(/„ 0<р,-<1, and 2fp,-(x)=l /or all xEK. If /3 is an n-form
defined on a bounded open subset D of R" such that supp/SCZD is a
closed subset ofR", then we define
?= / b
where b is the real-valued function defined by /3 = bdxx Л • • ■ Adx".
By an interchange of the components of <pt (if necessary), we can
assume that (on Ц) <p*(dxl Л ■ ■ • Adx") is a positive multiple of the
orientation v for all i — 1,..., N. Then, we define
N
fa=2f <РГ*(Р,«).
JM i=\J<p(U,)
We omit the proof that JMa is independent of the choice of coordinate
covering, and so on (see Spivak [1971]). However, the underlying
reason is that the component of an n-form (in local coordinates)
changes under a change of coordinates by a factor equal to the Jacobian
of the change. Thus, the change of variable formula for integrals
applies.
0.2.14 Theorem (Stokes' Theorem). Let M be an oriented n-
manifold and suppose that а£Л"~'(М) has compact support. Then we
have
da = 0.
M
Proof. Let iC=suppa and let (p,: Ц-+Ш" and p,EC°°(M) (i =
1,..., N) be as in 0.2.13. Then a = '2pla, and it follows that
la)=2 f
i J<p(
ff f
M j JM i \(U,) i J<p(U,)
Each of the integrals in this sum vanishes by the following argument.
Let fi be a compactly supported (n — l)-form onR", say fi = '2bJdxl
0 PRELIMINARIES 13
Л • • • AdxJ Л ■ • • Adx". Then
f
since b- vanishes outside of a compact subset of R". Now, set
<p7**p.a on (рЩ) and /3 = 0 on уЩУ. Then, 0 = /
0.2.15 Remark. Г/геге w a notion of an n-manifold M with an
(n — \)-manifold ЬМ for a boundary. The usual version of Stokes'
theorem that says jMda = JdMoc for compactly supported (и— \)-forms
a (and where dM has an orientation induced from that on M). We use
this result only once (in 10.4.13), and therefore we omit the proof (it can
be found in Spivak [1971], Sternberg [1963], etc.).
0.2.16 Theorem. If f: N->M is an orientation-preserving diffeo-
morphism of n-manifolds, then for any compactly supported aE.A"(M)
wehave j j
Proof. Let K= supp a, and let (p,: Ut,-> R" and p, E C°°(M) be as in
0.2.13. Then we have corresponding charts ^,-=<p,-°/: /1
and ai=plof^Cca(N), needed to define jNf*a. Then
2/
= 2/ «pr'IU0/0/-
= 2/ «P7U(P,«) = / a
14 GAUGE THEORY AND VARIATIONAL PRINCIPLES
0.2.17 Definition. A metric on M is a gE?T °i2(M) such that gx is a
metric on TXM. If M is oriented by с£Л"(М), then the volume element
of (M, g) is the juEA"(Af) such that jxx is a volume element of TxM
relative to gx and [xx is a positive multiple of vx. Let *x: Ak(TxM)-^
A"~k(TxM) be as in 0.1.6 or 0.1.8 relative to gx and цх. Define *:
Л*(М)-»Л"-*(М) by (*a)x = *xax. The relation aA*j8=g(a,j8)/i
(which follows from 0.1.6) implies that *f=fn and *(/м) = (— l)gf for
/EC°°(M) where (~l)g: M->{1,-1} is given by (- l)g(.x) = (- l)g*
(see 0.1.7). We define the codifferential 8: Ak(M)->Ak~\M) by
8a=-(-l)8(-l)"{k+v'>*d*a. ///EC°°(M) = A0(M), we set 5/=0.
Since **= ±1, we have 82 = ±*d2* = 0.
0.2.18 Theorem. Let M be an oriented n-manifold with metric g
and volume element ju. Let aEAk(M) and fi&Ak + ](M) such that
аЛ*/3 has compact support. Then we have
f g(a,
where g(a, 5/3) E C°°(M) is defined by g(a, 8j8)(jc)= gx(ax, 8/Sx).
Proof. Note that g(a, 8/S)tx = a A*8/3 = ~(-l)s(-l)n(k + 2)a
/\**d*fi= (using 0.1.9) -(-l)g(-iykaA(-l)g(-lY"-k)kd*IS =
-(-l)kaAd*IS. Thus, g(da,P)ix- g(a,8/3)ii = daA*p+(-l)kaA
d*[5 = d(aA*f5). Integrating this relation over M and applying
Stokes' theorem @.2.14), we obtain the result. ■
0.2.19 Theorem. Let M=Un with the usual coordinates x\...,xn
and volume element jj. = dxi A ■ ■ ■ Adx", with metric g such that
g,J = ±8IJ.Let
У R Л1' Л
where Bt , =^8C,,...,3, ). Then, we have
W=(-l)k+1jjlPil...,;,ldxl>A..-Adx^ A)
where we use the notation f t■ = ЗД/].
0 PRELIMINARIES 15
Proof. Let S;;;;;;* = (<&'■ Л • • • лЛс'*)(Э,.,..., dJk). For a =
,|...;4й?х'|Л ••• Лй?х''*ЕЛ*Aй"), we have
dald. ,...,3, ) = ут2а, ,, dxik+1 Adx'1 A - • ■
C,,...,3, )
V 7i' ' 7*+!-'
= —а ( — 11*8''•■■'■*+!
Thus, if a has compact support, we have
f g(a,Sp)n= f g(da,
2 «y,.. .л.л+, вл-.-.-^,^'1 ■ ■ • '*
k\ {k+\)\
where S/3 is the right-hand side of A). Thus, jR.g(a, 8^-Щц=0 for
all compactly supported аЕЛ*AЙ"). It follows that 8)8=8)8. ■
0.2.20 Definition. A form шбЛ'(М) is closed if Jw = 0, and is
exact if w=da for some aEAk~\M). Since d2 =0, every exact form
is closed. Let Zk(M) and Bk(M) be the vector spaces of closed and
exact k- forms, respectively. Then the quotient Hk(M) =
Zk(M)/Bk(M) is called the kth de Rham cohomology space of M. If
aEZk(M), then [a]EHk(M) denotes the class determined by a. If
16 GAUGE THEORY AND VARIATIONAL PRINCIPLES
[a]EH'(M) and [fi]EHJ(M), then we define a multiplication [a] A
[/?] E H'+ '(M) by [а]Л[/?] = [аЛ /3]. We must check that d(aAfi) =
0; and if [a\ = [a'] and [)8] = [j8'], we need [аЛ/?] = [а'Л/?'].
However, d(aA fi)=daA p+(-l)'aAdf}=0, and (for a'- a = da
and P'-/3 = db) we have a'A ft'-aA fi=(a+da)A(fi +db)~
aA/3 = daAl3 + aAdb + da A db = d(a A /8) + (- 1)'d(a A b) +
d(aAdb) = d(aAfi+(-\)'aAb + aAdb) whence [а'Л/?'] = [аЛ
P]. Note that [P]A[a] = (— l)tJ[a]A[fi]and the multiplication extends
(by linearity) to the entire space H*(M) — @k Hk(M), making H*(M)
an algebra.
0.2.21 Definition. A k-dimensional submanifold of an n-manifold M
is a k-manifold N that is a subset of M such that V is an open subset of
N iff V=Nr\V for some open subset V of M. Furthermore, it is
required that the inclusion i: N-+M is a map such that i^x: TXN'-»TXM
is one-to-one for all x EN. Thus, TXN can be identified with a subspace
of TXM.
0.2.22 Maxwell's Equations and Differential Forms. Let М-Шл
with coordinates (x°, x\ x2, x3) — (t, x, y, z) and metric g such that
g(d0,30)= 1, gC,., 3,.)= - 1 for i= 1,2,3, and g(8,, bj) = O for i*j (i.e.,
(M, g) isMinkowskispace). Consider the 2-form F—E]dxAdt+E2dy
Adt + E3dzAdt + Bx dyAdz + B2 dzAdx + B3 dx A dy. For dr =
(dx, dy, dz) and da — (dyAdz,dzAdx,dxAdy), we employ the
shorthandF=E'drAdt + B'da. By a simple computation, we obtain
VX£+y •daAdt+(V'B)dT
where dr=dxAdyAdz. Thus, dF=0 iff V XE + dB/dt = 0 and V 'B
= 0, which are two of Maxwell's four equations (where E is the electric
field and В is the magnetic field). Now, *F—E'da—B'drAdt, and so
d*F= (v -E) dr- Iv-B- -^
-duAdt.
Now 8=-(-l)g(-l)*ik+])*d* = *d* on A*(IR4). Thus, 8F=*d*F=
(VE)dt-(VXB-dE/dt)-dr. Let the maps p: R4-^IR andJ: R4->
R3 be the charge density and the current density, respectively. The
source 1-form jEA\R4) is defined by j= pdt—J'dr. Then 8F—j is
0 PRELIMINARIES 17
equivalent to the other two (inhomogeneous) Maxwell equations, V 'E
= p and \?XB — dE/dt=J. Thus, the four Maxwell equations are
summarized by dF—O and 8F=j. Applying 8 to 8F=j, we obtain
0 = 82F=8j=*d*j=*d(PdT-J-daAdt)
= * f -£-dtAdr- V 'JdrAdt) = - (^r + V •/) .
\ ot ■ I \ dt I
Thus, we obtain the so-called "continuity equation"
which says that charge is conserved.
0.2.23 Vector-Valued Forms. Let V be a vector space with a basis
u,,..., i5m. // a1,.. ., am E ЛА(М), then the function x н> a\vx
+ ■■■ +a™vm &Ak(TxM, V) is called a V-valued k-form, the space of
V-valued k-forms is denoted by Ak(M, V). For a = a1u1 + • • ■ +amvm
<EAk(M,V), we define da = {dax)vx + ■ ■ ■ +{dam)vm EAk(M, V),
and {in the case in which M is oriented and has a metric g) *a =
{*av)vx+ ■■■ +(*am)vm and 8 = (-l)g(-l)"(k+l)*d*. Let h be a
metric on V, and suppose that a, fiEAk(M,V). Then (gh)(a, /S)E
C°°(M) is defined by (gh)(a,P)(x) = (gxh)(ax,/3x) (see 0.1.5). //
a = 2aava and P = l£bvb and hah = h(va,vh), then (gh)(a,/3) =
1habg(a",fib). We have the following analogue of 0.2.18.
0.2.24 Theorem. Let M be as in 0.2.18, and let V be a vector
space with metric h. Suppose that aEAk(M, V) and @EAk + \M, V)
are such that suppBhaha" A*fib) is compact. Then, we have
f
Proof. For a = Iaava and P = lfibvb, we have fM(gh)(a,8f})ii
^hf8^2fHhfd^ Ш
0.2.25 Definition. Let M be an n-manifold and let N be an m-
manifold. We define the product manifold MXN of dimension m + n as
follows. Let {(p,: U,-+Rn} be an atlas for M and let {^: F; ->Rm} be
an atlas for N. Define <p,X^.: U,X V}->R"+m by (
18 GAUGE THEORY AND VARIATIONAL PRINCIPLES
^XlRm=IR"+m. Then {(р,Х-ф;: (^ X F;-^R"+m} will
be an atlas (for MXN), making MXN an (n + m)-manifold.
0.2.26 Definition. Let fECx(M) and suppose that df = 0 at some
xEM. then x is called a critical point of f. The Hessian of f
at such an x is a symmetric bilinear function (D2f)x: TXM XTXM~->
R defined by (D2f)x(Yx, Zx)=Yx[Z[f]\, where ZET(TM) is such
that ZX = ZX. Suppose YET(TM) with YX = YX. Then Zx[Y[f]] =
[Z,Y]x[f]+Yx[Z[f]] = Yx[Z[f]], since df = 0 at x. Thus, (D2f)x is
not only independent of the extension Z, but also symmetric.
0.3 LIE GROUPS AND LIE ALGEBRAS
0.3.1 Definition. Let G be an n-manifold and a group such that the
groups operation GXG-+G given by (g\, g2)^>gxg2 and the function
G-> G given by gh->g~] are (C°°) maps. Then G is called a Lie group.
0.3.2 Definition. Let Lg: G-^ G be defined by Lg(g')=gg'; Lg is a
diffeomorphism. Let e be the identity element of G, and let A ETeG.
Define A ET(TG) by Ag = Lgif(A)\ A is called the left-invariant vector
field determined by A.
0.3.3 Definition. Let §=TeG, and (for A,BE§) define [A,B]E§
by [A,B] = [A,B]e (see 0.2.4). Note that [A,B]=-[B,A] and
[A,[B,C]] + [C,[A,B]] + [B,[C,A]] = 0 (the Jacobi identity). Then g
(together with the bracket operation [ , ]) is called the Lie algebra
ofG.
0.3.4 Definition. For AE§, we can prove that A is a complete
vector field (see 0.2.7). Let {<pt} be the one-parameter group of diffeo-
morphisms generated by AE§. Let y: R->G be the curve through e
defined by y(t) — (pt(e). We prove that y(s+ t)=y(s)y(t) (group multi-
multiplication). Let sEU. be fixed and let yx(t) = y(s+ t), while y2(t) =
y(s)y(t). Then у1A) = у'^ + 1)=Ау{5 + 1) and y£t)= Ly{s)*(y'(t))
= Ly(s)*(AyU))=Ly(s)*(Ly(,)*A)=Ay(s)y(,v Th^s, Yi and y2 are in-
integral curves of the same vector field A, and (since y]@) = y(s) = y2@))
it follows thatyx(t)=y2(t) (i.e., y(s+ t) = y(s)y(t)). Thus, y: R-^G
is a homomorphism. Conversely, given a curve and homomorphism a:
0 PRELIMINARIES 19
U-^G, then \pt: G-^G (defined by \pt(g) — ga(t)) is a one-parameter
group of diffeomorphisms of G such that
defines the left-invariant vector field В determined by B=Be. Thus,
there is a one-to-one correspondence A<-*y. We define the exponential
map exp: §->G by exp(A) = y(l). Note that y(t) = exp(tA), and <pt(g)
0.3.5 Example. Let V be a vector space with dim V—m<oo, and let
GL(V) be the group of invertible linear functions F: V-*V. By
regarding GL(V) as a group of matrices, it is simple to see
that GL(V) (an open subset ofW) is a Lie group. Let IEGL(V) be
the identity, and denote T,(GL(V)) by §i(V). Note that §l(V) can be
identified with the vector space of all linear functions A: F-> V, the
correspondence being
l=0
ForAE§t(V), let
Exp(A)=I+A + ^A2 + ^A3 +
It is not hard to prove that the sum converges, and that Exp((t + s)A)
— ExptA Exp sA. Thus, Exp(A) Exp(—A)=I and so Exp(A)E
GL(V). Note that tn>Exp(tA) is a curve and a homomorphism with
ftExp(tA)
=A.
= 0
It follows from the discussion in 0.3.4 that Exp is the exponential map
for GL(V). In 0.3.10, we will prove that (for A,BE§t(V)) [A,B] =
AB-BA.
0.3.6 Definition. A Lie subgroup of a Lie group G is a submanifold
(of G) that is also a subgroup of G. A Lie subgroup H of G is itself a
Lie group. Since the homomorphisms y: f£ —*// are also homomor-
20 GAUGE THEORY AND VARIATIONAL PRINCIPLES
phisms into G, we have that exp: %-^H is just exp: §-> G restricted to
%. The next theorem implies that [ , ] on % is just [ , ] on § restricted to
0.3.7 Theorem. Let G and G' be Lie groups, and let F: G-> G' be a
C°° homomorphism. Then F^e: §->§' is a linear function such that
F^e([A, В]) = [Р^еА, F^eB] (i.e., Feif is a homomorphism of Lie alge-
algebras).
Proof. Note that F<> Lg(g') = F(gg') = Fjg)F(g') = (LF(s) о F)
(g'). Thus, F^Ag)=F^Lg^)=LF(g)t,AF^A) = (F^A)ng), and
so ^(Л) = (^еЛ). Using 0.2.5, we obtain F^([A,B]) = [
0.3.8 Definition. For gEG, let Adg: G->G be the C00 adjoint
isomorphism given by Adg(g')= gg'g~ '■ We let &bg: § -> % be the
g g
induced isomorphism of' § provided by 0.3.7 (i.e., $bg = &bgife). Let
gife)
&b: G-> GL(§) be the homomorphism gn> &bg. Then 0.3.7 gives us an
induced homomorphism ab: §-> §t(§) (i.e., ab = &b ).
0.3.9 Theorem. For A, В Е §, we have
Proof. Let {(pr} be the one-parameter group generated by A. By the
end of 0.3.4, wehave(p,(g) = gexpL4. Using LjB = [A, B] (see 0.2.7),
we have (at s = t — O)
di
——
= ~&b(cxp(
= ab(A)(B).
0 PRELIMINARIES 21
О.З.Ю Corollary. // G is any Lie subgroup of GL(V), then the
bracket operation on §C§i(V) is given by [A,B]=AB — BA.
Proof. By 0.3.6, it suffices to consider the case in which G = GL( V).
Using 0.3.9 with exp = Exp (see 0.3.5), we have
=AB-BA.
s,t = O
0.3.11 Definition. Let <?,,...,<?„ be a basis for the Lie algebra § of
G. The structure constants c,*- 6R are defined by [е„ eJ] — '2cfek. Note
that [ej,et]= — [ene}\ implies с^ = —с^. The Jacobi identity yields
2 l
p kTJ
>;mcZ)eh. Thus, 2mc*m<#
#
0.3.12 SU(n), the Special Unitary Group. The computation of the
Lie algebra of a Lie group of matrices is illustrated here for the group
SU(n), which is frequently used in elementary particle physics. Let
Ш(п,С) be the space of all nXn matrices with complex entries. For
A E§t(n,C), let A* denote the conjugate of the transpose of A. Recall
that the unitary group is U{n)-{A &§t(n,C)\AA*=I) and SU(n) =
{A E U(n)\detA-l}. If t^A(t) is a curve in U(n) with A@) = I, then
(at t = 0) we have
=A'@)A@)*+A@)A'@)*=A'@)+A'@)*.
Thus, for S= {Be§t(n,C)\B + B* = 0}, we have ^>D%(n) = the Lie
algebra of U(n). Conversely, if 5E§, then (ExpB)(ExpB)* =
(ExpB)(ExpB*) = Exp(B)Exp(~B)=I, and so ExpBEU(n). At
t = 0,
B= J-
whence %(«) = §. The Lie algebra S%(«) of SU(n) is the subalgebra
<?/%(«) consisting of matrices with trace 0 (i.e., §%(«)= {ВЕ%(п)\
22 GAUGE THEORY AND VARIATIONAL PRINCIPLES
tvB = O}). This follows immediately from the formula det(Exp B) = e'rb,
which is valid for any nXn matrix. We can prove this formula as
follows. Let f(t) = det(Exp tB). At h = 0, we have
= det(ExptB)-^det(l+hB)
= det(ExptB)tvB=(tvB)f(t).
Thus, f(t)=f(O)e{lrB)' = ei"'B)', and setting t- 1 yields the result.
CHAPTER
1
Principal Fiber Bundles and Connections
In the introductions to this and the following chapters, the topics
and results to be covered will be outlined, and some motivation will
be supplied to whet the grindstone, but no miracles are promised.
You may rest assured that you need not comprehend or agree with
the introductions in order to understand and accept the proper parts
of the chapters.
In this chapter, principal fiber bundles (PFBs) will be defined
and some nontrivial examples will be given (i.e., the double covering
of the circle and the frame bundle of a manifold). Three ways of
defining connections (ie., gauge potentials) will be proved to be
equivalent. The connection of a PFB with group U(l) over space-time
will be physically identified as the four-dimensional vector potential
of electromagnetism.
David D. Bleecker, Gauge Theory and Variational Principles ISBN 0-201-10096-7
Copyright © 1981 by Addison-Wesley Publishing Company, Inc., Advanced Book
Program/World Science Division. All rights reserved. No part of this publication
may be reproduced, stored in a retrieval system, or transmitted, in any form or by
any means, electronic, mechanical, photocopying, recording, or otherwise, without
the prior permission of the publisher.
23
24 GAUGE THEORY AND VARIATIONAL PRINCIPLES
In order to motivate the introduction of PFBs into physics, the
following considerations are offered in addition to the Prologue. In
the Prologue it was stated that a PFB could be regarded as a smooth
concatenation of reference frames. Since all measurements are made
relative to a choice of frame, and the measurement process can never
be completely divorced from the aspect of the universe being mea-
measured, we are led to the conclusion that the bundle of reference
frames should play a part in the very structure of the universe as we
perceive it.
In a different'vein, a PFB with a gauge potential provides a
natural way of geometrizing the forces of that potential. Indeed, we
can extend the following lesson from general relativity to show this.
Recall that two geodesies (straightest possible paths) in a possibly
curved space coincide if they are tangent at a point of contact. The
paths of massive objects in the three-dimensional space in which we
live can be tangent at a point without coinciding. In Figure 2, for
example, bullet 1 might escape the pull of the earth's gravity, while
bullet 2 (shot in the same direction, but with less speed) has fallen
back. We conclude that massive objects do not travel along geodesies
of some three-space geometry. In four-dimensional space-time, the
two bullets have paths that are not initially tangent, since the speeds
of the two bullets differ. Thus, the possibility that massive objects do
trace out geodesies in space-time (instead of in space alone) cannot
be ruled out by the example. Indeed, general relativity tells us that
the paths of objects that are subject to only the force of gravity are
geodesies relative to some geometry imposed on space-time. In other
words, adjoining the time dimension permits the geometrization of
the force of gravity.
Suppose that the earth carries a large positive electric charge,
bullet 1 has a negative charge, and bullet 2 is neutral. Even if the
paths of the bullets are initially tangent in space-time (i.e., shot in the
same direction with the same speed), the paths will not be the same.
Thus, the geometrization of electrical forces requires another dimen-
dimension, the "charge dimension." The resulting five-dimensional space
(the invention of Kaluza [1921] and Klein [1926]) is actually a PFB
with group [/A), the circle. A connection (or gauge potential) en-
endows the five-space with a geometry and allows us to speak of the
charge component of a geodesic in five-space. If the charge compo-
component of a certain geodesic is q, then the projection of this geodesic
1 PRINCIPAL FIBER BUNDLES AND CONNECTIONS 25
Figure 2. Here we have reduced space by one dimension. In deference to the
Flat Earth Society, the earth's surface at t = 0 is the x axis and the interior of the
earth traces out the half-space y<0. Curves 1 and 2 are the paths (or world
lines) of bullets 1 and 2 in space-time. Paths Г and 2' are the spatial projections
of paths 1 and 2 into the x-y plane. Paths Г and 2' cannot be geodesies relative
to some geometry imposed on the x-y plane, since they are tangent at @,0) and
yet are distinct. However, paths 1 and 2 are geodesies in a curved space-time.
They are still distinct, but they are not tangent at @,0,0) in space-time, since
bullet 2 was shot with less speed.
onto space-time is the nongeodesic path of an object of charge q
subject to the force of the gauge potential. The invariance of the
gauge potential under the U{\) action on the PFB implies the
conservation of charge. We will not verify these claims explicitly
until Section 10.1 (which can be read after studying Chapters 1 and 2
and Section 6.2). The purpose of bringing them up now is to
motivate the adjunction of extra dimensions to space-time (i.e., to
26 GAUGE THEORY AND VARIATIONAL PRINCIPLES
geometrize nongravitational forces via connections on PFBs in about
the same way that the adjunction of time to space permitted the
geometrization of the force of gravity). There are many other types of
charge that respond to various other kinds of forces (e.g., isospin;
hypercharge; red, blue, and green charge; weak charge). The associ-
associated geometrization of these forces requires PFBs with larger non-
Abelian groups, and so five dimensions are not nearly enough.
1.1 PRINCIPAL FIBER BUNDLES
1.1.1 Definition. A principal fiber bundle (PFB) consists of a mani-
manifold P {called the total space), a Lie group G, a base manifold M, and
a projection map it: P->M such that (A), (B), and (C) following hold.
(A) For each gEG there is a diffeomorphism Rg: P->P (we write
Rg(P)=Pg) s^ch that p(gxg2) = {pg\)g2 for al1 &\>&г eG and
pGP; and if eEG is the identity element, then pe=p for all
pEP. We require the function PXG->P given by (p, g)\-> pg to
be a map. We suppose that if pg—p for some p E.P and gEG,
then g — e. We summarize this paragraph by saying that G acts
freely (and differentiably) on P to the right.
(B) The map it: P->M is onto, andir^x(ir(p)) = {pg: gEG} (which
is, by definition, the orbit of G throughp). If x&M, then тт~\х)
is called the fiber above x. Note that for each p Елт~\х) there is
a map G-^ir^\x) given by gv+pg. This map is a diffeomor-
diffeomorphism by (A), but it depends on p. Thus, all the fibers it ~ \x) are
diffeomorphic to G, but there is no canonical identification of
тт~\х) with G, and hence no natural group structure on тт~\х).
(C) For each xEM there is an open set U with xEU and a
diffeomorphism Tu: тг "'([/)-> UXG of the form Tu(p) = (ir(p),
su(P)) where su- ir~\U)-^G has the property su(pg)=su(p)g
for all g£G,pEtt~~](U). The map Tu is called a local trivializa-
tion (LT), or (in physics language) a choice of gauge.
1.1.2 Remark. We abbreviate the foregoing by saying that it: P-^M
is a PFB with group G. If N is a manifold and G is a Lie group, we can
form a PFB it': NXG->N with group G by setting iT'(n,g) — n and
(n, g)g' = (n, gg'). This is called the product PFB of N with G. Note
that (C) states that for every xEM there is a neighborhood Uofx such
1 PRINCIPAL FIBER BUNDLES AND CONNECTIONS 27
that the restricted PFB it: tt~\U)-> U can be identified (via Tu) with
the product PFB of U with G. The requirement on su is necessary in
order that Tu respect the action of G (i.e., Tu(pg) = Tu(p)g).
1.1.3 Definition. Let Tu: tt~\U)->U XG and Tv: ir~\V)-+V XG
be two LTs of a PFB it: P -» M with group G. The transition function
from Tu to Tv is the map guv: UnV->G defined, for x = ir(p)EUr\V,
by guv(x) = su(P)sv(P)~]- Note that Suv(x) is independent of the
choice ofpEir-\x) because su(pg)sv(pg)~1 = su(p)g(sv(p)gyl =
su(P)gg ^v(p)~]=su(p)sv(p)'\ We have
@ g
(ii) g
(Ш) guv(y)gvw(y)gwu(y) = e № all y
The transition functions describe how the various products UX
G,VXG,... glue together to form the total space P. Indeed P may
be considered as the space obtained from the disjoint union (UXG)
U(FXG)U • • • by identifying the point (x, g)EUXG with (x, g')
EVXG if g—guv(x)g'. Because of (i), (ii), and (iii), this identifica-
identification is an equivalence relation. Thus, we see that a PFB can be
essentially recaptured by its transition functions.
1.1.4 Definition. We define a local section of a PFB it: P^M with
group G to be a map a: U^>P(UdM, Uopen) such that "n°o= \u =the
identity function on U(x\->x).
1.1.5 Theorem. There is a natural correspondence between local
sections and local trivializations.
Proof. If a: U^P is a local section, then define Tu: -u~\U)^UXG
by Tu(a(x)g) = (x,g). Conversely, given an LT Tu: it-\U)^UXG,
define a local section a: U-^P by o(x) = Tu~\x, e). ■
1.1.6 Remark. // Tu is a local trivialization with U—M (i.e., TM:
P-+MXG), then TM is called a global trivialization, and the PFB is
called trivial if such a TM exists. A local section a: U^P is called a
global section if U=M. In 1.1.5, global sections correspond to global
trivializations.
28 GAUGE THEORY AND VARIATIONAL PRINCIPLES
1.1.7 Example. Let 5'={zEC| |z| = l) be the unit circle in the
complex plane. Let it:S]^S] be given by тт(г)= z2; so it~\z2) =
{z,~ z). Let G = {e, g} = the two-element group, and let ze = z while
zg = - z for zES\ Set 17=5'-{1} and V=Sl-{~l), and let a:
U -» S1 be the local section defined by taking a( w) to be the square root
of w with Im(a(w))>0 while т. F-» S] is defined by taking t(w) to be
the square root of w with Re(T(w))>0. By 1.1.5, we have local
\ de-
detrivializations
termined by
\e
Thus, for w = z
т ■
if
if
2
ir-\U)
Im(
Im(
-*ux
>o,
<0;
G
y(z) = |*
77-1
if
if
Re(
Re(
F
XG
>o,
<0.
вт(»)=ФК(*) =[g if Im(w)<0.
Note that this PFB is not trivial, since the total space is S1 instead of
S1 X G, which is the disjoint union of two copies of Sx. Consequently,
there is no section defined on all of Sx; that is, there is no way to define
a continuous square root function on all of S1.
1.1.8 Example. Let M be an n-manifold. We will define a PFB it:
L(M)^>M with group GL(n,R), called the frame bundle of M. A
frame at xEM is a linear isomorphism u: U" -* TXM. Note that such a
frame determines a basis u{ex),..., u(en) of TXM where ex,...,en is the
usual basis ofR". Let L(M)X be the set of all frames at x, and set
L{M)= U L(M)x.
x<EM
For uEL(M)x,T7(u) = x. For AEGL(n,R), we define RA: L(M)-*
L(M) by RA(u) = u°A. This is a free, right action of GL(n,R) on
L(M), but we need to put a differentiable structure on L(M) before we
can speak of it: L{M)-*M or L(M)XGL(n,R)-*L(M) as being C°°.
Let W<ZM be a coordinate neighborhood with coordinates x\...,x"
and associated coordinate fields 3,,...,3n on W. Define a map a:
W-^L(M) by letting a(y): R"-*T M be the isomorphism such that
1 PRINCIPAL FIBER BUNDLES AND CONNECTIONS 29
a(y)(el) = (di)v. Let sw: ir"\W)^GL(n,R) be defined by sw(u) =
о(тт(и)) ]°u. Note that sw(u°A)=sw(u)° A. Define Tw: ir~\W)-^
WXGL(n,R) by Tw(u) = (tt(u),s^u)). Since WXGL(n,R) has a
differentiable structure, the maps Tw for various W<ZM define a
differentiable structure on L(M) provided that Tw,oT^]: (Wn W')X
GL(n,RH is C°° where W is another coordinate neighborhood {on
M) with coordinates x'\...,x'" and fields 3;,...,3^. From the equa-
equations Tw(u) = (tt(u), sw(u)) and Tw,(u) = (tt(u),sw,(u)), we obtain (for
y = iT(u)eWn W) Tw, о T~\y, A) = (y, sw,(u)sw(u)~ lA) =
(y,gW'W(y)A)- Hence, it suffices to prove gw,w: Wn W'-*GL(n,R) to
be C°°. For y = ir(u)^WDW', gw.w(y) = sw.(u)sw(u)~l =
(a'(y) ' о u)(o(y)~l о и) ' =о'(уухо(у), but this is just theJacobian
matrix with entires ЭД x"'] that are C°° on Wn W. The maps Tw, Tw,,...
are now diffeomorphisms, and hence are LTs. The maps it: L(M)-*M
andL(M)XGL(n,R)-*L(M) locally correspond (via the LTs) to the
C°° maps WXGL(n,R)-*W and (WXGL(n,R))XGL(n,R)-^WX
GL(n,R), and hence are C°°. By 1.1.6, the PFB it: L(M)-*M is
trivial iff there is a section M-*L(M), or in other words, a sequence of
vector fields Xx,...,Xn on M that are independent at each point. This
condition cannot be met on any compact surface, except for a torus; we
omit the proof (see Spivak [1971], Vol. 1).
1.2 CONNECTIONS
Connections can be defined in at least three ways. We will prove
that Definitions 1.2.1, 1.2.2, and 1.2.3 are equivalent. In the follow-
following we assume that it: P-*M is a PFB with group G; dim Л/=и.
1.2.1 Definition. A connection assigns to each pEP a subspace
HpCTpP such that for Ур={ХеТрР\тт^(Х)=О} we have TpP=Hp@
Vp. We require that Rgij!(Hp)—Hpg. Moreover, we assume that Hp
depends smoothly on p, in the sense that there are n vector fields
(defined on a neighborhood Uof p) that span Hq at each qEU. We call
Vp the vertical subspace of TpP, while Hp is the horizontal subspace
(see Figure 3).
1.2.2 Definition. Let § be the Lie algebra of G. A connection is a
§-valued \-form w defined on P such that the properties (a) and (b)
hold.
30 GAUGE THEORY AND VARIATIONAL PRINCIPLES
Vr
Figure 3
(a) Let AE§ and let A* be the vector field on P defined by
d
Then ьо(Л*)= A. A* is called a fundamental field.
(b) For gEG, let &bg: g-^g be defined as in 0.3.8. We require
"„*(**•*)= &bg-lUp(X) for all gEG, pEP, and XE TpP. In
other words, /?*w = &b -iw.
We call w a connection 1- form.
1.2.3 Definition. A connection assigns to each LT Tu: ir
X G {i.e., choice of gauge) a %-valued I-form wu on U. If Tv is another
LT and guv: U DV-^G is the transition function from Tu to Tv, then we
require iov(Yx)= Ll\x)Jf{guvJf{Yx)) + <$,Ъgm{x)-i{aJJx)) for all YXE
TXM and xEUГ) V. If G is a group of matrices, we can rewrite this
condition as follows. In matrix notation and with у a curve with
1 PRINCIPAL FIBER BUNDLES AND CONNECTIONS 31
= Yx, we have (at t =0)
— о (х) ' di (Y\
where dguv is d of the matrix-valued function guv (see 0.2.23, where V is
a vector space of matrices). For a group of matrices we have
d_
dt
dt
(see 0.3.8). Thus, 0bgMj(JC)-.o>u(Yx) = g;v\x)wu(Yx)guv(x). Conse-
quently, the transformation rule from ыи to u>v can be expressed as
1.2.4 Theorem. Definitions 1.2.1 and 1.2.2 are equivalent.
Proof. Suppose w is the connection 1-form of 1.2.2. Let Л/) =
TpP\icp(X) = 0}. We prove thatp^ Hp is a connection in the sense of
1.2.1. From 1.2.2(a) it follows that Hp®Vp = TpP. Also Rg^(Hp) =
Hpg, because (from (b) of 1.2.2) u(R X) = &bg-iu(X) = 0 for Xe
Hp. Conversely, suppose that p н> Нp is a connection in the sense of
1.2.1. For A* as in (a) of 1.2.2 and Xp EHp, define wp: TpP-^§ by
up(A* +Xp) = A. Condition (a) of 1.2.2 then holds. For (b) of 1.2.2,
we need to prove that upg(Rg^Y) = &bg-l(up(Y)) for all Г£Г/. If
YEHp, then Rgij!YeHpg, and so both sides vanish. If Y=A* for
some A Eg, then (at f = 0)
32 GAUGE THEORY AND VARIATIONAL PRINCIPLES
By linearity, we then have (b) of 1.2.2, and so w is a connection
1-form. ■
1.2.5 Theorem. Definitions 1.2.2 and 1.2.3 are equivalent.
Proof. Let w be a connection 1-form as in 1.2.2. If Tu: tt~\U)^>U
X G is an LT with associated local section au: U-^P given by 1.1.5,
then set wu=a*w. We prove that the assignment Tu^>u>u is then a
connection as in 1.2.3. Let Tv be another LT with local section av.
We need to check that the transformation equation of 1.2.3 holds
with uu = a*u and wv=a*w. Writing Tu(p) = (ir(p), su(p)), we
see that (for x = ir(p)eU) Tu(ou(x)su(p)) = (x,su(ou(x)su(p))) =
(x, su(ou(x))su(p)) = (x, esu(p)) = (x, su(p))=Tu(p). Thus, p =
ou(x)su(p), and similarly, p = ov(x)sv(p). Consequently, we have
av(x) = a«(x)su(p)sv(p)-l=ou(x)guv(x). Let YETXM, and suppose
that y: R -^M is a curve with y'@) = Y. Then (at f = 0)
dt °
— г -1
Now
by inserting the expression for ov^(Y) and applying (a) and (b) of
1.2.2. Thus, the assignment Ги^ьоц=а* is a connection as in 1.2.3.
Conversely, suppose that Tu\^>iou is a connection as in 1.2.3. Let
au: U^>P be the local section associated with Tu. Forp = a(x), xEU,
YETXM, and Л Eg, we define w": TpP^§ by tou{o^
p
+A. We extend w" to all of 77~\U) via the formula (for Xpg E TpgP)
tc"(Xpg) = &bg^u"(Rg-uXpg). It is left to the reader to verify that
w" is a connection 1-form on the restricted PFB 77: tt~\U)^>U. If Tv
1 PRINCIPAL FIBER BUNDLES AND CONNECTIONS 33
is another LT, then we can define w" on tt~\V) similarly. Once we
prove that u>" = wv on ir~\UnV), then the various w", w°,... piece
together to define a connection 1-form w as in 1.2.2. If w" and w"
agree on the set ov(UC\ V), then they must agree on all ofiT~l(Uf) V)
by 1.2.2(b). Now iou(A*)=A = icv(A*), and so we need only check
that o?(ovj) = tou(ovj) for YETxM,xEUnV. But wv(a^Y) =
uo(Y), while
which is woG), because of the transformation rule of 1.2.3. Thus
wu,wv,..- do piece together to define a connection w as in 1.2.2;
moreover, we see that icu = o*u>, tov = ao*w, etc. ■
1.2.6 Remark. Any PFB admits a connection. You can find a proof
of this fact in Kobayashi and Nomizu [1963].
1.2.7 Physical Interpretation. Physicists refer to the l-forms wu (in
1.2.3) as gauge potentials. In Chapter 2, we discuss (see 2.2.10) how the
field strength is computed from wu. For now, we consider the special
case of electromagnetism. Suppose that (M, g) is Minkowski space (as
in 0.2.22), and let it: P^M be a PFB with group U(l)= {eie\e<=R}
whose Lie algebra is %A) = {ia\ aEIR}. Suppose that w is a connection
1 -form on P, and let оu: U^> P be a local section. Thenwu=a*u>=—iAu,
where Au EA'((/,IR) is called the potential 1-form (or "vector poten-
potential"). The electromagnetic field strength relative to a: U-^P is then
Fu = —dAu EA2((/,IR). // av: V^P is another local section, then we
can prove that FV=FU. Indeed, according to 1.2.3, ">v=g~Jdguv +
8uvUuguv=guvi dguv + itu since U(\) is Abelian (i.e., commutative).
N™juvguv=l implies that d(g-v1)gUv+g^J dguv = 0 or d(g~v]) =
~SUJ dguvgj = ~guv2dguv. Thus, -d(gj dguv) = guv2 dguv/\dguv
= 0, and it follows that du>v =du>u or Fv =FU. Hence the Fu (for various
o: U->P) piece together to yield a well-defined FEA2(M,R), which is
interpreted as in 0.2.22. We will see that for nuclear forces, where the
group (typically SU(n),n>2) is non-Abelian, the field strength on
U<ZM is a nonlinear function of ыи, and it depends on the choice of
gauge ou: U-^P.
CHAPTER
Curvature and ^-Valued Differential Forms
The curvature of a connection corresponds to the physical notion
of the field strength of a gauge potential. While the local gauge
potentials transform in a fairly complicated way under a change of
gauge (see 1.2.3), the local field strengths obey a simpler transforma-
transformation rule (see 2.2.14). In the case of an Abelian group, the local field
strengths are invariant under a change of gauge. Thus, in the case of
electromagnetism, the local field strengths piece together to yield a
well-defined field strength on the base (see 1.2.7). For non-Abelian
groups the field strength is a well-defined ^-valued 2-form on the
total space P, but it does not give rise to a well-defined form on the
base. However, because of the simple transformation rule for local
field strengths, we can produce scalars from them that are gauge-
independent functions on the base. One such function is the self-
David D. Bleecker, Gauge Theory and Variational Principles ISBN 0-201-10096-7
Copyright © 1981 by Addison-Wesley Publishing Company, Inc., Advanced Book
Program/World Science Division. All rights reserved. No part of this publication
may be reproduced, stored in a retrieval system, or transmitted, in any form or by
any means, electronic, mechanical, photocopying, recording, or otherwise, without
the prior permission of the publisher.
34
2 CURVATURE AND ^-VALUED DIFFERENTIAL FORMS 35
Action, which is introduced in 5.2. For the electromagnetic field, the
KH2Hlll2
Both the gauge potential and the field strength are examples of
Lie algebra-valued forms. The algebra of all such forms is an
infinite-dimensional graded Lie algebra. Graded Lie algebras play an
essential role in physical theories of supersymmetry (i.e., Bose-Fermi
symmetry). Whether the graded Lie algebra of g-valued forms will
ever play such a role is uncertain. The reason for introducing this
algebra here is that calculations involving gauge potentials and field
strengths proceed with greater ease, and the use of structure con-
constants and indices can be avoided. The proof of the homogeneous
field equation (or Bianchi identity) is especially simplified (see 2.2.8).
The operation of exterior covariant differentiation of g-valued
differential forms on P, defined in 2.2.2, can easily be extended to
any vector-valued form on P. This is done in 3.1.3, but physicists
may not recognize this operation as that which replaces the ordinary
derivative via the principal of minimal coupling until we reach 3.1.5.
2.1 THE GRADED LIE ALGEBRA OF S-VALUED FORMS
Let TV be a manifold and § a Lie algebra. We denote the set of all
g-valued A:-forms on N by Ak(N, g).
2.1.1 Definition. Let (pEA'(N,§) and ^EA^iV, g). We define
where a ranges over the permutations of {1,2,..., i +j); (— l)a = ±\,
depending on whether a is even or odd; [, ] on the right-hand side is the
bracket of g; and Xx,..., Xi+J are arbitrary vector fields on N.
2.1.2 Component Formulation. Suppose that AE§ and <p is a
U-valued к-form on N; then we define <р®Л EAk(N, g) by (cp®
A)(Xl,...,Xk)=y(Xl,...,Xk)A for Xx,...,XkETyN. It is a simple
matter to check that {for another R- valued form \p on N and BE§)
36 GAUGE THEORY AND VARIATIONAL PRINCIPLES
,5]. Let Ex,...,Efbe a basis for g, with
structure constants c%p defined by [Ea, Ep] = '2cl^Ey. For q>E A'(N, g)
and \pE A'(N, g) there are unique R-valued forms y" and ^{a, /3 =
1,..., /) such that y = I(pa®Ea and ^ = 2^®£^. Then, we have
[]
а,/? ct.fi,у
2.1.3 Theorem. For (pEA!GV, g), ^eAJ(N, g)
we
(i) [^,«p]=-A)i7[«p,^];
B) (- iy*[[<p, ^], p] + (- l)^[[p, <p],^] + (- 1)>![[^ p], T] = 0.
In other words, the algebra of g-valued differential forms on TV is
a graded Lie algebra.
Proof. Relation A) follows (in the notation of 2.1.2) from у"
= (- l)'7^ Лф" and [Ea, Ep]= -[Ep, Ea]. For relation B), observe
that the left-hand side is
«,/S, у
which vanishes by the Jacobi identity for g. ■
2.1.4 Theorem. For <pEA'(N,§) and xpeAJ(N,§), we have
Proof. With the notation of 2.1.2, the result follows from d(ya Л
2 CURVATURE AND g-VALUED DIFFERENTIAL FORMS 37
2.2 CURVATURE
Given a connection 1-form w on a PFB тт: P^> М with group G,
we can write any XETpP as X=XV + XH where Xv is vertical (i.e.,
it (Xv) = 0) and XH is horizontal (i.e., w(XH) = 0).
2.2.1 Definition. If <pEAk(P,§), then we define q>H EAk(P, g) fey
2.2.2 Definition. The exterior covariant derivative of <p&Ak(P,§)
is Dw(p = (dup)H EAk+\P, g) where dip is the usual exterior derivative
of (p. Although the operator D" depends on w, it is customary to omit
the superscript w. Because we will consider functionals on the space of
connections and other situations where more than one connection is
involved, we will usually not observe this custom.
2.2.3 Definition. The curvature of the connection wEA'(P,g) is
Я" =D"wEA2(P, g). When u is regarded as a potential, Я" is called
the field strength of w.
The following structural equation will permit us to write an
expression for the field strength that looks more familiar to physi-
physicists.
2.2.4 Theorem (The Structural Equation). The curvature form is
given by fi"=dw + ^[w, w] (i.e., D"w=dw + ^[w, w]).
2.2.5 Lemma. Given a vector field X on M, there is a unique vector
field X on P such that u>(X) = 0 and чт^(Хр) = Х„1р) for all peP.
Necessarily, Rgij!X=Xfor all gEG. The field X is called the horizontal
lift of X.
Proof. The existence and uniqueness of X follow from the fact that
"■„,: H -*Tn, ^M is an isomorphism. We omit the proof that X is
smooth, but this is really clear from the smoothness of w. Observe
that TT*(RgJp) = (ir°Rg)il!(Xp) = iT^Xp) = X7T(p). Thus, RgJp =Xpg.
38 GAUGE THEORY AND VARIATIONAL PRINCIPLES
2.2.6 Lemma. IfA,BE§, then [A,B]* = [A*,B*] as vector fields
on P {see 1.2.2(a)).
Proof. Let (pt: P-*P be given by (p,(p)=pexp(tA). Then <pt is the
one-parameter group of diffeomorphisms generated by A*. Evaluat-
Evaluating all derivatives at zero, we have
dt ds
d d i л\ ( л\ ( л^~] d d
= — -rpexp{s&b tAB)
2.2.7 Lemma. If A Eg and X is a vector field on M, then [A*, X]-0
where X is the horizontal lift of X (see 2.2.5).
Proof. Defining yt as in the proof of 2.2.6, we have q>t~\X) = X by
2.2.5. Thus,
Proof of Theorem 2.2.4. Note that ^[w, w]G, Z) = \{[a{Y\ w(Z)]
-[w(Z), ш(У)]} = [ш(У), w(Z)]. Thus, we need to prove (for all
У, Z E Гр P) the equ ation
By linearity, we need only consider the following three cases.
Case 1 (У and Z horizontal). Equation (*) follows, since w(Y) =
u(Z) = 0, and УН = У and ZH=Z.
Case 2 (У and Z vertical). We may suppose that Y=A* and Z = B*
for Л,ЯЕ§. Then du(Y,Z)=A*[u(B*)]-B*[u(A*)]-
w([^*,5*]) = (since w(B*)=B = constant, etc.)= -w([A*,
В*\) = (Ъу2.2.6)-а([А,В]*)=-[А,В]=-[а(А*),а(В*)]
— — [u>(Y), w(Z)]. Hence, both sides of equation (*) vanish.
2 CURVATURE AND S-VALUED DIFFERENTIAL FORMS 39
Case 3 (Y vertical and Z horizontal). We may assume that Zp=X
where X is the horizontal lift of some vector field X on M,
and Y=A* for some A Eg. Now da{Y,Z)=A*[u(X)]-
Х[ы(А*)]-ы([А*,Х]) = 0, since ы(Х) = 0, и>(А*)=А =
constant, and 2.2.7 holds. Both sides of equation (*) then
vanish. ■
2.2.8 Theorem (Bianchi Identity or Homogeneous Field
Equation). If u> is a connection l-form on P with curvature Я", then
2ГЯ" =0. In fact, we have dQa=[Qa, w].
Proof. Observe that since w vanishes on horizontal vectors, D"u"
= 0 follows from dQa=[Qa,u]. Now dti" =d(du> + ^[w, «])
— d2w + \[du, w] — ^[w, dw] = (since u?2w = 0 and [w, с?ш]= — [die, w]
by 2.1.3 (l)) = [fi?u, u] = (since [[w, w], ш] = 0 by 2.1.3
2.2.9 Theorem. For allgEG, R*tt"=&bg-&u.
Proof. From the definition of [ , ] on ^-valued forms, it is evident
that [ , ] is preserved under pull-back (i.e., F*[y,\p] = [F*(p, F*\p]).
Thus, /?*fl"=/?*(Jw+i[w, u\) = d(R*gu+b[R*a, R*gu]) = d&bg-,tc
2.2.10 Local Expressions. Recall from the proof of 1.2.5 г/гаг ?/г<?
gauge potential icu of 1.2.3 и related to the connection l-form w o/1.2.2
fry ши =а*ш ЕЛ1 ([/,§). The field strength associated to wu w Яи =а*Я".
Of course, it is desirable to have a direct way of computing Яц in terms
2.2.11 Theorem. In terms of wu,Slu=dwu + ±[wu,wu].
Proof. We have Яи = а*(Я") = a*(du + i[«, ы]) = rf(o*«)
i
Since the groups used in physics are almost always groups of
matrices, it is convenient to have the following characterization
of [ , ].
40 GAUGE THEORY AND VARIATIONAL PRINCIPLES
2.2.12 Theorem. If N is a manifold and G is a matrix Lie group
(with matrix Lie algebra §), then for yEA'(N, <3) andipEhj(N, Q) we
have [<p, ф] = (рЛ-ф—(— 1)'Л//Л<р. Here <p and \p are regarded as
matrices of U-valued forms, and <рЛ^ is matrix multiplication where
the entries are multiplied via wedge.
Proof. Since [A,B]=AB-BA for A, В Eg, we have
1
X(p{xo0+V),...,xo0+J))
2.2.13 Corollary. // G is a matrix group, then
and пи = dcou + "„ Л сои.
Proof. Since coEA\P,§), we have JK(o] =
) Ao, etc. ■
The transformation rule for the local field strengths under a
change of gauge is relatively simple compared with the corresponding
rule (in 1.2.3) for the potentials.
2.2.14 Theorem. Let Tu and Tv be two LTs with transition function
guv: UHV^G. Then, on UnV,uv=&bg-v,Uu. In the case of a
matrix group, this becomes ttv =g~vl &uguv-
Proof. If ou and av are the local sections associated with Tu and Tv,
then (from the proof of 1.2.5) we have, for YETXM,
2 CURVATURE AND g-VALUED DIFFERENTIAL FORMS 41
y the definition of fi" in 2.2.3, we have that S2"(Z, Ж) = 0 if either
or И^ is vertical. Thus,
2.2.15 Theorem. Relative to an LT Tu: v~\U)^UXG, we have
the local form of the homogeneous field equation (or Bianchi identity)
</й„=[йц, wj. For a matrix group, we have duu = uu/\oou — o/u
Proof. This is evident from 2.2.8, because [ , ] is preserved under
pull-back. Also, [Йц,соц] = Й„ЛЫц-(-1J1ЫцЛЙц by 2.2.12. ■
2.2.16 Remark. At this point, one difference between Abelian and
non-Abelian gauge theory is manifest. For an Abelian group G, such as
U(\) in the theory of electromagnetism, we see that uv = gu~'^ugut) =fiM,
whence the local field strength is independent of the LT. Moreover,
since [fiM, wM] = 0, the homogeneous field equation is duu = 0 (or d2cou
= 0). However, for G non-Abelian, such as G = SUB) in classical
Yang-Mills theory, we have ^ =gj^'flugut) ^=ЯЦ in general. Instead,
we are moved to consider the field strength as being the well-defined
curvature form Й" that lives on P. Also, note that Й" =dco + j[co, со] is
no longer a linear function of со.
2.2.17 Remark. Let A be an arbitrary left-invariant vector field on a
Lie_group G (see 0.3.2). Define the %-valued one-form (p on G by
<p(A)=A =Ae ElTjG. Using the same computation as in Case 2 of the
proof of 2.2.4, we can prove that dy+ ^[у, <р] = 0. This is called the
Maurer-Cartan equation. The field strength fi" = dco+ j[co, со] mea-
measures the extent to which со fails to satisfy the Maurer-Cartan equation.
The assumption is sometimes made that a gauge potential be Maurer-
Cartan-like at infinity of U4. This just means that the field strength
tends to zero in some prescribed sense as we approach infinity.
CHAPTER
Particle Fields, Lagrangians,
and Gauge Invariance
A particle field can be regarded as a section of a vector bundle
associated to some PFB, or equivalently, as a vector-valued function
on P with certain transformation properties. Examples include the
Schrodinger wave function, the Klein-Gordon field, and the Dirac
electron field. Typically, a real-valued function (called the Action
density) on the base is assigned to each particle field according to
some fixed formula. The particle field obeys a differential equation
(Lagrange's equation) obtained by setting the first variation of the
integral of the Action density equal to zero. In other words, particle
fields obey the principle of least (or stationary) Action. Physicists use
the terms "Action density" and "Lagrangian" interchangeably for
the most part. However, we define (in 3.3.2) the Lagrangian as a
function (with certain invariance properties) on the finite-dimensional
David D. Bleecker, Gauge Theory and Variational Principles ISBN 0-201-10096-7
Copyright © 1981 by Addison-Wesley Publishing Company, Inc., Advanced Book
Program/World Science Division. All rights reserved. No part of this publication
may be reproduced, stored in a retrieval system, or transmitted, in any form or by
any means, electronic, mechanical, photocopying, recording, or otherwise, without
the prior permission of the publisher.
42
3 PARTICLE FIELDS, LAGRANGIANS, GAUGE INVARIANCE 43
space of 1-jets of vector-valued functions on P. The Action density
of a particle field is then defined by evaluating the Lagrangian on the
particle field and its differential (i.e., on its 1-jet).
Gauge transformations are defined in 3.2.1 as base-preserving
automorphisms of the PFB. Locally, they amount to a variable (i.e.,
space-time-dependent) change of "internal" reference frame or gauge.
There is little hope that the Action density is physically meaningful
unless it is invariant under gauge transformations of the particle
field. However, we will find (see 3.3.5) that this invariance is not
possible without introducing gauge potentials into the Action density
by replacing the ordinary differential of the particle field by its
covariant derivative. Physicists call this the principle of minimal
coupling (or minimal replacement). The gauge potentials (particu-
(particularly after quantization) can be interpreted as various forms of
radiation (photons, pions, gluons, intermediate vector bosons, etc.).
Thus, we can say that there can be no gauge-invariant Action
densities without "light."
3.1 PARTICLE FIELDS
Let it: P^M be a PFB with group G. Suppose that G acts on
some manifold F to the left. That is, for each gEG, there is a map
and the map GXF^F, given by (g,f)^ g-f, is C00. If F is a vector
space V and Lg: V^> V is linear, then the homomorphism G^GL( V)
given by g\-> Lg is called a representation of G. Two representations
G-*GL(V) and G^GL(V'), say g>->Lg and gh-> L'g, respectively, are
called equivalent if there is a linear isomorphism T: V-> V such that
L'=ToLoTl for all gin G.
5 5
3.1.1 Definition. // G acts on F as just stated, then we define
C(P, F) to be the space of all maps r. P^F such that T(pg)=g~l-
т(р). Readers who are familiar with associated fiber bundles will note
that C(P,F) is naturally isomorphic to the space of sections of the
associated bundle PXGF^>M with fiber F. Much of what follows can
be rephrased in terms of associated bundles, but we will not bother to do
so, since this point of view is foreign to most physicists and is also
notationally more difficult.
In the case where the action of G defines a representation
G^GL(V), the elements of C(P,V) are called particle fields.
44 GAUGE THEORY AND VARIATIONAL PRINCIPLES
3.1.2 Definition. Let Ak(P,V) be the space of V-valued differential
k-forms <p on P such that (for a given representation G-*GL(V))
(a) For Xx,...,XkETpP we have
<p(RgJfX],...,RgJfXk)=g-]-<p(Xu...,Xk)
(b) If one ofXl,...,Xk is vertical, then <p(Xv..., Xk) = 0.
Note that in the special case where V=§ and G-*GL(§) is the
adjoint representation gi-> &bg, we have (given a connection со on P)
Q" EA2(P,<3). Also observe that C(P,V) is the same as A°(P,V).
3.1.3 Definition. For a connection со on P we define Du: Ak(P,V)
-*~Ak + \P,V) by D"<p=(d<p)" (see 2.2.1). Note that R*Duq> =
R*g(dy)H = (R*gd<p)H_=(dR*<p)H = (dg~l-<p)H = g-l-(d<p)" = g~x-
Du(p, and so Du(pE Ak + l(P,V).
3.1.4 Definition. Let §-*§t(V) be the Lie algebra homomorphism
induced by the representation (/-> GL(V) (i.e., for AE% and vE V, let
A -v = — (exptA)-v
at
r=0
// <pEAk(P,V) and pEAJ~(P,<3), then we can define рЛсрЕ
AJ + k(P,V) by the formula
where a ranges over permutations of {1,..., j+k].
3.1.5 Theorem. For rEAk(P,V), we have D'V^
Proof. For vectors Xu..., Xk+, E TpP, we must verify that
PARTICLE FIELDS, LAGRANGIANS, GAUGE INVARIANCE 45
If X,,..., %k+i are aU horizontal, then X" = X,- and w(X,.) = 0, and so
both sides are the same. Suppose that two or more of X,,..., Xk+X
are vertical. Then, since т vanishes on vertical vectors and X,H =0 for
some i, equation (*) becomes 0 = u?t(X,,..., Хк+х). If we extend the
two or more vertical vectors to fundamental fields on P and extend
the other vectors arbitrarily, we obtain (see 0.2.10)
k+\
c/t(X],..., Xk+X) — 2j \~
+ У, (-\Y+J
The first sum vanishes, since at least one of Xx,..., Xl,...,Xk+x is
vertical, and (noting that if Xt and Xj are fundamental fields, then
[X,-, Xj] is also, by 2.2.6) the second sum vanishes for the same
reason. The remaining case is that for which all but one of Xx,..., Xk+x
are horizontal and the remaining vector, say Xx, is vertical. Extend Xx
to a fundamental field and X2,..., Xk+X to horizontal R^-invariant
fields by performing horizontal lifts on extensions of чт^Х2,...,
•п*Хк+х (see 2.2.5). From 2.2.7 we have 0 = [Xx, X2] = [XX, X3]
= ■ ■ ■ = [Xx, Xk+X]. It follows that dT(Xx,...,Xk+x) =
X\[t(X2,. .., Xk+X)].' Equation (*) therefore reduces to 0 =
Xx[T(X2,...,Xk+x)] + o,(Xx)-T(X2,...,Xk+x).LelXx=A*,AE§,Sind
write gt — exp tA. Then
ХХр[т(Х2,...,Хк + х)}
= -А-т{Х2,...,Хк + 1)
= -оз{Хх)-т{Х2,...,Хк +
Thus, equation (*) holds in this remaining case.
3.1.6 Corollary. Relative to the adjoint representation G -> GL(§)
j
given by g^&b wehave(forTEAk(P,<3)) D"t - di +[w, т].
46 GAUGE THEORY AND VARIATIONAL PRINCIPLES
Proof. Note that the Lie algebra homomorphism §-»§(!(§) induced
by the adjoint representation is given by A B = [A, B]. Thus, со/\т =
3.1.7 Remark. This corollary does not apply to со, since со & Л'( P, %).
It does apply to ЙШ6Л2(Р, §). Thus, D°1u01 = du01 + [со,п"], which
vanishes by 2.2.8.
3.2 GAUGE TRANSFORMATIONS
3.2.1 Definition. An automorphism of a PFB it: P^M is a diffeo-
morphism f: P^P such that f(pg)=f(p)g for all gEG, pEP. Note
that f induces a well-defined diffeomorphism /: M^>M given by
/G7-( p )) = 77-( /(/>))■ A SauSe transformation of a PFB is an automor-
automorphism f: P^P such that f=\M (i.e., тт(p) = тт(/(p))). We set GA(P)
= the group of gauge transformations.
3.2.2 Theorem. Let C(P,G) be the space defined in 3.1.1 where G
acts on itself via g-g' = gg'g~x {i-e., the adjoint action). There is a
natural (anti-) isomorphism C(P,G) = GA(P).
Proof. If tEC(P,G), then define /: P^P by f(p)=pT(p). Since
f(Pg)=PgT(Pg)=Pgg~lT(P)g=PT(p)g=f(p)g, it follows that/E
GA(P). Conversely, if fEGA(P), define т: P^G by the relation
f(P)=PT(p). Note that pgT(pg)=f(pg)=f(p)g=pT(p)g, whence
T(pg)=g~lT(p)g, and it follows that t6C(?,G). Finally, if/,/'E
GA(P) with f(p)=pT(p) and f'(p)=pr'(p), then (/»/')(/?) =
3.2.3 Caution, /и general, the maps Rg: P^P (p^pg) are not
gauge transformations because Rg(pg') = pg'g, while Rg(p)g'= pgg'.
Thus, unless G is Abelian (or g is in the center of G), we do not have
Rg(pg')=Rg(p)g'forallg'EG.
3.2.4 Misleading Misnomers. If -л: MXG^M is a product bun-
bundle, then the maps Lg. MXG^MXG given by Lg(x, g') = (x, gg') are
in GA(MX G). Physicists call such transformations global since g does
not depend on x. Local gauge transformations are those of the form
3 PARTICLE FIELDS, LAGRANGIANS, GAUGE INVARIANCE 47
(x, g')i-+(x' Hx)g') where h: M^G is not necessarily constant. We
emphasize that such gauge transformations make sense only for product
bundles (or for local trivializations), because in general there is no
natural way to define an action by G on P that commutes with the given
right action. Moreover, a gauge transformation that is global relative to
one LT may be local relative to another. We choose not to use these
terms, although they abound in the physics literature.
3.2.5 Theorem. If fEGA(P) and w is a connection \-form, then
f*co is a connection \-form.
Proof. Let A E§ and let A* be the corresponding fundamental field
on P. We have 1.2.2(a) for/*w, because
Since Rgof=foRg, we have Rgf*co = (foRg)*co = (Rgof)*oo =
f*R*a=f*&bg-ia = &bg-i f*a, as required by 1.2.2(b). ■
3.2.6 Theorem. Given a representation G-*GL(V) andfEGA(P),
the pull-back f* yields isomorphism /*: Ak(P,V)^ Ak(P, V), k =
0,1,2,....
Proof. For тЕАк(Р, V), we have Rgf*T=f*R*r=f*g~KT=g~l-
f*r. If A* is a fundamental field on P, then from the proof of 3.2.5
we have fjl*=A*. Thus-, (/*т)(Л*) = т(/„[4*) = т(Л*) = 0, and so
f*TEAk(P,V). Ш
3.2.7 The Space of Connections. _Let G be the space of all
connection \-forms on P. Note that G¥=A\P, §), but these spaces are
closely related.
3.2.8 Theorem. For a given coEG, the map Л'(Р, @)->в, given by
тн> t + 03, is one-to-one and onto.
48 GAUGE THEORY AND VARIATIONAL PRINCIPLES
Proof. li
while Rg(T + a) = R*gT + R*ga = &bg-iT + &bg-4a = &bg-i(T + a),
whence т + ибб. Conversely, ш'бб implies that w' — wEA\P,@).
The theorem then follows. ■
Note that for tEA\P, §) the curve y'- U^G, given_by y(t) =
It, through со has у'@) = т. In this way, we can regard Л'(Р, Q) as the
"tangent space" TjG to the "manifold" G at w.
3.2.9 Definition. The gauge algebra of a PFB with group G is the
space C(P,§), where the representation G^>GL(§) is the adjoint
representation g\-+&bg. The theorems to follow indicate the sense in
which C(P,§) can be considered to be the Lie algebra of C(P,G) =
GA(P).
3.2.10 Theorem. IfH,H'EC(P,§), then the map [H,H'\. P^G,
defined by [H,H'](p) = [H(p),H'(p)], is also in C(P,<3). Conse-
Consequently, C(P, §) has a Lie algebra structure.
Proof. Note that [H, H'](pg) = [H(pg), H'(pg)] = [&bg-,H(p),
&bg-< H'(p)] = &br<[H(p), H'(p)] = &bg-,[H, H'lp). Thus,
C(P,§) inherits a Lie algebra structure from %. ■
3.2.11 Theorem. There is a map Exp: C(P,§)^C(P,G) defined
by Exp(H)(p) = exp(H(p)) for HEC(P,§) such that t^Exp(tH) is
a one-parameter subgroup of C(P,G) with
dt
Moreover, if H, H' E C( P, §), then
s,t = L
Proof. Note that Exp(H)(pg) = exp(H(pg)) = exp(&bg-<H(p)) =
Ad g~>exp(H( p)) = Adg<Exp(H)( p). The other statements are also
easy to verify. ■
3 PARTICLE FIELDS, LAGRANGIANS, GAUGE INVARIANCE 49
3.2.12 Definition. We define exp: C(P, %)^GA(P) by exp(H)(p)
=:pexp(H(p)). Note that this is Exp of 3.2.11. followed by the
isomorphism of 3.2.2.
Theorems 3.2.5 and 3.2.6 tell us that GA(P) acts on the spaces &
and Ak(P,V), respectively. Now that we have a "Lie algebra"
C(P,§) for GA(P), we consider the effects on these spaces of
infinitesimal motions: For this, we first reformulate the action GA(P)
in terms of C(P,G).
3.2.13 Lemma. Let fEGA(P) and tEC(P,G) be related by f(p)
= pr(p). For XETpP, we have Д(X) = (^
Proof. Let у'- i^Pbea curve with y'@) = X. At t = 0,
3.2.14 Theorem. For wE6, feGA(P), and tEC(P,G) (with
f(p)=pT(p)),we have (^1
Proof. Simply apply w to each side of the equation of 3.2.13. ■
3.2.15 Theorem. For <pEAk(P,V),f(EGA(P), and tEC(P,G)
with Ар)=рт(р)), we have f*<p = T~l-<p.
Proof. Use the equation in 3.2.13, and recall that <p vanishes on
vertical vectors. ■
We are now ready for the infinitesimal versions of the actions in
3.2.5 and 3.2.6.
50 GAUGE THEORY AND VARIATIONAL PRINCIPLES
3.2.16 Theorem. Let соЕв andHEC(P, §). Then,
— (exp/tf)*w
r=o
= dH+[a,H]=DuHEAl(P,§)
dt
where exp tH EGA(P) was defined in 3.2.12.
Proof. Replacing/in 3.2.14 by exptH and applying
-0
dt{)
r=o
to the equation in 3.2.14, we get
3.2.17 Theorem. Let <pEAk(P,V) andHEC(P,§). Then,
— (exp?tf)*<p
r=o
Proof. Use 3.2.15 with /= exp tH and differentiate with respect to t.
3.3 LAGRANGIANS AND GAUGE INVARIANCE
Let 77-. P^M be a PFB with group G and let G^GL(V) be a
representation.
3.3.1 Definition. The space of 1-jets of maps from P to V is
J(P,V) = {(p,v,O)\pEP, vEV, and 0: TpP^V is linear). You can
verify that J(P,V) can be made into a manifold in a natural way.
3.3.2 Definition. A Lagrangian is a map L: J(P,V)^U such that
for all (p,v,0)EJ(P,V) and gEG, we have L(pg, g~Kv, g~x-
e°Rg-^) = L(p,v,6). As a result of this requirement, we have
3 PARTICLE FIELDS, LAGRANGIANS, GAUGE INVARIANCE 51
3.3.3 Theorem. Given a Lagrangian L: J(P,V)^>U, there is a
well-defined function £0: C(P,V)^>C°°(M) given (for xEM,->$/E
C(P,V), andpEP with чт(р) = х) by to(xp)(x) = L(p,^(p), d^p).
Proof. We must show that L(p,^(p),d^p) is independent of the
choice oipEir~l(x). Since ^ojR?=g~1-^5 we have d^pgoRgif=g^x
•d4>r, or d4/n=g~x-diioR, Thus,
3.3.4 Definition. A Lagrangian L:J(P,V)^>U is called G-invariant
if L(p, g-v, g-e) = L(p,v,e). Nearly all Lagrangians that arise in
practice have this kind of invariance.
Recall that the gauge group GA(P) acts on C(P,F) = A°(P,F)
via pull-back (i.e., (f*^)(P) = ^if(P)) for/E(J^(P),^EC(P,F),
pEP.)
3.3.5 Theorem. Let L: J(P,V)^R be a G-invariant Lagrangian.
Then it is not necessarily the case that £0(^) = £0(/~'*^) (i.e., £0 is
not necessarily gauge invariant).
Proof. Let т E C( P, G) be related to /E GA (P) by /(p) =p т( р). By
3.2.15, we havef*\p=T~l-\p. We compute d(T~l-\l>)p as follows: Let
XETpP and y. R^P be such that y'@) = X. Then (at / = 0),
Thus,
52 GAUGE THEORY AND VARIATIONAL PRINCIPLES
Now
Z0(f*+)(x)=L(p,{f*+){p),d(r+)p)
but because of the second term in the third slot, we cannot use the
(j-invariance of L to obtain L(p,\p(p), d\p ) or £0(^)(x). ■
In order to remedy the gauge noninvariance of £0, the physicists
sought an object that (when incorporated into £0 as a new variable)
produces a term (under a gauge transformation) that cancels with the
troublesome term /^(^„.(т)^-^/?)-^/?) in the proof of 3.3.5.
In this way, the concept of a connection was forced upon them. The
next theorem shows how nicely connections solve the gauge invari-
ance problem.
3.3.6 Theorem. Let L: J(P,V)^R be a G-invariant Lagrangian,
and let G be the space of connections on P. Define a function £:
С(Р,У)Хе^Сж(М) by e,(rf,,a)(x) = L(p,+(P),Duil>P) for xEM,
рЕтг~\х), \pEC(P,V), and шбб. Then t is not only well defined,
but also gauge invariant, in the sense that for fEGA(P), £(/*»/>, /*")
Proof. From 3.1.3, we know that Dw\pE A\P,V). Thus,
Then,
f 3 PARTICLE FIELDS, LAGRANGIANS, GAUGE INVARIANCE 53
and so £ is well defined. Now
3.3.7 Remark. By writing out d(f*\p)p+(f*<^)p-(f*^)(p) in terms
of т (using 3.2.14 and 3.2.15), you can directly see the cancellation of
the troublesome term in the proof of 3.3.5.
3.3.8 Definition. If L: J(P,V)^Uisa Lagrangian and t: C(P, V)
Xe^C°°(M) is defined as in 3.3.6, then £(^,co)EC°°(M) is called
the Action density of the pair (»//, со). The proof of 3.3.6 shows that L
need not be G-invariant in order that £(«/>, со) be well defined.
CHAPTER
Lagrange's Equation for Particle Fields
In this chapter, we formulate the principle of least (or stationary)
Action for particle fields under the influence of a gauge potential.
The particle field is then shown to obey this principle if and only if it
satisfies Lagrange's equation. Modulo some refinements in Chapter 7
for particles with spin, our approach is general enough to enable us
to obtain (as special cases of Lagrange's equation) the field equations
for quarks and their hadronic aggregates, as well as those for leptons.
The physicist may wonder why the derivation of Lagrange's
equation is more involved than usual. Often in physics books,
equations are established for free fields (without gauge potentials),
and then the ordinary derivatives are replaced by covariant deriva-
derivatives, according to the principle of minimal replacement, in order to
get the nonfree field equations. While this recipe always seems to
work, here we include the gauge potential throughout the derivation
David D. Bleecker, Gauge Theory and Variational Principles ISBN 0-201-10096-7
Copyright © 1981 by Addison-Wesley Publishing Company, Inc., Advanced Book
Program/World Science Division. All rights reserved. No part of this publication
may be reproduced, stored in a retrieval system, or transmitted, in any form or by
any means, electronic, mechanical, photocopying, recording, or otherwise, without
the prior permission of the publisher.
54
4 LAGRANGE'S EQUATION FOR PARTICLE FIELDS 55
of Lagrange's equation, just to make sure. Moreover, the base space
is not assumed to be flat Minkowski space. Thus, the integration-by-
parts step should be replaced by a coordinate-free version such as in
4.2.9. This requires some additional machinery. In spite of the fact
that gravity is so weak compared to other forces, its inclusion is
necessary in a complete theory, and it may play a crucial role in
some way even for small-scale phenomena. Another reason for the
length of the derivation is that our approach places Lagrange's
equation on the total space, instead of the base. In physics literature,
field equations live on space-time, and they explicitly involve local
gauge potentials. Thus, in general, the equations are not manifestly
gauge invariant, even though the equations are satisfied for all
choices of gauge, if they are satisfied for one choice. Here, we take
Lagrange's equation to be on the total space, where it is manifestly
gauge invariant and has a more natural appearance. Example 4.3.7
bears this out for charged scalar fields.
Finally, you should be aware that physicists sometimes add
non-gauge-invariant terms to the Action density. Since these terms
are always chosen to be divergences (i.e., codifferentials of 1-forms),
the new Action density still leads to the same field equation. In this
sense, the practice is harmless, and it is often convenient for the
purpose of applying canonical quantization in quantum field theory
and for handling certain boundary conditions in classical settings as
welL
4.1 THE PRINCIPLE OF LEAST ACTION
Let ir: P^M be a PFB with group G, and let G^GL(V) be a
representation. Let h be a metric tensor defined on M. For simplicity
we assume that M is oriented, so that there is a well-defined volume
form fi on M associated with h. Suppose that L: J(P,V)^R is a
Lagrangian, and w is a fixed connection on P. Recall that we defined
a function £: C{P,V)Xe^Cx(M) in 3.3.6. We write £"
C°°(M), where ta{^) = t^,a). Ideally, the Action of ^()
would be fM£"(ip)ii. However, there is no guarantee that this in-
integral exists, since M might be noncompact.
4.1.1 Definition. We use the notation UCCM to mean that U is
open with compact closure. For U(Z CM, we define the Action of
U to Ье£ /
56 GAUGE THEORY AND VARIATIONAL PRINCIPLES
4.1.2 Definition. For f£C(P,F), we define the projected support
of -^ to be the closure of the set {ir(
4.1.3 Definition. We say that feC(P,F) is stationary relative to
t01 if for all UCCM and also oEC(P,V) with projected support
contained in U, we have
Equivalently, we say that ty obeys the principle of least Action.
The remainder of this chapter is devoted to showing that
C(P,V) is stationary iff \p satisfies a certain differential equation
(Lagrange's equation). This amounts to mimicking the usual ap-
approach to calculus of variation problems, but the general setting here
forces some additional notions on us.
4.2 SOME MACHINERY
We continue to use the notation of Section 4.1. The metric hx on
TXM induces a metric h on the horizontal subspace H CTP
(pE-n~\x)) via the isomorphism m^. Hp^TxM (i.e., hp(X,Y) =
h^ir^X^^Y) for X, YETpP). In the same way a volume element p.
is induced on Hp from that on TXM, and thus we may define a star
operator *p: Ak(Hp)^A"~k(Hp) (« = dim M) such that *р(тт*т) =
■7т*(*хт) where it*: Ak(TxM)^Ak(Hp) is pull-back induced by тг,:
4.2.1 Definition. We define *: Hk(P,V)^A"~k(P,V) by setting
(for (pEAk(P,V)) (*(p)p equal to the unique extension of*p(<p\Hp) to a
V-valued (n~k)-form (on T P) vanishing on vertical vectors. In other
words, *<p is the unique form in A"~k(P,V) such that (*(p)\Hp=*p
4.2.2 Theorem. // aEAk(P,V) and a: U^P is a local section,
then o*(*a) — *o*(a).
Proof. If XEHp, then (о^Х)н =X since_тт„((о^*Х)н) =
^*(°*'n*x) = 'IT*(X). It follows that for any /3EA'(P,F), we have
(тт*а*Р)\Нр =Р\Нр. It suffices to prove that ir*(a*(*a)) agrees with
4 LAGRANGE'S EQUATION FOR PARTICLE FIELDS 57
(«)) on H Now ir*(a*(*a))\Hp =(*a)\Hp =%(a\Hp), while
({**\H) (\H) ■
4.2.3 Assumption. We will need to assume that the vector space V
has a metric h (not necessarily positive definite, but at least nondegen-
erate) such that the representation G -> GL(V) is orthogonal relative to
ft (i.e., h(g-v, g-w) = h(v,w) for all v, wEV). This assumption is
satisfied in all the physical examples we will consider. Moreover, h
always exists if G is compact, for we may set h(v,w)= jGh0(g~l-
v, g~]-w)dg, where h0 is an arbitrary positive definite metric on V and
dg is a volume element on G equal to the wedge of all components of the
Maurer-Cartan form B.2.17) for some basis of %.
4.2.4 Definition. Since we have a metric hp on Hp and a metric h on
V, we can define a metric (hph) on the space of V-valued k-forms
on Hp as in 0.1.5. For a, /3E~Ak(P,V), we define (hh)p(.ap,fSp) =
(hph)(ap\Hp,Cp\Hp).
4.2.5 Theorem. There is a well-defined function
(hh):Ak(P,V)XAk(P,V)^Cco(M)
given by(hh)(a,P) (x) = (hh)p(ap, flp) where тт(р) = хEM.
Proof. Note that apg=g~x-apoRg-ljf, and so on^Thus, (hh)pg(apg,
^) = (^Wg~4°Vv g-l-PpoRg~u) = (hh)p(ap,Pp), since
Rg^: Hpg^Hp is an isometry, and G^GL(V) is orthogonal rela-
relative to h. Ш
4.2.6 Note. Even more obviously, there is a function (hh):Ak(M, V)
k
4.2.7 Theorem. Let a, fi<E~Ak(P,V) and let a: U^P be a local
section. Then we have (hh)(a*a,a*P) = (hh)(a,/3).
Proof. Recall from the proof of 4.2.2 that {тг*о*а)\Нр =сс\Нр, and
also 77^: Hp -> TXM is an isometry relative to h and h. Thus, we have
(*^ h/**\H^
58 GAUGE THEORY AND VARIATIONAL PRINCIPLES
4.2.8 Definition. The covariant codifferential 5": Ak(P,V)-*
Ak~\P,V) is defined, for <p E Ak(P,V), by 8"(<p) =
-(- 1)A(- \yik+l)*D"(*<p) where (-1)A is the sign of the determi-
determinant of(h(dI,dj)), andn = dim M. Observe that when M is a space-time,
(—1)A = — 1 andn = 4; so then 8" = *DU*. In general, the factor ±1 is
necessary for the next theorem.
4.2.9 Theorem. Let UC CM {i.e., U is an open subset with com-
compact closure), and suppose that aEAk(P,V), while flEAk+\P,V).
Assume that the projected support of a is contained in U. Then
Jи Jи
Proof. We first assume that there is a local section a: U^P. Now
{hh){D"a,fi)n= [ {hh)(o*{Dua),o*P)n,
J и
by 4.2.7. From 3.1.5 and the preservation of d and Л under pull-back,
/ o*co)Ao*(a). Thus,
{hh){D»a,P)n= [
и J и
+ J (hh)(o*a>Ao*a,o*P)p.
By 0.2.24, the first term is
(hh)(o*a,8(o*C))ii, 8=-(-l)h{-\)k"*d*.
We need to do some algebraic work on the second term. Let
vx,...,vm be a basis for V and let eu...,efbe a basis for %. Using the
summation convention, we write a*co = co"elt, a*a = aava, o*/3 = /3hvh,
where со", аа, fih are real-valued forms on UCM. Then we have
(hh)(o*coAo*a, o*fi) = (hh)((uv f\aa)(ev-va), jihvh) = h{^ Л a",
jih)h{ev-va,vh). At /=0, we have
d
4 LAGRANGE'S EQUATION FOR PARTICLE FIELDS 59
Also, h(uv Aaa,y8fo)/x = (w" Лаа)Л*(Зь =(- 1) V Л (и" A*flh) =
(-_1)к(-1)л(-1)к<и-*>ао Л *(*(w" Л *flh)) = (-\)h(-\)"kh(aa,
*(w' Л *£V Then й( м' Л a", flh) = (-\)h(-\)nkh(aa, *( w" Л *£fo))
and h(ev-va,vh)=-h(va,ev-vh) together imply that
(hh)(o*co/\o*a,o*P)
= -{l)h(-l)nkh(a°,*(a>A*lil>))U{va,ev-vb)
= -(-\)h(-\yk(hh)(o*a,*{o*(a>)A*o*(C))).
We now have
X
f (
J и
= f {hh){o*a,o*{8"l3))ii= f' {hfi)(a,8uP)n.
J и J и
If there is no local section a: U^>P, then we may use the following
partition-of-unity argument (see Kobayaski and Nomizu [1963]). Let
К be the (compact) projected support of a. Let Ux,..., UN be an open
covering of К such that there are local sections a,: Ut -^P and Ц С U.
There are C00 functions/: M^R such that {xEM\ £(х)=^0}
has closure in U, and 2lNfl(x)=\ for xEK. Define ftaEAk(P,V),
where /,=чг*/„ by (fia)p=fi(p)ap, pEP. Since /a has projected
support in Ц-CU (and we have the local section а,: Ц^Р), our
previous result yields
f (hh){D»{f,a),C)n= f (h
Jи Jи
Noting that 2f /a = «, we then sum over i and obtain
60 GAUGE THEORY AND VARIATIONAL PRINCIPLES
4.3 LAGRANGE'S EQUATION
4.3.1 Definition. Let L: J(P,V)^R be a Lagrangian and let
A\P,V)p denote the spa
vertical vectors. For (i
A(P, V) by the equation
A\P,V) denote the space of linear maps T P->V that vanish on
vertical vectors. For (/>, t>,0) E/(P, F), define V 3Д/>, v, в) Е
For \f,EC(P,V), define a V-valued \-form dL/d(D>) on P by
4.3.2 Theorem. We have bL/b{D"^) ЕЛ'(Р, V).
Proof. Note that 9L/9(Da'^) vanishes on vertical vectors by defini-
definition^ We must show that R*dL/d(Duxp)=g~l-dL/d(DuxP). Let
PEA\P,V). Then
3L
R
3L
dt
\ >hp
I p )
Thus, g-dL/d(D"t)p oR =dL/d(D«t)p, etc
i 4 LAGRANGE'S EQUATION FOR PARTICLE FIELDS 61
4.3.3 Definition. For (p,v,0)EJ(P,V) we define v2L(P,v,0)E
У by the equation
d
T / Д \ \ j t
dt
If \pEC(P,V), we define a V-valued function дЬ/д\р on P by
4.3.4 Theorem. We have 3L/3^EC(P,F).
Proof. The idea here is the same as in the proof of Theorem 4.3.2,
and the execution (which is easier) is left to you. ■
4.3.5 Theorem. Suppose that Ud CM, and let tEC(P,V) have
projected support in U. Then at t = 0,
Proof. At t = 0,
£"(t+t)(())
Integrating both sides over U, applying 4.2.9, and noting that {hh) — h
on Л°(Р, V) = C(P, V), we obtain the result. ■
4.3.6 Theorem (Lagrange's Equation). The particle field ^e
C(P,V) is stationary (for a Lagrangian L: J{P,V)^>U and a fixed
connection со on P) iff the Lagrange equation holds: 5a>[3L/3(Da>^)}
+ 3L/3^O
Proof. Suppose that Lagrange's equation does not hold at some
pEP. Then, we can find a feC(P,V) such that /iEa'[3L/3(JDa'^)]
'"I, f)>0 at 7т(р). By continuity, this inequality persists in an
62 GAUGE THEORY AND VARIATIONAL PRINCIPLES
open set U (with compact closure) containing тт(р). Multiplying т by
a nonnegative C00 function/: P^U (constant on fibers) with pro-
projected support in U and with f(p)>0, we obtain r=ffEC(P,V).
Theorem 4.3.5 applies to т, and yet (by construction) the right-hand
side of the equation in 4.3.5 is nonzero. Thus,
and »// is not stationary. The converse is clear from 4.3.5. ■
4.3.7 Example (Spin-Zero Electrodynamics). Let тт: P-*M be a
PFB with group U(\) = {e'e\6EM} and fixed connection со: We sup-
suppose that M is a {general) space-time with Lorentz metric h. Let F=C
(regarded as a two-dimensional vector space over U), and suppose that
f/(l)-»GL(C) is the representation given by e'e-z = e'ez (complex
multiplication). The Lie algebra of U(l) is the space GlL(\) = {i6\6 EM}
of pure-imaginary numbers. Since
£«"•«1,-.=/*,
we see that %A) operates on С via complex multiplication. Let h be the
R-valued metric on С given by h(z, w) = \(zw + wz) (i.e., the ordinary
Euclidean inner product on R2sC). Recall D.2) that h induces
a metric hp on the horizontal subspaces Hp = {XETpP: cc(X) = 0}.
We_ define a Lagrangian L: J(P,V)^M by L(p,z,d) =
^(Щ^в^в")- \m2zz. //£■,,..., E4 is a_ basis of HP, в, = 0(E,),
and hij = hp(Et, Ej), then (hh)pFH, вн) = {h^Ofij + Ofr), which may
be more familiar to physicists. For any в' Е A\P, V) ,
whence V 3L(p, z, в) = вн. Similarly, V 2L(p, z,Q)— —m2z. Thus,
в в
{
and dL/d\p= —m2\p. Lagrange's equation is then 8"D"^ — m2^ = 0.
While this equation is perfectly elegant, physicists are used to thinking
of -^ as a function on M. This requires a local section a: U-+P (i.e., a
choice of gauge) to pull the equation back to UCM. We write \p' = \p°o:
4 LAGRANGE'S EQUATION FOR PARTICLE FIELDS 63
U-> P and — iA = а* со. Note that as со takes on pure - imaginary
values, A is a real-valued \-form on U. (A can be identified with the
- vector potential" up to a constant multiple depending on the charge.)
Using 4.2.2, we get
or
This is the equation for the "first-quantized " wave function of a spin-0
charged particle (e.g., а тт~ meson) of mass m under the influence of
an electromagnetic potential A (e.g., see Schiff [1968], p. 468, for the
special case where M is Minkowski space). Aside from the homoge-
homogeneous Maxwell equation d2A = 0, there is the inhqmogeneous Maxwell
equationS(-dA) =j' wherej' = h(d^', ixp')-A^'xP' = i(xP'd^'-dxp'xp')
—А-^/'-ф' is the current. This equation is derived from a principle of
least Action with respect to variations in со. Indeed, this is the subject of
the next chapter.
CHAPTER
The Inhomogeneous Field Equation
Given a particle field and a gauge potential to which it responds,
we will define a Lie algebra-valued 1-form on the total space. This
form is called the current. We give a number of equivalent char-
characterizations of the current. Conceptually, the best is 5.1.4, which
states that the current is the first variation of the Action with respect
to the gauge potential. The current can be pulled down to the base
by a choice of gauge, but the resulting local currents (like the local
field strengths) do not piece together to make a well-defined 1-form
on the base unless the group is Abelian. In the case of electromag-
netism, where we have the Abelian group U(l), the well-defined
pulled-down current is the usual 4-current occurring in Maxwell's
equations (see 0.2.22). Eventually we will consider the current of a
charged scalar field E.2.9), a charged Dirac electron field G.2), and a
David D. Bleecker, Gauge Theory and Variational Principles ISBN 0-201-10096-7
Copyright © 1981 by Addison-Wesley Publishing Company, Inc., Advanced Book
Program/World Science Division. All rights reserved. No part of this publication
may be reproduced, stored in a retrieval system, or transmitted, in any form or by
any means, electronic, mechanical, photocopying, recording, or otherwise, without
the prior permission of the publisher.
64
5 THE INHOMOGENEOUS FIELD EQUATION 65
oucleon field in the presence of a Yang-Mills potential G.3). Other
currents (e.g., various weak and hadronic currents) that are found in
the physics literature are also essentially special cases.
In the event that the Action density is gauge invariant, we prove
E.1.5) that the current is covariantly conserved whether or not the
inhomogeneous field equation (described later) holds. If the Action
density is not gauge invariant, we can use the inhomogeneous field
equation to obtain current conservation.
Gauge potentials are not particle fields, but in 5.2.1 we construct
from the gauge potential, a gauge-invariant function on the base.
This function is called the self-Action density of the gauge potential,
because it does not involve any particle fields. The total Action
density is the sum of the Action density and the self-Action density.
The particle field and gauge potential are stationary for the total
Action if and only if Lagrange's equation and the inhomogeneous
field equation hold (see 5.2.3). In the case of electromagnetism, the
inhomogeneous field equation is equivalent to the two Maxwell
equations that involve the charge density and current density (see
0.2.22 and 5.2.9). Returning to the general case, in the event that the
current is zero (e.g., consider a gauge potential free of particle fields),
then the inhomogeneous field equation is in fact homogeneous and is
then commonly called the Yang-Mills equation. If the group is
non-Abelian, the Yang-Mills equation is nonlinear in the gauge
potential, just like the homogeneous field equation (or Bianchi
identity of 2.2.8). However, unlike the Bianchi identity, which holds
for any gauge potential, the Yang-Mills equation holds only if the
gauge potential is stationary for the self-Action.
5.1 THE CURRENT
"We adopt_the notation used in Chapter 4." In order to define
the current/ЕЛ'(Р, g), we need a metric h on Vsuch that G-*GL(V)
is orthogonal, and we need a metric к on % such that &b: G-* GL(§)
is orthogonal. We then have a function
(hk):AJ'(P,$)XAJ(P,®)^C°°(M)
as a special case of 4.2.5.
66 GAUGE THEORY AND VARIATIONAL PRINCIPLES
5.1.1 Definition. For шЕв,^ EC(P,V), and pEP, the current Je
Л'(Р, §) is defined at p by the equation
= (hk)(jp,r),
required to hold for all тЕ_Л'(Р, §)p. Note that Jp exists and is unique
by the nondegeneracy of{hk)p. In 5.1.3, we show that JEA\P, §).
5.1.2 Theorem. Let eu...,ef be a basis for % and suppose that
(kafi) is the inverse of the matrix (kafj), kap=k(ea, ep). Then J(X)~
ка^ддХ
Proof. We calculate {hk)p{kalih\bL/b{Da^), ea -^)ep, туеу) =
kapkpyhp{h\bL/b{D^), еа'П Ti) = hp(h(dL/d(D»n ey ^), т*) =
(hh)p{dL/d(D"xP),T-xP) = (hk)p(Jp,Tyey), and the result follows. ■
5.1.3 Theorem. WehaveJEA\P,§).
Proof. For tEA\P,§), we have {hk)gg(&bg-,Jp°Rg-,.,Tpg) =
(hk)p(Jp, &bgWRg,) hkJ hhdd
p p_ p
Moreover, dL/d(Dy) E A\P, V) by 4.3.2. Thus, (hh)
d dd
ppp p pppg pgpgpg
and so &b-->JnoR ,. = J or R*J=&bp * J- Note that/ vanishes
on vertical vectors, by definition. ■
5.1.4 Theorem. Let £: C(P,V)X6-*CX(M) be as in 3.3.6. Let
Jw{4/) ЕЛ'(Р, §) be the current associated to the pair (\p, со). Then, for
all tETJ3=A\P, g), we have
Proof. For p E it ~ Xx), we have (at t=0)
) L{
= (hk)(J,r)(x).
I 5 THE INHOMOGENEOUS FIELD EQUATION 67
'k
5.1.5 Theorem (Conservation of Charge). Let L: J{P,V)-*R
be a G-invariant Lagrangian. For fixed со EG, suppose that \pEC(P, V)
is stationary relative to L in the sense of 4.1.3. Then the current JwD/)
obeys the "generalized continuity equation" 8"(J"(\p)) = O.
Proof. By 3.3.6, the G-invariance of L implies the gauge invariance
of t: С(Р,К)Хе^С°°(М). Thus, for any FEC(P,§) we have (in
the notation of 3.2.12) £((exp/F)*i//,(exp/F)*w) = £(i//, со) for all
t&U. Differentiating with respect to t and using 3.2.16 and 3.2.17,
we have (at / = 0)
(A) ftt^-tF^,a) + ft
Let UCM be an open set with compact closure, and assume that F
has projected support in U. Integrating (A) over U, we obtain (using
5.1.4)
(B) j-l°u^-
The first term vanishes, since ^ is assumed stationary. Applying 4.2.9
to the second term, we have
for all FEC(P, §) with projected support in U. It follows, as in the
proof of 4.3.6, that Sw(/w(i//)) = 0. ■
5.1.6 Theorem. // we do not assume that \p is stationary in 5.1.5,
then we still have kafih{8w[dL/d(Dw\l>)] + dL/d4>, ea -\p)ep =5W(^)
where ex,...,efis a basis for % and (kaP) is the inverse of (kap), ka/3 =
Proof. If \p is not necessarily stationary, then the first term of
equation (B) in the proof of 5.1.5 is
68 GAUGE THEORY AND VARIATIONAL PRINCIPLES
by 4.3.5. The integrand can be expressed in the form k(kaPh(Sw[dL/
d(Da\p)] + dL/d\p, ea-ip)efi, F), and the result follows by combining
the two transformed integrals in (B). ■
5.2 DERIVATION OF THE INHOMOGENEOUS FIELD EQUATION
In order to deduce the inhomogeneous Maxwell equation(s) from
an Action principle, it is necessary to add (to the Action density
£•(»//, w) of 3.3.8) the Action of the electromagnetic field itself.
Classically, this new term is i(\\E\\2-\\B||2) or - %g(F, F) where F
is the 2-form of 0.2.22 and g is the metric on 2-forms (see 0.1.4) on
Minkowski space. The next definition is the natural generalization.
5.2.1 Definition. Suppose that there is a metric к on % such that &b:
G->GL(g) is orthogonal. Define §: 6-* CX{M) by S(w) =
- i(A/c)(fiw, fiw). §>(w) is called the self-Action density of со. Then we
have (£+§>): C{P,V)Xe^C'x{M) where (£+§>)(,//, <o) = £(»//, w) +
§( w) is the combined Action density of the pair (\p, to).
5.2.2 Definition. We say that the pair (-ty, to) is stationary relative to
£+e> if, for_all open sets UCM with compact closure and crE
C(P,V), тЕЛ'(Р, §) with projected supports in U, we have
at t=0.
5.2.3 Theorem. The pair (\l>,co) is stationary relative to (£+§) iff
conditions (A) and (B) hold:
(B) 8"fl"=/"(^) (inhomogeneous field equation).
Proof. Let oEC(P,V) and тЕЛ'(Р, g) each have projected sup-
support in the open subset UCM. For flr = ua+tT = d(co + 1т) + ^[
tT, co +tr] we have
5 THE INHOMOGENEOUS FIELD EQUATION 69
at Ja
f
J и
Since a and т can be chosen independently, (A) and (B) follow if
(»/>, со) is stationary relative to (£•+§>). The converse is evident. ■
5.2.4 Remark. In 5.2.3 we do not assume that the Lagrangian L:
J{P,V)^R is G-invariant or that t: C{P,V)XQ^CX{M) is gauge
invariant. This assumption was crucial in 5.1.5. We devote the re-
remainder of this chapter to proving that 8"(/"(^)) = 0 is a consequence
of the field equation 8aua=Ja(\p). However, this does not give us an
alternate proof of 5.1.5, in which no assumption on со was made (in
contrast to 5.2.3).
5.2.5 Theorem. Let it: P-+M be a PFB with group G, and suppose
that G^GL(W) is a representation. IfTEA\P,W) and со Ев, then
D
Proof. We have £ы(_£ыт)
— соЛй?т + соЛй?т+соЛ(соЛт) = й?соЛт+соЛ(соЛт). By a simple
computation (using Definitions 2.1.1 and 3.1.4), we have со Л (со Лт) =
2 [со, со] Л т. The result then follows. ■
70 GAUGE THEORY AND VARIATIONAL PRINCIPLES
5.2.6 Theorem. For <p,p6AJ(P, §), we have [p,*<p] = — [<p,*p]. In
particular, [<p, *<p] = 0.
Proof. Write <p = (paea and p = ppep where ex,...,efis a basis for g.
Then [p, *<p] = p« Л *</К, ^] = /Г(ря, <р")/1[ев, ер] =
5.2.7 Theorem. We have 5"Eий") = 0.
Proo/. Note that
S"(S"ft") = ±*D"**D"Sft" = ±*D"(D"Sfi") = (by 5.2.5)
snw]) = 0 6y 5.2.6. ■
5.2.8 Corollary. //8ЫПЫ =/"(>//),/Леи 8ЫGЫ(>//)) = 0.
5.2.9 The Spin-Zero Case. Же can now justify the inhomogeneous
Maxwell equation stated in 4.3.7. In the notation of 4.3.7, we take
i = / — 1 /o fee a fray's vector of %A) аий? we choose the metric к on
%A) 50 that k(i,i) = \. According to 5.1.2, the current is J=
h{dL/d{Duip),i\f/)i. We found that дЬ/д(Па-ф) = Оа-ф in 4.3.7, and
so J = h(D"\p, i\p)i=h(d\p + com\p, i^)i. For a local section a, we have
o*J = h(d\l>'-iA\l>',i\l>')i = h\d4>',i\l>')i-Axl>'^'i. Applying_o* to the
equation 8"fl"=/, we obtain 8(-dA) = h\dip', м//)-Л»//»//=/ after
dividing by i.
CHAPTER
Free Dirac Electron Fields
The main goal of this chapter is to define properly the notion of a
free Dirac electron field over a curved space-time, and to exhibit
Dirac's equation for this particle field as a special case of Lagrange's
equation of Chapter 3. To do this in complete generality, starting
with the basics, requires the introduction of several concepts that are
of independent interest. These include the Lorentz group (and its
universal cover SXB,C)), the Levi-Cevita connection, spin struc-
structures, and the algebra of Dirac (or gamma) matrices (i.e., Clifford
algebra).
Physicists will probably be quite familiar with the results of
Section 6.1. However, they may find the bundle-theoretic determina-
determination of the topology of the Lorentz group and the related group
5LB,C) refreshing compared to the awkward discussions (if any)
David D. Bleecker, Gauge Theory and Variational Principles ISBN 0-201-10096-7
Copyright © 1981 by Addison-Wesley Publishing Company, Inc., Advanced Book
Program/World Science Division. All rights reserved. No part of this publication
may be reproduced, stored in a retrieval system, or transmitted, in any form or by
any means, electronic, mechanical, photocopying, recording, or otherwise, without
the prior permission of the publisher.
71
72 GAUGE THEORY AND VARIATIONAL PRINCIPLES
found in many physics books. Some facts about the irreducible
spinor representations of SLB,C) are provided (without proof) in
6.1.11 for your information. Also, we give an explicit explanation
why spinors with noninteger spin (such as the electron bispinor) are
changed to their negatives under a spatial rotation of 360°.
The Levi-Cevita connection (defined in Section 6.2) is a standard
connection on the PFB of orthonormal frames over a (pseudo)-
Riemannian manifold (e.g., curved space-time). This connection is
not usually introduced in physics books in conjunction with Dirac's
equation, since that equation is typically considered only on flat
Minkowski space. On a general space-time, the Levi-Cevita connec-
connection is used to replace coordinate differentiation by covariant differ-
differentiation, which has meaning independent of coordinates. The Levi-
Cevita connection also plays a crucial role in Chapters 8 and 9 (and
Section 10.1), which involve general relativity and its extensions.
A spin structure is essentially a PFB (with group SLB, C)) that is
a twofold cover of the oriented o. n. frame bundle, whose group is
the proper Lorentz group, over an oriented space-time. A spin
structure is needed because electron fields and other noninteger
spinor fields are particle fields associated to representations of
SXB,C) instead of the proper Lorentz group.
Physics students must eventually master the algebra of Dirac (or
gamma) matrices. Only the bare essentials are introduced in Section
6.3, since our primary use is only in the construction of the Lagrangian
for electron fields and in the expression of Dirac's equation. They
will also appear in the nucleon field constructions of Chapter 7.
Mathematicians may be surprised by the widespread use of Clifford
algebra in physics.
In Section 6.4 Lagrange's equation for electron fields is revealed
to be Dirac's equation in disguise. The unmasking requires some
work when carried out over a curved space-time. Essential use of
properties of the Levi-Cevita connection is made in the reduction.
Dirac's equation also appears more natural on the spin bundle than
on space-time itself because Christoffel symbols (or local connection
forms of the Levi-Cevita connection) arise when the equation is
pulled down to space-time by a choice of gauge (i.e., a choice of local
spinor frame field).
Finally, a brief account of the evolution of Dirac's equation, in its
simplest form, is given in 6.4.10. You are urged to consider this
account before immersing yourself in the details of this chapter.
6 FREE DIRAC ELECTRON FIELDS 73
6.1 COVERING THE LORENTZ GROUP
6.1.1 Definition. For x=(xo,...,x3) and y = (yo,...,y3) in R4, we
define (x, y) = xQyQ-ххУх-х-2у2-хъуу
The Lorentz group L is the group of all linear transformations B:
r4^R4 such that (Bx, By) = (x, y). Let tj be the 4X4 diagonal
matrix with entries 1, — 1, — 1, — 1 on the diagonal. Regarding x, yE
R4 as column matrices, we have хт<цу— (х, у); while taking BEL to
be a 4X4 matrix, we have (Вх)тт)Ву = хтт)у or хтВтт)Ву = хтт\у for
4. Thus, 5ELiff
6.1.2 Theorem. The Lorentz group L has four connected compo-
components. Writing В EL as (Я^.),0< i, j<3, they are
Ll = {BEL\detB=-l,B00>\),
Li+ = {BEL\detB=\,B00<-l),
Ll_ = {BEL\detB=-l,B00<-l}.
Proof. Let BEL. Then - 1 = det tj = det(BTr)B) = det(BT)
det(T))detE) = -(det BJ. Thus, det5 = ±l. Also (for e0 -
(l,O,O,O)),l = (eo,eo) = (Beo,Beo) = B^-B2o-B2o-Blo. Thus, B&
>\ (i.e., Доо>1 ог5ш<-1). It follows that L=L\uV_ULl+ULi_
(a disjoint union of open subsets). Moreover, /=diag( 1,1,1,1)EL'+,
Is = t,= diag(l, - 1, -1, - 1) ELI, I, = diag(- 1,1,1,1) ELi, Iu =
ISI = -IEL1+, and L\ =IsV+,Ll =ItV+,Ll+=IstL]+. Thus, it
suffices to show that L\ is connected. Let
Certainly (хо,...,х3)^(х1,х2,х3) defines a diffeomorphism of H
with R3. If v°EH, then (t>°,t>°)= 1, and we can complete v° to an
o.n. basis of R4, say t>°,...,t>3 with the matrix of column vectors
[t)°,...,D3]6Lf+ (note that v§>\, since v°EH; and we can always
change u3 to -t>3 to get det[t>0,..., t>3]= 1). If eo=(l,O,O,O)E#, then
[vo,...,v3]eo=v°. Thus, the map -л: L\^>H given_by_77E)=Be0 is
onto. We see that -7T~\eo) = {BEL\\Be0=e0} = SOC), and also
■!T~\vo) = [v0,..., v3]SOC). Indeed, 5OC) acts on L\ to the right in
74 GAUGE THEORY AND VARIATIONAL PRINCIPLES
such a way that -n: L\^>H is a PFB over H=U3 with group
549CJ= S0C). Any bundle over U" is trivial (see Steenrod [1951], p.
53), and so L\ is topologically U3XSOC), which is connected. ■
6.1.3 Notation. Let H{2,C) be the space of 2X2 Hermitian matrices
A(AT=A). A basis for HB,C) is given by
o = [l 0] ,_[0 Г
7 [о ij' T~[i oj'
2_[0 ~i] з Г 1 0
T "I,- oj' T-[o -i
There is an isomorphism U4 -» HB,C) given by x\^x = хот° + х,т' +
х2т2 + х3т3. Later we will have need for another isomorphism R4->
7/B,C), namely, хн>x = хот° — х,т' — х2т2 — х3т3.
6.1.4 Theorem. The following hold for x6R4:
(A) detx = det x = (x,x>, (B) xx = xx = (x, x)I.
Proof. Compute with
co + x3 xx ix2
x =
X j i IX 2 -Л-q .X^
^Q "^3 "^ 1 2
X1 IX 2 Xq 1 X^
Recall that SXB,C) is the group of 2X2 complex matrices A
with det^ = l.
6.1.5 Theorem. There is a homomorphism A: SLB,C)-*L^+ given
by (A(A)(x))^=AxA* where A*= AT, A E5LB,C), and xER4.
Moreover, A is onto with A~\l)—±I.
Proof. Certainly AxA* is linear in x, and so A(A) is linear. By
6.1.4(A) we have (A(A)x,A(A)x) = det[{A(A)(x))^] = det(AxA*) =
(detA)(detx)(detA*) = detx = (x,x). Thus, A(A)EL. We can prove
that SXB,C) is connected.'lndeed, let -n: SXB,C)^C2-{@,0)} be
defined by
6 FREE DIRAC ELECTRON FIELDS 75
that is,
Note that
•
-
([*]) =
1 b
0 1
tec HCs
We see that чг. SXB,C)^C2 - {@,0)} is a PFB with group C. There
is a (global) section a: C2 - {@,0)} ^5LB,C) defined by
a ~
J
b a/y
where y = aa+bb. By 1.1.6 we have SLB,C) diffeomorphic to
(C2 - {@,0)} XC or (since C2-{@,0)} = R4-{0} = 53XIRM'3XIR3,
which certainly connected. Thus, A(SLB,C))CL]+. Suppose that
A{A)= I. Then x = AxA* for allxER4. Writing
and setting х = т° =/, we obtain ad+bb= 1,cc + dd= 1. Then setting
х=т3, we get aa — bb=\,cc — dd= — \. Hence, we must have b = 0
and c = 0, whence \ = detA=ad and aa= 1. Thus й?=а, and so
Settingx = T' yields a2= 1, whence a2-ad= 1, and so Л~'(/) = {±/}.
Note that Л(Л5) = Л(Л)ЛE), because (Л(Л5)(х))^=Л5х(Л5)*
=ЛEх5*)Л*=/4(ЛE)(х))^* = (Л(Л)(ЛE)(х)))^. Thus" Л is a
homomorphism, and in particular, Л sends the curves /н>ехр(/Л) of
5LB,C) to the curves /^ехр^Л^ДЛ)) (without degeneracy, since
Л~'(/)={±/}). Hence, Л#/: SfB,C)^£ is one-to-one. Moreover,
Л , is onto, since dim(SeB,C)) = dim(S'3X|R3) = 6 = dim(S'0C)X
R ) = dim(Lt+) = dim£. Since AHc-4=LA(-4))|coA)|c/oL-4-i)|c, it follows
that Л is a local diffeomorphism onto an open subgroup Lo of V+.
Now V+ is the disjoint union of open cosets of Lo in V+, whence the
connectedness of V+ implies that V+ =L0=AELB,C)), and so Л is
onto. ■
76 GAUGE THEORY AND VARIATIONAL PRINCIPLES
6.1.6 Remark. In the course of the preceding proof we showed that
SLB,C) is diffeomorphic to the simply connected space S3X|R3. [t
follows that A: SLB,C)-*V+ is the so-called universal covering homo-
morphism.
6.1.7 Definition. We define a representation p: SZB,C)-> GLD,C)
by
a*-
Clearly p is the direct sum of two irreducible representations, commonly
denoted by Z^1/2-0»: SLB,C)->GLB,C) and D^/2): SLB,C)->
GLB,C), given by D°/2'°\A)=A and D^x/2\A)=A*-\ In Section
6.2 we will introduce a PFB P^M with group SLB,C). The Dirac
electron fields will be the particle fields in C(P,C4) = {\p: P->C
^А А'}
4|
6.1.8 Theorem. The representations £)A/20) and D@A/2) are not
equivalent.
Proof. We must show that there is no 5EGLB,C) such that
BAB'' =A*~X for all Л ESXB,C). Taking the trace of both sides, we
have tv A=tv A*~\ but this does not hold for
-2i 0
0 //
6.1.9 Theorem. There is no representation p': L^GLD,C) such
that p'(A(A)) = p{A),A ESXB,C).
Proof. If p' exists, then we have p'(/)=/, and yet p\I) —
р'(Л(-/)) = р(-/)= -/, since Л(-/)=/. ■
6.1.10 Remark. Here we examine the physicist's somewhat para-
paradoxical statement that the Dirac electron field is transformed into its
negative when space undergoes one complete rotation. Let
6 FREE DIRAC ELECTRON FIELDS 77
show that A(A): R4^IR4 is a rotation about the x3 axis by 26
oing x0 fixed). Indeed,
Thus, we see that A(Ae) leaves x0 and x3 fixed, while xx and x2
change according to x, + bc2i-» е2в'(хх + ix2) (a rotation by 26 in the
x,-jc2 plane' or equivalently, about the x3 axis). If 6 — m, then Л(А„)
= Iisa rotation by 2тт, butp(A7r) = — I.Accordingly, for фЕ C(P, С4)
asm6.1.1,4>{pA,)= *
6.1.11 Remark. The irreducible representations of SLB,C) (with
complex representation space) form a doubly indexed family D(*'v)
where /x and v run independently over the set @,\, \,\,2, ... }. We
have already defined £>(I/2-0) and D{0A/2). The representation D^'p) is
just the subrepresentation of
£H/2'0)B) .2/\ (g)/)(l/2,0)^)£)@,l/2)^) ,2.". g,2)@,1/2)
with representation space being the space S(^'p) of tensors (referred to
as spinors) symmetric in the first 2/x slots and in the last 2v slots. The
dimension of this representation space is B/x+ 1)B^+ 1). When Z?(|i•"'
/5 restricted to SUB)CSLB,C), it decomposes into a direct sum of
irreducible representations of SUB), the largest of which has dimension
2(v + ijl)+1. Consequently, the number v + ц is called the spin of the
representation D(>l-V). The representation p=D°'/2'0)®D@'l/2) in 6.1.7
« the direct sum of spin-^ representations, and so the electron is said to
have spin ^. Finally, note that the number of slots of a spinor in S^1"'
is 2y.+2v, and so D^-'X-1) = (- lJ^+l/)I, which is I if the spin ц + v
is an integer and —I if \i+v is half-integral. Thus, the property
explored in 6.1.10 is characteristic of particles with half-integral spin.
6.2 THE LEVI-CEVITA CONNECTION
Let M be an и-manifold with metric h of signature (r, s). For the
most part, we will be concerned with the case in which r— 1,5 = 3,
but the general case is no more difficult and will be of use to us later.
Let ( , ) denote the standard scalar product on R" with signature
78 GAUGE THEORY AND VARIATIONAL PRINCIPLES
(r,s)(i.e.,(v,w)=v[wl + ■■■ +vrwr-vr+lwr+] vr+swr+s). Let
O(r, s) denote the subgroup of GL(n,R) preserving ( , ),
in the sense that A EO(r, s) iff (Av, Aw) = (v, w) for all v,wEU".
If т) is the nX n diagonal matrix with diagonal entries
A, ■ ■ ■, 1, — 1, •••,— !), then for v,wER" regarded as column vec-
vectors, we have (v, w) ~vTi]w and A EO(r, s) iff ATy\A —т\. If t\-> A(t)
is a curve in O(r, s) with A@) = I, then (A'@)v, w) + (v, A'@)w)=0
follows from differentiating (A(t)v, A(t)w) =(v,w). Thus, the Lie
algebra of O(r,s) is 6(r, s) = {BE§f(n,R)\(Bv, w) + (t>, Bw) =0}
r
6.2.1 The Orthonormal Frame Bundle. An o.n. frame at xEM is
a frame uEL(M)x,u: W->TXM, such that h(u(v),u(w)) = (v,w).
The set of all o.n. frames at x is denoted by F(M)X. We set F(M) =
UxeMF(M)x and define -n: F(M)->M by tt(u) = x ifuEF(M)x. Note
that ifuEF(M)x and AEO(r,s), then uA=u°A EF(M)X. Indeed,
using the same ideas as in 1.1.8, we can prove that it: F(M)->M is a
PFB with group O(r,s); the essential difference is that o.n. vector
fields must be used in place of coordinate fields in proving local
triviality.
6.2.2 Definition. The canonical 1-form on F(M) is the U"-valued
form yEk\F(M),Un) defined {for XUETUF(M)) by <p(Xu) =
u~\tt,(Xu)). Note that this l-form is the restriction of a form фЕ
A\L(M),R") defined by the same equation (for XUETUL(M)).
6.2.3 Theorem. Relative to the usual representations O(r,s)^>
GL(R") (resp. GL(n,R)^GL(R")), we have (in the notation of 3.1.2)
<pEA\F(M),R") (resp. <pEA\L(M),R"))-
Proof. For XUE TUF(M) (resp. XUE TUL(M)) and A E O(r, s) (resp.
AEGL(n,R)), we have y(RAif Xu) = (uA)-\iTJf(RAjfXll))
^ p. ф
tnitXu =0, both <p and ф vanish on Xu.
(resp. <p(RAjfXu) = А~1-ф(Хи)). When
6.2.4 Definition. // w /5 a connection on F(M), then the torsion
form of со is the 2-form 0" =D"<peA2(F(M),R"). Analogously, we
may define the torsion form of a connection on L(M).
6 FREE DIRAC ELECTRON FIELDS 79
The following is known as the "fundamental theorem of pseudo-
Riemannian geometry."
6.2.5 Theorem. There is a unique connection в {called the Levi-
Cevita connection) on F(M) with vanishing torsion form (i.e., Dey = 0).
Proof. Let со be any connection on F(M) (see 1.2.6). By 3.2.8, we
need to prove that there is a unique oEA\F(M), 0(r, s)) such that
D"~°<p=0. By 3.1.5, Dw~~a(p = d(p+(oj-o)/\(p = @w -o/\<p. Thus, it
suffices to prove that there is a unique oEA\F(M), 0(r, s)) such
that @ы = оЛ<р. Note that (o/\<p)(X, Y) = o(X)<p(Y)-o(Y)<p{X),
according to 3.1.4. Let us assume that there is a a such that
0ы=аЛф. We have
(A)
for all X,Y,Z<ETUF(M) and all u<EF(M). Let 2(*, У, Z) =
. By (A),
(<p(Y),o(Z)<p(X))-(<p(Y),o(X)<p(Z))
Since o(X)EG(r,s), we have <«p(Z),a(^)«p(Y)) = -(
<P(Y)), and so on. The resulting cancellations then yield
(B) 2(*,Y,Z) = 2(<p(Z),a(J0«p(Y)>.
Since <p(Z) and a( Y) range independently over U" as Y and Z range
over TUF(M), we see that a(X) is uniquely determined by (B). Thus,
assuming a exists, we have that a is unique. Now suppose we define
о by (B). Then a(X) is linear in X, since 2(X Y, Z) is linear in
X For ЛЕО(г, 5) we have ( q>(RA#Z), @(RA*X, RA^ Y)> =
(y /4~'0(Х, Y)> = (<p(Z), @(X, Y)). Thus, SCi?^^,
80 GAUGE THEORY AND VARIATIONAL PRINCIPLES
RAtY,RAtZ)=~2(X,Y,Z), and so
Thus, we have a(RA,X)=A'la(X)A = &bA-,a(X). Since 2(*, Y, Z)
vanishes for X, Y, or Z vertical, we see that a(X) = 0 for X vertical
(and a is well defined). Thus, oEA\F(M), 6(r,s)). Finally, from (B)
we have (<p(Z), o(X)<p(Y))-(<p(Z),a(Y)<p(X)) = $B(X, Y, Z)-
2(Y, X, Z)) = («p(Z),0"(X Y)>, and so 0м = аЛ<р. Thus, the re-
required a exists. ■
There is a "local" version of 6.2.5, which will be of use to us later.
Let £,,..., En be o.n. vector fields defined on some open UCM, and
let <p',...,<p" be the 1-forms dual to Ex,...,En (i.e., <p'(Ej) = 8j).
6.2.6 Theorem. There is a unique matrix в —{в') of real-valued
l-forms 6' EA'(f/,IR) such that
(A) втг1 + т]в = 0, Tj = diag(l, • • • , 1, — 1, •••, — !);
(B) dip' = ~2-6^ Лф7 (or ш matrix form dy= —^Лф).
Proof. Let e,,..., en be the standard basis of W. We can define a
local section T: U-*F(M) by letting т(х): IR"^7XM be given
by j(x)(ej) = Eix for г = 1,...,и. We prove т*(ф) = ф where ф is
the canonical 1-form of 6.2.2. Indeed, T*(y>)(Ei) = ц>(т*Е() =
T(x)-\ir.a,Ei) = T(xr\Ei) = e,=(^(Ei),...,^(El)) = ^(El) for i
= 1,...,и. Let в be the Levi-Cevita connection on F(M). Then
d(p= —в/\(р, and applying т*, we have й?ф~= — т*@)Л<р. Thus, set-
setting в = т*(в), we have (B); and since в is ©(/•,s) valued, we have
(A). Let в' satisfy (A) and (B). Then 6' induces aconnection в' on
ir-\U) (as in the proof of 1.2.5) such that т*в' = в'. Now T*@e') =
т*(</ф + в'Лф) = </ф + в'Лф = О. So в6*' ЕЛ2(т7~\U), 0(r, 5))
vanishes on т(Г/), and hence throughout чт~\U\ Thus_6' must be
the Levi-Cevita connection, and в' = т*в' = т*в = в (i.e., в is unique).
6 FREE DIRAC ELECTRON FIELDS 81
6.3 SPIN STRUCTURES AND THE LAGRANGIAN
We specialize to the case where M is a 4-manifold with metric of
lature A,3) (i.e., a space-time).
!$.3-1 Orientability. Let it: F(M)^>M be the o.n. frame bundle.
)f]ote that tt~\x) = F(M)x has four components, corresponding (in an
iinspecified way) to the four components of L=O(\,3). It can happen
that F(M) has fewer than four components, because it is conceivable
that a point in one component of F(M)X could be joined to another
component by a curve in F(M) that leaves F(M)X and then returns.
Upon return, an observer who had traveled along the projection of such
a curve might find that the world is a mirror image of what it was, or
that time is running backward, or bothl You may contemplate these
possibilities, but for the sake of simplicity, we will assume that F(M)
has four components. In this case M is called space and time orientable.
A choice of one component of F(M) is called a space and time
orientation. Let F0(M) be such a choice (i.e., F0(M) is some compo-
component of F(M)). Note that тт: F0(M)-*M is a PFB with group Lr+,
instead of L = 0A,3).
6.3.2 Definition. A spin structure for F0(M) consists of PFB tts:
S(M)-*M with group SLB,C) and a map X: S(M)-*F0(M) such that
ir(\(p)) = TTs(p) for all p^S(M), and \(pA) = \(p)A(A) for all
pES(M),A<ESLB,C), where A: SZB,C)-*LT+ is the homomor-
phism 0/6.1.5. If tt: Fo(M)-*M is trivial, a spin structure for F0(M)
clearly exists. In the case where M is noncompact, Geroch [1968]
proved the converse. For compact M, the converse may not hold (see
Geroch [1968]). We assume throughout the rest of this chapter that there
is a spin structure X: S(M)-*F0(M).
6.3.3 Dirac matrices. In the notation 0/6Л.З, we define a linear
map y: U4 -*§£D,C) by
о
The matrices Y/ —y(et), i = 0,..., 3, are referred to as Dirac matrices,
although sometimes {у/} is replaced by Ey,5"'} for some matrix B,
depending on the book used.
82 GAUGE THEORY AND VARIATIONAL PRINCIPLES
6.3.4 Theorem. For all x, y£R4 we have y(x)y(y) + y(y)y(x) =
2(x,y)I.
Proof. Since both sides are symmetric and linear in x and y, it
suffices to verify the result in the case in which у—х, but this case is
clear from 6.1.4(B). ■
6.3.5 Remark. The Clifford algebra C(R4)®C, relative to < , ), is
the 16-dimensional algebra over С generated by the unit vectors
eo,ei,e2,ei with relations e^j+e]el■=2{ei, Sj)\, where 1 is the unit.
Note that у extends to a representation of C(R4)®C. Up to equiva-
equivalence, this is the only irreducible representation of C(R4)®C.
6.3.6 Theorem. Let X: S(M)-*F0(M) be a spin structure; let Л,:
S£B,C)-> £ be the isomorphism of Lie algebras induced by A: SLB,C)
-*L\; and suppose that в is the Levi-Civita connection on FQ(M).
Then §=Avl °\*в is a connection on S(M).
Proof. For p<ES(M) and А&ЫB,С), we have X(pexptA)-
X(p)A(exp tA) = \(p)exp(tA*(A)). It follows that X*(A*) -
(A*(A))t(p), and so_ в(А*) = А-14в(Х^(А*р))] = A-^(A^(A))= A.
For g£ SLB,C), R*e = 6oRgjf = A~ lX*6oRgjf = Л^ ' ° во Л* ° r^ =
(since ЛоЛ, = ЛЛ(г?)°А) = Л,1о0оЛл(^оЛ,= Л,1овЬЛ(;?) ->0<>\,
6.3.7 Theorem. Let ф — Х*(р where <p is the canonical 1- form on
F0(M) (see 6.2.2). Forg£SLB,C) we haveR*<p = A(g)~'-f, and Ф
vanishes on vertical vectors. Equivalently, ф£Е A\S(M),R4) relative
to the representation SXB,C)-* GL(R4) given by g-v — A(g)(v).
Proof. Since <p vanishes on vertical vectors and Л^ restricts to an
isomorphism of vertical subspaces, it follows that ф — q>° X^ vanishes
on vertical vectors. Moreover, R *ф = ф ° R я# = ф ° Л^ ° R gif =
6.3.8 Definition. Let H: C4XC4-*C be the "twisted" Hermitian
form given by H((zl,...,z4),(wl,...,w4))=z{w3+z2w4+z3wl+z4w2.
Regarding z,w£=C4 as column matrices, we have H(z,w) = zTyow,
where
6 FREE DIRAC ELECTRON FIELDS 83
las in 6.3.3. The reason for the "twist" is that otherwise the next
theorem is false.
■ 6.3-9 Theorem. For A eGLB,C) (in particular, for A G5LB,C))
and
\A 0
.0 A*'1.
we have H(p(A)z, p(A)w) = H(z,w) for all z,w (EC4.
Proof. Since Я(р(Л)г,р(Л)уу) = ггр(Л)гуор(Л)уу, it suffices to
check that p(A)Tyop(A)=yo; this is a simple computation. ■
6.3.10 Definition. We define a real-valued metric h on C4 by
h(z,w) = j(H(z,w) + H(w,z)). From 6.3.9, it is evident that p:
SLB,C)-*GL(C4) is orthogonal relative to h (i.e., h(p(A)z, p(A)w)
= h(z,w)).
6.3.11 The Lagrangian for Free Electron Fields. A free electron
field is a particle field ^eC(S(M),C4) where the representation is p:
SLB,C)-*GL(C4) 0/6.1.7. We define a Lagrangian L: J(S(M),C4)
-*R as follows. Let p<ES(M), and suppose that a^A\S(M),C4)p-
Let Eo, ...,E3(=T S(M) be horizontal vectors (relative to the connec-
connection в of 6.3.6) such that ф(Е1)=е1 =ith standard basis vector of R4,
where <jp = \*(p(see 6.3.7) or (equivalently)
TpS(M). Define yXoeC4 by
YXa= 2 1чу(е,)
where T]-diag(\, -\, - 1, - 1). Then, for deC4, we let L(p,v,a) =
h(i(yXo),v)-mh(v,v). In order to prove that L: J(S(M),C4)-*R
is a Lagrangian, we need some lemmas.
6.3.12 Lemma. For all A&SLB,C) and x&R4, we have
(A) = p(A)y(x)p(A)-1.
Proof. By continuity, we need only prove this for (x,x)jk0.
Using 6.1.4(B), we have xx =(x, x)I, or (assuming that (x,x)
¥=0)x = (x, x)x~l. Since < A(A)x, A(A)x ) = < x, x ) ^0, we
have (A(A)x)~ ~= (x, x)(A(A)x)Zl = (x, x)(AxA*)~l = A* ~ (x, x)
84
GAUGE THEORY AND VARIATIONAL PRINCIPLES
, (A(A)x)~ =
and so
p(A)y(x)p(A) ' =
О
А*
0
(A(A)x)
0 х
х 0
АхА*
0
(А(А
0
А '
0
0
A*
(A
6.3.13 Lemma. Let Eq,..., E^TpS(M) be horizontal vectors
such that <р(Ец),...,ф(Еъ) are o.n. relative to ( , ). For all crG
A\S(M),C%, we then have yX а =1т]чу{ф(Е'1))а(Е'1).
Proof. There is an A&L = O(\,3) such that ei = '2.Akifp{E'k), and
hence E=^AmjE'm. Then
6.3.14 Theorem. 77ге junction L: /E(M),C4)
L(/>, и, а) = /г(гухог, и) —т/г(и, и), is a Lagrangian.
g/uen
Proof. We must prove L(pg, g~lv, g~l ■ o° Rg-^) = L(p, v, a).
By 6.3.9 and 6.3.10, it suffices to prove that yX.(g~KooRg-li>t) =
g
g~K(yXo). Let E0,...,E3(ETpS(M) be as in 6.3.11. According to
6.3.7, we have <p(/?^) = A(g)-1-<p(£,) = A(g)-1e,- Thus R^E,
may serve as £/6E TpgS(M) as in Lemma 6.3.13, and it follows that
l[°(Rg-'*(Rg*EJ))] =
a) = g-l-(yXa), as
J
required.
6.4 DIRAC'S EQUATION
Here we find Lagrange's equation for the Lagrangian L of 6.3.11
and prove that it reduces to Dirac's equation for a free electron field.
6 FREE DIRAC ELECTRON FIELDS 85
0,4.1 Lemma. For all xeR4, y(x) is self-adjoint relative to h (i.e.,
h{y(x)v,w)=h(v,y(x)w)).
Proof. Note that h(y(x)v,w) = vTy(x)Tyow, while h(v,y(x)w) =
vTyoy(x)w. Thus, we must prove Y(*)rYo =YoY(*)- Since y(x) is
linear in x, we need only check this for x = eo,el,e2,ei. For x =
еоМео) = Уо> and so Y(eo)rYo=YoYo=YoY(eo)- For x = et,i¥=0, we
havey(e,)Yo+YoY(e,)=O by 6.3.4. Thus, Y(^)rYo+YoY(^)r = O, and
it suffices to check that y(ei)T— —y(ei) for i^O. For z — zxex +z2e2
+ z3e3, note that z= —z, and
f
while
l
Since z is Hermitian, zT~z=— z and zT — z=— z, we see that
y(z)r= —y(z), as needed. ■
6.4.2 Theorem. For L as in 6.3.11, we have that (in the notation of
4.3.1) V3L(/>,t>,a): TpS(M)-*C4 is given by V 3L(p, v, a)(X) =
Proof. For /? G Л'( S(M))p we have
(hh){p,v3L(p,v,o))
etc.
6.4.3. Theorem. For L^as in 6.3.14, Lagrange's equation is
-i8e[(yoy)(^)\ + i(y<<De^)-2m^=Q where ^<EC(S(M),C4) and
в is as in 6.3.6.
86 GAUGE THEORY AND VARIATIONAL PRINCIPLES
Proof. From 6.4.2, it follows that 9L/9(#fy) = - i(y°q>)(\py
From the definition of L in 6.3.11, we see that 9L/9^ = /(yX £>fy)
— 2m\l>. Noting that 8e commutes with any fixed linear operator on
C4 (e.g., multiplication by /), we find that the desired result follows
from 4.3.6. ■
The reduction of the equation in 6.4.3 to Dirac's equation F.4.8)
is done with some lemmas.
6.4.4 Lemma. For В<еЫB,С), x<EU4, and pEC4, we have
Proof. Set A =exp(tB) in the equation of 6.3.12. The result follows
from applying both sides of that equation to v, and then differentiat-
differentiating with respect to t at / = 0. ■
6.4.5 Lemma. Let ^<EC(S(M),C4) and recall that X: S(M)->
F0(M) is the spin structure of 6.3.2. We have
where
Proof According to 3.1.5 and 4.2.8, we have 8в[(уоф)(ф)] =
*{d[(yo *<р)(\р)] + вЛ[(уо *ф) (»//)]}. Using 6.4.4, we can rewrite the
second term:
Also, d[(y о *ф)(ф)] = [y ° (d*<p)](\p) — (у ° *ф) Л di>. Combining
6 FREE DIRAC ELECTRON FIELDS 87
uese results yields 8в[(уоф)(ф)] = *{[уоX*(d*<p + 0Л
6.4.6 Lemma. For the Levi-Cevita connection в and canonical form
Pwo/. Since 5*<p= ±*.D9*<p, it suffices to prove that De*<p = 0. Now
*<р = (£<р',..., *<p") where *«р' = (-1)'"'т),|(р'Л ••• Лф'Л •••Л<р".
Applying й? to *<p (componentwise) and using d(pJ — —ieJkA(pk, we
see that d(*(p) vanishes on horizontal subspaces, because of the
factors 0Jk. Thus, De*<p=(d*(p)H = 0. Ш
6.4.7 Lemma. For ^GCE(M),C4), we have *[(y°*q>)/\(De\p)]
Dep
Proof. Let ф — (ф°,..., ф3) = 2 ф'е,. Suppose that Eo,..., E3
TpS(M) are as in 6.3.11. Then
6.4.8 Theorem. The Lagrange equation of 6.4.3 reduces to the
Dirac equation yXDe\p+im\p = 0.
Proof We need to prove that_86'[(Y°<p)(^)] = -yXDe\p. Now
8<>[(УОФ)D>)]= -*[(Y°(*9))A(Z>fy)]> by 6.4.5 and 6.4.6. The result
follows from 6.4.7. ■
6.4.9 Remark. Let М—Ш4 be Minkowski space with the usual
coordinates (x0, x,, x2, x3) and coordinate fields 30,3,, 32,33. We may
then take S(M) to be the product MXSLB,C). The horizontal sub-
space at (x, A) (relative to the connection в) is just the tangent space of
88 GAUGE THEORY AND VARIATIONAL PRINCIPLES
the submanifoldMX{A} at (x, A). If a: M-*MXSLB,C) is defined
by a(x) = (x,I), then аД=£, (defined in 6.3.11). Let \p' = o*4> for
^eC(S(M),C4). Then
3>', and so yXDV 7^ yy
2t)j■■y(ei)d-\p'. Writing y' =1,r]jky(ek), we //zen see //га/ ?/ге Z);>ac
equation on MXSLB,C) pulls down (via a) to the equation 2y/3/^'
+ im\p' = O found in physics books.
6.4.10 Historical Note. The relativistically invariant Klein-Gordon
equation 2tj jkd<dk\p+m2\p=0 is unlike Schrodinger's equation because
of the second time derivative. In order to remedy this (and other related
problems) Dirac sought a first-order equation of the form 2a*3^=
-/mf which upon iteration yields the Klein-Gordon equation:
— im^) = { — im) \p= —m2\p
or
This is the Klein-Gordon if j(aJak + akaJ) — f],kI. The smallest (nec-
(necessarily square) matrices ak(k = 0,1,2,3) satisfying these relations are
4X4 (e.g., the Dirac matrices yk). Thus, Dirac found his equation, and
deduced that \p must have at least four components. If m =0 (unlike the
electron), then a simpler possibility exists. Multiplying 2 a* 3^=0 by
(a0), we obtain Э0^ = 2-/?7Э7^ where /?7=(a°)~ la'(j = 1,2,3).
Thus, 3O2 4> = 2 {(Pkfij + PJ(lk)dk д/4>, which is the Klein-Gordon equa-
equation (for m=0) provided that {(fikfi> + (lJPk) = SkjI(\^ j, к <3).
The 2X2 matrices т1, т2, т3 of 6.1.3 (/.e., the Pauli matrices) will meet
this condition (use 6.1.4E)). The corresponding two-component equa-
equation 30^ + 2t'3,^=0 was noticed by Weyl (see Bjorken and Drell
[1964]), but was dismissed because of the lack of a so-called parity
symmetry. The Weyl equation was vindicated in 1956 with the discovery
of parity violation by neutrinos in experiments proposed by T. D. Lee
and C. N. Yang et al. Indeed, the Weyl equation is satisfied by neutrino
fields in C(S(M),C2) where the representation is Z)A/20) or D@A/2),
depending on the "handedness." Naturally, the ambidextrous electron
arises from Z)('/2-°>©Z)(°.'/2).
CHAPTER
Interactions
In Chapter 6 we developed the basic theory of the free electron
field. In reality, however, the electron field is always attended by the
electromagnetic gauge potential (i.e., four-dimensional vector poten-
potential of 1.2.7) with which it interacts. This gauge potential is defined
on the total space of a PFB with group U(\), while the free electron
field is defined on a different PFB, namely, the bundle of spinor
frames, with group SXB,C). However, in Chapters 3-5 the particle
field and the gauge potential to which it responds were always
defined on the total space of the same PFB. Hence, the results of
these chapters do not apply directly to the case of electron fields and
electromagnetic gauge potentials. To remedy the problem, in Section
7.1 we describe a general, straightforward procedure for splicing two
PFBs together to form a new PFB whose group is the product of the
groups of the original PFBs (see the diagram in 7.1.1). Then a free
David D. Bleecker, Gauge Theory and Variational Principles ISBN 0-201-10096-7
Copyright © 1981 by Addison-Wesley Publishing Company, Inc., Advanced Book
Program/World Science Division. All rights reserved. No part of this publication
may be reproduced, stored in a retrieval system, or transmitted, in any form or by
any means, electronic, mechanical, photocopying, recording, or otherwise, without
the prior permission of the publisher.
89
90 GAUGE THEORY AND VARIATIONAL PRINCIPLES
particle field that lives on P, and a gauge potential that lives on p
can both be lifted to the total space P, о Р2 of the spliced bundle,
where they may interact essentially as in Chapters 3-5, as shown in
7.1.15.
In Section 7.2, the general procedure of 7.1 is applied to the case
of the interaction of an electron field with an electromagnetic gauge
potential. In Section 7.3 we apply it to the case of a nucleon field
interacting with a classical Yang-Mills gauge potential (i.e., a con-
connection on a PFB with non-Abelian group SUB)). The procedure of
7.1 generalizes to the case where more than two PFBs are spliced, as
when a particle field responds to several different gauge potentials
(e.g., electro-weak interactions). It is noteworthy that while Lagrange's
equation for nonfree particle fields resides on the total space of the
spliced bundle, it is shown in 7.1.15 that the inhomogeneous field
equation and the current can be pushed down to the total space of
the PFB on which the gauge potential was originally defined. Finally,
in 7.3.13-7.3.22 we make a detailed effort to show that our formula-
formulation of the field equations for nucleon fields and Yang-Mills poten-
potentials is equivalent to that in the original Yang-Mills paper [1954].
7.1 BUNDLE SPLICING
The constructions of this section will be used in both 7.2 and 7.3.
7.1.1 Definition. Let -nf. P^M be a PFB with group G,, / = 1,2.
We define P, о P2 ={(;,„ Pl) eP, XP2 h(/>,)=*2(/>2)}. Let ira:
P,oP2-^M be given by тт12(Р[, />2) = *,(/> ,)=w2(/>2). For (g,,g2)E
G^XGj and (/>,,/>2)EP,oP2, define (p,,p2)(g1,g2) = (p1g1,/?2g2).
You can verify that тт[2: Р, oP2 -*M is a PFB with group G\~XG2. We
say that the bundles irf. Pt-*M have been spliced. There are also
projections it': P\°P2-*Pl given by ir'ip^ Р2)=Р>, / = 1,2. Certainly
' 2
77': P, о p2 -*p, is a PFB with group {1}XG2 =G2, and it2: P, о Р2 -*P
is a PFB with group Gx X{1} =GV We have the diagram
2
7 INTERACTIONS 91
JM.2 Theorem, //w, is a connection for <n{. Pt->M (/=1,2), then
_i*tf| is a connection for к2: P, »Р2 -*P2,77-2*w2 /s a connection for тт1:
p op2 -*Pi, and tt1*co[®tt2*co2 is a connection for чт12: P, о p2 ->M.
proof. This is routine. Note that we have identified g, with g, X {0}
§ ^ SO ОП- ^
7.1.3 Theorem. Let pf. G{-*GL{V) (/=1,2) fee representations
such that Pl(gl)°p2(g2) = P2(g2HPi(gi) M all (gl,g2)BGlXG2.
Then (p^Xpj)- GXXG2-*GL(V), given by (g,, g2)^ P,(g,) °P2(g2),
/s a representation, and so C(PX° P2,V) makes sense.
Proof. We have
= (by assumption) =P,(g,) °P2(g2) °p,(gO °p2(g2)
= (p,xp2)(g1,g2)o(p,xp2)(g;,g2). ■
7_.1.4 Definition. Let Ak(Pl о Р2, g.) fee //ге subspace of
Л/с(Р| оР2, §,Ф§2) consisting of those forms with values in g. С§,Ф§2,
/=1,2. 5/nce g, anc/ §2 are invariant subspaces_of the representation
(% G, X (J2 -> GL(g,©g2), ?/геге are projections §,: Л*(Р, ° P2, §,©§2)
-* Л/с(Р, о p2, g.) induced by the projections §,©g2 -> g,.
7.1_.5 Theorem. For anj^ aGA*(P,oP2, g,®g2)_we /гаие a = g,(a)
+ S2(a)._ Moreover, there is a unique form a. GA*(P,-, g,-) smc/z
-Proo/. The first statement is obvious, by definition. For
T(P),P2)Pi°P2, define aXp^X) = (Qxa){X). We check that alpi is
well defined (i.e., independent of the choice of X and p2). If X'&
T(P> p,)P^Pi with <*=*;*', then wl2l»( A--A") = w,X(^-^") = 0.
whence X-X' is vertical for mx2\ P,oP2-^M. Hence a(X-X')=0,
and §,(«)(X) = §,(«)( X'). For any g2G{l}XG2, wehave77-^(«g2!(cX)
= <(X), and (S,a)(/?ft#A') = abg2-(9la)(X) = (9la)(^)' since
6Eb -i acts trivially on §,. Thus, a, is well defined. If ii\X is vertical
92 GAUGE THEORY AND VARIATIONAL PRINCIPLES
for 77,: P,^M, then 0 = 77|!(с77^=7г|2!(сХ, and so al;,iG7^) = (§l
= 0. Finally, for g, EG,, we have a,(/?giX^) = «i(<^gl*^) =
(S1a)(/?Jfi^) = abJfi-.(S1a)(^)=SbJfi-.a1(^). The proof is the
same for г = 2. ■
7.1.6 Theorem. Let u, fee a connection for <n{. Pf^>M, and let
ш, =77-'*w, vw//z w,+w2 the connection for_тт[2: PX°P2-+ М provided by
7.1.2. 77геп £)"| + sen* /orm5 m Л*(Р, о P2, §.) /0 /orms in
Л* + 1(Р, oP2, §.), f =1,2. Moreover, in the notation of 7.1.5,
(Z> *■ + *>( a)),=D "■(«,).
Лчю/. For aE Л*(Р, о Р2, §,), we have £>"' + a = rfa + (w, + ш2)Л
a=c/a + w,Aa, since the Lie algebra action of 0®§2 on §,©0 is
trivial. Both da and ш,Аа have values in §,®0, and so £>"l + aG
Л* + 1(Р, °P2' ^i)' and so on- To prove the second statement, we need
771*(Z)a1))=§1(£)"l + U2(a)), but TTl*(Da'(al))=TLl*(da[+ool/\al)
+ iT^co^AiT1*^ = d(§la)+ w, A(S,a) = §,( rfa + ( w, + a2)
7.1.7 Theorem. Assume that M is an oriented Riemannian n-
manifold in order that star operators * , * , and * can fee defined on
the spaces Л(Р„§,), Л(Р2,§2), апс? Л(Р,о.Р2Д©§2). Г/геп *
respects the decomposition Ak(Pl ° P2, §, © §2) = ^(P, ° P2,
ФЛ*(Р, о Р2, §2) (f.e., *l2(§,-a) = §,-(*l2a)). Moreover, for aE
Ak(P[oP2, §,©§2), we /гаие (*|2а), =*.(а,), ш the notation of 7.1.5.
Proof. By definition of * in 4.2.1, it is evident that §, commutes
with * . The last statement follows from the fact that 77^: H{p^Pi) -»
/f is an isometry of horizontal subspaces where the metrics on these
subspaces are induced by that on T (Pi)M via ■77-12He and чты. ■
|2
7.1.8 Theorem. In the notation ofl.X.6 and 1Л.1, we have (for
aEAk(P]oP2^§j)) that 6i|+b2(«NAbl(?,oP2,§.). Moreover, in the
notation 0/7.1.5, (S"' + (a)). =8ы'(а,.).
Proof. This is clear from 7.1.6 and 7.1.7. ■
7.1.9 Note, /n order /0 motivate the rest of Section 7.1, we mention
that in Sections 7.2 and 7.3 тт,: P, ->M wi7/ fee the PFB S(M)^M with
connection в of 6.3.6. The bundle tt2\ P2^>M will carry the potential
I 7 INTERACTIONS 93
i{i.e., connection) that interacts with the free Dirac-type fields on P,
when everything is lifted to the arena P, о P2. The remaining considera-
considerations ofl.\ also apply to other situations of physical interest.
7.1.10 Definition. Referring to 3.3.1, we define J(P, V) = {(p, v, a)
&J(P, V)\oE:K\P,V)p}. In other words, we require a to vanish on
vertical vectors. We could have required this in 3.3.1, but then the
expression L(p,^(p),d^p) in 3.3.3 would not be defined. Note that
L(p,\p(p)^ D"\pp) does make sense, even if L is only defined on
J(P, V). Indeed, with the exception of 3.3.3 and 3.3.5, all that we have
done so far makes sense for Lagrangians defined only on J(P, V).
7.1.11 Definition. Suppose that p^. Gt^>GL(V) are commuting rep-
representations as in 7.1.3, and let L: J(Pl,V)^>U be a Lagrangian (see
7Л.10). For aeA'(P,°P2,F)(;,|i;>2), define ^o ЕЛ'(Р„ V)Pi by
(tt1o)('7t\.X)=o(X) (well defined, since a vanishes on vertical vectors).
Define L: 7(P, oP2, F)->R by L((px, p2),v,o)=L(pvv,Tflo).
7.1.12 Theorem. The function L: 7(P, о Р2, F)->R is a Lagrangian,
provided L is G2-invariant in the sense that L(p{, g2-v, g2-a) =
L(px, v,a) for all(px, v, a)E/(P,, F) and g2 EG2.
Proof. First note that for аЕЛ'(Р, °P2,V)(pt pi) and (g,,g2)EG,
XG2, we have ?(a» R^^ ) = (^a) 0 R , since wl((p
:p,g, =77-'(Yp,, P,))g,. Then we have
= L((pl,p2),v,a). Ш
7.1.13 Notation. Suppose that §, has a metric /c, such that &b:
Gi^>GL(§i) is orthogonal (г = 1,2). From /c, and k2 we can form a
metric kl2 on §]®§2 (such that §^±§2) in the obvious way. Assuming
that L o/7.1.11 /5 Gj-invariant as in 7.1.12, we have a Lagrangian L:
J(P{ oP25J/)->R. Let ш,+ш2 be cis in 7.1.6, and suppose that \[>E
C(P,oP2,V). Let/=/"|+(^/)ЕЛ1(Р|оP2,§,©§2) be the current, as
94 GAUGE THEORY AND VARIATIONAL PRINCIPLES
defined in 5.1.1 or 5.1.2. Accordingjo 7.1.5, J=QXJ+\J. In applica-
applications we will not have much use for §, /, but (in the notation of 7.1.5) we
7.1.14 More Notation. Let в(Р2) be the space of connections for
тт2: P2^M, and let §2: e(P2)^Cx(M) be the self-Action {i.e.,
§2(Ы2)= ~ 2(h2k2)(®Wl,QWl)). For a fixed connection w, on P,, we
define {t+%2): C{P,o P2,V)X6(P2)^ CX(M) by (£+S2)(,//, ^)(x)
^^ %2{^2){x) where x =
7.1.15 Theorem. Let /(^) be as in 7.1.13, and (£+§2) as in
7.1.14. The pair (»//, w2)E C(P, »P2, F) X6(P2) /5 stationary relative
to (£+§2) (z'n a sense similar to 5.2.2) iff conditions (A) and (B) hold:
(A)
(В)
Proof. The Lagrange equation (A) holds if \p is stationary for £. (or
£+S2) when w2 is fixed (i.e., w, +co2 is fixed). Thus, (A) is necessary.
To prove the necessity of (B), we follow the proof of 5.2.3. Let §|2:
6(P, oP2) ^ C°°(M) be the self-Action (i.e., §|2(ш,+ш2) =
~ 2(hnk\2)(®"' + ,®"'+))- Note that every connection onP,oP2
is of the form ш, +ш2. The proof of 5.2.3 almost applies to £+§|2.
Since we are keeping w, fixed, in the proof we must assume that
тЕЛ'(Р, op2, g2). We are then led to
0= /
Since т has values in §2, we can replace the forms in the inte-
integrands by their %2 projections. Using 7.1.8 and the identity
(hakl2)(<32a,§2(l) = (h2k2)(a2,(}2), we then have
0= f
Ju
It follows that (B) is necessary. The sufficiency of (A) with (B) is
clear, as in the proof of 5.2.3. ■
7.1.16 Remark. As a consequence of 8fl =J(\p), we have
8J(\p) = 0 (see 5.2.8). However, this continuity equation can be
7 INTERACTIONS 95
Meduced if we assume only that \p is stationary for £ relative to a fixed
^connection cc2. We prove this next, by a slight modification of the proof
О/5.1.5.
7.1.17 Theorem, //^e C(P[ о P2, V) is stationary relative to tfor a
fixed connection w,+w2 (see 4.1.3), then 8(/^) = 0.
Proof. Implicitly we assume that L is G2-invariant so that L is a
Lagrangian (see 7.1.12). H fEGA(Pl °P2) is of the form f(p,, p2) =
(pvf(p2)), then (from the proof of 3.3.6) fc(f*4>, /*w) = £(»//, со). In
the notation of 3.2.12, exp(F) is such a gauge transformation iff
FEC(Pl°P2,Q2). Proceeding as in the proof of 5.1.5 with FE
Р2Д)> we find
'и
Thus, §2E"'+Ч/)=0; and so (using 7.1.5 and 7.1.8) 8(/2) = 0, but
/(^)=/2 (See 7.1.13). ■
7.1.18 Remark. 77ze results of this section generalize to the situation
where more than two PFBs are spliced. This is needed when a particle
field interacts with several potentials.
7.2 THE (NONFREE) DIRAC ELECTRON FIELD
We derive Lagrange's (or Dirac's) equation and the inhomoge-
neous Maxwell equation, which govern the interaction of the Dirac
electron field with an electromagnetic potential. Much of this amounts
to specializing the results of Section 7.1.
7.2.1 The Preliminary Setup. Let M be a space- and time-oriented
Lorentz manifold with oriented frame bundle F0(M)^>M and spin
structure X: S(M)^>F0(M). Let в be the Levi-Civita connection on
F0(M), and suppose that в is the related connection on S(M) defined in
6.3.6. The PFB S(M)^M with group SXB,C) and connection в will
serve as the PFB чт^.Р^М with group Gx and connection со, of
Section 7.1. We suppose that it2: P2^>M is a PFB with group U(\) =
{e'e\6ER} and co2 is a variable connection (i.e., electromagnetic
potential) on P2. With F=C4, we define the representation p,: Gx ->
96 GAUGE THEORY AND VARIATIONAL PRINCIPLES
GL(V) to be p: SLB,C)^GL(C4) of 6.1.7 and we suppose that p2-
G2^GL(V) is p2: U(\)^GL(C4), given by p2(ei$)(v) = eiev. Note
that for AESLB,C), we have pl(A)(p2(eie)(v)) = pl(A)(ewV) =
ewpl(A)(v) = p2(e'e)(pl(A)(v)), as required in 7.1.3.
7.2.2 Definition. A Dime electron field (nonfree) is a particle field
in C(P\ °P2, V) (see 7.1.3) where Pv P2, p,, andp2 are given in 7.2.1.
7.2.3 Theorem. Let L: J(S(M),C4)->U be the Lagrangian of
6.3.11. Then L is G2-invariant in the sense o/7.1.12. Moreover, the
Lagrangian L: J(S(M)<>P2,C4)->R (supplied by 7.1.12) is explicitly
given by
L((pl,p2),v,o) = ti(iyX(;iflo),v)-mh(v,v)
where -n^a is defined in 7.1.11.
Proof. We have L( p{, e'ev, ei6a) = h(i(y X e'ea), e'ev)-
mh(e'%,e'ev). From the definition of yXa in 6.3.11, we see that
уХешд=ешуХд. Thus, it remains to prove that h(eww,ewv) =
h(w, v), but this follows from the definition of H in 6.3.8 and h in
6.3.10. The rest is clear. ■
7.2.4 Definition. For аЕЛ'(Р, о P2,C4)(Pi PiV we define yXa
= 2 Чуу(е,)(а(£у)) G C4 where Ej E T^pi)P, о Р2 satisfies
MPi) X'n\2*Ej) = ej=thejth standard basis vector in R4, X: S(M)^>
F0(M) being the spin structure. Note that E- is not unique, but if a
connection w2 on P2 is given, then E- is unique if (в + <Ь2)(Е1)=0 is
imposed. Also, if E-(ET S(M) is defined as in 6.3.11 and 6(E ) — 0 is
imposed, then it^E ) = E . Then we have у X W]a =
2vijY(ei)[(vlo)(Ej)] = 2r,ijy(ei)(o(Ej))=yXo. The use of yXa
rather than yXir]a is a matter of aesthetics.
7.2.5 Theorem. For ^ECE(M)oP2,C4), Lagrange's equation
relative to the Lagrangian L of1.2.3 and connection в + &2 is (for ф as
in 6.3.7) -i
Proof. Noting that чг1*ф(Е-) = e- is an equivalent characterization
for Ej as in 7.2.4, we see that V3L((pvp2)v,a)= — /(уо^'*)()
7 INTERACTIONS 97
a computation as in the proof of 6.4.2. Thus, we have
/d(De+4')= -/(y°77-'*9)(^). From the equation for L in 7.2.3,
see directly that dL/d4>=iyX(De+D>))-2m4>. The result fol-
follows. ■
7.2.6 Theorem. The Lagrange equation of 125 reduces to the
Dirac equation for an electron field \p influenced by the electromagnetic
potential oo2, namely, уХ(й?»^+@+<Ь2)-^) + гт^ = О.
Proof. We must prove that 8в + [(уотт1*<р)D/)] = -уХ De + (\p).
This is done by using 6.4.6 and analogues of 6.4.5 and 6.4.7. The
analogue of 6.4.5 is S +W2[(y ° тт1*ф)(\р)] = [у ° тт1*Х*(8 <р)](»^)
— * [(y°(* ■7Tl*<p))A(De + W2\p)]. The proof of this equation is simi-
similar; note that co2 A[(y отт1*ф)(\р)]= —(уотт]*ф)Аоо2-\р since the rep-
resentation p2: [/A)^ (jL(C4) commutes with the gamma matrices
(i.e., y(x)(e'ev) = e'ey(x)(v)). The analogue of 6.4.7 is
*
12
in the notation of 7.2.4. Again the proof is similar. The result then
follows, since 8e<p = 0 by 6.4.6. ■
♦
7.2.7 Theorem. Let §2 =%A) = {/a|aElRJ have the metric k2 with
k2(i,i) = \. Then the current component %2J {defined in 7.1.13),
relative to the Lagrangian L of 7.2.3, of the pair (\p, со) in
CE(M)oP2,C4)Xe(P2) is given by §2J- -й((уочг1*ф)(\р), \p)i.
Proof. In 5.1.2, let ex — i be the o.n. basis for §2, and note that §2J
is independent of the remaining o.n. basis vectors of §,Ф§2 that^span
S, (relative^ to some metric on §,). Then we see that %2J—
h(dL/d{De+"i\l>), i\p)i, which (from the proof of 7.2.5) is
h{ — г(у°77-'*ф)(^), i\p)i, and so on. ■
7.2.8 Remark. In 7.1.13 we found that_ there is a unique form
/^)EA'(P2,g2) such that tt2*JW2{^) = §2J. In the case at hand,
%2J {and hence /(^)) is independent of cc2 by 7.2.7. Let a: U^P2 be
a local section. Since U{\) is Abelian, the next theorem shows that
98 GAUGE THEORY AND VARIATIONAL PRINCIPLES
a*Jai(^)EA\U,6H(\)) yields a well-defined form (on M) that is
independent of the choice of local section a. We denote this form bv
7(^еЛ'(М%A)) (iej(t)i\U=o*J"^)
7.2.9 Theorem. Let it: P-^M be a PFB with group G and let
Gh>GL(V) be a representation. Suppose that au: U^P and av: V~>p
are local sections related by av(x) = au(x)guv(x) for some guv: UC\ V~>
G. Then for any aE~Ak(P,V) we have (ov*a)x=guv(x)-l-(ou*a)x. ln
particular, if V-% and G -> GL(§) is ®b, then (ov*a) =
Proof. The proof is a clear generalization of the proof of 2.2.14. ■
7.2.10 Theorem. Let a: U-^S(M)oP2 be a local section of the
bundle щ2: S(M)oP2^M. For ^ECE(M)oP2, C4), let ^' = a*^
(i.e., \p'(x) = \p(o(x))). Note that X ° ттх ° a /5 an oriented frame field on
U, say XQ,..., Хъ. Let ф=(ф0,..., ф3) be the coframe (i.e., <p'(Xj) =
S/). Then the form j(\l>)eAl(M,U), introduced in 7.2.8, is given by
Proof. Note that 77^00: U-*P2 is a local section. Thus jD>)i\U=
G72oCT)*/(^) = a*(g2/) = -/г([уо(а*о77-1*ф)](^/), ^')i = -h(y°
[(Лотт-1 оа)*<р]D,'),-фу = (Ъу the proof of 6.2.6)= -h(y[v]{\f/f),ty.
7.2.11 Theorem. Let a: U^> S(M)° P2be a local section, and write
co2 = -iA for co2 Ев(Р2). Define the "vectorpotential" A'EA\U,R)
byA' = (iT2oa)*A. Then the pair (ф, co2) in C(S(M)o Р2,С4)Хв(Р2)
is stationary for £+§2 (see 7.1.15, where L is given in 7.2.3) iff
(a) yX(d4> + F-iA)-4>) + imxp = O (Dirac equation);
(b) 8( — dA') —j(\p) (inhomogeneous Maxwell equation).
Proof. We show that this theorem is a special case of 7.1.15. By
virtue of 7.2.6, we have that (a) is equivalent to (A) of 7.1.15 in the
case at hand. Applying G72oa)* to both sides of (B) in 7.1.15, we
obtain (b). ■
7 INTERACTIONS 99
I7.3 THE NUCLEON IN A YANG-MILLS POTENTIAL
fflere we find Lagrange's equation and the inhomogeneous field
ation for a nucleon (proton-neutron doublet) subject to a Yang-
Is potential (i.e., connection on a PFB with group SUB)). The
d physicist will be quick to point out that this section is only
first step toward a physically realistic model (see 10.3).
^7.3.1 The Setup. We proceed as in 7.2.1. Now, however, тт2: Р2^>М
is a PFB with group G2=SUB). We set F=C4®C4 and define p,:
SLB,C)^GL(V) by Pi(A)(vl®v2) = p(A)(v1)®p(A)(v2) where p is
as defined in 6.1.7. The representation p2. SUB)^>GL(V) is defined,
for
by
Writing
we can express this definition in terms of matrix multiplication. The
linearity of p(A) gives us P\(A) op2(B)= p2(B) °P\{A) {for AE
SLB,C), BESUB)), as required in 7.1.3.
7.3.2 Definition. A nucleon field is a particle field in
C(S(M)oP2,C4®C4) (see 7.1.3 where p, and p2 are defined in 7.3.1).
We may write a nucleon field \p in terms of its C4 components
D' = -ф1®4/2) with \pl and \p2 being the proton and neutron components,
respectively. Note that SLB,C) respects these components, but SUB)
scrambles them.
7.3.3 Definition. Let h be the metric on C4®C4 given by h{V\®
v2,wx®w2) = h~(v[,wl) + h~(v2,w2) where h was defined in 6.3.10. We
100 GAUGE THEORY AND VARIATIONAL PRINCIPLES
defin_ef-U4 ^§i(C4®C4) byy(x)(v]®v2) = y(x)(v,)®y(x)(v2). For
44
p we set у Xa = (yXa,)©(y Xa2)
where a = ox®a2 and yXa, is defined in 6.3.11. Finally, we define a
function
)^R byL(p,v,o)=h(iyXo,v)-mh\v,v).
With the proof of 6.3.14, it is clear that L is a Lagrangian.
7.3.4 Theorem. The Lagrangian L is fi 2-invariant (G2 = SUB)) in
the sense of 7.1.12. The Lagrangian L: J(S(M)o P2,C4®C4)^R
(supplied by 7.1.12) is explicitly given by
Proof. The G2 -invariance of L follows from the next two lemmas,
and the rest is immediate from the definitions. ■
7.3.5 Lemma. For x£R4, AeSUB), u£C4®C4, and a£
A\S(M), C4©C4)p we have y(x)(A-v)=A-(y(x)(v)), and conse-
consequently yX(A-o)=A-(yXa), where we have used our customary
shorthand A -v = p2(A)(v) and A -o = p2(A) «a.
Proof. Let
'a b
с d
and write v=vl®v2. Then y(x)(A ■v) = y(x)(av] +bv2)®y(x)(cv] +
dv2)=A -(y(x)(v)) by the linearity of y(x). ■
7.3.6 Lemma. For A<=SUB) and u,w£C4©C4, we have h(A-
v, A -w) = h(v,w).
Proof. From the symmetry of h, it is enough to check the case in
which v = w. Let
7 INTERACTIONS
write v=vx®v2. Then
Г h(A-v, A-v)=h(avx +bv2, avx +bv2) +h(cvx +dv2,
=H(avx +bv2, avx +bv2)+H(cvx +dv2,cvx +dv2)
in the notation of 6.3.8 and 6.3.10. This equals (aa + cc)H(vx,vx) +
(ab+cd)H(v \,v2) + (ba+dc)H(v2,v,) + (bb + dd)H(v2, v2) — H(vx,
vx)+H(v2,v2) = h(v,v). ■
7.3.7 Definition. For теЛ1E(М)оР2,С4ФС4)(рь(,2) feryXrGC4
ФС4 be defined in the same manner as у Ха was defined in 7.2.4 (i.e.,
replace у by у).
7.3.8 Theorem. For ф£СE(М)оР2,С4©С4) and в the connec-
connection on S(M) induced by the Levi~Civita connection on F0(M) (see
6.3.6) we have that Lagrange's equation (relative to the Lagrangian L
of 13 A and the connection в + ы2оп S(M) ° P2) reduces to у X(d\j/ +
Proof. The proof is just a matter of checking that the proofs of
7.2.5 and 7.2.6 carry over to the case at hand. Note that one step in
7.2.6 requires that p2 commute withy (i.e., y(x)оp2(A) = p2(A)°y(x)
for xeR4, A £S£/B)), but we have this from 7.3.5. ■
7.3.9 Definition. We define the metric k2 on S2=S%B) to be that
positive definite metric such that a1 = — \irx, a2= —jir2, and
аъ= — \1тъ form an o.n. basis (see 6.1.3). Note that [a',aJ] = 1,ejjkak.
We can check that k2 is then invariant under &b: S£/B)->GL(S%B)).
In fact, k2 is determined up to a constant multiple by this invariance
property.
7.3.10 Theorem. The current component S2/G A](S(M) о Р2,
S%B)) (defined generally in 1.1.13) for the pair (\p,w2), relative to the
Lagrangian L of 7.3.4 and metric k2 of 7.3.9, is given by
k=\
which is independent of co2.
102 GAUGE THEORY AND VARIATIONAL PRINCIPLES
Proof. We obtain
as in the proof of 7.2.5. The rest follows from 5.1.2 and the definition
of §2J. ■
7.3.11 Remark. 5у_7Л.13, there is a unique J(xP) eA'(P2, g,)
such that tt2*JuD>) = §2J. In the case at hand, J(xp) is independent
of co2. // o2: U2 ->P, ° P2 is alocal section of тт2: P, °P2 ->P2, then
JU2D>)\U2=o2*tt2*JU2D>) = o2*§2J, which is independent of a* since
J(xp) is independent. However, because SUB) is non-Abelian, the
form JU2D>) does not pull back (via a local section of 7т2: P2^>M) to a
form independent of that gauge. The next theorem clarifies the situa-
situation .
7.3.12 Theorem. Let au: U^S(M)°P2 be a local section. For
^GC(S(M)oP2,C4©C4), let ^и=о*ф (i.e., ^H(x) = ^(aH(x))). Note
that Ao7r'oau: U^>F0(M) is a frame field on U. Let qpu be the
corresponding dual coframe field on U. Note that ■7T2°ou: U^>P2 is a
local section. We then have (for
-! 2 л(у(
If av: V-*S(M)°P2 is another local section with GT2°ai;)(x)
(it2 °ou)(x)guv(x) for some guv: UHV^ G2=SUB), then
Proof. The expression for the pulled-back current follows from the
proof of 7.2.10. The transformation equation is a special case of
7.2.9. ■
7.3.13 Remark. Physicists sometimes call (it2 ° au)*/(v//) e
A'(M, S%B)) (after dividing by i~^J— 1 ) the isospin 4-current. It
depends on the choice of gauge au, but tranforms nicely under a change
7 INTERACTIONS
103
g (as in 7.3.12). Perhaps it is preferable to think of the isospin
rent as being the well-defined (gauge-independent) form J(xp) on
$ At this point, we could merely state that the Lagrange equation and
homogeneous field equation for the nucleon in a Yang-Mills potential
given (as a special case of 7.1.15) by yX(dxp+(e+co2)-xp) + imxp
JteO and5fl =J(\p). In the case where M is Minkowski space, we
twill proceed to check that these equations are equivalent to the corre-
Isponding equations in the Yang-Mills paper [1954].
7.3.14 Notation. For w2 G6(P2), we write
where У &A\P2,U) (or b<=A\P2,U3)). We define a form
A2(P2,№y) by the equation (bXb)(X,Y) = b(X)Xb(Y) where the
right-hand side is the cross product of vectors in R3.
7.3.15 Theorem. In the notation of 7.3.14, we have
Proof. By definition,
т, —ib-т]. Now
jk
Я = do>2 + \[u2, co2] = -/ db • т+ {[-ib •
jkm
= -'■2
jk
= -iBbXb)-r.
7.3.16 Notation. The equations in the Yang-Mills paper are defined
on Minkowski space M=R4. Since any PFB over U4 is trivial, we can
pull our equations down to M by means of a (global) section a:
, M-*S(M)°P2. Moreover, we can assume that Х°тт^°а is the usual
t coordinate vector frame field on U4. Using 6.2.6, we find that 0 = в —
104 GAUGE THEORY AND VARIATIONAL PRINCIPLES
^). We write o*(b) = (тт2о o)*(b) = b', and
7.3.17 Theorem. In the notation above, Lagrange's equation (when
pulled down to M via a) is
where ^'^=Ц'/Ъх^Ь'^Ь'^), and у^ = 2т)^у(е,).
Proof. Pull the equation of 7.3.8 down to M via a, noting that
a*@*) = O and £
7.3.18 Notation. We define the form f £ Л2( P2, U3) by fi = if • т.
By 7.3.15, f=-(db+2bXb). We write /' = (тг2°а)*( /), /;„ =
7.3.19 Theorem. /« the notation above, the inhomogeneous Yang-
Mills equation 8fl =/(ф) (when pulled down to M via о°тт2)
becomes
a,13
where
Proof. By 4.2.8 and 3.1.5, we have 8fi = *2(d*pf- r) + [-ib-
T,* (if'7)])- The result follows from the next two lemmas along with
7.3212. ■
7.3.20 Lemma. We have (*
Proof. See 0.2.19. ■
7.3.21 Lemma. We have *[*'*т,*/'-т](Эм) = 2/2а/3 r^(b'a X/^)'
Proof. Note that
7 INTERACTIONS 105
HIS,
*[А'-т,*/'-т]=2*(б'УА(*/'*))[т/,т*]
Jk
= 2i2^m*F'M(*/'*))T-,
and it remains to prove that *{b'JЛ ^)(Эм) = 2аДт)а^/;*. Now
This is the p,p2p3-component of the star of the 1-form with ju-
component ff^b'Jf^. Since **= 1 on 1-forms, the result follows. ■
7.3.22 Remark. The equations in 13Л1 and 7.3.19 differ slightly
from those in the Yang-Mills paper. One difference can be attributed to
a different choice of k2 on §2 =S%B). In the Yang-Mills paper, т1,
t2, t3 are orthonormal. Moreover, we have absorbed the constant e into
b. Finally, the gammajnatrices in the Yang-Mills paper are (~i) times
ours, and the metric h is expressed differently.
CHAPTER
8
Calculus on the Frame Bundle
In this chapter, the essentials of tensor analysis on a (pseudo-)
Riemannian manifold are developed. These tools will enable us to
formulate a principle of stationary Action that leads to the Einstein
field equation for a gravitational field under the influence of the field
strength of a gauge potential. The Action density involves a certain
tensor called the Riemann-Christoffel curvature tensor. In Section
8.2 we show how this tensor is essentially the curvature of the
Levi-Civita connection of some given metric. In Section 8.3 we find
formulas describing how the Riemann-Christoffel curvature tensor
changes under an infinitesimal change of metric. These formulas are
used in Chapter 9 to compute first variations of functionals defined
on the space of metrics on a given manifold (e.g., to derive the
Einstein field equation from an Action principle).
David D. Bleecker, Gauge Theory and Variational Principles ISBN 0-201-10096-7
Copyright © 1981 by Addison-Wesley Publishing Company, Inc., Advanced Book
Program/World Science Division. All rights reserved. No part of this publication
may be reproduced, stored in a retrieval system, or transmitted, in any form or by
any means, electronic, mechanical, photocopying, recording, or otherwise, without
the prior permission of the publisher.
106
? 8 CALCULUS ON THE FRAME BUNDLE 107
In Section 8.1 we show that tensor fields on a manifold can be
regarded as certain vector-valued functions on the frame bundle of
the manifold. One purpose for doing this is that physicists are used
to working with components of tensors, whereas "modern" differen-
differential geometers prefer to work with invariant notation (e.g., indepen-
independent of coordinate systems). On the frame bundle, components of
tensors have invariant meaning; hence there is some possibility of
pleasing everyone equally. Also, complicated computations involving
many contractions, index raisings, covariant differentiations, and so
on are much easier to carry out by using components of tensors. We
can take advantage of this greater facility, without sacrificing invari-
ance, by working with components on the frame bundle.
8.1 TENSOR FIELDS ON ЦМ)
Let M be an «-manifold with m: L(M)^> M the frame bundle of
1.1.8 with group GL(nM).
8.1.1 Definition. Let K" be the dual space of R", and for integers
p,q&*0, let Tp-q be the vector space of all multilinear functions f:
R"X -P■ XR"XR"X •?• XR"-*R. Define a representation
GL(n,U)^GL(Tp-q) (for A<=GL(n,U), f^Tp-q, e,,..., t^ER",
f, w?eR") by (A-f)(vb...,vp, wl,...,wq)=f(v]oA,...,vpoA,
A~]wl,...,A~]wq). Note that A (B ■ f) — (AB) ■ ffollows because of the
use of A~xWj instead of Awr The space of L-tensors of type (p, q) is
C(L(M),Tp-q) = {f: L(M)^Tp-q\f(uA)=A~x -f(u) for all
L(M) andA<=GL(n,[
8.1.2 Theorem. There is a natural one-to-one correspondence be-
between the space of L-tensors of type (p, q) and the space of tensor fields
oftype(p,q) on M (i.e., C(L(M), Tp-q)s$p-q(M)).
Proof. For f<=C(L(M),Tp-q) we define f<=6Sp'q(M) by
l(ll)(Xi,...,Xp,Y,,...,Yq)=f(u)(XioU,.._.,u-\Yq)) where X, E
Т„(и)М, y.Grw(u)M, and u^L(M). Now/is well defined, since
108 GAUGE THEORY AND VARIATIONAL PRINCIPLES
Conversely, given f<=$p-q(M), we define f<=C(L(M),Tp-q) by
f(u)(vi,...,wq)=f7l(u)(v]°u-\...,u(wq)); note that it follows that
f{uA)—A~x-f(u). These correspondences are inverses. ■
8.1.3 Definition. Let еи...,е„ be the standard basis for U" with
dual basis ё\...,ё". For fE:Tp-q we define the components of f by
8.1.4 Theorem. Forf<=Tp-qandA<=GL(n,U),wehave
(A-f))'"i!=Ail\---A's(A-iy}--- (Л~хУ'Н\У"
^ J ' J\ Jq >\ lpX ' J\ V ' JqJ 7l ' ' 'J
where A'j=e'{Aej) are the matrix entries of A.
Proof. We have
8.1.5 Theorem. Let Ql(n,U)^Ql(Tp'q) be the homomorphism of
Lie algebras derived from GL(n,U)^GL(Tp-q). For В<=Ш(п,П) and
-q, the derived representation is given by
- BJ[ f'x '"'p.— ■ ■ ■
Proof. In 8.1.4, set A=e'B, and apply d/dt\r=0. ■
8.1.6 Definition. The components of f<=C(L(M),Tp'q) are the
functions fjx\\':Ip: L(M)^U given by fj[l.'.'.'jp(u)=f(u)j\'-'.'-'jpll- We write
f=(fj'r.'.'j'p). rfote that the components of j are defined without refer-
reference to any coordinate system. Observe that f/r.'.'j'p(u) are the compo-
I 8 CALCULUS ON THE FRAME BUNDLE 109
nents offv(u) (in the proofof"8.1.2) relative to the basis w(e,),..., u(en)
8.1.7 Theorem. Let f/'.'.'.'Jp- L(M)-*U be a collection of functions
!</,,..., jq<n. There is an L-tensor f^C(L(M),Tp-q) such that
f^(fh;;;jp) if and only if we have (for all и ^L(M) and A
the rule
Proof. For f<=C(L(M), Tp-q) we have fj?.'::}"(,uA) =/(мЛ)''' ;;>=
{A~^-f{u))l}\"'p, and then the rule follows from 8.1.4. The converse
is now clear. ■
8.1.8 Theorem. Let /=(/V;.^)eC(L(M),P'«). // we set an
upper index of fj'^.'.'J" equal to some lower index and sum over that
index, we obtain the components of a new L-tensor of type (p—l, q— 1),
called a contraction of f; there are pq different ways of contracting.
Proof. In the transformation rule of 8.1.7, note that if we contract
with respect to, say, z, andy',, then we hae (A~l)^AJJ = 8^ as a factor
on the right-hand side. Hence we obtain the required rule for the
collection of contracted components. ■
8.1.9 Theorem. Iff<=C(L(M),TP-i)andh<EC(L(M),Tr-s),then
there is an L-tensor fh £C(L(M),Tp + r-q + s) with components
fi''.'.'.''ph'p + i' V + '. The new L-tensor fh is called the tensor product off
and h.
Proof. Upon multiplying the transformation rules for / and h, we
obtain that for fh. ■
8.2 PSEUDO-RIEMANNIAN GEOMETRY
Here we describe the covariant differentiation process and
Riemann-Christoffel curvature tensor of pseudo-Riemannian geom-
geometry in terms of the frame bundle formalism of Section 8.1. We work
on L(M) instead of F(M) because (in the next section) we consider
110 GAUGE THEORY AND VARIATIONAL PRINCIPLES
what happens to the curvature tensor under a variation of the metric.
By using L(M), we need not cope with the problem of F(M)
changing with the metric.
8.2.1 Theorem. Let g be a metric on M and let g(EC(L(M), T0-2)
be the corresponding L-tensor given by 8.1.2. Let <p£A'(L(M),[R") be
the canonical \-form of 6.2.2. Then there is a unique connection §
on L{M) such that Deg=0 and £>"ф=0.
Proof. Let в be the Levi-Civita connection on F(M) (see 6.2.5). We
define в at points uGF(M)CL(M) by ви(А*)=А for A<=GL(n,U)
and ви(Х) = Ои(Х) for X<=TUF(M). This uniquely determines §u on
TUL(M), for uGF(M). AnypGL(M) is of the lormp = uA for some
u<=F(M) and A<=GL(n,U) (not unique). We define 0p on TpL(M),
for X<=TUL(M), by 9p(RAitLX) = &bA-^u(X). A simple argument
shows that §p is independent of the choice of и and A. Observe that
the horizontal subspaces of в at points u^F(M) are tangent to
F{M). Also, the components gtJ of g^C(L(M), Г02) are constant
on F(M) (i.e., gij = r\ij °n F(M)). Thus, Deg=dg"-0 on F(M).
Since z/g£A'(L(M), Г0-2), it follows thati/g = 0 on all of L{M).
Also note that D*qp = 0 implies that De<p = 0 on F(M), and hence
8.2.2 Remark. 77z/s proof clearly generalizes to give the fact that
any connection on a "sub-PFB" extends uniquely to a connection on
the ambient PFB. Also, there is no harm in calling в (as well as в) the
Levi-Civita connection associated to g.
8.2.3 Definition. For v GU" and u£L(M),the standard horizontal
vector in TUL(M) associated to v is the unique horizontal vector
vu &TUL(M) (relative to 9) such that <p(uu) = u (i.e., u~\'iTijivu) = v).
The assignment ut-*vu defines the standard horizontal vector field on
L(M) associated to v. This definition makes sense relative to any
connection on L(M), but we do not need this generality.
8.2.4 Theorem. For v^W andA£GL(n,W), we have RA^(v) =
8 CALCULUS ON THE FRAME BUNDLE 111
Proof. Note that RAJ[vu) is a horizontal vector in TuAL(M), and
8.2.5 Theorem. There is a natural isomorphism A'(L(M), Tp-q)ss
C{L{M),Tpq+x) (depending on в) which we denote by /н> / where
f{u){vx,...,vp,wx,...,wq+x) = fu{wq+x){vx,...,vp,wx,...,wq) ("w? + 1E
TUL(M), as in 8.2.3).
Proof- We check that f(uA) = A'l-f(u). Using 8.2.4, we have
f(uA){vx,...,wq + x) = fuA(wq + x)(vx,...,wq)
Since / vanishes on vertical vectors, the map /W / is one-to-one, and
this calculation reveals surjectivity. ■
8.2.6 Notation. Given f <= C{L{M), Tp'q), the form Def^
~ti{L(M),Tp<q) gives rise {via 8.2.5) to an L-tensor Def<=
C(L(M), Tp-q+{). The components of this new tensor are denoted by
Jj\
:L(M)
For those who know of covariant differentiation of tensor fields on M,
we mention that the tensor field on M corresponding to Def is the
covariant derivative of that corresponding to f. Since everyone has a
different definition of covariant differentiation, we leave it to you to
show that this is correct for your definition. Finally, if the operation
ft-*Def is repeated к times, we write
Jj\-Jq |У„+1 •••/, + !
for the components of the resulting L-tensor.
112 GAUGE THEORY AND VARIATIONAL PRINCIPLES
8.2.7 Theorem. For f<=C(L(M),Tp-q) and ek the standard hori-
horizontal vector field on L(M) corresponding to the vector ekE:R", we
have
which is the directional derivative of fj ['.'.'jp: L(M)->R along the
field ek.
Proof. We have
8.2.8 Raising and Lowering Indices. The matrix of components
gtj of the metric tensor gE:C(L(M),T0-2) can be inverted to give a
matrix of functions gIJ. Using 8.1.7, we can easily check that the g'J are
components of some tensor, say g~'GC(L(M), Г2'0). For /G
C(L(M),Tp'q), we can form the tensor product gf (or g~xf) as in
8.1.9, and then we can contract an index of g (or g~x) with an upper
(or lower) index of f as in 8.1.8. This process changes f to an L-tensor
of type (p — \,q+ 1) (or (p+ \,q— 1)). When performing such opera-
operations, it is a good idea to write the components of f in the form
f'x 'Oi •••>' t>ecause we WM use the following shorthand (where the
symbol ik means that ik is to be erased, etc.)
and
f'\---ip " =s»Jkfh-'p
8.2.9 Theorem. For L-tensors f and h we have De (fh) = De (f)h
+ fDe (h). In terms of components,
8 CALCULUS ON THE FRAME BUNDLE 113
Proof. This follows from 8.1.9 and 8.2.7, using the product rule for
differentiation. ■
8.2.10 Note. The symbols fr-i...im...j4{ii and р-^.^...^
seem ambiguous because it is impossible to tell whether the raising or
lowering of indices occurred before or after the covariant differentiation.
Fortunately, it does not matter, since glJ\ll = 0 and g'J^ = 0, as a
consequence of Deg=Q {see 8.2.1); that is, the gtJ and g'j behave like
constants under covariant differentiation.
8.2.11A Theorem. Let /н> jf be the isomorphism of 8.2.5, and suppose
that Se-. A\L(M),Tp'q)^C(L(M),Tp'q) is the covariant codifferen-
fia/o/4.2.8. Then for /G A\L(M),Tp'q) we have
(«•>)r-i=-/''-"v ■■>>
Proof. Let ф=(ф',...,фл)еЛ1(ЦМ),[йл) be the canonical 1-form.
For the standard horizontal fields e,,..., e~n we have ф'(ё7) = Ц- Thus,
the iormf<=A\L(M), Tp-q) can be written as/ = 2/(e,)<p,. Recall
that g is the metric on M, and note that (for u^L(M))
(^Ш^,, ej) = g(nji, v^j)=g(u(et), u{ej))=g(u)(el, ey)= gu(u).
Thus, we have */ = 2g'V(e,)|g|1/2ei,-i...1.n_^'>® • • • ®ф''»->. Note
that Z)V = 0- Also< since ёт[8*А= gij\m'=°"™e have em[g;7t] = 0 and
] = o. It follows that
We see that r>'(*/) = *Bg'*ej/(e,-)])- Thus, 8e/=-(-l)**Z)V=
-Bg'^J/(e,)]) EC(L(M), Г'-»), since *2 = (-\y on 0-forms. The
result now follows. ■
8.2.12 Corollary. For s£C(L(M),T0]) with compact projected
support, we have
>
m l
Proof. Since roo=IR with GL(/!,R)-> GL(T°'°) being trivial (i.e.,
ix = i VxGR), we have я1',, GC(L(M), Гао) is constant on the
114 GAUGE THEORY AND VARIATIONAL PRINCIPLES
fibers and hence JMs',,/л makes sense. The obvious metric on Г°-° is
g(x^y) = xy. Applying 8.2.11 and 4.2.9 and writing s=f for some
/GA'(L(M), Г00), we obtain
м
8.2.13 Definition. Let fl = De§£ A2(L(M),GL(n,U) be the curva-
curvature of в, and let e,,..., en be the usual basis ofU" with el,...,en being
standard horizontal vector fields as in 8.2.3. Note that we have a
function Щёрек): L(M)-+ §Цп,П)- Define functions Rhljk: L(M)^U
by the relation u(ej,ek){ei) — ldhRhiJkeh or equivalently RhiJk —
eh(u(eJ,ek)(ej)) where ёх,...,ёп is dual to ex,...,en. The functions
Rhijk are the components of an L-tensor (see 8.2.14) called the
Riemann-Christoffel curvature tensor.
8.2.14 Theorem. The Rhijk are components of an L-tensor in
C(L(M),TU3).
Proof. We use 8.1.7. For u<=L(M) and A eGL(n,R), Rhljk(uA) =
h
eh(Q(uA)(en ek)(el))=eh{(A-] AJj AUh
Now Aei = Ai'iei,,ehoA-x=(A-xfh.eh', and (by 8.2.4) RA-i
AJ'jer. Thus, we see that Rhljk(uA) = (A-])hh,Aii'Aj'AkkRh'I,J,k,(u).
8.2.15 Theorem. For the standard horizontal vector fields e,,..., en
(relative to the Levi-Civita connection в), the Lie brackets [ej,ek]
satisfy -eh(9([ejrek])(eI)) = Rhljk and [e,,ej"=0.
Proof. We have Щёр ёк) = йЩ, ёк) = ё^в(ёк)} - ёк[Щ)} -
в([еу, ек])= — e([ej,ek]). The first equation then follows from 8.2.13.
Now 0=&в(ёрёк)=О°ф(ё^ёк) = ё;.[ф(ёк)]-ёк[ф(е^]-ф([ё;;ёк])
= — ф[(еу, ek}) since ф(еу) and ф(ек) are constant. Thus, [ву,ек]н = 0.
8-2.16 Note. Although we will have no use for connections со (on
L(M)) other than Levi-Civita connections, Definition 8.2.13 still
makes sense for со and 8.2.14 is still true. From the proof of 8.2.15 we
8 CALCULUS ON THE FRAME BUNDLE 115
see that in general в"(еу, ёк)~ — ф([еу, ёк}). So the torsion form
measures [ё^,ёк]н, while the curvature form measures [ej,ek]v.
8.2.17 Theorem (First Bianchi Identity). For an arbitrary con-
connection со on L(M) with torsion form ®u=Dw<p, we have DU@U-2W
Лф. In the case in which в" = 0 (e.g., co = #), this identity is equivalent
Proof. Since Du@u=DuDu<p, the equation Ои&и=пи/\ф is a spe-
special case of 5.2.5. If в"=0, then й"Лф=0 implies that 0 = (fi"A
ргк ^ 1^}
Now ф(е,) = е,. So applying eh to both sides and using Rhijk=
eh(ttu(ej, ek)(et)) gives the result. ■
8.2.18 Theorem (Second Bianchi Identity). In the case where
в" = 0 (e.g., co=0), the identity DT = 0 is equivalent to Rhljkl,n +
h h0
Proof. Note that Dwuu = 0 as a special case of the general Bianchi
identity of 2.2.8. By the invariant definition of exterior derivative and
the fact [ё,.,ё/.]// = 0 of 8.2.15, we have (DaQa)(em,ej,ek) =
em[Qa(ej,ek)] + ej[Qa(ek,em)] + ek[Qa(em,ej)]: which at each point
u^L(M) is an element of GL(n,U). Letting it act on en and letting
the result be acted upon by eh, we obtain the result. ■
8.2.19 Theorem. Let Rhijk be the components of the curvature
tensor associated to the Levi-Civita connection of some Riemannian
metric g. Recall that Rhijk —ghmRmijk are the components of an
L-tensor in C(L(M), T0A). We have the identities:
A) RhiJk = -Rhik/, B) Rhljk+Rhkll+Rhjki = 0;
C) Rhuk = -RlhJk; _ D) Rhljk=RJkhl\
E) Rhijk\m+ И-ИШАк^ Rh,km\j -Q-
Proof. Identity A) follows from П(ёу, ёк)= ~Щёк, ёу), while B)
and E) follow from^.lAl and 8.2.18 (see also 8.2.10 for E)). For
C), note that 0= De"(D°g)= U-g by 5.2.5. Recalling (in 8.1.5) the
way Ш(п,М) acts on Г0-2, we then have 0 = (Щё ,ёк)-g)hi =
116 GAUGE THEORY AND VARIATIONAL PRINCIPLES
, ek)h'gh'i - fi(^' ek)khv = -Rihjk -Rhijk> whence we have C).
id h ri
For D),
consider the matrix
Rhtjk
Rkh,j
Rjkhi
Rijkh
R
R
R
R
hkij
kjhi
jikh
ihjk
R
R
R
R
hjki
kijh
jhik
ikhj
Note that the first column is obtained by cyclically permuting the
four indices, while the rows are obtained by cyclically permuting the
last three indices of the initial entries. Let p, be the sum of the entries
in the z'th row. Since p,=0 by B), we have 0 = p, +p2 — p3 — p4 =
2Rhijk -2Rjkhi, using only A) and C). Thus, D) holds. ■
8.2.20 Definition. The Ricci tensor is that tensor in C(L(M), Г0'2)
with components Rjk=Rjijk. The scalar curvature is the tensor in
C(L(M), T°'°) with component R=Rkk =gikRik. Note that Rik=Rki,
since (using 8.2.19 D)) we have Rik=RJljk=ghjRhijk=ghJRjkhi =
RJ'kjj—Rki- Using the identities of 8.2.19, we can verify that the Ricci
tensor and scalar curvature are the only tensors (up to sign) that can be
obtained by contracting the "full" curvature tensor (Rhijk) after possi-
possible index raising or lowering.
8.3 METRIC VARIATIONS
For the next chapter, we derive formulas in this section that give
the change of the Riemann-Christoffel curvature tensor under an
infinitesimal change of metric.
8.3.1 Notation. Let g,<=C(L(M),T0-2) be a (smooth) family of
metric tensors. Let 9t be the Levi-Civita connection (for gr) on L(M).
We set
= 0
a £, d
and 9—-J-
dt
= 0
8.3.2 Theorem. We have в' E A'(L(M), Ql(n, U)) a A'(L(M),
Ги). Hence 9' determines (via 8.2.5) a tensor в'EC(L(M), Г1'2),
which we denote by 9' for the sake of simplicity.
8 CALCULUS ON THE FRAME BUNDLE 117
Proof. We omit the full proof that 0' exists. Let it suffice to say
that 0, is uniquely determined by the equations dф + 0rAф = O and
dg, + $ ^g, = 0; thus, since g, depends smoothly on ?, it is reasonable
to expect that 0, does, too. The isomorphism in 8.3.2 follows, once we
prove that the representations
@,b:GL(n,R)-+GL(§l(n,RJ) and GL(n,U)^GL(TXA)
are equivalent. Define an isomorphism $£(h,IR)->7'u by B^>B
where B(v,w)~v(B(w)). For A£GL(n,U), we have Г ~
which proves that B->B is an equivalence of representations. The
rest is clear. ■
8.3.3 Theorem. // 0 is the Levi-Civita connection of g=g0 (i.e.,
? = 0), then we have (in the notation of 8.3.1) Deg'=~0'Ag and
0'Лф О
Proof. Since 0, is Levi-Civita for gn we have Q = dgl+0,/\gl. Dif-
Differentiating, at t = 0, we obtain O = dg' + 0'Ag+0Ag' or Deg'=~0'
Ag. Similarly, the equation dy + 0t Лф = 0 yields 0'Аф = О. ■
8.3.4 Theorem. Let 0'kl} be the components of 0'EC(L(M), 7"-2)
fl^ ш 8.3.2; /e? g=g0; fl«cf /e? g' be as in 8.3.1. 77ге« we /zaue
/. We write the equation Deg'=—0'Ag of 8.3.3 in terms of
components by using the formula for the action of Ql(nM) on Тол.
The result is equation A); B) and C) are obtained by cyclic permuta-
permutation of m, i,j.
8>ns, 0)
(з)
118 GAUGE THEORY AND VARIATIONAL PRINCIPLES
The equation 9'Лф=0 of 8.3.3 yields
or 9"mk = 9"km. Thus, subtracting C) from the sum of A) and B)
yields
g' -f«' —а' = 79'S V
&mi\j &jm\i &ij\m ij&ms'
Multiplying by gmk and summing over m yields the result. ■
8.3.5 Caution. The symbols 0'*. are not Christoffel symbols. How-
However, they can be related {via a local section of L(M) induced by a
local coordinate system on M) to the t-derivative of the Christoffel
symbols.
8.3.6 Theorem. Let Rh,jk(t) be the components of the curvature
tensor ofgt, and set Rhjjk' = Rhijk'@). We have
Proof. Let ej(t) be the standard horizontal vector field relative
to в,. Then Rhijk(t) = eh(tt(t)(eJ(t),ek(t))(ei)), where Й(Г)= fif. For
ej = ej@), we have q>(ej(t) — eJ-) = ej—ej=Q (i.e., eJ(t) — eJ is vertical).
Since fi(f) vanishes on vertical vectors, RhiJ.k(t) = eh(Q(t)(eJ,ek)(ei)),
and so RhIjk' = ё"(й'@)(ё^ё-к)(е()). Now Q(t) = d6t+\[§tJt], whence
fi'(O) = dO' + [9,9'] = D59'. Thus, ti'(O)(ej,ek) = (d6')(ej,ek)
= ё\9\ёк)]-ёк[9'(ё^], since [ёу, eJH =0 by 8.2.15. From 8.2.7 and
8.3.2, it then follows that R^'^ ёи(Щ0)(ё^ёк)(е,))=9^ки-9%lk.
By 8.3.4, this last quantity is
28 r\8'rk\ijJt~g',r\kj~8'ik\rj~8'rj\ik~8',r\jkJr8ij\rk) •
By the next theorem, g'ir]kj-8',r\jk= ~RS,jk8'Sr-Kjk8',s, and the
result follows. ■
8 CALCULUS ON THE FRAME BUNDLE 119
8.3.7 Theorem. Using the notation of 8.2.6, we have (for /G
C{L{M),TP-q)) the "Ricci identity"
if'p —fif-'p —f'\--'pj?s + . . . +/■'! ■■ ■ <p Ds
J\--Jq\km Jj\---Jq\mk Jsjf-jq )\km Jj\--Jq-\S J
km
Jjr-Jq skm Jjr-Jq ^ skm-
Proofs By 5.2.5, we^have D\Def) = пвЛ/. Now De\D'f )(ёк,ёт) =
ё^вУ(ё„)]-ёт[^вУ(ё^] = ёк[ёт[П]-ёт[ё^Г]]. Thus, the com-
components of De(Def)(EA2(L(M),TP-ci), regarded as an element of
C(L(M), Tp-«+1) (iterate 8.2.5), are /^//^-/Д.:^. Using 8.1.5
with В = пв(ёк,ёт) and Bhi=Qe{ek,em)hi = Rhikm, we see that the
components of tie A/(regarded as in C(L{M),TP-q+2)) are given by
the right-hand side of the desired equation. ■
8.3.8 Theorem. Let Rhijk(t)= g'm(t)Rhmjk(t), and set Rh'jk' =
Rhijk@). Then we have
where the indices on the right-hand side have been raised using the
initial metric g = g0.
Proof. Note that gij{t)gJS{t)= 8/, and so gIJ'@)gJS@) + g'7@)gy/@)
= 0, whence g""'@)= - g%sgsm. Hence, Rh'jk'= g'm'@)Rhmjk +
g"nRh,nJkW=-g%sgsmRhmjk+g"nRhmjk'- The first term is
— RhsJkg"s, while the second is computed by using 8.3.6. Note that
there is a term within the second term that would cancel with the
first term were it not for the factor of j in 8.3.6. ■
CHAPTER
Unification of Gauge Fields and Gravitation
The Einstein field equation for empty space-time (Л, — jRgt ■• = 0)
arises from setting the first variation (with respect to the space-time
metric) of the integral of the scalar curvature equal to zero. In
Section 9.1, a coordinate-free method is given for computing first
variations of integrals of scalars formed from the curvature tensor by
means of tensor product, index raising or lowering, and contraction.
This method not only yields the Einstein field equation from a
variational principle, but also gives a nice proof of the Gauss-Bonnet
theorem (see 9.1.10).
In Section 9.2 we investigate a consequence of the fact that
integrals of scalars, as above, are invariant under a change in metric
via pull-back (see 9.2.1) by a diffeomorphism. We already found in
Chapter 5 that the current of a particle field is covariantly conserved
David D. Bleecker, Gauge Theory and Variational Principles ISBN 0-201-10096-7
Copyright © 1981 by Addison-Wesley Publishing Company, Inc., Advanced Book
Program/World Science Division. All rights reserved. No part of this publication
may be reproduced, stored in a retrieval system, or transmitted, in any form or by
any means, electronic, mechanical, photocopying, recording, or otherwise, without
the prior permission of the publisher.
120
9 UNIFICATION OF GAUGE FIELDS AND GRAVITATION 121
because of the gauge invariance (or internal symmetry) of the Action
density. In the case of invariance of metric-dependent integrals under
pull-back by diffeomorphism (i.e., external symmetry), we arrive at
another conservation law. A special case of such a law is the
conservation of the energy-momentum tensor in general relativity. A
more explicit description of the basic idea is given in 9.2.1.
The main goal of this chapter is the natural unification of gauge
fields and gravitation. The first model of this type goes back to the
five-dimensional model of Kaluza [1921] and Klein [1926] for the
unification of gravity and electromagnetism. Their model extends in
a reasonably straightforward way to the case of gauge potentials on
PFBs with arbitrary (e.g., possibly non Abelian) groups. This exten-
extension is carried out in great detail in Section 9.3, but the basic idea is
outlined in the next paragraph.
Let t: P->M be a PFB with group G over a space-time M with
metric g. If со is a connection 1-form (i.e., gauge potential) on P and
к is some fixed (Jb-invariant metric on §, then we can construct a
metric h (depending on g, k, and со) on P. It happens that for all A in
G, R/. P->P is an isometry of (P, h). As a consequence, the scalar
curvature R: P->R of h is constant on the fibers, and hence yields a
function on the base M. Since this function depends on both g and
со, we denote it by R(g, со): M->U. If U is an open subset of M with
compact closure, let Iv(g, со) denote the integral of R(g, со) over U.
We will prove that if (for all U), Iyig, со) is stationary for variations
(with support in U) of g, then the Einstein field equation holds for g,
where the energy-momentum tensor (i.e., source of the gravitional
field) depends on the field strength fi". Moreover, /^(g, со) is sta-
stationary (for all U) for variations of со (with projected support in U)
if со satisfies the Yang-Mills equation. Thus, the Einstein field
equation and the Yang-Mills equation arise simultaneously from a
single variational principle that derives from the scalar curvature of
the metric h on P. In Section 10.1 it is shown that the geodesies of
(P,h) project down to generally nongeodesic paths of charged
particles on M, where the charge is essentially the vertical component
of a tangent vector of the geodesic in P. In view of this, we can
hardly deny the physical significance of the geometry of (P, h).
In spite of the foregoing, these Kaluza-Klein type models are not
regarded as solutions to the unified field theory problem. One reason
is that these models do not predict the values of certain universal
122 GAUGE THEORY AND VARIATIONAL PRINCIPLES
constants, such as the ratio of the strength of the gravitational to the
electrical force between the proton and electron. This ratio is around
jq-40 -р^е pOSSibility that such a small constant can ever be pre-
predicted on purely theoretical grounds seems remote. It is perhaps
more likely that such constants are just arbitrary parameters that
were frozen by some spontaneous symmetry-breaking mechanism at
some initial stage of the big bang. Since the relative abundances of
atomic elements and their chemical properties are sensitive to the
values of these constants, it is conceivable that their values might be
characterized as those at which life intelligent or curious enough to
contemplate them could evolve. Thus, we might conjecture that a
unified field theory of reasonable proportions does not exist.
9.1 GRADIENTS OF METRIC-DEPENDENT FUNCTIONALS
Here we formulate a calculus to compute gradients of functionals
defined on the space of all metrics on a manifold. The results we
obtain are of independent interest, and they exceed that which is
necessary to obtain the Einstein field equations.
9.1.1 Notation. We let M be an oriented n-manifold, and let Ш
denote the space of all (nondegenerate) metrics on M. The notation
Ud CM means that U is an open subset of M with compact closure.
For UC CM, we denote the space of all restrictions to U of metrics in
9li by 'Dli17. The space of all symmetric tensors on M is denoted by
S2(M), while the space of restrictions to U of such tensors is denoted by
S2(U).
9.1.2 Heuristic Motivation. For t/c CM, ge<9lu, ands<ES2(U)
we have g + ts€zc$\Lu for sufficiently small t. Thus, it is convenient to
think of S2( U) as the "tangent space of the manifold <Жи at g," and we
sometimes write S2(U) as T3\lu. Note that for M noncompact with
gtE^and s<ES2(M), it is possible that g + ?s£9H for all t¥=0. This is
one reason for introducing the spaces ty\lu.
9.1.3 Definition. For r, s G S 2( M) and gG9lL we let r-LJ, sip and g,7
be the components of r, s, and g considered as L-tensors in
C(L(M),T0-2) (see 8.1.2). The scalar gh'gJkrhJsik&C(L(M),T0-0)
induces a function on M that we denote by (r, s) : M->R. Let ц be the
9 UNIFICATION OF GAUGE FIELDS AND GRAVITATION 123
volume element on M determined by g and the orientation of M. For
f/CCM, we write {r, s)^= jи(г, s)gn . In the notation of 9.1.2, we
can regard ( , )L% as a "metric" on Tp\LL', and hence the assignment
gt-*( , )и„ defines a "pseudo-Riemannian metric on вЖи."
9.1.4 Definition. Let t: 9И^С°°(М) assign to each gG9H a scalar
formed from g and its curvature tensor by means of tensor product,
index raising or lowering, contraction, and/or covariant differentiation.
For example, £(g) might be the scalar curvature of g, or perhaps
something more exotic (e.g., Rh'jkRjk,in\hpR'"n)- For such £ we define
Lu: GJliu^U by Lu(g) = ]u£(g)txg. The gradient of Lu at gG9H is
that VgLu &Т^и = 82(и) such that for all s<ES2(U) with compact
support in U, we have (at ?=0)
We say that gG9H is stationary for L if VgLu = 0 for all L/C CM.
From the examples we will consider, it will be clear that in general
VgLu always exists and is unique. Moreover, the V%LVpiece together
to form a well-defined tensor vgLGS2(M).
9.1.5 Theorem. //ju.g is the volume element on M determined by g,
then for any s&S2(M) we have (on any f/C CM)
Proof. The volume element jug corresponds to an L-tensor
in C(L(M),T°-n) with components e,v..Jg|l/2 where \g\ is
the determinant of the matrix g,. of components of g^C(L(M),
T0-2) (i.e., Ы = 2(/^1у1--^я,Л1...л). Now ixg + ts has components
and (at ? = 0)
124 GAUGE THEORY AND VARIATIONAL PRINCIPLES
We have
Jt\g+tS\= 2 &\n'"Skh-"&nJ*h-J.
= 2
,• Е-
Thus, we have
as desired. ■
9.1.6 Definition. Let t: 9IL-^ C^M) be as in 9.1.4.
s&S2(M) with compact support, we define £4(j)eC°°(M)
For f/CCM, the partial gradient of Lu (see 9.1.4) a? ge^ll/7 м
(unique) dLu <ES2(U) such that
\U
/or all s£S2(M) with compact support in U. In our examples, we will
see that dgLu exists and that there is a dgLES2(M) such that
(dgL)\U=dgLu for all f/C CM. The next theorem reduces the compu-
computation of gradients to that of partial gradients.
9.1.7 Theorem. In the notation c/9.1.4 and 9.1.6, we have VgLu =
dgLu+\t(g)g.
9 UNIFICATION OF GAUGE FIELDS AND GRAVITATION 125
of. For s£.S2(U) with compact support in C/C CM we have (at
4=0)
g + 's
Since s is arbitrary and ( , ) is nondegenerate, we have v Lu =
3 Lu+^t(g)g by a standard argument (as in the proof of 4.3.6.).
■
9.1.8 Remark. Ift(g) is a scalar (as in 9.1.4) that does not involve
the covariant derivatives of the curvature tensor, then the formula of
8.3.8 reduces the computation of dgL to a mechanical procedure, which
we illustrate in Examples 9.1.10 and 9.1.11. // £(g) involves covariant
derivatives of the curvature tensor, then you can find formulas for
Rh'ijk\m', and so on (as in 8.3.6) in order to compute 3^L.
9.1.9 Notation. For /<=C(L(M), Г'-0 or T0J), the scalar /',, (com-
(computed relative to the Levi-Civita connection of some gG9H) is called
the divergence of f. Given UC CM and scalars a,fi£C(L(M), T°-°)
= CX(M), we write a— /? if a~p—f\, for some f with support in U.
Note that a —ft implies that /(yajug=/(y/8jug by 8.2.12. As an example,
note that for s&C(L(M),T0-1) and t&C(L(M),T20) (with s or t
having support in U), we have f4s,-, — —t'JySj, since (t''s:)y = t'JySi +
t'J'si{j by 8.2.9.
9.1.10 Example. Let £(g) be the scalar curvature o/ge9H (i.e.,
£(g)=Rhih,(g) = R(g))- We derive (VgLH- -Rlf(g)+ ^R(g)gij,
or equivalently, C L), = —Л, (g) = components of the Ricci tensor of
g. For U(Z CM andf£S2(M) with support in U, we have (using 8.3.8
126 GAUGE THEORY AND VARIATIONAL PRINCIPLES
with g'=f)g {lh
f'hlh-f\',)-l(-Rh\,fs-Rslh,fhs), since /% = (/*,.,% and so on
are divergences. Using 8.2.20 and 8.2.19 A), C), we obtain
Д,7/^ = (-(Д,7),Д, whence
s, 3gL(y= -(Д,7) or{\LL=-R,j. Note that g<E9H/s stationary
for t(see 9.1.4) *// «,/g)- i«(g)g,7 = 0. W^e« M w аи "empty"
space-time, this is the Einstein field equation. Also, it is worth mention-
mentioning that when M is a 2-manifold, this equation is automatically
satisfied. Indeed, R,j = RhihJ=RhkhJgik, and (for dim M = 2) Rhkh; = 0
unless k=j¥=h. In this case, Rhjhj = Rl2l2 = R2\x = h{Rn\i +
R2l2\)=2R- Thus> R,j=2Rg,j holds for any ge9H. It follows that if
M is a compact Riemannian 2-manifold, then JMRfig is independent of
the metric g. This is a qualitative statement of the Gauss-Bonnet
theorem.
9.1.11 Example. Here we take t(g) = RhiJkRJ'khl(g), and derive
(VgL)ah=-2RhhjkR^ha + 4R'J{ji + ^(g)gah. The computation is
more representative of the general case than that of 9.1.10. Let fE.
S2(M) have support in UC CM. We have
The covariant derivatives on f may be "transferred" to Rjkhl modulo
divergence {e.g., Г\^'\^- V\V
, we obtain ^
id j /_ d/ h i n к h _ n к i\— fab( — I R*l -, RJ,k +
bui\j bha\j b hci\k b ш\к ' J V ^7" "^
using 8.2.19 ш the final step. As in the previous example,
;sion within the parentheses is dgL, and the result follows.
9 UNIFICATION OF GAUGE FIELDS AND GRAVITATION 127
9.2 CONSERVATION LAWS FROM EXTERNAL SYMMETRY
9.2.1 Notation and Motivation. We denote the group of all diffeo-
morphisms of M by ty(M). If f^p-q(M) and X&%M), then we
define the pull-back of f by X, X*f^p-q(M), by (X*f)x(Xl,...,
Xp,Yl,...,Yq)=fMx)(Xlo\-\...,\^Yq) for Xx,...,Xp^fxM and
Yx,...,Yq<ETxM. Observe that X*: $°-2(M)^$°'2(M) leaves the set
911 of metrics on M invariant (i.e., X*: 911^911). In this way, ty(M)
acts on 9H. // gG^lt and X*g=g, then X is called an isometry of
(M,g). In this section, we will prove that for £: 91L-» C°°( M) any
scalar as in 9.1.4, we have
f
J
MU)
where AG ^(M) and U С С М. Hence we may say that the variational
problems of 9.1 possess the "external" symmetry group <Ф(М). In
contrast, we considered variational problems in earlier chapters that
possess an "internal'"' symmetry group (i.e., the group of gauge trans-
transformations). There we found that gauge invariance led to conservation
of charge. Here we will prove that external invariance implies the
conservation law (vgL)kJy = 0 where vg^£ S2(M) was defined in
9.1.4, the index was raised using g, and the covariant derivative is that
of the Levi-Civita connection of g. In general, for /GS2(M), the
I- form with components fL, is called the divergence of f relative to the
metric g.
In the special case where £(g) is the scalar curvature of g, we
obtain the well-known identity (RkJ — 2^Sk) \j ~0 (Ле Einstein tensor
Gjj=Rij — ^Rgij has divergence zero). Actually, GkJ{j=0 can be
derived directly from 8.2.19, but it is more satisfying to obtain it
from external symmetry. The Einstein field equation is G,7 = Erj
where (£,-.-) is the so-called energy-momentum tensor due to non-
gravitational sources. Consequently, EkJy-Gjy = 0, meaning that
energy-momentum is conserved.
9.2.2 Definition. For any AG6D(M) there is an induced diffeomor-
phism X: L(M)^L(M) defined by X(u) = X^(u)o u: R"->7\()r(H))M.
128 GAUGE THEORY AND VARIATIONAL PRINCIPLES
Observe that for u<EL(M) and A £GL(R") we have X(uA) = X(u)A
and тт(Х(и)) = Х('гт(и)).
9.2.3 Theorem. Let f<E<5p-q(M) correspond to /(EC(L(M), Tp-«)
as in 8.1.2. Then X*fcorresponds to /° Л (i.e., /°Л = Л*/).
Proof. For «^....UpGR" and w w?eR", we have
(foX)(u)(vu...,Wq)=f(X^U)(vl,...,Wq)
as required (see proof of 8.1.2). ■
9.2.4 Theorem. Let <peA'(L(M),R") fee ?/ге canonical \-form
(ф(Хи) = и-\^(Хи))). For Ле^(М) a«rf Л: L(M)^L(M) as in
9.2.2, we have A*qp = qp.
Proo/. For le TUP we have
9.2.5 Theorem. // 9(g)<EA\L(M), §t(n,U)) is the Levi-Civita
connection for ge9It, then \*F(g)) is that for X*g (i.e., 6(X*g) =
Proof. For BGgP(«,R) and B* the fundamental vertical field on
L(M), we have (at t=0)
Thus, X*(e(g))(B*) = e(g)(B*) = B. Moreover, for A&GL(n,U),
ЩХ*9(8)) = Х*(К*9(8)) = Х*(&ЪА->в(8)) = &ЪА-1(Х*в(8)). Thus,
X*6(g) is a connection on L(M). Regarding gG9H as being in
C(L(M), T0-2), we have dg+0(g)Ag = O. Applying Л* to this equa-
9 UNIFICATION OF GAUGE FIELDS AND GRAVITATION 129
tion yields d(g°\)+\*e(g)/\(go\) = 0. Applying A* to the equa-
equation d<f> + ()(g)/\<f> = 0 and using 9.2.4 gives d<p+ \*(e(g))A<p=0.
In view of 9.2.3, it follows that A*@(g)) is the Levi-Cevita connec-
connection of A*gG 9lt. ■
9.2.6 Theorem. // fi(g)(EA2(L(M), Ш(п,Щ is the curvature of
9(g), then A*(fi(g)) is that of 9(\*g) (i.e., fi(A*g) = A*fi(g)).
Proof. We have Sl(g) = d$(g) + ±[$(g)J(g)\. Applying A* to this
equation (and noting that A* commutes with d and [ , ]), we obtain
A*fi(g) = fi(A*g) by 9.2.5. ■
9.2.7 Theorem. Let el(g),...,en(g) denote the standard horizontal
fields (relative to 0(g)) associated to the usual basis e,,..., en of R"
(see 8.2.3). Then, ^\(
Proof. Certainly A~' =(Л"'). Thus, by 9.2.4, we have
= <J>(e,(g))=e,.
Moreover,
and the result follows. ■
9.2.8 Theorem. Let Rhi]k(g)( u) be the components of the curvature
tensor of^<=9H at u<EL(M). For\<E%M) and\: L(M)^L(M) as
in 9.2.2, we have Rh,jk(X*g)(u) = Rhljk(g)(\(u)).
Proof. Using 9.2.6 and 9.2.7, we have fi(A*g)u(e/A*g),^(A*g)) =
Letting both sides act on et (and then be acted upon by eh), we find
that the result follows from 8.2.13. ■
9.2.9 Theorem. For f<EC(L(M),Tpq), let fj[\'.':jqp\k(g) denote the
components of the covariant derivative of f with respect to 6(g). For
130 GAUGE THEORY AND VARIATIONAL PRINCIPLES
AG^D(M), we havefl\:;;r[k(g)(Mu)) = (fo\yr/:'flk(\*g)(u) for all
u<EL(M). Note that fo\<=C(L(M),TP'i) by 9.2.3,
Proof. By 8.2.7, we have
9.2.10 Theorem. With notation as in 9.2.9, we have
Proof. This result follows from 9.2.8 and repeated application of
9.2.9. ■
9.2.11 Theorem. If £(g)<ECco(M) = C(L(M),T0-0) is a scalar as
in 9.1.4, then (for \ВЩМ)) £(g)(A(u)) = e(A*g)(u) for all u<E
L(M).
Proof. Recall that £(g) is a contraction of a product of components
of the form g,7, g4 and Rhljk^... m{g) (possibly s = 0). By 9.2.3 and
9.2.10, these components satisfy g,7(A(u)) = (A*g),7(u), g'^(A(u)) =
(A*g№), and Rhi]k^...m{g){ku)) = Rhl]klmx...m{\*g){u). Thus,
9.2.12 Theorem. //AG6D(M) is orientation preserving and jugG
Л"(М) is the volume element of'gG 9H, then fxx*g = A*ju?.
Proof. Let X\,..., Xn€=TxM be an oriented o.n. basis relative to g.
Since ('K*g)(^\Xi,X-\XJ)= g(X,, Xj), we know that \~\XX,...,
\~\Xn is an oriented o.n. basis of Tx-\(x)M relative to A*g.
Thus, \ = Н^{\-\ХХ,...,\-\ХП)=\-'*{11Х*Я){ХХ,...,ХП). Thus,
9 UNIFICATION OF GAUGE FIELDS AND GRAVITATION 131
9.2.13 Theorem. 7/AG6D(M) and £(g) is a scalar as in 9.1.4, then
JMU) Ju g
for any U(Z CM.
Proof. By 0.2.16, 9.2.11, and 9.2.12, we have
\(U)
9.2.14 Definition. Let X be a compactly supported vector field on
M, and let Xt €Е<ф(М) be the one-parameter group of diffeomorphisms
generated by X. The Lie derivative of f<E<T\ p'q{M) is L'^
defined by
9.2.15 Theorem. Supposef^p'q{M) corresponds to f<EC(L(M),
Tp'q). For X and A, as in 9.2.14, we let X be the vector field on L(M)
generating the group ArG6D(L(M)) induced by \t as in 9.2.2. Then,
Lxf<E$p-q(M) corresponds to X[f](EC(L(M),Tp'q).
Proof. According to the proof of 8.1.2, we must check that
Vu---,»>q) = {Lxf)ir(u){vxou~\...,u{wq)). AW = 0, we have
X[f\u{v{,...,wq)=j-tf(\t(u)){vx,...,wq)
132 GAUGE THEORY AND VARIATIONAL PRINCIPLES
9.2.16 Theorem. Let g£9H and let X be a vector field on M.
Regardingg andLxg as in C(L(M), Тол) andXas in C(L(M), 7м-0),
we have (Lxg)i]=Xk^gkj + Xkyg:k where the covariant derivatives
come from the Levi-Civita connection of g.
Proof. From 9.2.15 we have (Lxg),7=/[g,7] = afg,7_(X)- К 9 is the
Levi-Civita connection of g, then dg(X) = — 9(X)Ag. Using 8.1.5,
we have {-9{X)f\g)ij = 9{X)klgkj + 9{X)kjglk. Thus, it remains to
prove that 9(X)k=Xklr Let X be the horizontal vector field on
L(M) such that тт^(Х)—Х, от equivalently X=XH. Suppose that
?н> ut is the integral curve of X through m=m0GL(M). Define
At£GL(n,U) by Xt(u) = ^t*°u=utoAf Then at r~° we have
=9u{X+A>@)*)=A>@).
Note that A= u~' ° ЛГИс ° и. Thus, for e
9u(X)(e,)=A'@)(e,)=jtu;<{\Ju(e,)))
at t=0. Let У be a vector field on M that agrees with the field
ЛГИс(м(е,)) at XtGr(u)) for all t in some small neighborhood of OK
At x = 7t(u) and ?=0,
Let У be the horizontal field on L(M) such that n^(Y)= Y, and note
that u~\\ti):(u(ei))) = <f>u(Yu). Thus, from above, we have
at r = 0. Now Xu[<p(Y)] = (dy)u(X,Y)+Yu[<p(X)]+<pu([X,Y]): but
the first term vanishes since De<p = 0, and фи([Х, Y])= u~\irJ_X, Y])
= u-\[X, Y]x) = 0. Thus, 9u(X)k, = Хы[фк(¥)]=¥и[фк(Х)] =
9.2.17 Theorem (Conservation Law). Let £(g) be a scalar with
"gradient" v»LeS2(W) as in 9.1.4. Considering vxL as in
9 UNIFICATION OF GAUGE FIELDS AND GRAVITATION 133
C(L(M),T0-2), we have (vgL)^ = O where the index was raised
using g, and the covariant derivative is that of the Levi-Civita
connection of g.
Proof. Let U С С M and let X be a vector field on M supported in
U. Let ArG6D(M) be the one-parameter group generated by X. To
within first order in t we have X*g — g + tLxg. Since X has support
in U, we have Xt(U) = U; and so by 9.2.13 we have
Lu(g)=f £(g)ixg = f £(
Ju Ju
Hence at t=0,
but the integrand is (vgL)'\Xkligkj +Xkljglk) = 2(vgLyJXkug,k
since vgL is symmetric. Moreover, (s7gL)'JXk,jgjk ——\s7gL)kJ^Xk
Thus
and it follows that on U, (vgL)kJy = 0, since X is sufficiently arbi-
arbitrary. Since UC CM is arbitrary, (vgL)^|y = 0 on M. ■
9.3 THE EINSTEIN-YANG-MILLS ACTION PRINCIPLE
Here we pursue the program outlined in the introduction to this
chapter. We easily concoct an Action density that leads to both the
Einstein equation and the Yang-Mills equation. With some persever-
perseverance, we then show that this density is the scalar curvature of a
certain metric on P, as in the introduction.
9.3.1 Definition. Let к be an B b-invariant metric on §, and let Qw
be the curvature of some connection со on P, as in the introduction to
this chapter. There is a tensor field on M of type @,4) denoted by
134 GAUGE THEORY AND VARIATIONAL PRINCIPLES
) and defined as follows. Let x€LM, and suppose that чт(р)~
x,peP. Let X,Y,Z,WeTxM and let X,Y,Z,W be the horizontal
vectors in TpP projecting to X, Y, Z, W. We set к(п,п) (X, Y, Z, W) ~
k{uw(X,Y),uw(Z,W)). Observe that к(п,п) is well defined (inde-
(independent of the choice of p), since (for A&G) k(9,"(RAij!X,
RA^Y),Q"{RA^Z,RJmW)) = k(&bA-lQa(X,Y),&bA-lQa(Z,W)) =
k(u"(X,Y),u"(Z,W)). We write the components of the associated
element of C(L(M), T0-4) as k(uhj,uJm), but note that uhj makes no
sense by itself. Finally, note that the self-Action of со
(see 5.2.1) is then given by - ighJg'mk(Uhi,UJm) or -
9.3.2 Definition. Let G(P) denote the space of connections on P,
and 91L the space of metrics on M. Suppose that R(g) is the scalar
curvature function for (M,g), and let c£R. We define a function
Л + S + c: G3\LX6(P)^CX(M) by (R + S + c) (g,u)=R(g) + §(g,u)
+ c. We say that the pair (g, со) is stationary relative to /?+§> +с if(at
t=0) we have
jf f R(g+
for all UC CM, alls<ES2(M) with support in U, and all т<=Л'(Р, S)
with projected support in U.
9.3.3 Theorem. The pair (g, со)<е91х6(Р) is stationary for Л + S
+ ciff
(A) RtJ - ±RgiJ - ±g.j = $ghmk{Qhi, QmJ) + ^(g, a)giJ (Einstein
field equation);
(B) 8"fi" = 0 (Yang-Mills equation).
Proof. By 9.1.10, we have (at t=0)
9 UNIFICATION OF GAUGE FIELDS AND GRAVITATION 135
Also,
A
Now
A
dt
from which it follows that
Piecing these results together, we see that (A) holds iff g is stationary
relative to /? + §> +с for a fixed со. Since /{,§>( w, g)ju is the only part
of the integral J(y^(g) + S(co, g) + cju depending on со, we see that
(B) holds iff со is stationary for the self-Action (see the proof of
5.2.3). ■
9.3.4 Definition. With notation as in the introduction to this chapter,
we define a metric on P (depending on g, со, and к) as follows. There is
a symmetric tensor field к со of type @,2) on P defined by (ko>)( X,Y) =
k(u(X),u(Y)) for X, Y(ETpP. Another such tensor field is ir*g {i.e.,
Ti*g(X,Y) = g('!Til.X,tn^Y)). Adding these yields a (nondegenerate)
metric h = n*g+kcc called the bundle metric associated to g, со, and k.
9.3.5 Theorem. The maps RA: P^P(A^G) are isometries of P
with the bundle metric h = TT*g+kco.
Proof. For X, Y(ETpP, we have
= h(X,Y). ■
9.3.6 Notation and Remarks. As a consequence of 9.2.11 and
9.3.5, the scalar curvature R of (P, h) is constant on the fibers of P.
136 GAUGE THEORY AND VARIATIONAL PRINCIPLES
Thus R determines a well-defined function R(g,u) on M. The next
theorem states that R(g, u) — R(g) + S(g, io) + RG where RG is the
scalar curvature of the fibers of P with the metric induced by h. Note
that all of the fibers _of P are isometric to the group G with the
(bi-invariant) metric к defined as follows. For A&G and X, FG TaG,
we set_k(X,Y) = k(LA^X, LA^Y). Certainly LA is an isometry
of(G,k). Since RAl X = &bA(LAl X), we have k(X,Y) =
k{R~lX, RAl Y), using the &b-invariance of k. Thus, RA: G-> G is
an isometry of (G, k). For any p €E P, the map gn> pg is an isometry of
(G,k) onto the fiber тт~ \тт(р)). It is remarkable that (as a conse-
consequence of 9.3.7 and 9.3.3) both the Einstein equation and the Yang-
Mills equation arise from a single Action principle based on the
geometry of the bundle metric. The significance of the bundle metric is
enhanced in the next chapter, where we show that the geodesies of the
bundle metric project to paths of "charged test particles" on M.
9.3.7 Theorem. The scalar curvature of P with the bundle metric
h = "ir*g + кы projects to the function R(g) + S(g, w) + RG on M,
where RG is the (constant) scalar curvature of (G, k) described in 9.3.6.
Proof. The calculation of the curvature tensor (at some fixed p&P)
is simplified if we use 6.2.6 with a judicious choice of o.n. vector
fields in a neighborhood of p. We suppose that Ex,...,En are o.n.
vector fields on (M, g) defined on a neighborhood U of x = it(p) in
such a way that the local section a: U->F(M) determined by
E{,..., En is tangent to the horizontal subspace of Ta(x)F(M) relative
to the Levi-Civita connection 9(g). This way, we have e(g) = a*6(g)
= 0 at x. Let £,,..., En (defined on w~\U)CP) be the horizontal
lifts of Ev...,En relative to the connection to (i.e., to(£,) = (), ^^(E,)
= Ei). Let e,,...,ey be an o.n. basis of § relative to k. Set En+a=e*
for a=l,...,/; these are fundamental vertical fields on P. Then
Ev...,En, £„+,,..., En+f are o.n. vector fields (defined on ir~\U))
relative to the metric h. In the expressions to follow, we use the
Einstein summation convention. Moreover, 1=5/, j,k,... *Sn, and
l<a, ft,y,... *£/, and Ijia, b,c,... =£и+/аге the ranges on indices.
We write gIJ=g(El,EJ) = h(El,EJ) = ±8ij and кар=к(еа,ер) =
h(En+a,E,l+fi)=±8afi. Let <p\..., <ря+/ be the 1-forms dual_ to
Eu...,En+f. If fl is the curvature of со, we can write со = 5Йа,7(<р'Л
9 UNIFICATION OF GAUGE FIELDS AND GRAVITATION 137
qiJ)ea for some functions паи on w~ l(U). The indices in fi",- can be
raised or lowered by using gjJ or kafj (e.g., Й„'у =я""^аДЙдту).
Finally, we define structure constants саду by [e^, ey] = capyea. In the
following lemmas, we compute the components of the curvature
tensor of (P, h).
9.3.8 Lemma. The matrix 9{h) = {e(h)uh)\^a,b^n+f provided
by 6.2.6 relative to the above choice of o.n. fields £,,..., En+f (and
dual forms ф„...,Фя+/_) is given by 0(A)^*@(g)I,)r£fie'.^'I+<\
ri^M^-ia^', and e(h)n+all+y
Proof. Let ф^,...,ф^ be the 1-forms on M dual to £,,...,£„.
According to 6.2.6, we have dy1 M = — 0( g)'■ Л <pJM. Since ir*4>M' = ф',
we obtain dip' = — ir*(j(g)'j ЛфЛ Writing со = соаеа, we see that
-я + а = ы«_ ThuSj ^« + «ea = ufw = -^[w,w]+fi = (-^ca/Syw/JAw1'
+ fia)ea. Hence, </фя + в = -^с%уфя + " Лфя + т + ^Пв17ф''ЛфЛ For
■qah = h{_Ea,Eh) \^a,b^n+_f, recall that the matrix e(h)\_must
satisfy 0(/г)гт) + т,0(/О = О or ^(/i)"ar,,c+ -qj(h)hr = 9{h)ca +
= 0. Note that
at r = 0. Thus, it is natural to try 9(h)n+an+y = icafiyyn+fi. The other
formulas are then forced upon us by the requirement 0(h)ac =
anc^ 1Ье expressions above for dip' and dip"+a. ■
9.3.9 Lemma. 77ze components (relative to Eu...,En+f) of the
curvature tensor forthe metric h on P are given by 2^"л^ф'' Aipd =
e
Proof. Let а: тт~ \U)->L(P) be the local section determined by the
o.n. fields £,,..., En+f. Note that R"bcdat q^ir' \U) is пв(к)\{ес, ed)
evaluated at 5(q) (see 8.2.13). Since ЙЙ(Л) vanishes on vertical vec-
e ^ e
138 GAUGE THEORY AND VARIATIONAL PRINCIPLES
9.3.10 Lemma. We have the formulas
nn + a — L~a ~Y
^ n+p n + S n+v 4C y/3c Si/)
n+a — ± a qy _ I / Qa Q A: _ Пи
P—2c « i«« W
and
Л « + /3я+у; Л
+p. Now (yn+/iyfiy
<p, and в(А)я+вя+уАв(_А)я+1'„+/,= 1сввус^фвЛф',апAв(А)я+вА Л
0{h)kn+p= — iuaklupkj(p'A(pJ. Evaluating these forms on (En+S,
En+V) yields
— _ X(ra „y
ljra ry _ ra у \ — _\ а у
Л\с SyC vp C vyC Sp)~ 4C ypc 8
_ ra у \
since cySv = — cyvS and the second parenthetical expression is cay/icySl/
by the Jacobi identity 0.3.11. Evaluation on (Et,E) yields the
expression for Rn + an + PlJ, and clearly Rn + an + p,n + y =0. ■
9.3.11 Lemma. For Я™,_а = £а[П™..], we have
pn + a —I/O" О" ^
and
and
n + a — A
9 UNIFICATION OF GAUGE FIELDS AND GRAVITATION 13
Proof. We have d9{h)n+aj = ^ji,u^u /\^', and 9{h)n + an + y /
Л<р"+Р. Using 9.3.9 and evaluating on appropriate pairs (E ,
we obtain the desired result. ■
9.3.12 Lemma. Let R'jkm be the components of the curvature tensi
of(M, g) relative to the o.n. fields El,...,En, and let flay, a = Ea[uaf
At p&P, R'jkm —R'Jkm +{tiyjtiymk -д(ЙДЯат — йа'тпа к), аг
Proof. Here we use the fact that Ex,...,En were chosen s
that 0(g)'j = O at x = ir(p); and so в{к)'"= **0(gYj =0 at p. W
calculate at p, to obtain rf0(A)'y=7r*(rf0(g)';)-^na'y аф"Лф'
t Л фт -
'"'. Also alp
and 9(h)'n + a A9~(h)" + a/ =-^а'^ау„,фАЛф"'. The 'result follow
from 9.3.9. ■
9.3.13 Lemma. The Ricci tensor of (P, h) at p, relative to £,,..
En+f, is given by RJm=R]m- ^„^V,, R,,,+y = - 2^y'y,„ at
Proof. From 9.3.12, we obtain R'j:m = R]m + lQy'jWmi, since Ua',
g'Juaij =0 because пш] = - uaJ1. From 9.3.11, we have R" + ajn + a
= 2®am п + а + А^ау®7m + ffikm^J'j- Observe that the first tv
terms are antisymmetric inj and m while R"+aj n+a m = Rjm — R'j,m
symmetric in j and m (see 8.2.20), whence the first two terms mu
sum to zero. Thus, Л"+" „+а „, = \пактпакг and we haveRJm = R'jim
R"+aj n+am=R.,m~ 2^akj^akm- From 9.3.11, we have Л"+ау„+а „+y=
while 9.3.12 gives R) , n+y= ~^у),г Thus, Rj „+y= 4 %.,■ No
R"+an+y n+a „+, = -\caPycpay by 9.3.10. Also R'n+y t „+v
e' >R
n+v
jn+ у i n + v й -■>■ n + у j n+ v i ft -yo"
^ n+у n+a n+v' К n + y i n+v 4C jiyC av ' 4*"yA " J'
Now we complete the proof of 9.3.7. The scalar curvature <
(P,h) is (using 9.3.13) R"a =RJ]-+Rn+\+y =R-^akj'Qak
140 GAUGE THEORY AND VARIATIONAL PRINCIPLES
~\ку"саРус^ау. We know that S(g, a) = -\uakjuakj. It remains to
prove that Rc = -\ку"саРусрау. Note that the restrictions of
<ря+',...,<ря+/ to a fiber 7г~'Gг(/?)) are dual to the o.n. fields
En+ u..., En + ,on tt~Xit(p)). Thus, the computation of the curvature
tensor RGapyS of the fiber is much like that of 9.3.10, noting also that
the connection forms on the fiber are the restrictions of 0(h)n+an+p
to the fiber (see the proof of 9.3.8). Indeed, we have RGapy8 =
follows that Rc = k^R"+\+y n+
by 9.3.10.
Rn+an+p n+y n+s, and it follows that Rc = k^R"\+y n+a n+v
As a by-product, we have a geometric interpretation of the
Yang-Mills equation fiwnw=0.
9.3.14 Theorem. The equation 8МЙМ =0 holds iff the vertical and
horizontal subspaces at each qE:P are orthogonal relative to the Ricci
tensor of the bundle metric h = ir*g+ kcc.
Proof. The horizontal and vertical subspaces at the point p&P are
orthogonal relative to the Ricci tensor for h iff 0 = Rjn+y= -jfiy'. ,=
$Qyjj. We have П<|>=Па1-|1-2Ф1'|®Ф1'2ев. Recall that the fields^,,...,' En
were chosen so that Oig)^ =0 at n (p). Thus, d' = тг*(- %)^)Л^
= 0 at p. By a computation similar to that in 8.2.11, we then have
-Гйи = й";;,ф^а, which vanishes iff ^y/, vanishes for all у and j.
CHAPTER
10
Additional Topics
10.1 GEODESICS AND FORCES ON CLASSICAL PARTICLES
Here we define the acceleration of a curve in a manifold with a
metric; geodesies are then curves of zero acceleration. Let n: P^>M
be a PFB with group G. For a metric g on M, a connection со on P,
and an (Jb-invariant metric к on §, we can form the bundle metric
h = 7r*g+ku on P (see 9.3.4). We will prove that the geodesies on
(P, h) project to curves on M that can be nicely interpreted as the
paths of "charged" particles that are accelerated by the field strength
аи.
10.1.1 Definition. Let тт: P^>M be a PFB with connection со.
Suppose that y: [a,b]^>M is a curve, and let рЕ:тт ~'(y(a)). The
horizontal lift of у through p is that curve u: [a, b]^P such that
David D. Bleecker, Gauge Theory and Variational Principles ISBN 0-201-10096-7
Copyright © 1981 by Addison-Wesley Publishing Company, Inc., Advanced Book
Program/World Science Division. All rights reserved. No part of this publication
may be reproduced, stored in a retrieval system, or transmitted, in any form or by
any means, electronic, mechanical, photocopying, recording, or otherwise, without
the prior permission of the publisher.
141
142 GAUGE THEORY AND VARIATIONAL PRINCIPLES
7тои=у, u(a)=p, andu'(s) is horizontal [u(u'(s)) = O] for alls£[a, b].
The existence and uniqueness of horizontal lifts is fairly clear; a proof
may be found in Kobayashi and Nomizu [1963]. In the special case
where M has a metric g and P = F(M) with the Levi-Civita connection
of g, the horizontal lift и of у determines a set of o.n. vectors
u(s)(e,),..., u(s)(en)€ETy{s)M for each sG[a, b]. This basis is said to
be the parallel translate of w( a)(e,),..., u( a)(en) along у from у (a) to
y(s). In Euclidean space, this is just ordinary translation, but in
general, parallel translation is path dependent.
10.1.2 Definition. Let M have a metric g, and let u: [a, b]^
be the horizontal lift of y: [a, b]->M, as in 10.1.1 Suppose that
$и X(s)€= Ty{s)M is a (smooth) vector field along y. Let f: [a, ft]->[R"
be given by f(s) = u(sy\X(s)) (i.e., X(s) = 2f>(s)u(s)(ej). We de-
define a new vector field DX/ds along у by DX/ds=u(s)(f'(s)); DX/ds
is called the covariant derivative of X along y. Note that DX/ds is
independent of the horizontal lift u; if A &g = O(r, s), then RA°u is the
horizontal lift through u(a)A and
as needed. If X(s) = y'(s), then Dy'/ds is called the acceleration of y,
and у is a geodesic if Dy'/ds^O.
10.1.3 Theorem. Let Ev...,En be o.n. vector fields defined on
some open UdM. Suppose that (#'.) is the pull-back of the Levi-Civita
connection 9 on F(M) via the section a'. U->F(M) determined by
Ev...,En (i.e., a(x)(e,) = Ej(x)). Let y: [a, b]->UCM be a curve,
and define functions y": [a, b]^>R by y'(s) = '2y"(s)Ej. For a vector
field $и X(s)€E Ty(s)M along y, we have (where terms are evaluated at
s or y(s))
where X(s) = 2X'(s)E, defines X': [a, b}^ R.
10 ADDITIONAL TOPICS 143
Proof. From 10.1.2, it is clear that
ds 2 ds
Thus, it suffices to prove that
у Q E yE
k.J
Select an arbitrary point s0E:[a, b], and let u: [a, b]->F(M) be the
horizontal lift of у such that M(so) = a(y(so)). Then, at s=s0,
Thus, we have
at s0. Letting both sides act upon ^ER", we obtain
at s = s0. Applying u(s0) to both sides yields
DF
Since both sides are now independent of u, we have
144 GAUGE THEORY AND VARIATIONAL PRINCIPLES
10.1.4 Corollary. With the notation of 10.1.3, we have
ds \
Proof. Set X=y' in 10.1.3.
10.1.5 Theorem. With notation as in the opening paragraph of
Section 10.1, let y: [a, b]->P be a geodesic relative to the bundle metric
h = 7T*g+kco. Then co(y'(t))&§ is independent of t.
Proof. We take £,,...,£„, En+V..., En+f to be the o.n. vector fields
on tt~\U) CP that were introduced in the proof of 9.3.7. We use the
conventions and other notations in that proof. As a consequence of
10.1.4, we have (with the summation convention):
'а)' + в (h)"b(Ec)y'by'c]Ea=0.
Applying со to both sides, we obtain (see 9.3.8)
Since cayfi and uaJt are antisymmetric in the lower indices, we obtain
(y'«+«)' = 0. Thus, W(y')' = W(Y'"+a£a)'=(Y'"+aR = 0. ■
10.1.6 Theorem. Let у be a geodesic in (P,h) and let Q = co(у') G§
be the (constant) element provided by 10.1.5. Continuing to use the
notation in the proof of 9.3.7 and setting у=7гоу, we have (relative to
the metric g on M)
where Q= Qaea and П" = Пв1.,ф/
10 ADDITIONAL TOPICS
145
Proof. Taking 7^ of equation (*), using 9.3.8, and noting that
y" = y", we have
10.1.7 A Physical Interpretation. Let M be a {possibly curved)
space-time. The curve у=7то у of 10.1.6 is the path of a particle (of rest
mass m0 and "charge" q=Q/mo&§) under the influence of the
velocity-dependent four-force with components qaua'Jy'J,0*^1*^3. (i.e.,
mo(Dy'/ds) = qaua'Jy'JEj). Note that the charge q depends on the
choice of the initial point p = y(a). Indeed, for A €E G, RA о у is another
geodesic such that 7roRAoy = y1 and yet a>((RAoy)') = (t}(RA^y') =
&ЬА-чс(у'). Thus for G a non-Abelian group, charge is a gauge-
dependent concept, just as current was found to be, when pulled down to
M (e.g., see 7.3.12).
10.1.8 Electrically charged particles. As a simple check on the
physical interpretation of 10.1.7, we consider the case of a charged
particle in an electromagnetic field in Minkowski space M = R4 with
the metric g = dt2 — dx2 — dy2 — dz2. Here m: P -> M is a PFB with
group U(\) = {e'e\0SU} (and Lie algebra %A) = {i0|0GR}) and
connection u> = — iA. Then the field F= — dA = d(— /co)= — ifl", when
pulled down to M via a section a: M -> P, becomes the (a-independent)
form E{ dxAdt + E2 dyAdt + E3 dzAdt + fi, dyAdz + B2 dzAdx +
B3 dx A dy where E and В are the electric and magnetic fields, respec-
respectively. With the convention (x0, x,, x2, x3) = (t, x, y, z), we use the
o.n. coordinate fields 30, 9,, 32, 33 (and their horizontal lifts to P) when
referring to components. The single o.n. basis vector for %A) will be
ex = i = /zzT. Thus, we have FJk = F(dj, dk)=- tt\k. In matrix nota-
notation, we have
о
-Е, -
0
в3
в.
-в
0
в
B2
-в,
146 GAUGE THEORY AND VARIATIONAL PRINCIPLES
whence
О Ех
Е[ О
Е2 -Въ
Е, В,
В3
О
-B2
0
Write y(s) = (t(s), x(s), y(s), z(s)), and suppose that g(y'(s), y'(s))-
1 E0 that s is proper time). Set
15 =
1
t'(s)
and note that t'(s) = (\ — \v|2) 1/2 = ^8,
a column matrix. Multiplying the matrices (й,"'л)
J E-v 1
[E+vXB\
Note that for r=(x, y, z), we have
and r"(
t"(s)
r"(s)
while
Thus, the equation
10 ADDITIONAL TOPICS 147
is equivalent to the pair
and ~
The first equation states that the rate of the particle's energy increase is
equal to the rate at which the field does work on the particle. The
second equation is the relativistic analogue of Newton's equation, where
the force is the so-called Lorentz force.
10.1.9 Remark. Given a geodesic y: [ — a, a]->P, consider the geo-
geodesic y:[ — a, a]->P defined by y(s) = y(~s). Note that y'(s) =
— y'( — s). If у is the trajectory in P of some classical particle x, then у
is the trajectory of a particle x that, relative to x, is "traveling" in the
opposite spatial direction, backward in time, and has opposite charge
(i.e., x is the antiparticle of x). Apparently, the correspondence y<->y is
the PTC (parity, time, charge) symmetry. The PTC principal in
physics says that if all the particles in a physically possible interaction
undergo the PTC symmetry, then the result is also a physically possible
interaction. That space, time, and charge should be related by such a
principle seems natural from the bundle viewpoint; note that PTC
leaves trajectories (in the total space P) setwise fixed, whereas P, T,
and С individually do not. Finally, note that conservation of energy-
momentum and charge in particle interactions can be considered as a
single conservation of'"momentum" in the total space.
10.2 UTIYAMA'S THEOREM
Not long after the appearance of the Yang-Mills paper [1954], R.
Utiyama [1956] generalized their work and provided a plausible
proof of an important theorem that now goes by his name. Essen-
Essentially Utiyama's theorem states that any gauge-invariant Action
density defined on the space of connections on a PFB (and con-
constructed from the -jet") must be an (Jb-invariant function of the
curvature of the connection (e.g., the self-Action w-> — \(gk)(uw, fi")
is such an Action density). In what follows, we develop the machin-
machinery needed to provide a precise statement and proof of Utiyama's
theorem.
148 GAUGE THEORY AND VARIATIONAL PRINCIPLES
10.2.1 Definition. Let it: P^M be a PFB with group G, and let в
denote the space of connection \-forms on P. For p E.P, let J°(G)p =
[up: TpP^§\u(Ee} = the set of all linear maps TpP^§ such that
A*» A for A eg. Set J°(G) = UpBPJ°F)p; /°(в) is called the space
of 0-jets of connections. It is a simple matter to show that J°(в) can be
made into a manifold in such a way that тт°: J°(&)^P (taking J°(Q)p
to p) is C°°. For each gGG, there is a map Rg: 0
defined by
Thus, G acts on J°(Q), and we write ccp-g = Rg(up). A connection can
then be regarded as a map со: Р-> J°(P) such that тт°оы = \р and
up-g = upgfor allp^P (i.e., R°gou = uoRg).
10.2.2 Definition. The differential of a connection со: P^J°(G) at
p£P is the linear map ump: TpP^Taj\e). Then Р(е)р = {шшр\ыЕ:
в}, and J\G)= UpePJl(G)p is called the space of 1-jets of connec-
connections. Again, /'(в) can be made into a manifold such that the
projection 7Г1: J\G)^P is C°°. Let R°gm: TwJ°(e)^TWp.gj\e) be the
differential of Rg at up^J°(Q). We define an action of Q on /'(в) by
K("*p)=K*o"*PoRg-'*&Jl(e)Pg> we write R\^*P)="*p-8- v
then u>v-g=u>vr
10.2.3 Definition. A (first-order) Lagrangian for connections is a
C°° function S: J\G)^U such that 5(co^-g) = S(co^) for all co^ <E
/'(в). Associated to S, there is a function §: &->Cco(M) given by
Note that § is well defined, since S(co^g) =
10.2.4 Definition. For k = 0,l,2,---, we define J°(Ak(P,§)) =
l-)pePAk(P,§)p where Ak(P,§)p is the space of all k-linear antisym-
antisymmetric functions тр: TpPX ■ ■ ■ XTpP->§ that vanish whenever one
argument is vertical. There is an action of G on J°(Ak(P, §)) given by
&b<R)£Ak(P§)
'PS'
10.2.5 Theorem. There is a well-defined map U: У
J°(A2(P,§}) given by^ U(u^p) = (du}p + !,[up,up} = U"p for
Moreover, й(сс^р-g) = U(coifp)-g, and U is onto.
10 ADDITIONAL TOPICS 149
Proof. Since fi"p depends only on u>p and u>^p, we have that U
is well defined. For to ев, we have &(.u>^p-g) = U(u>^ps) = uwpg =
&bg-iR*-&ap) = Qap-g = U(amp)-g. To prove that U is onto,
we need to show that for each тр еЛ2(Р, §)р we can find со ев such
that (du>)p + [u>p, и>р]=тр. Let UCM be a coordinate neighborhood
about q=7r(p) with coordinates x\...,x" vanishing at q. We can
choose Usuch that there is a local section a: U-*P with a(q)=p. Let
т =о*тр, and suppose that we can find a>uE.Al(U, §) such that
Then there is a unique connection to on тт '(£/) with a*to = cou, and
applying m* to equation (*) yields (du)p + j[up, ыр]= тт*о*тр = тр,
since тр vanishes on vertical vectors. Thus, it suffices to find an
appropriate u>u. Let t?C,, Зу) = 2ата;7еа where e{,...,efis a basis for
§ and 3,,...,3n are the coordinate vector fields of x\...,x". Let
0еЛ'(£/, §) with 0а1 = ва(д/) and 0а, у = ЭД0",]. For [еа,ед] =
2 cya/iey, the equation (*) then becomes (at g) Oa.J—eajj
+ 2с<хв (Q^fi^j ~ ^^j^yi)= T<Xij- In terms of coordinates, set
ва,(х1,..., xn)= -\Taikxk. Then 9a, =0 and 0a,,y= -^та,.у at 4, and
equation (*) is solved. ■
10.2.6 Definition. Let f: M^N be a map between manifolds. Then
f is called a submersion iff^x- TxM^Tf{x)N is onto for each хёМ. It
is a consequence of the implicit function theorem that for each у е
f(M), f~\y) is a submanifold of M of codimension equal to dim N
(i.e., dim f ~\y) = dim M~dimN).
10.2.7 Theorem. For each p <=P, the map &p: J\e)p^A2(P, §)p
is a submersion.
Proof. Let /'(Л'(Р, §))р be the vector space of all differentials (at
p) of maps о: Р^/°(Л'(Р, §)) such that o(p'g) = a(p')-g for all
p'GP; of course, such a a may be regarded as an element of
A'(P,g). Note that J\7L\P,§))p can be regarded as the tangent
space of J\Q-)p at any (owe/'F)f via the isomorphism
— (
150 GAUGE THEORY AND VARIATIONAL PRINCIPLES
Under this isomorphism (/'(Л'(Р, §))p^T^ /'F)), the differential
of Up at «„ is a linear map J\A\P, §))p^ Щ(^ }Л2(Р, §)pSs
A2(P,§)P given by
■*,'
The equation (do + [u, a])p = 0 imposes 2n(n~ 1)/ (n = dimM, f=
dimG) independent linear conditions on a . Thus, the kernel of the
differential of Up at wv has codimension 2>i(n — l)f, which is the
same as dim(A2(P, §)p). Thus, this differential is onto. ■
10.2.8 Theorem. For each тр еЛ2(P, §)p we have that Up '(Tp) is a
connected submanifold of J\Q)p of codimension ^n(n— 1)/. Moreover,
the tangent space ofUp\rp){at w^ Е.й~\тр)), regarded as a subspace
ofJ\K\P, §))p, consists of all a^p such that (D"a)p-0.
Proof. All assertions, except for the connectedness of й~\тр),
follow from 10.2.7 and its proof. Recall the following equation (*) in
the proof of 10.2.5: ваА1-ваи + ^саРу(вр1в^-вр/ву1)=та1Г We
must show that for given components т",., the set of solutions for
9ai j,9yj is a connected subset of U{ + n)f. There is the one special
solution 0au = - ^та,,, ва,• = 0. A solution @вл/; ва,) such that в",=0
is called normal. Any normal solution (в", ■; 0",) can be deformed
into the special solution: set 9attJ(t) = (\ -[)ва1 j-h^ij and ^",@
= 0 for 0^ t ^ 1. Suppose now that (9a, /, 0™.) is any solution of (*).
By defining TatJ = TatJ - ^сару(в^9^ - вр/ву1), observe that (ва,-j =
9al: y, 0™,. =0) is a normal solution of (*) when та( is replaced by та,у.
(Call this new equation (*).) We can deform this normal solution of
(*) into the special solution of (*) (keeping в",=0). Having deformed
the first component of @а, у, в"^ in this way, we perform a second
deformation by setting 0a,(?) = (l — ?Ha, and letting 0a,/O be the
special solution of (*)(t), which is equation (*) when та,7 is replaced
^« 0
7 y^yy^
the solution obtained after the first deformation, while at t = 1 we
have the special solution of (*). Thus, we have exhibited a path of
10 ADDITIONAL TOPICS 151
solutions of (*) linking the arbitrary solution @я, /,ва.) to the
special solution (i.e., й~\тр) is connected.) ■
10.2.9 Definition. Recall that the gauge group GA(P) acts on в via
pull-back (see 3.2.5). This induces an action of GA(P) on /'(в),
defined (for f^GA(P)) by (umpyf=(f*^r'(PY For ^GP' let
GA(P)p be the subgroup of all f(EGA(P) such that f(p)=p. Then
GA(P)p acts on J\S)p, and (for /еСА(Р)р)ш^-/=(/*а) only
depends on the first and second differentials of f at p. By identifying /,
and f2E:GA(P) if the first and second derivatives of'/, and /2 are equal
at p,we arrive at a finite-dimensional Lie group denoted byJ2( GA(P) ).
By 3.2.9, the Lie algebra ofJ2(GA(P)p)is then seen to beJ2(C(P, §)p),
consisting of equivalence classes of elements A £C(?, §) with A(p) = 0
where A is equivalent to A' if their first and second derivatives agree
at p. The group J2(GA(P)p) acts on /'(в), and the set О(со^р) =
{<% -f\f^j2(GA(P)p)} will be called the gauge orbit through to As
a consequence of the smoothness of the action ofJ2(GA(P)p) on У (в),
we have that O(co^) is a submanifold of'/'(в).
10.2.10 Theorem. The tangent space of the gauge orbit O(w )
consists of all а^е/'(Л'(Р, §))р s^ /'F) such that а<=Л'(Р, S)
is of the form a = dA+[u,A]= DWA for A(EC(P,§) with A(p) = 0.
Moreover, dimO(coij!p) = Bn(n + 1) + n)f, where f= dimG and n —
dim M.
Proof. According to 3.2.12, for A (EC(P, §) with A(p) = 0, we have
exp(tA)£GA(P)p. Tangent vectors to O(co^) are of the form (at
r=0)
by 3.2.16. Let xx,..., xn be coordinates on some U С М such that
there is a local section a: U-> M with o(q)= p. For Au = a*A, the
components of a*DwA relative to the coordinate fields 3,,...,3n and
basis e,,..., ef of § are Aua,■, + c^ccf^A J. Thus, (DaA)mp&
JX(AX(P, §))p is determined by the parameters Aua (.. and Aau t at q.
Since А"и = Aua Jt, there are (^n(n + 1) + n)f independent parame-
parameters, and this number is consequently dimO(co^). ■
152 GAUGE THEORY AND VARIATIONAL PRINCIPLES
The crux of Utiyama's theorem is
10.2.11 Theorem. The gauge orbit O(u^p) is precisely п~ \®шр).
Proof. Uf(EGA(P)p, then f(p')=p'T(p') for some t<EC(P,G) with
T(P) = e=identity element of G. Ву^З.2.15, we then have f*uup =
&bT{p)-;U"p = U"p.^We also have Qf*a = df*a+$[f*u>, /*ы]=/*Яи.
Thus, il(co -f) = il((f*o}) ) = &' u' =/*йк' — йш =£1(сб ). Hence,
we have O(co )См!~'(йи' ). Moreover, dimO(co ) = B«(« + l) +
")/= (n2 + n)f— \n(n — \)f= dim Jl(G)p — codim Й^ '(й^) =
ШтЙ^^П^) (see 10.2.8). Thus O(co^) is an open subset of
U~\u"p). Suppose that п^рЕ:йр1(п"р) — О(ы^р). Then О(п^р) is an
open subset of й~1(пар) such that О(п^р)ПО(и^р)= 0, since orbits
coincide or are disjoint. Thus, O(co^) and йр\пир) — О(ш#р) are
open. The connectedness of Й1"'(йы ) (see 10.2.8) then implies that
10.2.12 Theorem. Ler S: e^C°°(M) fee associated to the
Lagrangian S as in 10.2.3. Then § /5 gauge invariant (i.e., S(/*co) =
S(co)/or allu(Eeandf(EGA(P)) iff S is invariant under GA(P) (i.e.,
Proof. According to 10.2.3,
Hu)(
and
Thus, the left-hand sides of these equations are equal iff the right-
hand sides are equal (note T(f~l(p)) = 7r(p)). Ш
10.2.13 Definition. A curvature Lagrangian is a function K:
J°(A2(P,§))^№ such that K(rp-g) = K(Tp) for all g<EG (see 10.2.4).
We say that К is &b-invariant if K(&bgTp) = K(rp) for all g(EG and
\2§
10.2.14 Theorem. //К is a curvature Lagrangian and S: j\6)-^U
is the composition Ко fl, then S is a Lagrangian as in 10.2.3. Moreover,
§>: в->С°°(М) is gauge invariant iff К is &b-invariant.
10 ADDITIONAL TOPICS 153
Proof. Using 10.2.5, we have S(«w-g) = K(u(u^,-g))= K(Ci(u )
• g)= K(U(o>^p))= S(o}^p), and so S is a Lagrangian. Suppose К is
&b-invanant.Let f&GA(P)withf(p)=p ■h(p).Then,S((f*cc)i):f-l{p))
= K(U((f*a).f-,{p))) = K((f*Qa)f-,(p)) = (see 3.2.15 and note that
p (p)
= h~\p))= K(&bh-4p)Q»p)= K(Q"p)=
4p)p p mp)
whence § is gauge invariant by 10.2.12. Conversely, if § is gauge
invariant, then from the foregoing we conclude that K(&bh(pr-iu" )
= K(Qap) for every w;<= 6 and/G G4(P). Since every т?еЛ2( Р, § ^
is of the formfi'Jp =Й(ш^) for some со G 6by 10.2.5 (and since /z(/?)
ranges over all of G as / ranges over GA(P)), we have that К is
B b -invariant. ■
10.2.15 Theorem (Utiyama). Let S: J\G)^U be a Lagrangian,
and let §: 6-> CX(M) be associated to S as in 10.2.3. Then § is gauge
invariant (i.e.,<i>(f*u) = §((o) for all f£:GA(P) and @^6) iff S = Kou
for some &b-invariant curvature Lagrangian K.
Proof. The "if part is 10.2.14. Conversely, if § is gauge invariant,
then S: Jl(G)^U must be invariant under the action of GA(P) on
/'F) by 10.2.12. Since S is then constant on the gauge orbits
O(co^) of 10.2.9, we deduce from 10.2.11 and 10.2.5 that S induces
(via й) a curvature Lagrangian К: /°(Л2(Р, $))->R such that
S = Ko&. Since o> is gauge invariant, we have that К is (Jb-invariant
by 10.2.14. ■
10.2.16 Construction of Curvature Lagrangians. We examine a
method for producing some (lib-invariant curvature Lagrangians. Let
F: §X - ■ XS^R be a multilinear function that is &b-invariant (i.e.,
F(&bgAl,...,&bgAn) = F(Al,...,Am)forallgSGandAl,...,Alf[E:
§). If F is symmetric, then F is called a (polarized) Weil polynomial.
An excellent treatment of the determination of the Weil polynomials
(for G=U(n), O(n), SO(n), and Sp(n)) is found in Section 2,
Chapter 12, of Kobayashi and Nomizu [1969]. In any case, given a
connection со on a PFB тт: Р^>М with group G, we can define a tensor
field f(fi)G?r@-2m)(P) (for Q = fl"w) by F(U)(X,,..., X2m) =
F(U(XvX2),...,U(X2m^l,X2ni)). Just as in 9.3.1, we can prove that
there is a unique tensor field F(u)^{0-2m)(M) such that n*F(u) =
). Using a fixed metric on M, we can raise m indices of F(U) and
154 GAUGE THEORY AND VARIATIONAL PRINCIPLES
contract those with the remaining indices to form a scalar, say CF(Q,).
The function J°(A2(P,§))^U given by up>-*CF(u)w(p) is then a
curvature Lagrangian. The self-Action is derived from a curvature
Lagrangian of this type where F: §X§->[R is an &b-invariant metric.
10.3 SPONTANEOUS SYMMETRY BREAKING
Here we study the so-called .Higgs mechanism, which has been
applied extensively by physicists in building more realistic models for
the behavior of elementary particles. For example, one problem with
the model of Section 7.3 is that the mass (a term we define later in
this section) of the gauge fields b\b2,b3 (where co= — ib-т, as in
7.3.14) is zero. Apparently it is fine if the gauge field of electromag-
netism has zero mass because there the force is mediated by photons,
which are massless. However, Yang-Mills type forces must arise
from the exchange of massive particles because of the observed short
range of these forces. The Higgs mechanism helps in two ways. First,
gauge fields can acquire mass by the symmetry breaking. Second, the
undesirable Goldstone bosons (which arise in the symmetry-breaking
process) can usually be gauged away. We will not apply the Higgs
mechanism in a detailed fashion to the model in Section 7.3; rather,
we describe the mechanism in general terms, and refer you to the
physics literature for the many specific applications.
10-3.1 Notation. Let M be an n-manifold with metric h. Suppose
that 7Г. P -» M is a PFB with compact group G. Let V be a vector space
with positive definite metric h, and let G-> O(V) be some orthogonal
representation. Assume that L: J(P,V)^>R is a Lagrangian of the
form L(p,v,9)={(hh)(9,9)-F(v) where F:V^U is (necessarily) a
G-invariant function (i.e., F(g- v) = F(v) for all gE:G,vE:V). We
think of F(v) as being the potential energy of v.
10.3.2 Definition. // uoe V is a local minimum for F, then we call
v0 a vacuum, and the set Gv0 = [gvo\ gfE G) is called a vacuum orbit.
The unbroken subgroup of G relative to v0 is Go = {g €E G \ g ■ v0 = v0}.
Note that Gv0 is a submanifold of the sphere of radius h(v0, uo)l/2 '" ^
and Gv0 can be identified with the quotient space G/Go. In^ the physics
papers, F is often given by — F(v) = ^m2h(v, v) — ^X2h(v, vJ. At
10 ADDITIONAL TOPICS 155
t=0,
—r-F(v + tw) = m2h(w, v)~ X2h(v,v)h(w, v)
= h(w,(m2 — X2h(v,v))v),
which is zero for all w when h(v, v)l/2 = m/X. In this case, the set of
vacuum points is the sphere of radius m/X, but the vacuum orbits may
be smaller, depending on the representation G->O(V). Often, the
unbroken subgroup is just some U(\)CG corresponding to electric
charge.
10.3.3 Theorem. Referring to 0.2.26, we let D2F: TVVXTVV^№
be the Hessian of F at the vacuum v0. For d=dim(Gv0), there is an
o.n. basis u{,...,ud,ud+v...,um of TVV {with metric induced by h)
such that uv...,ud spans the subspace Tv(Gv0) СTv V, and the mXm
matrix with entries Mah—D2F{ua,uh) is diagonal with Mu= ■ ■ ■ =
Proof. Let TVo(Gvo) be the subspace of TVV orthogonal to TVo(Gvo).
First, we prove that D2F(wl,w2) = 0 if w, &TVo(Gvo) and w2°(ETvV.
Select A eg such that
— (expL4)-«0=w,
at t = 0, and let y: [— \,\]->V he a curve with y'@) = w2. Define Я:
[ —1,1]X[ —l,l]^Fby ЯE, t) = (exptA)-y(s). Then at 5=r=0,
while
Hence,
since F is G-invariant. Let ux,...,ud be an arbitrary o.n. basis of
Tv£Gv0), and let ud + l,..., um be an o.n. basis of Tv^Gv0)± (provided
by a well-known result) such that the matrix D2F(ua, ub) d + \^a,b
«£ m is diagonal. Then, и,,..., um is the desired basis. ■
156 GAUGE THEORY AND VARIATIONAL PRINCIPLES
10.3.4 Definition. If\pE:C(P,V) is a particle field, then xP' = xP~v0
is called the shifted field of \p (relative to v0). Note that \p'&C(P,V),
since xP'(pg)=g-lxP(p)-v0^g~ixpr(p) ifg&G0. We think of V as
the deviation of \p from the chosen vacuum v0. Moreover, it is conve-
convenient to think of \p' as having values in TVV under the identification
F= TvY Siven ЪУ
d i
There are real-valued functions £,,..., £rf and t]d+ v...,T\mon P defined
byV(p) = UP)«i+ ■■■+Vm(p)um- We call iv...Лd the Goldstone
bosons of \p (or \p') and rjd+,,..., i\m the scalar mesons. Collectively,
£,,..., rjm are known as Higgs fields. Let ma^0 be defined by m2u~
D 2F( ua,uh), a=\,...,m. Note that to within second order in the Higgs
fields, we have F<>xp=Fo(v0 +\p') = F(vo)+ ^D2F(^', xP')+ ■ ■ ■ =
F( uo) + 2f гт1И +2^+1 Ki)!+ ' ' ' > and (in analogy with the
Lagrangian of 4.3.7) we call ma the mass of the Higgs field associated
to ua. By 10.3.3 the Goldstone bosons have mass 0, while the scalar
mesons may have positive mass.
10.3.5 Definition. Let H: P^M. be a C°° function. We define the
fiber derivative of H atp&P to be the linear functional (dGH)p: S->R
given by
(dcH)p(A)=~H(pcxptA)
at t=0. The next result is an observation of Weinberg [1973].
10.3.6 Theorem. Ler u0 be a vacuum and let \p£C(P,V). Define
H: P^U by H(p) = h(xP(p),v0). Then the Goldstone bosons of xp
vanish at p iff (dGH)p =0.
Proof. For A €E §, we have
—h
10 ADDITIONAL TOPICS 157
at t = 0. Since h{vo,A-vo) = 0 for all Л eg, we have {dGH)p{A) =
h{\p'{p),A-v0), and the result follows, because Tv{Gv0) is identified
with {A-vo\A<E§}. Ш
10.3.7 Definition. For uoeF and \P<EC{P,V) we call UD*,vo) =
[p €=P\{dcH)p = 0} the unitary set of \p relative to vQ. Since the fiber
tt~i{x) is compact, we know that Н\чт~х{х) has a critical point, whence
U{\p, ьо)Г\тг~*(х)ф-0. Goldstone bosons are considered to be spurious
{or unphysical) states. Thus, it is important to know when they can be
"gauged away," in the sense that there is a local gauge a: W^>P such
that the Goldstone bosons vanish on a{W) {or equivalently a{W)C
U{\p,v0) by 10.3.6). Such a a is called a unitary gauge. Some may
think that a {global) unitary gauge can be defined by taking o{x) to be
"the" point where Н\чт~х{х) achieves its maximum. This procedure is
doomed to fail somewhere if тт. Р->М is nontrivial, since there can
never be a global gauge {unitary or not) for a nontrivial bundle.
Indeed, it is not even certain that local unitary gauges exist about every
x E.M, although a sufficient condition will be given in 10.3.10.
10.3.8 Definition. Since G is compact, we can introduce a positive
definite &b-invariant metric к on g. Let §^ be the subspace of g
orthogonal to §0. We define another metric к on §^ by k{A, B) = h{A-
v0, B-v0). Note that к is positive definite, and we can find an o.n. basis
e{,...,ed of Q^ {relative to k) such that k\ea,ep) = 0 for афE, while
k{ea,ea) = M^>0, l«£a, /?=£<! The vectors ev...,ed are referred to as
broken generators.
10.3.9 Definition. For p&U{\p,v0), we define the broken Hessian of
H at p to be the bilinear form {D^H)p: §^ X g^ ->R given by
{D^H){AB)
at s = t = 0. A short computation yields {Dj;H)p{A, B)= h{\p{p),A-В
■ v0), which is symmetric in A andB, since h{\p{p),[A, B]-vo) = 0.
10.3.10 Theorem. For p&U{\p, v0), we have a local unitary gauge
a: W ^ U{\p, vo)CP {for some neighborhood W of х = тт{р)) such
that a{x) = p, provided that {D^H)p is nondegenerate.
158 GAUGE THEORY AND VARIATIONAL PRINCIPLES
Proof. Let §^ be the dual space of g^. Define /: P-^ by
f(p)={bcH)p\^. Note that LW,«o)=/~'(O). Thus, £/(*,«<>) will
be a submanifold (of codimension d) in some neighborhood of p if
(see 10.2.6) Д: TpP^T0(§^)^§^ is surjective. Moreover, кегД is
the tangent space TpU(\p,v0) in this case. For A, B&§^, we have
{bH){B)
at s=t=O. Thus, the surjectivity of/^ is equivalent to the nondegen-
eracy of {DlH)p. Hence, we see that not only is U(\p,v0) a submani-
submanifold in a neighborhood oip, but also TpU(\p,v0) has a d-dimensional
complement in the vertical space of P alp. Thus, ir^(TU(\p,vo)) =
TXM, and this is enough to give us the unitary gauge a. ■
10.3.11 Remark. Observe that U(\p,v0) is invariant (as a set)
under the action of Go on P, as a consequence of the G0-invariance of
H. Thus, U(\p,vo)C\4r~\x) is typically a manifold of dimension
dim(G0). Aside from this, there is a compelling analogy between the
situation described in 10.3.10 and catastrophe theory. The points x^lM,
such that (D^H)p is singular for some p Е:тт~1(х), are catastrophe
points. About such points, the existence of a unitary gauge is not
ensured, even if a different vacuum in Gv0 is chosen. The locus of
catastrophe points in M should be of some physical interest. Finally,
differential geometers should note that for pE:U(\p,vo),(DJ;H) is
essentially the second fundamental form of Gv0 at v0 in the normal
direction \p( p).
10.3.12 Massive Vector Bosons. The physicists claim that those
gauge potentials corresponding to the broken generators acquire mass,
thereby deserving to be called "massive vector bosons." Here we make
some sense out of this claim. Let к be the (lib-invariant metric on § as
in 10.3.8, and suppose that (\p, со) GC(P, V)XG. Consider the Action
density (£+§)(xp, со) <EC°°(M) given by
10 ADDITIONAL TOPICS 159
Let \p' = \p — v0 be the shifted field of \p relative to the vacuum v0, and
suppose that a: W->P is an arbitrary local section. We introduce the
notation \|/ =a*\p' = \p'oo and ww = a*w. Let ev...,ed be the broken
generators of 10.3.8, and extend these to an o.n. basis ex,...,ef of §;
necessarily, ed+v...,ef is an o.n. basis of §0. We write tow=2w"ea,
where the to" are real-valued \-forms {or vector potentials) on WCM.
Take ux,..., um G TVV as in 10.3.3 and consider i|/ as having values in
T F= V. We write °
v0
b=\ c=d+\
Now,
where we have retained only those terms that are less than third order in
the "variables" i|/, d^'w, cow, and du>K. Note that none of the retained
terms, except the cross term {hh){d\p^, uw-v0), involve both ccw and i|/,.
160 GAUGE THEORY AND VARIATIONAL PRINCIPLES
If a: W^P is unitary, then this cross term vanishes. Indeed,
Id m f \
(hh)(d^,cow-vQ) = (hh) 2#X+ 2 difwuc, 2 <ea-v0\
\b=\ c=d+\ a=\ I
m d
= 2 2h(d^,uaJh(uc,Maua) = 0,
c=d+\ a=\
where we have used the facts £*=0, ea-v0=Maua (see 10.3.8), and
h(uc, ua) = 0 for c>d and a<d. Then, assuming that a: W^>P is a
unitary gauge, the expression for (£+§)(»//, со) (up to second order in
r\cw, drfw, u>l, du>l) becomes
c = d+\
d
- 2
a=\
2
This is essentially the Klein-Gordon type Lagrangian density for
m — d scalar mesons ifw of mass mc, together with d massive vector
bosons and f—d massless vector bosons. Moreover, without the
higher-order terms, we see that none of these fields is coupled to any
other in the unitary gauge. It is hoped that you have gained not only
an understanding of how gauge potentials acquire mass, but also a
feeling for the significance of the unitary gauge.
10.3.13 Remark. You may wonder how the foregoing applies to the
model in Section 7.3, because there the Lagrangian was of Dirac type
instead of Klein-Gordon type. The answer is that in addition to the
spin-\ nucleon field, we must introduce the scalar Higgs fields
(transforming according to some representation of SUB)). Then we add
the Dirac type Lagrangian involving the nucleon field to the Klein-
Gordon type Lagrangian for the Higgs fields. The rest of the Lagrangian
consists of the self-Action of the connection along with terms coupling
the nucleon field to the Higgs fields. For further details on such models
(e.g., the Weinberg-Salam model), we refer you to Abers and Lee
[1973], Salam [1968], and Weinberg [1973].
10 ADDITIONAL TOPICS 161
10.4 CHARACTERISTIC CLASSES, MONOPOLES,
AND INSTANTONS
Here we define certain real-valued forms /(fi") on M that are
constructed from the curvature form fi" of a connection со for a PFB
•n: P ^> M with group G. The forms /(fi") are closed and depend on
со. However, /(Я") —/(fi"') is exact. We say that two forms are in
the same class if their difference is exact. Thus, the form /(fi")
determines a class [/(fi")] that is independent of со and is called a
characteristic class of it: P -» M. If the PFB is nontrivial, then there
might be nontrivial characteristic classes, in which case Йш Ф0 for all
со. Thus, we see that a nonzero field strength can be forced upon us
by the topology of the PFB (i.e., solitons arise). As a simple example,
we consider magnetic monopoles. Finally, we make some elementary
comments about instantons.
10.4.1 Definition. Recall A0.2.16) that a Weil polynomial of degree
m is a symmetric multilinear map f: § X • • • X § -»R (or C) such that
f(&bgAl,...,&bgAJ = /(/!„..., AJ for all gGG and A,G§. IJ'we
set g = exp tB (BE§), then applying (d/dt)\l = Q to this equation yields
We denote the space of such Weil polynomials by lm(G). IffEI'(G)
andf'EIJ(G), thenff'Er+J(G) is defined by
where a ranges over all permutations of {1,...,/+_/}. In this way,
I(G)=®klk{G) is a graded algebra, the Weil algebra.
10.4.2 Definition. Let со be a connection for the PFB it: P^M with
group G. For fElm(G) and fl" = dcc+{[ic, со] we define f{U")E
A2m(P,IR) by
162 GAUGE THEORY AND VARIATIONAL PRINCIPLES
Note that г/ Й" = 2 tt"ea (ex,...,efa basis for §), then
10.4.3 Theorem. For each /£/m(G) there is a unique closed 2m-
form f(Q") on M such that ir*f(Qu)= f(Qu). Moreover, we have
> /Я /Я), for fe'f(G), /' e P(G).
Proof. Since со is fixed, we write Я" = Я. If ЛГ,,..., X2k £TpP
are horizontal, then we set /(Щ„(Р)(^^Хи . . . , "п^Х2т) =
f(u)p(Xu...,X2m). Since R*u = &b'g-,u and / is £b-invariant, we
see that /(Я)х is independent of pE7T~\x). Since п vanishes on
vertical vectors, we have/(fi) = 7r*/(fi). In the notation of 10.4.2, we
have
Thus,
and so df(u)H =0, since JflH =0 by 2.2.8. Now •n*df{u)= dir*f(u)
= df(Q). Thus, df(U)=ir*df(tt) = (TT*df(U))H = df(U)H=0, "and
<//(Я) = 0 follows. For fel'(G) and feiJ(G), we have
Note that fi Л • • • ЛЙ"'+; is symmetric in a,,..., aj+J, since Я"* is а
2-form. Thus,
(i.e., the symmetrization of/®/' is effected by Я Л • • • Afi"-^). The
right-hand side is Шеп/(Я)Л/'(Я). ■
10.4.4 Definition. Леса// the definition of Hk{M) in 0.2.20. For a
fixed connection со on P we define a homomorphism of algebras W(u>):
®mIm{G)^@kHk{M) by W(u)(f) = [f(Qa)]. Note that W(u) dou-
doubles the degree, and preserves the multiplicative structure by 10.4.3.
Our immediate goal is to prove that W(oi) (the Chern-Weil homomor-
homomorphism) is actually independent of the choice of со.
10 ADDITIONAL TOPICS 163
10.4.5 Lemma. Let ф0 and ф, be any %-valued 1-forms on P (or any
manifold), and let fEl"'(G). Set а=ф,— <р0, ф,=фо + ?а, а«^/Ф,=^ф(
+ НфеФг]- ^е? /(ф() *е ?^e 2m-form defined as in 10.4.2, ал</ let
f(а, Ф,,..., Ф,) йе the analogously defined Bm— \)-form. Then we have
Proof. Note that
dt
Hence
-r<b=da+[<pt,a].
m dt
It suffices to prove that the right-hand side is <//(а,Ф,,..., Ф,). Now
d<bt—— [ф,,Фг] (see the proof of 2.2.8), we have df(a, Ф,,...,Ф,) =
Ф,,...,Ф,)}. The expression within the braces is (in the notation of
10.4.2)
by the в Ь-invariance of /(see 10.4.1). ■
10.4.6 Theorem. For any two connections со and со' о« Р we have
W(w)=W(u'). That is, [/(Йш)] = [/(Йш')]еЯ2т(Л/) for all /G
Proof. We apply 10.4.5 with Ф0 = со and ф,=со'. Then /(Йы)
) = mdp, where
164 GAUGE THEORY AND VARIATIONAL PRINCIPLES
Since со-со'£Л'(Р, §), Qu- EA2(P,§), and / is (£b-invariant, we
have /?*/? = /? and fi vanishes on vertical vectors. It follows that there
is a unique form /?'£Л2т~'(М) such that tt */?' = /? (as in the proof
of 10.4.3), an
10.4.7 Definition. The various classes [/(Йш)]еЯ2т(М) for fE
I"'(G) are called characteristic classes. Since they are independent of
со, they depend only on the PFB it: P^>M. If it': P'^>M is another
PFB with group G, then it': P'^M is equivalent to it: P^M if there is
a map Л: P^>P' such that \(pg)=\(p)g (for allpEP andgEG) and
10.4.8 Theorem. The Chem-Weil homomorphisms ®mI
©kHk(M) for equivalent bundles are the same.
Proof. For it: P^M and it': P'^M as in 10.4.7, let со' be a
connection on P'. Note that со = Л*со' is a connection on P, and
ЯШ = Л*ЯШ'. Thus (for fElm{G)) /(ПШ)=Л*/(ЙШ), and we have
), since Л induces \M: M^M. ■
10.4.9 Corollary. The characteristic classes for a trivial PFB all
vanish.
Proof. By 10.4.8, it suffices to show that the characteristic classes
for a product bundle {P — MXG) vanish. On MXG there is the
trivial connection со with horizontal subspaces tangent to the slices
MX{g). Since co|MX{g} vanishes, so does d<x>\MX{g), whence
) = 0 for
10.4.10 Remark. // G is a compact semisimple Lie group and r is
the dimension of a maximal Abelian subgroup (i.e., torus) of G, then it
is known (see Chern [1972]) that the ring ®mIm(G) is generated by r
elements, say/,,..., fr. The Chern-Weil homomorphism for a PFB with
group G is then determined by [/-(й")], i=\,...,r. When G~U(n),
then r — n, and for a certain choice of generators f EI'(U(n)), we
obtain the Chern classes [/.(Ои)]е#2'(М), i=\,...,/?. If G = SOBn
+ 1), then we obtain the Pontryagin classes, and if G = SOBn), then we
obtain not only the Pontryagin classes, but also the Euler class in
10 ADDITIONAL TOPICS 165
H2"(M). For further details, see the excellent treatment in Kobayashi
andNomizu [1969].
10.4.11 Theorem. Let [/(Йш)]еЯ2т(М) be a characteristic class
of it: P^> M. Suppose that N is any orientable compact submanifold (of
M) of dimension 1m. Then ^/(fi") is independent of со.
Proof. Note that jNf{u") is an abuse of notation for jNi*f(u")
where i: N^M is inclusion. Now, i*f(Qu)-i*f(Qu') = i*~dP' =
d(i*/3'), in the notation of the proof of 10.4.6. Integrating this
relation over N, and applying Stokes' theorem, we are done. ■
10.4.12 Definition. Let N and N' be oriented submanifolds of M.
We say that N can be deformed into N' if there is a map H:
NX@,3)^M such that (Я|ЛГХ{1}): NX{\}^N and (tf|JVX{2}):
NX{2}^Af' are orientation-preserving diffeomorphisms. We write N
<-+N' if N can be deformed into N'.
10.4.13 Theorem. // Л^ and N' are Im-dimensional orientable sub-
manifolds ofM and N<-*N', then (for any /G/m(G) and connections со
and со' on P) we have
/7(яи)=/дяи')-
JN~ JN'-
Proof. Since
/ /
JN'~ JN'~
by 10.4.11, we need only consider the case where co = co'. Let H:
@,3)XJV^M be as in J0.4.12. Then tf*(/(fi")) is a closed C°°
2m-form on NX[1,2] = N, and so
0= [_dH*(f(№))
JN
= Г_Я*(/(ПИ))= f Я*(/(ПИ))- f Я
JdN ' JNX{\} ' JNX{2)
= (№")-[ /(яи),
•'Л'" JN"
using the version of Stokes' theorem in 0.2.15 and 0.2.16.
166 GAUGE THEORY AND VARIATIONAL PRINCIPLES
10.4.14 Remark. Typically, the generators /,,..., fr of 10.4.10 are
normalized so that the various integrals JNfj(&"), for compact N with
dim N=deg ft(u"), are in tegers.
10.4.15 Magnetic Monopoles. Here we apply the foregoing to
show how magnetic monopoles arise from nontrivial PFBs with group
U(\). Let mF: F(R3)^IR3 be the bundle of oriented o.n. frames of R3
with the Euclidean metric and usual orientation. For any xER3, there
is an identification TXU3 = U3, and so we can write F(U3) = U3 X SOC).
Let PdF(U3) be the submanifold consisting of all (x, A)EU3 XSOC)
such that x=£0 and A(e3) = x/\\x\\. We can identify SOB) with
{BESOC)\B(e3) = e3}. If (x, A)EP and ВESOB), then(x,A)-B =
(x,AB)EP, and so we see that it: P^R3 = R3 -{0} (ir = iTF\P) is a
PFB with group SOB). For r>0 and Sr = {xEU3\\\x\\ = /■}, note that
it: it ~\Sr)->Sr can be identified with the oriented o.n. frame bundle of
Sr. Let e = (e'j) be the Levi-Civita connection on F(U3). Let
0 0\\P
[в\\Р 0
an §><QB)-valued \-form on P. It is not hard to see that со is a
connection for it: P^IR3. Let
'LI s])=-
and note that fEl\SO{2)). We have f(u")=f(du+ ^[u, u])=/(rfu)
= ^F>'2|Р)=-F'1з|Р)ЛF'32|Р), since de=-B/\B+Ue and fl"=0
for Euclidean IR3. Let <p = (<p\ ф2, ф3) be the canonical \-form
on F(U3). We prove that (в\\Р)= \\x\\ "'( ф1 \Р) at (x,A)EP. In-
Indeed, let y(t) = (x(t), A(t))EP be a curve in P with y(Q) = (x,A).
Then 0'3(Y'(O)) = ( ex, в(у'@))(е3)) = {ех,А~ U'@)(e3)> =
( ex, A -\\\x\\ -'jc)'(O)> = IIjcH -' < e,, A ~lx'@)) + (
е],А-\\\х\\~и@)х)) = \\х\\ "'(e,, Ф(у'@))> =ф'(у'@)), where we
have usedA(ei) = x/\\x\\. Similarly, в\\Р= \\x\\ ~ У \Р. Thus,f{U")
- \\xII ^2(ф'|^)Л(ф2\P) at (x, A)EP. Let ju. be the 2-form on R3 such
that n\Sr is the area element of Sr and ix(X, Y) = 0 if X is radial.
At (x,A)EP, we prove that ttf*ix = ф1 Л ф2. Indeed, for ex and e2
standard horizontal fields in T(x A)F(U3), we have (ттр1л)(ёх,ё2) =
10 ADDITIONAL TOPICS 167
lx(iTFJi,7TFJ2) = ix(A(el),A(e2))=\, since А(е^ = х/\\х\\ implies
A{e^),A(e2) is an oriented o.n. frame of TxSr. It is clear that
and we then have f{U")= ||дс1ГУ Note that
We consider the physical interpretation. Note that there is an
isomorphism U( 1) -> SOB),
,„ . cos a sin a 1
sin a cosaj
with induced Lie algebra isomorphism
0 fi
-fi or
Thus we see that it: P^>R3 can be regarded as a PFB with group
U(\), and /(fi") is the (time-independent) electromagnetic field
strength of the %(l)-valued connection corresponding to со. Also,
/Y О ш\ — II у (I ^2 — || v||~3/vl Jv2 д jv3 _j_ v2 Jv3 д Jv 1 _j_ v3 Jv! д
у V *& ^ v — |[a|[ jx — пли \Х их i \ их т x их i \ их i л йл / \
dx2) is (according to 0.2.22) a "purely magnetic" field due (presuma-
(presumably) to a magnetic monopole at 0. Actually, [(l/27r)/(£2t0)] is the
Chern class of it: P^IR3 (with group U(\)) and the number
f 1 _
/s, 27r"
is the (magnetic) charge of the monopole. By 10.4.13, we cannot
change the magnetic charge by deforming Sr or by changing the
potential со. The monopole is a manifestation of the topology of тт:
P^> M and not the choice of со.
In this example, the charge was 2. In order to change the charge
to 1, we must change the PFB. Let S(IR3) = IR3 XSUB), and let Л:
S(IR3)^F(IR3) = IR3X5'<9C) be given by X(x,A) = (x,A(A)) where
Л: SUB)^SOC) is the universal covering homomorphism of 6.1.5
restricted to SUB)CSLB,C). Let S(P) = \~\P), and note that tts:
5(P)^R3 is a PFB with group A~\SOB))=U(l). Since A~'(S0B))
covers SOB) twice, we have that со' = ^Л*(со) is a connection on
S{P). Also, f(u"') = \f{uu), and so tts: S(P)^IR3 gives us a mono-
168 GAUGE THEORY AND VARIATIONAL PRINCIPLES
pole of charge 1. Note that the cyclic group ZpС U(\) of order n acts
on S(P), and gives rise to a PFB S(P)/Zn->R3 with group U(\)/Zn
= GA). Since the fibers of S(P) cover those of S(P)/1n n times, the
charge of the monopole given by S(P)/Zn^> IR3 is n. By reversing the
direction of the U(\) action on S(P)/Zn, we can achieve charge —n.
10.4.16 Instantons. Let к be an &b-invariant metric on § (such
always exists if G is compact). Observe that kEl2(G). For a PFB
it:P^M with group G and connection со, we have k(u")EA4(P,R) as
in 10.4.2. Writing п" = ^паеа for a basis <?,,..., ey of §, we have
к(п") = 2карпаЛ пр, where ка/} = к(еа,ер). If a: U^P is a local
section, then k(U")\U=lka/iUauA Upu where Uau = o*Sla. Suppose
that M is an oriented Riemannian 4-manifold with (positive definite)
metric g. Then we have **—\ on 2-forms. Using the notation Яои =
2кар$1аи, we have
(see 0.2.17).
Since **= 1 on 2-forms and *: A2(M)x -*A2(M)x is self-adjoint in
the sense that gx(*a,/3) = gx(a, */3), it follows that A2(M)X decomposes
into orthogonal + 1 and — 1 eigenspaces of *, say A2(M)X = Л2(М)ХЬ
ФЛ2(M)x . For ctEA2(M) we then have a decomposition a~a + +a~,
and a is called self-dual if ct~ =0 and anti-self-dual if a+ =0. Simi-
Similarly, we have a decomposition A2(P, §) = Л2(Р, §) + ®A2(P, §)~, and
we can write ПШ=ПШ++ПШ'. Now, g(Upu, *Upu)n = g(U+pu +
Q-pu,Qp+u-Qp-u)^=g(Q+Pu,Qp+u)^-g(Q-pu,Qp-u)li. Summing
over p, we see that к(пш) = к(п"+)~к(п" ), where the right-hand
side can be defined independently of a, using the fact that the decom-
decomposition of A2(P, §) is preserved under R* (A EG), and so on. Define
the Action density of со to be &(o>)ii=(gk)(tt",tt")n=(k(tt"+) +
к(п"~)). If M is compact, then 10.4.13 says that jMk(u") is a
constant, independent of со. Calling this constant c, and defining
a = jMk(U"+) and b=jMk(U"~), we have a~b=c and a + b=
jM&(ic)ix or fM&(o>)n=2a-c=2b+c. Thus, /w(£(u)ju. will be mini-
minimized for those со such that fi" is self-dual (i.e., b = 0) or fi" is
anti-self-dual (i.e., a = 0). Such connections are called instantons or
10 ADDITIONAL TOPICS 169
anti-instantons, respectively. For arbitrary со, we have
where с depends only on the PFB.
Unlike 10.4.15, we will not consider the physical significance of
instantons, since this is currently beyond the author's scope. Let it
suffice to say that the case of most interest is that in which M is
the ordinary sphere in R5 and G = SU(n). There are a number of
nice results in this area (see Atiyah, Hitchin, and Singer [1978];
Bouguignon, Lawsoh, and Simons [1979]; Schwarz [1979]; etc.).
References
Abers, E., and B. W. Lee. 1973. Gauge theories, Phys. Repts. 9C 1-141.
Abraham, R., and J. E. Marsden. 1978. Foundations of Mechanics, 2nd ed. Addison-
Wesley, Reading, Mass.
Arnol'd, V. I. 1978. Mathematical Methods of Classical Mechanics, Graduate Texts in
Mathematics, Vol. 60. Springer-Verlag, New York.
Atiyah, M., N. Hitchin, and I. Singer. 1978. Self-duality in four dimensional
Riemannian geometry, Proc. Roy. Soc. London A362, 425-461.
Belavin, A., A. Polyakov, A. Schwarz, and S. Tyupkin. 1976. Pseudoparticle solu-
solutions of the Yang-Mills equations, Phys. Lett. 59B, 85-87.
Bjorken, J., and S. Drell. 1964. Relativisitic Quantum Mechanics. McGraw-Hill, New
York.
Bleuler, K., and A. Reetz, Eds. 1975. Differential Geometric Methods in Mathematical
Physics, Lecture Notes in Mathematics, Vol. 570. Springer-Verlag, Berlin.
Booss, B. 1977. Topologie undAnalysis, Hochschultext/Universitext. Springer-Verlag,
New York.
Bourguignon J., H. B. Lawson, and J. Simons. 1979. Stability and gap phenomena
for Yang-Mills fields, Proc. Natl. Acad. Sci. U.S.A. 76:4, 1550-1553.
Chern, S. S. 1972. Geometry of characteristic classes, Proc. 13th Biennial Seminar,
Canadian Math. Congr. pp. 1-40.
Cho, Y. M. 1975. Higher-dimensional unifications of gravitation and gauge theories,
J. Math. Phys. 16, 2029-2035.
Faddeev, L. D., and A. A. Slavnov. 1980. Gauge Fields, Introduction to the Quantum
Theory. Benjamin/Cummings, Advanced Book Program, Reading, Mass.
Geroch, R. 1968. Spinor structure of space-times in general relativity, J. Math. Phys.
9, 1739-1744.
Herman, R. 1978. Yang-Mills, Kaluza-Klein and the Einstein Program, Interdisci-
Interdisciplinary Mathematics Series 19. Math. Sci. Press. Brookline, Mass.
171
172 REFERENCES
Higgs, P. 1966. Spontaneous symmetry breakdown without massless bosons, Phys.
Rev. 145, 1156-1163.
Jackiw, R., and C. Rebbi. 1976. Conformal properties of a Yang-Mills pseudopar-
ticle, Phys. Rev. D14, 517-523.
Kaluza, T. 1921. Sitzber. Preuss. Akad. Wiss. p. 966.
Klein, O. 1926. Quantentheorie und funfdimensionale Relativitatstheorie, Z. Phys.
37, p. 895.
Klein, O. 1939. On the theory of charged fields, in Le Magnetisme (proceedings of
the conference organized at the University of Strasbourg by the International
Institute of Intellectual Cooperation, Paris).
Kobayashi, S., and K. Nomizu, 1963. Foundations of Differential Geometry, Vol. 1.
Wiley, New York.
Kobayashi, S., and K. Nomizu. 1969. Foundations of Differential Geometry, Vol. 2.
Wiley, New York.
Mayer, M. E. 1981. Gauge Theories, Vector Bundles, and the Index Theorem.
Springer-Verlag, New York.
Noether, E. 1918. Invariante Variationsprobleme, Nachr. Ges. Gottingen, 235-257.
Palais, R. S. 1965. Seminar on the Atiyah-Singer Index Theorem. Princeton Univer-
University Press, New Jersey.
Sachs, R., and H. Wu. 1977. General Relativity for Mathematicians. Springer-Verlag,
New York.
Salam, A. 1968. In Elementary Particle Theory (N. Svartholm, Ed.). Almquist and
Wiksells, Stockholm.
Schiff, L. 1968. Quantum Mechanics, 3d ed. McGraw-Hill, New York.
Schwarz, A. S. 1979. Instantons and fermions in the field of an instanton, Commun.
Math. Phys. 64, 233-268.
Spivak, M. 1971. A Comprehensive Introduction to Differential Geometry, 5 vols.
Publish or Perish, Waltham, Mass.
Steenrod, N. 1951. The Topology of Fiber Bundles. Princeton University Press,
Princeton, N. J.
Steraberg, S. 1963. Lectures in Differential Geometry. Prentice-Hall, Englewood
Cliffs, N. J.
Tabensky, R. 1976. Unification of gauge and gravitation theories, J. Math. Phys. 17,
1-5.
Thirring, W. 1978. A Course in Mathematical Physics, 4 vols. Springer-Verlag, New
York,
't Hooft, G. 1974. Magnetic monopoles in unified gauge theories, Nucl. Phys. B79,
276-284.
Utiyama, R. 1956. Invariant theoretical interpretation of interaction, Phys. Rev. 101
1597-1607.
Weinberg, S. 1967. A theory of leptons, Phys. Rev. Lett. 19, 1264-1266.
Weinberg, S. 1973. General theory of broken local symmetries, Phys. Rev. D7,
1068-1082.
Weyl, H. 1918. Gravitation und Elektrizitat, Sitzber. Preuss. Akad. Wiss. 465-480.
Yang, C. N., and R. L. Mills. 1954. Conservation of isotopic spin and isotopic gauge
invariance, Phys. Rev. 96, 191-195.
Selected Bibliography
Arms, J. M. 1979. Linearization stability of gravitational and gauge fields, J. Math.
Phys. 20, 443-453.
Arnowitt, R., and P. Nath. 1975. Gauge Theories and Modern Field Theory (proceed-
(proceedings of a conference held at Northeastern University, Boston). MIT Press,
Cambridge, Mass.
Bernstein, J. 1974. Spontaneous symmetry-breaking, gauge theories, Higgs mecha-
mechanism and all that, Rev. Mod. Phys. 46, 7-48.
Bjorken, J., and S. Drell. 1964. Relativistic Quantum Fields. McGraw-Hill, New
York.
Bourguignon, J., and H. B. Lawson. 1981. Stability and isolation phenomena for
Yang-Mills fields,.Commun. Math. Phys. 79, 189-230.
Drechsler, W., and M. E. Mayer. 1977. Fiber Bundle Techniques in Gauge Theories,
Lecture Notes in Physics, Vol. 67. Springer-Verlag, Berlin.
Garcia, P. 1977. Gauge algebras, curvature, and symplectic structure,/. Differential
Geometry 12, 209-227.
Georgi, H., and S. L. Glashow 1974. Unity of all elementary-particle forces, Phys.
Rev. Lett. 32, 438-441.
Greub, W., S. Halperin, and R. Vanstone. 1972-1974. Connections, Curvature, and
Cohomology, Vols. 1-3. Academic Press, New York.
Hawking, S. 1977. Gravitational instantons, Phys. Lett. 60A, 81-83.
Husemoller, D. 1975. Fiber Bundles, 2nd ed. Springer-Verlag, New York.
Jarlskog, C. 1979. in New Phenomena in Lepton-Hadron Physics, edited by D. Fries
and J. Wess, Plenum Press, New York, 1-54.
Milnor, J., and J. Stasheff. 1974. Characteristic Classes. Princeton University Press,
Princeton, NJ.
Salam, A. 1980. Gauge unification of fundamental forces, Rev. Mod. Phys. 52, No.
3, 525-538.
173
174 SELECTED BIBLIOGRAPHY
Singer, I. M. 1978. Some remarks on the Gribov ambiguity, Commun. Math. Phys.
64,7-12.
Taylor, J. С 1976. Gauge Theories of Weak Interactions, Cambridge University
Press, New York,
't Hooft, G. 1976. Symmetry-breaking through Bell-Jackiw anomalies, Phys. Rev.
Lett. 37, 8-11.
Trautman, A. 1970. Fiber bundles associated with space-time, Rept. Math. Phys. 1,
No. 1, 29-62.
Trautman, A. 1980. in General Relativity and Gravitation, I, edited by A. Held,
Plenum Press, New York, 287-307.
Uhlenbeck, K. 1979. Removable singularities in Yang-Mills fields, Bull. Amer.
Math. Soc. (New Ser.) 1, 579-581.
Weinstein, A. 1978. A universal phase space for particles in Yang-Mills fields, Lett.
Math. Phys. 2, 417-420.
Yang, С N. 1975. in Proceedings of the 6th Hawaii Topical Conference in Particle
Physics. University Press of Hawaii, Honolulu, Hawaii, 489-561.
Index of Notation
In the following, each notation is accompanied by the section(s)
in which it was introduced. The order of the index is by section
number.
aA/3
g
*
\g\
(-\у
тхм
Yx[f]
[Y,Z]
/..*
9,
Ak(M)
die
/*co
5a
Ak{M,V)
Ad_
0.1.2
0.1.4
0.1.5
0.1.6,0.2.17
0.1.7
0.1.7
0.2.3
0.2.3
0.2.4
0.2.5
0.2.6
0.2.8
0.2.9
0.2.10
0.2.11
0.2.17
0.2.23
0.3.8
ab
PFB
LT
§
A*
D"
п"
C(P,V)
Ak(P,V)
рЛф
GA(P)
J{P,V)
UCCM
hp
*
P
*
(hh)
0.3.8
0.3.8
1.1.1
1.1.1
1.2.2
1.2.2
2.2.2,3.1.3
2.2.3
3.1.1
3.1.2
3.1.4
3.2.1
3.3.1
4.1.1
4.2.0
4.2.0
4.2.1
4.2.4
175
176 Index of Notation
(hh)
8"
V гЬ
dL/d(D"t)
V 2L
9L/9i^
X, X
weA](F(M) U")
фел'сдмхш")
9"
в
FQ(M)
<ns: S(M)^M
A: S(M)^ F0(M)
y: R4^§CD,C)
«~
4.2.6
4.2.8
4.3.1
4.3.1
4.3.3
4.3.4
5.1.4
6.1.3
6.2.2
6.2.2
6.2.4
6.2.5
6.3.1
6.3.2
6.3.2
6.3.3
6.3.6
Ф
yXa
Р\°Рг
J(P,V)
•jfxa
yXo
fP.q
_
/
в
f
~nh
Kijk
Rik,R
a = /3
X*f
Л: L(M)^L(
6.3.7
6.3.11
7.1.1
7.1.6
7.1.10
7.1.11
7.2.4
8.1.1
8.1.2
8.2.1
8.2.5
8.2.13
8.2.20
9.1.9
9.2.1
M) 9.2.2
Index
Acceleration, 142
Action, 55
density, 53
self-, 68
Adjoint actions, 20
Automorphisms of PFBs, 46
Base manifold, 26
Bianchi identity, 39
first, 115
second, 115
Broken Hessian, 157
Canonical 1-form, 78
Characteristic classes, 164
Charge density, 16
Chart, 7
Chern-Weil homomorphism, 162
Choice of gauge, 26
Clifford algebra, 82
Closed form, 15
Codifferential, 14
Compact subset, 11
Connection, 29-31
Conservation law
charge, 17, 67
energy-momentum, 127
external, 132
Continuity equation, 17, 67
Contraction of tensors, 109
Coordinate system, 7
Coordinate vector fields, 8
Covariant codifferential, 58
Covariant derivative, 37, 111, 142
Critical point, 18
Current, 65-66
Current for E-M, 16
Curvature
of connection, 37
as field strength, 37, 39
see also Riemann-Christoffel
curvature tensor
Curve, 8
de Rham cohomology space, 15
Diffeomorphism, 7
Differential of map, 8
Dirac equation
free, 87
nonfree, 98
Dirac matrices, 81
Divergence, 125, 127
Duality for 2-forms, 168
Einstein field equation, 127, 134
Einstein tensor, 127
Electron field, 83, 96
Electric field, 16, 145
Electromagnetic field, 33, 145
Exact form, 15
177
178
Index
Exponential map, 19, 48
Exterior derivative, 11
covariant, 37, 44
Fiber, 26
Fiber derivative, 156
Field equation
Einstein, 127, 134
homogeneous, 39
inhomogeneous, 68, 94, 98, 103-104
Field strength, 37, 39
Forms
equivariant, 44
real-valued, 10
vector-valued, 17
Frame bundle, 28, 78
Fundamental field, 30
Gauge
algebra, 48
choice of, 26
orbit, 151
potentials, 33
transformations, 46
unitary, 157
Gauss-Bonnet theorem, 126
Geodesic, 142
Global
section, 27
trivialization, 27
Goldstone bosons, 156
Graded Lie algebra, 36
Gradient of metric functional, 123-124
Hessian, 18
broken, 157
Higgs fields, 156
Horizontal lift
of curve, 141
of vector field, 37
Instantons, 168-169
Integration of forms, 11-12
Isospin current, 101-102
Jacobi identity, 8
Jets, 50, 93, 148
L-tensors, 107
components, 108-109
contraction, 109
covariant differentiation of, 111
raising and lowering indices, 112
tensor product, 109
Lagrange's equation, 61, 94
Lagrangian,
connection, 148
curvature, 152
electron, 83, 96
G-invariant, 51
nucleon, 99-100
particle field, 50
spin-zero, 62
Left-invariant vector field, 18
Levi-Civita connection, 77, 110
Lie algebra, 18
Lie derivative, 9, 131
Lie group, 18
Lie subgroup, 19
Local section, 27
Local trivialization, 26
Lorentz group, 73
Magnetic monopoles, 166-168
Magnetic field, 16, 145
Manifold, 7
Map, 7
Mass of Higgs field, 156
Massive vector bosons, 158-160
Maxwell's equations, 16-17, 63
Metric
on manifold, 14
on vector space, 3
Minkowski space, 16
Nucleon field, 99
Nucleon field equation, 103-105
One-parameter group, 9
Open subset, 6
Open covering, 11
Orientation, 3, 11, 81
Orthonormal frame bundle, 78
Parallel translation, 142
Particle fields, 43
Principal fiber bundles, 26
Index
179
Principle of least action for
metrics, 123
with connections, 134
particle fields, 56
with connections, 68
Product manifold, 17
Product bundle, 26-27
Projected support, 56
Pull-back, 11, 127
Representation, 43
Ricci identity, 119
Ricci tensor, 116
Riemann-Christoffel curvature tensor,
114
identities, 115
infinitesimal changes, 118-119
Scalar curvature, 116
Scalar mesons, 156
Self-Action, 68
Shifted field, 156
Source 1-form, 16
see also Current
Special unitary group, 21-22
Spin structure, 81
Spin-zero electrodynamics, 62, 70
Spinors, 76-77
Spliced bundles, 90
Spontaneous symmetry breaking, 154-
160
Standard horizontal fields, 110
Star operators, 4, 14, 56
Stationary
metric, 123
with connection, 134
particle fields, 56
with connection, 68
Stokes' theorem, 12
Structure constants, 21
Structural equations, 37, 80
Submanifold, 16
Submersion, 149
Support, 11
Tangent vector, 8
Tensor field, 10
Tensor product, 109
Torsion form, 78
Total space, 26
Transition function, 27
Trivial bundle, 27
Twisted metric, 82
Unbroken subgroup, 154
Unitary gauge, 157
Unitary set, 157
Utiyama's theorem, 153
Vacuum, 154
Vector field, 8
Vector-valued forms, 17
Vertical subspace, 29
Volume element, 3, 14
Wedge product, 3
Weil algebra, 161
Weil polynomial, 153, 161
Yang-Mills equation, 65, 134